Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
NAACL'2021: A Frustratingly Easy Approach for Entity and Relation Extraction https://arxiv.org/abs/2010.12812
Full text geoparsing as a Python library
The BiLSTM-CRF model implementation in Tensorflow, for sequence labeling tasks.
基于自然语言理解与机器学习的聊天机器人,支持多用户并发及自定义多轮对话
THUOCL(THU Open Chinese Lexicon)中文词库
Crawl BookCorpus
Python AI assistant 🧠
A Lite Bert For Self-Supervised Learning Language Representations
:heavy_check_mark: 微信上的定时提醒 - Cron on WeChat
PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.
Library for faster pinned CPU <-> GPU transfer in Pytorch
Python bindings to libpostal for fast international address parsing/normalization
Deep neural network framework for multi-label text classification
DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome
Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)
NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms...
A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)
A Modern C++ Data Sciences Toolkit
An AI-powered Personal Identifiable Information (PII) scanner.
Natural language detection library for Go
A curated list of resources for NLP (Natural Language Processing) for Korean
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For ac...
Automatically split your PyTorch models on multiple GPUs for training & inference
This repository contains my full work and notes on Coursera's NLP Specialization (Natural Language Processing) taught by the instructor Younes Bensoud...
Repo for Chinese Medical ChatGLM 基于中文医学知识的ChatGLM指令微调
Quickly format your notes with ChatGPT in Obsidian
Python framework for AI workflows and pipelines with chain of thought reasoning, external tools, and memory.
A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)
A Neural Framework for MT Evaluation
A fast, low-resource Natural Language Processing and Text Correction library written in Rust.
A Vietnamese natural language processing toolkit (NAACL 2018)
Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word norm...
Homer, a text analyser in Python, can help make your text more clear, simple and useful for your readers.
Examples and libraries for "Natural Language Processing in Action" book
🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.
LexNLP by LexPredict
Active Learning for Text Classification in Python
An open platform for artificial intelligence, chat bots, virtual agents, social media automation, and live chat automation.
Pretrained ELECTRA Model for Korean
⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.
BabyAI platform. A testbed for training agents to understand and execute language commands.
Language, Knowledge, Cognition
SpaCy 中文模型 | Models for SpaCy that support Chinese
专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference
The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models,...
:black_circle: A spaCy pipeline and model for NLP on unstructured legal text.
汉语现代诗歌语料库整理,3489诗人,81.7K诗歌,15.43M字。持续扩充...
Build chatbots and conversational experiences using React