Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

transfer-learning-conv-ai

🦄 State-of-the-Art Conversational AI with Transfer Learning

420   1655   1655  

news-please

news-please - an integrated web crawler and information extractor for...

379   1655   1655  

RL4LMs

A modular RL library to fine-tune language models to human preferences

155   1646   1646  

sense2vec

🦆 Contextually-keyed word vectors

239   1645   1645  

magnitude

A fast, efficient universal vector embedding utility package.

119   1644   1644  

Research

novel deep learning research works with PaddlePaddle

804   1629   1629  

pet

This repository contains the code for "Exploiting Cloze Questions for...

281   1628   1628  

awesome_Chinese_medical_NLP

中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名...

315   1623   1623  

TextInfoExp

自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类、词向量、情感...

776   1615   1615  

Chinese-XLNet

Pre-Trained Chinese XLNet(中文XLNet预训练模型)

278   1575   1575  

Recognizers-Text

Microsoft.Recognizers.Text provides recognition and resolution of numb...

423   1574   1574  

Transformers-Recipe

🧠 A study guide to learn about Transformers

149   1568   1568  

usaddress

:us: a python library for parsing unstructured United States address s...

302   1559   1559  

deepsparse

Inference runtime offering GPU-class performance on CPUs and APIs to i...

95   1556   1556  

delta

DELTA is a deep learning based natural language and speech processing...

299   1551   1551  

TigerBot

TigerBot: A multi-language multi-task LLM

150   1545   1545  

Keras-TextClassification

中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Cla...

398   1541   1541  

awesome-ai-ml-dl

Awesome Artificial Intelligence, Machine Learning and Deep Learning as...

363   1539   1539  

jiant

jiant is an nlp toolkit

287   1534   1534  

deepdoctection

A Repo For Document AI

66   1528   1528  

StudyBook

Study E-Book(ComputerVision DeepLearning MachineLearning Math NLP Pyth...

122   1523   1523  

bi-att-flow

Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarc...

690   1515   1515  

nlp-lang

这个项目是一个基本包.封装了大多数nlp项目中常用工具

498   1499   1499  

tensorflow-nlp

NLP and Text Generation Experiments in TensorFlow 2.x / 1.x

425   1487   1487  

similarity

similarity: Text similarity calculation Toolkit for Java. 文本相似度计...

333   1486   1486  

nlpcda

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpc...

162   1478   1478  

nlp_xiaojiang

自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似...

393   1473   1473  

setfit

Efficient few-shot learning with Sentence Transformers

158   1468   1468  

BotSharp

The Open Source Chatbot Framework in .NET

336   1466   1466  

eda_nlp

Data augmentation for NLP, presented at EMNLP 2019

306   1455   1455  

nlp_paper_summaries

✍️ A carefully curated list of NLP paper summaries

248   1453   1453  

scispacy

A full spaCy pipeline and models for scientific/biomedical documents.

196   1428   1428  

TextBrewer

A PyTorch-based knowledge distillation toolkit for natural language pr...

229   1428   1428  

refinery

The data scientist's open-source choice to scale, assess and maintain...

70   1417   1417  

QA-Survey-CN

北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研...

240   1412   1412  

vvedenie-mashinnoe-obuchenie

:memo: Подборка ресурсов по машинному обучению

332   1388   1388  

Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

215   1388   1388  

paperai

📄 🤖 Semantic search and workflows for medical/scientific papers

105   1377   1377  

awesome-document-understanding

A curated list of resources for Document Understanding (DU) topic

159   1370   1370  

entity-recognition-datasets

A collection of corpora for named entity recognition (NER) and entity...

236   1350   1350  

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。...

83   1347   1347  

search-index

A persistent, network resilient, full text search library for the brow...

154   1344   1344  

nlp-tutorial

A list of NLP(Natural Language Processing) tutorials

269   1343   1343  

TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albe...

182   1341   1341  

konlpy

Python package for Korean natural language processing.

338   1337   1337  

transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Expla...

100   1325   1325  

duckling_old

Deprecated in favor of https://github.com/facebook/duckling

224   1323   1323  

bootcamp

Dealing with all unstructured data, such as reverse image search, audi...

477   1308   1308  

tribuo

Tribuo - A Java machine learning library

178   1307   1307  

scibert

A BERT model for scientific text.

202   1305   1305