Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

graph4nlp

Graph4nlp is the library for the easy use of Graph Neural Networks for...

198   1596   1596  

magnitude

A fast, efficient universal vector embedding utility package.

115   1592   1592  

Chinese-XLNet

Pre-Trained Chinese XLNet(中文XLNet预训练模型)

278   1575   1575  

Recognizers-Text

Microsoft.Recognizers.Text provides recognition and resolution of numb...

423   1574   1574  

Medical_NLP

Medical NLP Competition, dataset, large models, paper 医疗NLP领域...

342   1558   1558  

deepsparse

Inference runtime offering GPU-class performance on CPUs and APIs to i...

95   1556   1556  

pet

This repository contains the code for "Exploiting Cloze Questions for...

281   1553   1553  

delta

DELTA is a deep learning based natural language and speech processing...

299   1551   1551  

TigerBot

TigerBot: A multi-language multi-task LLM

150   1545   1545  

Keras-TextClassification

中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Cla...

398   1541   1541  

jiant

jiant is an nlp toolkit

287   1534   1534  

deepdoctection

A Repo For Document AI

66   1528   1528  

StudyBook

Study E-Book(ComputerVision DeepLearning MachineLearning Math NLP Pyth...

122   1523   1523  

awesome-ai-ml-dl

Awesome Artificial Intelligence, Machine Learning and Deep Learning as...

362   1520   1520  

bi-att-flow

Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarc...

690   1515   1515  

sense2vec

🦆 Contextually-keyed word vectors

239   1515   1515  

nlp-lang

这个项目是一个基本包.封装了大多数nlp项目中常用工具

498   1499   1499  

tensorflow-nlp

NLP and Text Generation Experiments in TensorFlow 2.x / 1.x

425   1487   1487  

similarity

similarity: Text similarity calculation Toolkit for Java. 文本相似度计...

333   1486   1486  

nlpcda

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpc...

162   1478   1478  

nlp_xiaojiang

自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似...

393   1473   1473  

setfit

Efficient few-shot learning with Sentence Transformers

158   1468   1468  

BotSharp

The Open Source Chatbot Framework in .NET

336   1466   1466  

eda_nlp

Data augmentation for NLP, presented at EMNLP 2019

306   1455   1455  

nlp_paper_summaries

✍️ A carefully curated list of NLP paper summaries

248   1453   1453  

transformer-deploy

Efficient, scalable and enterprise-grade CPU/GPU inference server for...

142   1452   1452  

usaddress

:us: a python library for parsing unstructured United States address s...

289   1435   1435  

Transformers-Recipe

🧠 A study guide to learn about Transformers

135   1430   1430  

scispacy

A full spaCy pipeline and models for scientific/biomedical documents.

196   1428   1428  

TextBrewer

A PyTorch-based knowledge distillation toolkit for natural language pr...

229   1428   1428  

refinery

The data scientist's open-source choice to scale, assess and maintain...

70   1417   1417  

QA-Survey-CN

北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研...

240   1412   1412  

vvedenie-mashinnoe-obuchenie

:memo: Подборка ресурсов по машинному обучению

332   1388   1388  

Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

214   1384   1384  

paperai

📄 🤖 Semantic search and workflows for medical/scientific papers

105   1377   1377  

NLP-Knowledge-Graph

自然语言处理、知识图谱、对话系统三大技术研究与应用。

346   1373   1373  

awesome-document-understanding

A curated list of resources for Document Understanding (DU) topic

159   1370   1370  

spacy-models

💫 Models for the spaCy Natural Language Processing (NLP) library

293   1353   1353  

entity-recognition-datasets

A collection of corpora for named entity recognition (NER) and entity...

236   1350   1350  

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。...

83   1347   1347  

search-index

A persistent, network resilient, full text search library for the brow...

154   1344   1344  

nlp-tutorial

A list of NLP(Natural Language Processing) tutorials

269   1343   1343  

TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albe...

182   1341   1341  

konlpy

Python package for Korean natural language processing.

338   1337   1337  

transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Expla...

100   1325   1325  

duckling_old

Deprecated in favor of https://github.com/facebook/duckling

224   1323   1323  

bootcamp

Dealing with all unstructured data, such as reverse image search, audi...

477   1308   1308  

tribuo

Tribuo - A Java machine learning library

178   1307   1307  

scibert

A BERT model for scientific text.

202   1305   1305  

TAADpapers

Must-read Papers on Textual Adversarial Attack and Defense

179   1305   1305