Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Medical_NLP

Medical NLP Competition, dataset, large models, paper 医疗NLP领域...

342   1558   1558  

deepsparse

Inference runtime offering GPU-class performance on CPUs and APIs to i...

95   1556   1556  

pet

This repository contains the code for "Exploiting Cloze Questions for...

281   1553   1553  

delta

DELTA is a deep learning based natural language and speech processing...

299   1551   1551  

TigerBot

TigerBot: A multi-language multi-task LLM

150   1545   1545  

Keras-TextClassification

中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Cla...

398   1541   1541  

spago

Self-contained Machine Learning and Natural Language Processing librar...

81   1537   1537  

jiant

jiant is an nlp toolkit

287   1534   1534  

deepdoctection

A Repo For Document AI

66   1528   1528  

StudyBook

Study E-Book(ComputerVision DeepLearning MachineLearning Math NLP Pyth...

122   1523   1523  

bi-att-flow

Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarc...

690   1515   1515  

sense2vec

🦆 Contextually-keyed word vectors

239   1515   1515  

tensorflow-nlp

NLP and Text Generation Experiments in TensorFlow 2.x / 1.x

425   1487   1487  

nlpcda

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpc...

162   1478   1478  

nlp_xiaojiang

自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似...

393   1473   1473  

setfit

Efficient few-shot learning with Sentence Transformers

158   1468   1468  

BotSharp

The Open Source Chatbot Framework in .NET

336   1466   1466  

eda_nlp

Data augmentation for NLP, presented at EMNLP 2019

306   1455   1455  

nlp_paper_summaries

✍️ A carefully curated list of NLP paper summaries

248   1453   1453  

transformer-deploy

Efficient, scalable and enterprise-grade CPU/GPU inference server for...

142   1452   1452  

nlp-lang

这个项目是一个基本包.封装了大多数nlp项目中常用工具

499   1442   1442  

usaddress

:us: a python library for parsing unstructured United States address s...

289   1435   1435  

Transformers-Recipe

🧠 A study guide to learn about Transformers

135   1430   1430  

scispacy

A full spaCy pipeline and models for scientific/biomedical documents.

196   1428   1428  

TextBrewer

A PyTorch-based knowledge distillation toolkit for natural language pr...

229   1428   1428  

QA-Survey-CN

北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研...

240   1412   1412  

vvedenie-mashinnoe-obuchenie

:memo: Подборка ресурсов по машинному обучению

332   1388   1388  

NLP-Knowledge-Graph

自然语言处理、知识图谱、对话系统三大技术研究与应用。

346   1373   1373  

spacy-models

💫 Models for the spaCy Natural Language Processing (NLP) library

293   1353   1353  

entity-recognition-datasets

A collection of corpora for named entity recognition (NER) and entity...

236   1350   1350  

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。...

83   1347   1347  

search-index

A persistent, network resilient, full text search library for the brow...

154   1344   1344  

Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

215   1344   1344  

nlp-tutorial

A list of NLP(Natural Language Processing) tutorials

269   1343   1343  

TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albe...

182   1341   1341  

konlpy

Python package for Korean natural language processing.

338   1337   1337  

duckling_old

Deprecated in favor of https://github.com/facebook/duckling

224   1323   1323  

bootcamp

Dealing with all unstructured data, such as reverse image search, audi...

477   1308   1308  

scibert

A BERT model for scientific text.

202   1305   1305  

awesome-ai-ml-dl

Awesome Artificial Intelligence, Machine Learning and Deep Learning as...

331   1305   1305  

TAADpapers

Must-read Papers on Textual Adversarial Attack and Defense

179   1305   1305  

tika-python

Tika-Python is a Python binding to the Apache Tika™ REST services allo...

229   1302   1302  

Chinese-ELECTRA

Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)

173   1298   1298  

nlp_overview

Overview of Modern Deep Learning Techniques Applied to Natural Languag...

199   1294   1294  

jieba-php

"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chine...

258   1267   1267  

spacy-transformers

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

161   1264   1264  

text_gcn

Graph Convolutional Networks for Text Classification. AAAI 2019

426   1259   1259  

gnes

GNES is Generic Neural Elastic Search, a cloud-native semantic search...

217   1258   1258  

opennlp

Apache OpenNLP

427   1253   1253  

detext

DeText: A Deep Neural Text Understanding Framework for Ranking and Cla...

140   1248   1248