Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Chinese-ELECTRA

Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)

172   1424   1424  

QA-Survey-CN

北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研...

240   1412   1412  

projects

🪐 End-to-end NLP workflows from prototype to production

466   1395   1395  

spacy-transformers

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

172   1392   1392  

vvedenie-mashinnoe-obuchenie

:memo: Подборка ресурсов по машинному обучению

332   1388   1388  

Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

215   1388   1388  

nlg-eval

Evaluation code for various unsupervised automated metrics for Natural...

225   1388   1388  

nlp-tutorial

A list of NLP(Natural Language Processing) tutorials

264   1378   1378  

paperai

📄 🤖 Semantic search and workflows for medical/scientific papers

105   1377   1377  

transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Expla...

100   1364   1364  

jieba-php

"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chine...

257   1361   1361  

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。...

83   1347   1347  

search-index

A persistent, network resilient, full text search library for the brow...

154   1344   1344  

TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albe...

182   1341   1341  

konlpy

Python package for Korean natural language processing.

338   1337   1337  

rpaframework

Collection of open-source libraries and tools for Robotic Process Auto...

247   1332   1332  

duckling_old

Deprecated in favor of https://github.com/facebook/duckling

224   1323   1323  

tribuo

Tribuo - A Java machine learning library

178   1307   1307  

scibert

A BERT model for scientific text.

202   1305   1305  

tika-python

Tika-Python is a Python binding to the Apache Tika™ REST services allo...

229   1302   1302  

obsei

Obsei is a low code AI powered automation tool. It can be used in vari...

173   1300   1300  

spacy-llm

🦙 Integrating LLMs into structured NLP pipelines

103   1296   1296  

basaran

Basaran is an open-source alternative to the OpenAI text completion AP...

80   1295   1295  

nlp_overview

Overview of Modern Deep Learning Techniques Applied to Natural Languag...

199   1294   1294  

wink-nlp

Developer friendly Natural Language Processing ✨

61   1292   1292  

lingua-go

The most accurate natural language detection library for Go, suitable...

69   1269   1269  

gnes

GNES is Generic Neural Elastic Search, a cloud-native semantic search...

210   1267   1267  

natasha

Solves basic Russian NLP tasks, API for lower level Natasha projects

109   1262   1262  

textrank

TextRank implementation for Python 3.

259   1262   1262  

ktrain

ktrain is a Python library that makes deep learning and AI more access...

266   1261   1261  

text_gcn

Graph Convolutional Networks for Text Classification. AAAI 2019

426   1259   1259  

opennlp

Apache OpenNLP

427   1253   1253  

detext

DeText: A Deep Neural Text Understanding Framework for Ranking and Cla...

140   1248   1248  

zemberek-nlp

NLP tools for Turkish.

216   1241   1241  

one-pixel-attack-keras

Keras implementation of "One pixel attack for fooling deep neural netw...

213   1230   1230  

bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair En...

102   1217   1217  

awesome-relation-extraction

📖 A curated list of awesome resources dedicated to Relation Extractio...

136   1213   1213  

fastText_multilingual

Multilingual word vectors in 78 languages

121   1200   1200  

hmtl

🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural n...

147   1195   1195  

Repo-2017

My first Python repo with codes in Machine Learning, NLP and Deep Lear...

678   1193   1193  

transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classificati...

256   1182   1182  

nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim...

790   1175   1175  

PPLM

Plug and Play Language Model implementation. Allows to steer topic and...

205   1150   1150  

Deep-Learning-Experiments

Videos, notes and experiments to understand deep learning

767   1142   1142  

KoBERT

Korean BERT pre-trained cased (KoBERT)

346   1133   1133  

question_generation

Neural question generation using transformers

350   1131   1131  

awesome-transformer-nlp

A curated list of NLP resources focused on Transformer networks, atten...

131   1106   1106  

datumbox-framework

Datumbox is an open-source Machine Learning framework written in Java...

290   1087   1087  

nlp-library

curated collection of papers for the nlp practitioner 📖👩‍🔬

91   1074   1074  

contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine co...

134   1067   1067