Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

nlp-tutorial

A list of NLP(Natural Language Processing) tutorials

269   1343   1343  

TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albe...

182   1341   1341  

konlpy

Python package for Korean natural language processing.

338   1337   1337  

rpaframework

Collection of open-source libraries and tools for Robotic Process Auto...

247   1332   1332  

transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Expla...

100   1325   1325  

duckling_old

Deprecated in favor of https://github.com/facebook/duckling

224   1323   1323  

tribuo

Tribuo - A Java machine learning library

178   1307   1307  

scibert

A BERT model for scientific text.

202   1305   1305  

TAADpapers

Must-read Papers on Textual Adversarial Attack and Defense

179   1305   1305  

tika-python

Tika-Python is a Python binding to the Apache Tika™ REST services allo...

229   1302   1302  

nlp_overview

Overview of Modern Deep Learning Techniques Applied to Natural Languag...

199   1294   1294  

wink-nlp

Developer friendly Natural Language Processing ✨

61   1285   1285  

gnes

GNES is Generic Neural Elastic Search, a cloud-native semantic search...

209   1267   1267  

jieba-php

"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chine...

258   1267   1267  

spacy-transformers

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

161   1264   1264  

natasha

Solves basic Russian NLP tasks, API for lower level Natasha projects

109   1262   1262  

ktrain

ktrain is a Python library that makes deep learning and AI more access...

266   1261   1261  

text_gcn

Graph Convolutional Networks for Text Classification. AAAI 2019

426   1259   1259  

obsei

Obsei is a low code AI powered automation tool. It can be used in vari...

167   1256   1256  

opennlp

Apache OpenNLP

427   1253   1253  

detext

DeText: A Deep Neural Text Understanding Framework for Ranking and Cla...

140   1248   1248  

one-pixel-attack-keras

Keras implementation of "One pixel attack for fooling deep neural netw...

213   1230   1230  

DataProfiler

What's in your data? Extract schema, statistics and entities from data...

125   1228   1228  

textrank

TextRank implementation for Python 3.

262   1194   1194  

awesome-relation-extraction

📖 A curated list of awesome resources dedicated to Relation Extractio...

136   1192   1192  

Repo-2017

My first Python repo with codes in Machine Learning, NLP and Deep Lear...

678   1191   1191  

underthesea

Underthesea - Vietnamese NLP Toolkit

256   1182   1182  

transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classificati...

256   1182   1182  

hmtl

🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural n...

146   1176   1176  

fastText_multilingual

Multilingual word vectors in 78 languages

125   1173   1173  

Deep-Learning-Experiments

Videos, notes and experiments to understand deep learning

767   1142   1142  

bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair En...

93   1136   1136  

KoBERT

Korean BERT pre-trained cased (KoBERT)

346   1133   1133  

nltk_data

NLTK Data

958   1133   1133  

projects

🪐 End-to-end NLP workflows from prototype to production

450   1121   1121  

datumbox-framework

Datumbox is an open-source Machine Learning framework written in Java...

290   1087   1087  

awesome-transformer-nlp

A curated list of NLP resources focused on Transformer networks, atten...

130   1082   1082  

nlp-library

curated collection of papers for the nlp practitioner 📖👩‍🔬

91   1074   1074  

docspell

Assist in organizing your piles of documents, resulting from scanners,...

83   1074   1074  

nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim...

778   1070   1070  

contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine co...

134   1067   1067  

xmnlp

xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转...

183   1066   1066  

PPLM

Plug and Play Language Model implementation. Allows to steer topic and...

187   1064   1064  

nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

258   1064   1064  

zemberek-nlp

NLP tools for Turkish.

207   1061   1061  

PyTorchText

1st Place Solution for Zhihu Machine Learning Challenge . Implementati...

368   1059   1059  

nlp-with-ruby

Curated List: Practical Natural Language Processing done in Ruby

68   1059   1059  

learn-nlp-with-transformers

we want to create a repo to illustrate usage of transformers in chines...

221   1055   1055  

nlp

:memo: This repository recorded my NLP journey.

323   1052   1052  

languagemodels

Explore large language models on any computer with 512MB of RAM

74   1031   1031