Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

refinery

The data scientist's open-source choice to scale, assess and maintain...

72   1452   1452  

DAMO-ConvAI

DAMO-ConvAI: The official repository which contains the codebase for A...

228   1450   1450  

awesome-document-understanding

A curated list of resources for Document Understanding (DU) topic

162   1449   1449  

iOS_ML

List of Machine Learning, AI, NLP solutions for iOS. The most recent v...

153   1431   1431  

practical-nlp-code

Official Repository for Code associated with 'Practical Natural Langua...

640   1402   1402  

projects

🪐 End-to-end NLP workflows from prototype to production

466   1395   1395  

spacy-transformers

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

172   1392   1392  

nlg-eval

Evaluation code for various unsupervised automated metrics for Natural...

225   1388   1388  

nlp-tutorial

A list of NLP(Natural Language Processing) tutorials

264   1378   1378  

Speech-Emotion-Analyzer

The neural network model is capable of detecting five different male/f...

436   1370   1370  

transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Expla...

100   1364   1364  

jieba-php

"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chine...

257   1361   1361  

awesome-text-summarization

The guide to tackle with the Text Summarization

207   1310   1310  

hazm

Persian NLP Toolkit

195   1305   1305  

pororo

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

223   1302   1302  

obsei

Obsei is a low code AI powered automation tool. It can be used in vari...

173   1300   1300  

spacy-llm

🦙 Integrating LLMs into structured NLP pipelines

103   1296   1296  

basaran

Basaran is an open-source alternative to the OpenAI text completion AP...

80   1295   1295  

superlinked

Superlinked is a Python framework for AI Engineers building high-perfo...

96   1294   1294  

wink-nlp

Developer friendly Natural Language Processing ✨

61   1292   1292  

lingua-go

The most accurate natural language detection library for Go, suitable...

69   1269   1269  

textrank

TextRank implementation for Python 3.

259   1262   1262  

bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair En...

102   1217   1217  

awesome-relation-extraction

📖 A curated list of awesome resources dedicated to Relation Extractio...

136   1213   1213  

fastText_multilingual

Multilingual word vectors in 78 languages

121   1200   1200  

extractous

Fast and efficient unstructured data extraction. Written in Rust with...

54   1200   1200  

hmtl

🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural n...

147   1195   1195  

Repo-2017

My first Python repo with codes in Machine Learning, NLP and Deep Lear...

678   1193   1193  

natural-language-processing

Resources for "Natural Language Processing" Coursera course.

1952   1193   1193  

tidytext

Text mining using tidy tools :sparkles::page_facing_up::sparkles:

182   1190   1190  

budou

Budou is an automatic organizer tool for beautiful line breaking in CJ...

55   1178   1178  

nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim...

790   1175   1175  

PPLM

Plug and Play Language Model implementation. Allows to steer topic and...

205   1150   1150  

seqeval

A Python framework for sequence labeling evaluation(named-entity recog...

133   1148   1148  

question_generation

Neural question generation using transformers

350   1131   1131  

KnowledgeEditingPapers

Must-read Papers on Knowledge Editing for Large Language Models.

78   1128   1128  

LLM-RL-Visualized

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/R...

123   1126   1126  

FreeML

A List of Data Science/Machine Learning Resources (Mostly Free)

517   1118   1118  

vlms-zero-to-hero

This series will take you on a journey from the fundamentals of NLP an...

101   1117   1117  

wtpsplit

Toolkit to segment text into sentences or other semantic units in a ro...

69   1115   1115  

awesome-transformer-nlp

A curated list of NLP resources focused on Transformer networks, atten...

131   1106   1106  

awesome-grounding

awesome grounding: A curated list of research papers in visual groundi...

102   1091   1091  

TextBox

TextBox 2.0 is a text generation library with pre-trained language mod...

116   1090   1090  

TencentPretrain

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

149   1079   1079  

nlp-with-ruby

Curated List: Practical Natural Language Processing done in Ruby

68   1061   1061  

pythainlp

Thai natural language processing in Python

282   1060   1060  

RAGHub

A community-driven collection of RAG (Retrieval-Augmented Generation)...

101   1049   1049  

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.  ...

53   1044   1044  

insuranceqa-corpus-zh

:helicopter: 保险行业语料库,聊天机器人

345   1040   1040  

Treasure-of-Transformers

💁 Awesome Treasure of Transformers Models for Natural Language proces...

217   1032   1032