Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

nlp.js

An NLP library for building bots, with entity extraction, sentiment an...

620   6363   6363  

awesome-self-supervised-learning

A curated list of awesome self-supervised methods

836   6254   6254  

ML-Course-Notes

๐ŸŽ“ Sharing machine learning course / lecture notes.

816   6143   6143  

ERNIE

Official implementations for various pre-training models of ERNIE-fami...

1252   5898   5898  

ai-deadlines

:alarm_clock: AI conference deadline countdowns

1008   5797   5797  

AI-Job-Notes

AI็ฎ—ๆณ•ๅฒ—ๆฑ‚่Œๆ”ป็•ฅ๏ผˆๆถต็›–ๅ‡†ๅค‡ๆ”ป็•ฅใ€ๅˆท้ข˜ๆŒ‡ๅ—ใ€ๅ†…ๆŽจๅ’ŒAIๅ…ฌๅธๆธ…ๅ•็ญ‰่ต„ๆ–™๏ผ‰

655   5525   5525  

datascience

This repository is a compilation of free resources for learning Data S...

526   5104   5104  

ltp

Language Technology Platform

1048   5075   5075  

marqo

Unified embedding generation and search engine. Also available on clou...

201   4813   4813  

nlpaug

Data augmentation for NLP

466   4537   4537  

practical-pytorch

Go to https://github.com/pytorch/tutorials - this repo is deprecated a...

1115   4462   4462  

argilla

Argilla is a collaboration tool for AI engineers and domain experts to...

420   4422   4422  

autotrain-advanced

๐Ÿค— AutoTrain Advanced

559   4347   4347  

libpostal

A C library for parsing/normalizing street addresses around the world....

432   4214   4214  

trafilatura

Python & Command-line tool to gather text and metadata on the Web: Cra...

288   4109   4109  

FLAML

A fast library for AutoML and tuning. Join our Discord: https://discor...

528   4095   4095  

Data-science

Collection of useful data science topics along with articles, videos,...

1030   4086   4086  

pytorch-sentiment-analysis

Tutorials on getting started with PyTorch and TorchText for sentiment...

1117   3962   3962  

arXivTimes

repository to research & share the machine learning articles

202   3910   3910  

MatchZoo

Facilitating the design, comparison and sharing of deep text matching...

898   3853   3853  

olivia

๐Ÿ’โ€โ™€๏ธYour new best friend powered by an artificial neural network

355   3693   3693  

textract

extract text from any document. no muss. no fuss.

533   3546   3546  

lit

The Learning Interpretability Tool: Interactively analyze ML models to...

358   3532   3532  

OpenPrompt

An Open-Source Framework for Prompt-Learning.

390   3530   3530  

zhihu

This repo contains the source code in my personal column (https://zhua...

2161   3440   3440  

catalyst

Accelerated deep learning R&D

393   3342   3342  

spark-nlp

State of the Art Natural Language Processing

661   3321   3321  

vale

:pencil: A syntax-aware linter for prose built with speed and extensib...

121   3265   3265  

nlp-roadmap

ROADMAP(Mind Map) and KEYWORD for students those who have interest in...

520   3252   3252  

TextAttack

TextAttack ๐Ÿ™ is a Python framework for adversarial attacks, data aug...

415   3127   3127  

torchscale

Foundation Architecture for (M)LLMs

215   3067   3067  

prose

:book: A Golang library for text processing, including tokenization, p...

159   3000   3000  

nlp_tasks

Natural Language Processing Tasks and References

565   2998   2998  

DeepNLP-models-Pytorch

Pytorch implementations of various Deep NLP models in cs-224n(Stanford...

659   2954   2954  

MITIE

MITIE: library and tools for information extraction

538   2937   2937  

pyhanlp

ไธญๆ–‡ๅˆ†่ฏ

789   2905   2905  

fastNLP

fastNLP: A Modularized and Extensible NLP Framework. Currently still i...

448   2866   2866  

thinc

๐Ÿ”ฎ A refreshing functional take on deep learning, compatible with your...

280   2840   2840  

promptsource

Toolkit for creating, sharing and using natural language prompts.

364   2810   2810  

knockknock

๐ŸšชโœŠKnock Knock: Get notified when your training ends with only two ad...

235   2805   2805  

pythoncode-tutorials

The Python Code Tutorials

1978   2776   2776  

rebiber

A simple tool to update bib entries with their official information (e...

162   2773   2773  

MTBook

ใ€Šๆœบๅ™จ็ฟป่ฏ‘๏ผšๅŸบ็ก€ไธŽๆจกๅž‹ใ€‹่‚–ๆก ๆœฑ้–ๆณข ่‘— - Machine Translation: Foundati...

762   2762   2762  

machine-learning

๋จธ์‹ ๋Ÿฌ๋‹ ์ž…๋ฌธ์ž ํ˜น์€ ์Šคํ„ฐ๋””๋ฅผ ์ค€๋น„ํ•˜์‹œ๋Š” ๋ถ„๋“ค์—๊ฒŒ ๋„์›€์ด ๋˜๊ณ ์ž ๋งŒ๋“  r...

873   2716   2716  

Top-AI-Conferences-Paper-with-Code

MLNLP: This repository is a collection of AI top conferences papers (e...

603   2615   2615  

gluon-nlp

NLP made easy

530   2559   2559  

huggingface_hub

The official Python client for the Huggingface Hub.

664   2499   2499  

ml-course

Open Machine Learning course

1151   2479   2479  

UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Mode...

443   2475   2475  

texar

Toolkit for Machine Learning, Natural Language Processing, and Text Ge...

372   2388   2388