Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Mode...

443   2475   2475  

aws-machine-learning-university-accelerated-nlp

Machine Learning University: Accelerated Natural Language Processing C...

624   2414   2414  

texar

Toolkit for Machine Learning, Natural Language Processing, and Text Ge...

371   2388   2388  

turkce-yapay-zeka-kaynaklari

Türkiye'de yapılan derin öğrenme (deep learning) ve makine öğrenmesi (...

455   2374   2374  

spacy-course

👩‍🏫 Advanced NLP with spaCy: A free online course

375   2355   2355  

decaNLP

The Natural Language Decathlon: A Multitask Challenge for NLP

469   2349   2349  

CodeSearchNet

Datasets, tools, and benchmarks for representation learning of code.

404   2346   2346  

practical-machine-learning-with-python

Master the essential skills needed to recognize and solve complex real...

1657   2340   2340  

scattertext

Beautiful visualizations of how language differs among document types.

290   2305   2305  

DeepInterests

深度有趣

417   2294   2294  

JioNLP

中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocess...

299   2244   2244  

PyTorch-NLP

Basic Utilities for PyTorch Natural Language Processing (NLP)

253   2222   2222  

uda

Unsupervised Data Augmentation (UDA)

311   2194   2194  

lazynlp

Library to scrape and clean web pages to create massive datasets.

311   2179   2179  

pytextrank

Python implementation of TextRank algorithms ("textgraphs") for phrase...

333   2173   2173  

Awesome-Rust-MachineLearning

This repository is a list of machine learning libraries written in Rus...

119   2161   2161  

ML

A high-level machine learning and deep learning library for the PHP la...

194   2152   2152  

textacy

NLP, before and after spaCy

256   2079   2079  

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeN...

297   2045   2045  

ecco

Explain, analyze, and visualize NLP language models. Ecco creates inte...

174   2043   2043  

PyTorchNLPBook

Code and data accompanying Natural Language Processing with PyTorch pu...

812   2042   2042  

ABigSurvey

A collection of 1000+ survey papers on Natural Language Processing (NL...

246   2026   2026  

The-NLP-Pandect

A comprehensive reference for all topics related to Natural Language P...

276   1981   1981  

tensorflow-1.4-billion-password-analysis

Deep Learning model to analyze a large corpus of clear text passwords.

394   1944   1944  

sling

SLING - A natural language frame semantics parser

266   1932   1932  

NCRFpp

NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence l...

447   1893   1893  

awesome-semi-supervised-learning

😎 An up-to-date & curated list of awesome semi-supervised learning pa...

229   1846   1846  

spago

Self-contained Machine Learning and Natural Language Processing librar...

88   1815   1815  

ABSA-PyTorch

Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的...

503   1791   1791  

Awesome-FL

Comprehensive and timely academic information on federated learning (p...

199   1786   1786  

awesome-embedding-models

A curated list of awesome embedding models tutorials, projects and com...

252   1783   1783  

kaggle-CrowdFlower

1st Place Solution for CrowdFlower Product Search Results Relevance Co...

674   1751   1751  

WikiSQL

A large annotated semantic parsing corpus for developing natural langu...

327   1742   1742  

hunspell

The most popular spellchecking library.

222   1733   1733  

spacy-models

💫 Models for the spaCy Natural Language Processing (NLP) library

305   1724   1724  

adapter-transformers

Huggingface Transformers + Adapters = ❤️

283   1721   1721  

lightning-bolts

Toolbox of models, callbacks, and datasets for AI/ML researchers.

320   1719   1719  

bert_score

BERT score for text generation

228   1714   1714  

Senta

Baidu's open-source Sentiment Analysis System.

358   1690   1690  

transformer-deploy

Efficient, scalable and enterprise-grade CPU/GPU inference server for...

153   1688   1688  

graph4nlp

Graph4nlp is the library for the easy use of Graph Neural Networks for...

204   1687   1687  

stocksight

Stock market analyzer and predictor using Elasticsearch, Twitter, News...

404   1667   1667  

language

Shared repository for open-sourced projects from the Google AI Languag...

352   1664   1664  

Chinese-XLNet

Pre-Trained Chinese XLNet(中文XLNet预训练模型)

281   1650   1650  

RL4LMs

A modular RL library to fine-tune language models to human preferences

155   1646   1646  

sense2vec

🦆 Contextually-keyed word vectors

239   1645   1645  

magnitude

A fast, efficient universal vector embedding utility package.

119   1644   1644  

DAT8

General Assembly's 2015 Data Science course in Washington, DC

1066   1610   1610  

Transformers-Recipe

🧠 A study guide to learn about Transformers

157   1604   1604  

awesome-ai-ml-dl

Awesome Artificial Intelligence, Machine Learning and Deep Learning as...

362   1566   1566