Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorF...

21305   106636   106636  

d2l-zh

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被60多个国家...

9267   44920   44920  

bert

TensorFlow code and pre-trained models for BERT

9287   34715   34715  

Made-With-ML

Learn how to responsibly develop, deploy and maintain production machi...

5477   33501   33501  

HanLP

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析...

8444   29571   29571  

spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

4180   26575   26575  

applied-ml

📚 Papers & tech blogs by companies sharing their work on data science...

3324   23895   23895  

NLP-progress

Repository to track the progress in Natural Language Processing (NLP),...

3556   21465   21465  

d2l-en

Interactive deep learning book with multi-framework code, math, and di...

3753   18378   18378  

rasa

💬 Open source machine learning framework to automate text- and voice...

4388   16652   16652  

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, ea...

2243   16631   16631  

Dive-into-DL-PyTorch

本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改...

5111   16198   16198  

lectures

Oxford Deep NLP 2017 course

3633   15507   15507  

awesome-nlp

:book: A curated list of resources dedicated to Natural Language Proce...

2509   14910   14910  

gensim

Topic Modelling for Humans

4366   14472   14472  

Awesome-pytorch-list

A comprehensive list of pytorch related content on github,such as diff...

2790   14197   14197  

Ciphey

⚡ Automatically decrypt encryptions without knowing the key or cipher,...

845   13641   13641  

flair

A very simple framework for state-of-the-art Natural Language Processi...

2026   12931   12931  

nlp-tutorial

Natural Language Processing Tutorial for Deep Learning Researchers

3768   12758   12758  

nltk

NLTK Source

2760   12091   12091  

allennlp

An open-source NLP research library, built on PyTorch.

2258   11526   11526  

clip-as-service

🏄 Embed/reason/rank images and sentences with CLIP models

2016   11441   11441  

deep-learning-drizzle

Drench yourself in Deep Learning, Reinforcement Learning, Machine Lear...

2814   11102   11102  

ML-YouTube-Courses

📺 Discover the latest machine learning / AI courses on YouTube.

1284   10643   10643  

stanford-tensorflow-tutorials

This repository contains code examples for the Stanford's course: Tens...

4383   10263   10263  

haystack

:mag: Haystack is an open source NLP framework to interact with your d...

1255   9360   9360  

CoreNLP

Stanford CoreNLP: A Java suite of core NLP tools.

2693   9082   9082  

ludwig

Data-centric declarative deep learning framework

1046   9015   9015  

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and cu...

1089   8920   8920  

TextBlob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech...

1119   8617   8617  

pattern

Web mining module for Python, with tools for scraping, natural languag...

1599   8512   8512  

languagetool

Style and Grammar Checker for 25+ Languages

1039   8370   8370  

doccano

Open source annotation tool for machine learning practitioners.

1576   7970   7970  

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All...

1810   7656   7656  

machine_learning_examples

A collection of machine learning examples and tutorials.

6111   7538   7538  

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Producti...

610   7207   7207  

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

904   6863   6863  

models

Officially maintained, supported by PaddlePaddle, including CV, NLP, S...

2963   6800   6800  

stanza

Official Stanford NLP Python Library for Many Human Languages

859   6697   6697  

WantWords

An open-source online reverse dictionary.

587   6654   6654  

mycroft-core

Mycroft Core, the Mycroft Artificial Intelligence platform.

1269   6275   6275  

nlp-recipes

Natural Language Processing Best Practices & Examples

898   6159   6159  

DocsGPT

GPT-powered chat for documentation, chat with your documents

573   5987   5987  

ERNIE

Official implementations for various pre-training models of ERNIE-fami...

1252   5898   5898  

nlp.js

An NLP library for building bots, with entity extraction, sentiment an...

588   5720   5720  

ML-Course-Notes

🎓 Sharing machine learning course / lecture notes.

740   5519   5519  

autogluon

AutoGluon: AutoML for Image, Text, Time Series, and Tabular Data

712   5492   5492  

awesome-self-supervised-learning

A curated list of awesome self-supervised methods

796   5480   5480  

bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

695   5350   5350  

Deep-Learning-Interview-Book

深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理...

1133   5328   5328