Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorF...

21305   106636   106636  

ailearning

AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2

11206   35857   35857  

bert

TensorFlow code and pre-trained models for BERT

9287   34715   34715  

Made-With-ML

Learn how to responsibly develop, deploy and maintain production machi...

5477   33501   33501  

HanLP

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析...

8444   29571   29571  

spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

4180   26575   26575  

jina

🔮 Multimodal AI services & pipelines with cloud-native stack: gRPC, Ku...

2150   18734   18734  

mindsdb

MindsDB is a Server for Artificial Intelligence Logic. Enabling develo...

2195   17178   17178  

rasa

💬 Open source machine learning framework to automate text- and voice...

4388   16652   16652  

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, ea...

2243   16631   16631  

lectures

Oxford Deep NLP 2017 course

3633   15507   15507  

AI-For-Beginners

12 Weeks, 24 Lessons, AI for All!

2507   15317   15317  

awesome-nlp

:book: A curated list of resources dedicated to Natural Language Proce...

2509   14910   14910  

gensim

Topic Modelling for Humans

4366   14472   14472  

Awesome-pytorch-list

A comprehensive list of pytorch related content on github,such as diff...

2790   14197   14197  

best-of-ml-python

🏆 A ranked list of awesome machine learning Python libraries. Updated...

2086   13982   13982  

ML-NLP

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中...

4332   13944   13944  

Virgilio

Your new Mentor for Data Science E-Learning.

2477   13456   13456  

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and...

1920   13389   13389  

flair

A very simple framework for state-of-the-art Natural Language Processi...

2026   12931   12931  

500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code

500 AI Machine learning Deep learning Computer vision NLP Projects wit...

3801   12827   12827  

nlp-tutorial

Natural Language Processing Tutorial for Deep Learning Researchers

3768   12758   12758  

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca...

1284   12266   12266  

nltk

NLTK Source

2760   12091   12091  

PaddleHub

Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models...

2069   11926   11926  

allennlp

An open-source NLP research library, built on PyTorch.

2258   11526   11526  

clip-as-service

🏄 Embed/reason/rank images and sentences with CLIP models

2016   11441   11441  

DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to t...

2871   11190   11190  

compromise

modest natural-language processing

680   10775   10775  

ML-YouTube-Courses

📺 Discover the latest machine learning / AI courses on YouTube.

1284   10643   10643  

botpress

The open-source hub to build & deploy GPT/LLM Agents ⚡️

1466   10554   10554  

stanford-tensorflow-tutorials

This repository contains code examples for the Stanford's course: Tens...

4383   10263   10263  

PaddleNLP

👑 Easy-to-use and powerful NLP library with 🤗 Awesome model zoo, suppo...

2537   9607   9607  

haystack

:mag: Haystack is an open source NLP framework to interact with your d...

1255   9360   9360  

CoreNLP

Stanford CoreNLP: A Java suite of core NLP tools.

2693   9082   9082  

TextBlob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech...

1119   8617   8617  

Chinese-BERT-wwm

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系...

1334   8528   8528  

nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

1493   8496   8496  

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All...

1810   7656   7656  

text_classification

all kinds of text classification models and more with deep learning

2602   7585   7585  

pycaret

An open-source, low-code machine learning library in Python

1616   7458   7458  

Awesome-Chinese-NLP

A curated list of resources for Chinese NLP 中文自然语言处理相关资料

1697   7338   7338  

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Producti...

610   7207   7207  

NeMo

NeMo: a toolkit for conversational AI

1664   7206   7206  

GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

1725   7030   7030  

models

Officially maintained, supported by PaddlePaddle, including CV, NLP, S...

2963   6800   6800  

FinGPT

Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥...

1640   6770   6770  

stanza

Official Stanford NLP Python Library for Many Human Languages

859   6697   6697  

Ai-Learn

人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基...

1634   6659   6659  

WantWords

An open-source online reverse dictionary.

587   6654   6654