Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

transformers

🤗 Transformers: the model-definition framework for state-of-the-art m...

29981   148192   148192  

Made-With-ML

Learn how to design, develop, deploy and iterate on production-grade M...

6507   41824   41824  

bert

TensorFlow code and pre-trained models for BERT

9692   39410   39410  

AI-For-Beginners

12 Weeks, 24 Lessons, AI for All!

7562   39328   39328  

ailearning

AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2

11206   35857   35857  

HanLP

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析...

10739   35494   35494  

spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

4558   32152   32152  

500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code

500 AI Machine learning Deep learning Computer vision NLP Projects wit...

5982   25925   25925  

serve

☁️ Build multimodal AI applications with cloud-native stack

2233   21683   21683  

best-of-ml-python

🏆 A ranked list of awesome machine learning Python libraries. Updated...

2906   21663   21663  

haystack

AI orchestration framework to build customizable, production-ready LLM...

2219   21169   21169  

rasa

💬 Open source machine learning framework to automate text- and voic...

4833   20513   20513  

datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, e...

2898   20487   20487  

awesome-nlp

:book: A curated list of resources dedicated to Natural Language Proce...

2622   17396   17396  

mindsdb

MindsDB is a Server for Artificial Intelligence Logic. Enabling develo...

2195   17178   17178  

ML-NLP

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中...

4619   16996   16996  

ML-YouTube-Courses

📺 Discover the latest machine learning / AI courses on YouTube.

2026   16802   16802  

gensim

Topic Modelling for Humans

4404   16131   16131  

Awesome-pytorch-list

A comprehensive list of pytorch related content on github,such as diff...

2816   16045   16045  

lectures

Oxford Deep NLP 2017 course

3580   15857   15857  

FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥...

2207   15714   15714  

nlp-tutorial

Natural Language Processing Tutorial for Deep Learning Researchers

3959   14696   14696  

DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to t...

3357   14422   14422  

flair

A very simple framework for state-of-the-art Natural Language Processi...

2120   14253   14253  

nltk

NLTK Source

2938   14230   14230  

Virgilio

Your new Mentor for Data Science E-Learning.

2481   14149   14149  

botpress

The open-source hub to build & deploy GPT/LLM Agents ⚡️

2026   13812   13812  

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and...

1920   13389   13389  

PaddleHub

Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models...

2072   12803   12803  

clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences wit...

2076   12716   12716  

unstructured

Convert documents to structured data effortlessly. Unstructured is ope...

1013   12319   12319  

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca...

1284   12266   12266  

allennlp

An open-source NLP research library, built on PyTorch.

2238   11870   11870  

Ai-Learn

人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基...

2494   11541   11541  

ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

675   11054   11054  

compromise

modest natural-language processing

680   10775   10775  

txtai

💡 All-in-one open-source embeddings database for semantic search, LLM...

677   10673   10673  

text-generation-inference

Large Language Model Text Generation Inference

1220   10386   10386  

stanford-tensorflow-tutorials

This repository contains code examples for the Stanford's course: Tens...

4289   10366   10366  

go-openai

OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go

1611   10049   10049  

Chinese-BERT-wwm

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系...

1395   10030   10030  

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Exampl...

771   10006   10006  

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Product...

952   9979   9979  

CoreNLP

CoreNLP: A Java suite of core NLP tools for tokenization, sentence seg...

2717   9948   9948  

PaddleNLP

👑 Easy-to-use and powerful NLP library with 🤗 Awesome model zoo, sup...

2537   9607   9607  

TextBlob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech...

1169   9416   9416  

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All...

2037   9333   9333  

pycaret

An open-source, low-code machine learning library in Python

1800   9257   9257  

nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

1493   8496   8496  

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

845   8189   8189