Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

transformers

🤗 Transformers: the model-definition framework for state-of-the-art m...

29981   148192   148192  

d2l-zh

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家...

11797   71575   71575  

Made-With-ML

Learn how to design, develop, deploy and iterate on production-grade M...

6507   41824   41824  

bert

TensorFlow code and pre-trained models for BERT

9692   39410   39410  

HanLP

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析...

10739   35494   35494  

spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

4558   32152   32152  

applied-ml

📚 Papers & tech blogs by companies sharing their work on data science...

3781   28208   28208  

d2l-en

Interactive deep learning book with multi-framework code, math, and di...

4712   26549   26549  

NLP-progress

Repository to track the progress in Natural Language Processing (NLP),...

3618   22931   22931  

Resume-Matcher

Improve your resumes with Resume Matcher. Get insights, keyword sugges...

4288   21877   21877  

haystack

AI orchestration framework to build customizable, production-ready LLM...

2219   21169   21169  

rasa

💬 Open source machine learning framework to automate text- and voic...

4833   20513   20513  

datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, e...

2898   20487   20487  

Ciphey

⚡ Automatically decrypt encryptions without knowing the key or cipher...

1287   19791   19791  

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language...

1575   19013   19013  

Dive-into-DL-PyTorch

本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改...

5433   19002   19002  

awesome-nlp

:book: A curated list of resources dedicated to Natural Language Proce...

2622   17396   17396  

DocsGPT

DocsGPT is an open-source genAI tool that helps users get reliable ans...

1773   16937   16937  

ML-YouTube-Courses

📺 Discover the latest machine learning / AI courses on YouTube.

2026   16802   16802  

gensim

Topic Modelling for Humans

4404   16131   16131  

Awesome-pytorch-list

A comprehensive list of pytorch related content on github,such as diff...

2816   16045   16045  

lectures

Oxford Deep NLP 2017 course

3580   15857   15857  

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and c...

1473   15582   15582  

nlp-tutorial

Natural Language Processing Tutorial for Deep Learning Researchers

3959   14696   14696  

flair

A very simple framework for state-of-the-art Natural Language Processi...

2120   14253   14253  

nltk

NLTK Source

2938   14230   14230  

camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Sc...

1497   13792   13792  

languagetool

Style and Grammar Checker for 25+ Languages

1452   13425   13425  

clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences wit...

2076   12716   12716  

deep-learning-drizzle

Drench yourself in Deep Learning, Reinforcement Learning, Machine Lear...

2959   12651   12651  

CV

✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习...

1484   12425   12425  

unstructured

Convert documents to structured data effortlessly. Unstructured is ope...

1013   12319   12319  

MOSS

An open-source tool-augmented conversational language model from Fudan...

1143   12070   12070  

allennlp

An open-source NLP research library, built on PyTorch.

2238   11870   11870  

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Langu...

916   11723   11723  

ludwig

Low-code framework for building custom LLMs, neural networks, and othe...

1219   11555   11555  

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

1276   11153   11153  

stanford-tensorflow-tutorials

This repository contains code examples for the Stanford's course: Tens...

4289   10366   10366  

doccano

Open source annotation tool for machine learning practitioners.

1809   10206   10206  

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Exampl...

771   10006   10006  

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Product...

952   9979   9979  

CoreNLP

CoreNLP: A Java suite of core NLP tools for tokenization, sentence seg...

2717   9948   9948  

TextBlob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech...

1169   9416   9416  

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All...

2037   9333   9333  

autogluon

Fast and Accurate ML in 3 Lines of Code

1046   9263   9263  

pattern

Web mining module for Python, with tools for scraping, natural languag...

1578   8832   8832  

openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI in...

2679   8702   8702  

machine_learning_examples

A collection of machine learning examples and tutorials.

6405   8671   8671  

Deep-Learning-Interview-Book

深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理...

1365   8439   8439  

bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

835   7594   7594