Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, Tensor...

28538   142433   142433  

d2l-zh

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家...

11514   68083   68083  

Made-With-ML

Learn how to design, develop, deploy and iterate on production-grade M...

6078   38362   38362  

bert

TensorFlow code and pre-trained models for BERT

9287   34715   34715  

spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

4488   31316   31316  

HanLP

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析...

8444   29571   29571  

applied-ml

📚 Papers & tech blogs by companies sharing their work on data science...

3739   27867   27867  

d2l-en

Interactive deep learning book with multi-framework code, math, and di...

4583   25472   25472  

NLP-progress

Repository to track the progress in Natural Language Processing (NLP),...

3621   22825   22825  

haystack

AI orchestration framework to build customizable, production-ready LLM...

2120   20161   20161  

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, e...

2802   19940   19940  

rasa

💬 Open source machine learning framework to automate text- and voic...

4753   19909   19909  

Ciphey

⚡ Automatically decrypt encryptions without knowing the key or cipher...

1202   18778   18778  

awesome-nlp

:book: A curated list of resources dedicated to Natural Language Proce...

2600   17071   17071  

ML-YouTube-Courses

📺 Discover the latest machine learning / AI courses on YouTube.

1979   16462   16462  

Dive-into-DL-PyTorch

本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改...

5111   16198   16198  

gensim

Topic Modelling for Humans

4391   15942   15942  

Awesome-pytorch-list

A comprehensive list of pytorch related content on github,such as diff...

2822   15762   15762  

lectures

Oxford Deep NLP 2017 course

3566   15705   15705  

DocsGPT

DocsGPT is an open-source genAI tool that helps users get reliable ans...

1656   15502   15502  

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and c...

1440   14602   14602  

flair

A very simple framework for state-of-the-art Natural Language Processi...

2111   14129   14129  

nltk

NLTK Source

2918   13971   13971  

nlp-tutorial

Natural Language Processing Tutorial for Deep Learning Researchers

3768   12758   12758  

deep-learning-drizzle

Drench yourself in Deep Learning, Reinforcement Learning, Machine Lear...

2953   12533   12533  

allennlp

An open-source NLP research library, built on PyTorch.

2258   11526   11526  

clip-as-service

🏄 Embed/reason/rank images and sentences with CLIP models

2016   11441   11441  

ludwig

Low-code framework for building custom LLMs, neural networks, and othe...

1204   11410   11410  

unstructured

Open source libraries and APIs to build custom preprocessing pipelines...

895   10770   10770  

stanford-tensorflow-tutorials

This repository contains code examples for the Stanford's course: Tens...

4297   10340   10340  

doccano

Open source annotation tool for machine learning practitioners.

1768   9896   9896  

CoreNLP

Stanford CoreNLP: A Java suite of core NLP tools.

2693   9082   9082  

pattern

Web mining module for Python, with tools for scraping, natural languag...

1587   8797   8797  

CV

✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习...

1107   8790   8790  

autogluon

Fast and Accurate ML in 3 Lines of Code

987   8626   8626  

TextBlob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech...

1119   8617   8617  

Resume-Matcher

Resume Matcher is an open source, free tool to improve your resume. It...

3166   8602   8602  

machine_learning_examples

A collection of machine learning examples and tutorials.

6385   8560   8560  

languagetool

Style and Grammar Checker for 25+ Languages

1039   8370   8370  

Deep-Learning-Interview-Book

深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理...

1351   8094   8094  

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All...

1810   7656   7656  

stanza

Stanford NLP Python library for tokenization, sentence segmentation, N...

900   7419   7419  

bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

808   7289   7289  

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Product...

610   7207   7207  

models

Officially maintained, supported by PaddlePaddle, including CV, NLP, S...

2894   6923   6923  

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

904   6863   6863  

WantWords

An open-source online reverse dictionary.

587   6654   6654  

mycroft-core

Mycroft Core, the Mycroft Artificial Intelligence platform.

1287   6574   6574  

nlp-recipes

Natural Language Processing Best Practices & Examples

917   6405   6405  

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

873   6364   6364