Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

prose

:book: A Golang library for text processing, including tokenization, p...

167   3067   3067  

flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention mode...

230   3024   3024  

nlp_tasks

Natural Language Processing Tasks and References

544   3016   3016  

Baichuan-13B

A 13B large language model developed by Baichuan Intelligent Technolog...

237   2968   2968  

DeepNLP-models-Pytorch

Pytorch implementations of various Deep NLP models in cs-224n(Stanford...

653   2951   2951  

MITIE

MITIE: library and tools for information extraction

539   2949   2949  

promptsource

Toolkit for creating, sharing and using natural language prompts.

369   2921   2921  

rebiber

A simple tool to update bib entries with their official information (e...

164   2888   2888  

pythoncode-tutorials

The Python Code Tutorials

1980   2883   2883  

thinc

🔮 A refreshing functional take on deep learning, compatible with your...

283   2869   2869  

huggingface_hub

The official Python client for the Huggingface Hub.

776   2833   2833  

knockknock

🚪✊Knock Knock: Get notified when your training ends with only two ad...

233   2819   2819  

MTBook

《机器翻译:基础与模型》肖桐 朱靖波 著 - Machine Translation: Foundati...

762   2772   2772  

machine-learning

머신러닝 입문자 혹은 스터디를 준비하시는 분들에게 도움이 되고자 만든 r...

894   2772   2772  

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learnin...

368   2750   2750  

Top-AI-Conferences-Paper-with-Code

MLNLP: This repository is a collection of AI top conferences papers (e...

605   2633   2633  

LLMAgentPapers

Must-read Papers on LLM Agents.

151   2614   2614  

gluon-nlp

NLP made easy

528   2562   2562  

EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

308   2499   2499  

EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution...

261   2496   2496  

aws-machine-learning-university-accelerated-nlp

Machine Learning University: Accelerated Natural Language Processing C...

625   2417   2417  

texar

Toolkit for Machine Learning, Natural Language Processing, and Text Ge...

371   2388   2388  

spacy-course

👩‍🏫 Advanced NLP with spaCy: A free online course

377   2380   2380  

turkce-yapay-zeka-kaynaklari

Türkiye'de yapılan derin öğrenme (deep learning) ve makine öğrenmesi (...

456   2375   2375  

CodeSearchNet

Datasets, tools, and benchmarks for representation learning of code.

404   2353   2353  

decaNLP

The Natural Language Decathlon: A Multitask Challenge for NLP

469   2348   2348  

stocksight

Stock market analyzer and predictor using Elasticsearch, Twitter, News...

483   2344   2344  

practical-machine-learning-with-python

Master the essential skills needed to recognize and solve complex real...

1657   2342   2342  

RL4LMs

A modular RL library to fine-tune language models to human preferences

200   2336   2336  

scattertext

Beautiful visualizations of how language differs among document types.

289   2308   2308  

hunspell

The most popular spellchecking library.

255   2307   2307  

DeepInterests

深度有趣

417   2295   2295  

textacy

NLP, before and after spaCy

248   2230   2230  

GLiNER

Generalist and Lightweight Model for Named Entity Recognition (Extract...

197   2223   2223  

PyTorch-NLP

Basic Utilities for PyTorch Natural Language Processing (NLP)

254   2222   2222  

uda

Unsupervised Data Augmentation (UDA)

311   2196   2196  

lazynlp

Library to scrape and clean web pages to create massive datasets.

311   2193   2193  

AIGC-Interview-Book

【三年面试五年模拟】AIGC算法工程师面试秘籍。涵盖AIGC、传统深度学习、自...

255   2189   2189  

pytextrank

Python implementation of TextRank algorithms ("textgraphs") for phrase...

333   2188   2188  

Awesome-Rust-MachineLearning

This repository is a list of machine learning libraries written in Rus...

120   2162   2162  

ML

A high-level machine learning and deep learning library for the PHP la...

193   2149   2149  

ABSA-PyTorch

Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的...

529   2084   2084  

P-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning acr...

206   2054   2054  

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeN...

297   2048   2048  

OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended...

269   2044   2044  

ecco

Explain, analyze, and visualize NLP language models. Ecco creates inte...

174   2044   2044  

PyTorchNLPBook

Code and data accompanying Natural Language Processing with PyTorch pu...

813   2043   2043  

ABigSurvey

A collection of 1000+ survey papers on Natural Language Processing (NL...

246   2027   2027  

The-NLP-Pandect

A comprehensive reference for all topics related to Natural Language P...

283   2021   2021  

Senta

Baidu's open-source Sentiment Analysis System.

370   1979   1979