Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

language-detection

A language detection library for PHP. Detects the language from a give...

86   827   827  

alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF me...

61   822   822  

causal-text-papers

Curated research at the intersection of causal inference and natural l...

100   802   802  

catalyst

🚀 Catalyst is a C# Natural Language Processing library built for spee...

83   801   801  

OCTIS

OCTIS: Comparing Topic Models is Simple! A python package to optimize...

115   781   781  

papermage

library supporting NLP and CV research on scientific papers

62   778   778  

Paper-Reading

📖 Paper reading list in dialogue systems and natural language generat...

150   774   774  

AI-Notes

:books: [.md & .ipynb] Series of Artificial Intelligence & Deep Learni...

240   770   770  

Instruction-Tuning-Papers

Reading list of Instruction-tuning. A trend starts from Natrural-Instr...

24   768   768  

AI-Job-Recommend

国内公司人工智能方向(含机器学习、深度学习、计算机视觉和自然语言处理)...

85   767   767  

lingua

The most accurate natural language detection library for Java and the...

72   765   765  

asreview

Active learning for systematic reviews

143   764   764  

trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multili...

103   761   761  

awesome-persian-nlp-ir

Curated List of Persian Natural Language Processing and Information Re...

115   759   759  

PIXIU

This repository introduces PIXIU, an open-source resource featuring th...

99   753   753  

keras-attention

Visualizing RNNs using the attention mechanism

244   751   751  

texar-pytorch

Integrating the Best of TF into PyTorch, for Machine Learning, Natural...

113   746   746  

OpenAttack

An Open-Source Package for Textual Adversarial Attack.

131   741   741  

FewRel

A Large-Scale Few-Shot Relation Extraction Dataset

164   739   739  

spacy-stanza

💥 Use the latest Stanza (StanfordNLP) research models directly in spa...

62   739   739  

primeqa

The prime repository for state-of-the-art Multilingual Question Answer...

57   736   736  

PromptKG

PromptKG Family: a Gallery of Prompt Learning & KG-related research wo...

75   733   733  

Failed-ML

Compilation of high-profile real-world examples of failed machine lear...

49   732   732  

deeplearning-guide

An evolving guide to learning Deep Learning effectively.

134   716   716  

talisman

Straightforward fuzzy matching, information retrieval and NLP building...

47   716   716  

Opus-MT

Open neural machine translation models and web services

77   714   714  

Awesome-Papers-Autonomous-Agent

A collection of recent papers on building autonomous agent. Two topics...

59   713   713  

SkyChat-Chinese-Chatbot-GPT3

SkyChat是一款基于中文GPT-3 api的聊天机器人项目。它可以像chatGPT一样,...

72   709   709  

efaqa-corpus-zh

❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库

87   708   708  

SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

49   701   701  

spacy-layout

📚 Process PDFs, Word documents and more with spaCy

51   700   700  

chat

基于自然语言理解与机器学习的聊天机器人,支持多用户并发及自定义多轮对话

217   700   700  

advanced-machine-learning-engineer-roadmap-2024

A Full Stack ML (Machine Learning) Roadmap involves learning the neces...

91   688   688  

Courses-

Answers for Quizzes & Assignments that I have taken

701   686   686  

SpeedTorch

Library for faster pinned CPU <-> GPU transfer in Pytorch

40   685   685  

Me_Bot

Build a bot that speaks like you!

68   685   685  

DNABERT

DNABERT: pre-trained Bidirectional Encoder Representations from Transf...

172   682   682  

CS224n

CS224n: Natural Language Processing with Deep Learning Assignments Win...

274   679   679  

Awesome-Korean-NLP

A curated list of resources for NLP (Natural Language Processing) for...

116   661   661  

tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training...

42   658   658  

obsidian-ava

Quickly format your notes with ChatGPT in Obsidian

17   654   654  

calvin

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long...

80   648   648  

seqGAN

A simplified PyTorch implementation of "SeqGAN: Sequence Generative Ad...

150   647   647  

AI-Job-Resume

AI 算法岗简历模板

87   647   647  

Books

My book list

383   644   644  

COMET

A Neural Framework for MT Evaluation

94   643   643  

open-korean-text

Open Korean Text Processor - An Open-source Korean Text Processor

97   641   641  

nlprule

A fast, low-resource Natural Language Processing and Text Correction l...

39   641   641  

BERT4doc-Classification

Code and source for paper ``How to Fine-Tune BERT for Text Classificat...

101   639   639  

VnCoreNLP

A Vietnamese natural language processing toolkit (NAACL 2018)

153   635   635