Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

spark-nlp

State of the Art Natural Language Processing

661   3321   3321  

CLUEDatasetSearch

搜索所有中文NLP数据集,附常用英文NLP数据集

548   3294   3294  

vale

:pencil: A syntax-aware linter for prose built with speed and extensib...

121   3265   3265  

Huatuo-Llama-Med-Chinese

Repo for BenTsao [original name: HuaTuo (华驼)], Llama-7B tuned with C...

299   3256   3256  

nlp-roadmap

ROADMAP(Mind Map) and KEYWORD for students those who have interest in...

520   3252   3252  

AiLearning-Theory-Applying

快速上手AI理论及应用实战:基础知识、Transformer、NLP、ML、DL、竞赛。含...

453   3215   3215  

sumy

Module for automatic summarization of text documents and HTML pages.

512   3196   3196  

TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data aug...

415   3127   3127  

prose

:book: A Golang library for text processing, including tokenization, p...

159   3000   3000  

nlp_tasks

Natural Language Processing Tasks and References

565   2998   2998  

Jiagu

Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名...

596   2981   2981  

DeepNLP-models-Pytorch

Pytorch implementations of various Deep NLP models in cs-224n(Stanford...

659   2954   2954  

nlp-architect

A model library for exploring state-of-the-art deep learning topologie...

467   2910   2910  

texthero

Text preprocessing, representation and visualization from zero to hero...

239   2905   2905  

awesome-deeplearning-resources

Deep Learning and deep reinforcement learning research papers and some...

667   2893   2893  

SimCSE

EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings...

455   2877   2877  

neuralcoref

✨Fast Coreference Resolution in spaCy with Neural Networks

478   2871   2871  

daily-interview

Datawhale成员整理的面经,内容包括机器学习,CV,NLP,推荐,开发等,欢迎...

450   2861   2861  

ml-surveys

📋 Survey papers summarizing advances in deep learning, NLP, CV, graph...

292   2847   2847  

thinc

🔮 A refreshing functional take on deep learning, compatible with your...

280   2840   2840  

promptsource

Toolkit for creating, sharing and using natural language prompts.

364   2810   2810  

knockknock

🚪✊Knock Knock: Get notified when your training ends with only two ad...

235   2805   2805  

rust-bert

Rust native ready-to-use NLP pipelines and transformer-based models (B...

221   2804   2804  

GPT2-chitchat

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思...

669   2789   2789  

eli5

A library for debugging/inspecting machine learning classifiers and ex...

331   2771   2771  

text-generation-inference

Large Language Model Text Generation Inference

249   2743   2743  

paper-qa

LLM Chain for answering questions from documents with citations

245   2741   2741  

lingvo

Lingvo

434   2737   2737  

llm-foundry

LLM training code for MosaicML foundation models

263   2638   2638  

gse

Go efficient multilingual NLP and text segmentation; support English,...

217   2624   2624  

aeneas

aeneas is a Python/C library and a set of tools to automagically synch...

246   2622   2622  

Familia

A Toolkit for Industrial Topic Modeling

612   2606   2606  

textlint

The pluggable natural language linter for text and markdown.

158   2594   2594  

sentiment

AFINN-based sentiment analysis for Node.js.

318   2592   2592  

gluon-nlp

NLP made easy

530   2559   2559  

awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型集合

290   2537   2537  

papers

Summaries of machine learning papers

442   2503   2503  

Getting-Things-Done-with-Pytorch

Jupyter Notebook tutorials on solving real-world problems with Machine...

648   2412   2412  

Kashgari

Kashgari is a production-level NLP Transfer learning framework built o...

437   2392   2392  

text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现...

249   2384   2384  

awesome-ChatGPT-repositories

A curated list of resources dedicated to open source GitHub repositori...

278   2374   2374  

spacy-course

👩‍🏫 Advanced NLP with spaCy: A free online course

375   2355   2355  

rasa_core

Rasa Core is now part of the Rasa repo: An open source machine learnin...

1010   2340   2340  

awesome-DeepLearning

深度学习入门课、资深课、特色课、学术案例、产业实践案例、深度学习知识百...

768   2314   2314  

scattertext

Beautiful visualizations of how language differs among document types.

292   2289   2289  

awesome-chatgpt

🧠 A curated list of awesome ChatGPT resources, including libraries, S...

170   2283   2283  

CodeSearchNet

Datasets, tools, and benchmarks for representation learning of code.

399   2282   2282  

Medical_NLP

Medical NLP Competition, dataset, large models, paper

422   2262   2262  

awesome-sentence-embedding

A curated list of pretrained sentence and word embedding models

262   2248   2248  

JioNLP

中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocess...

299   2244   2244