Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

xmnlp

xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转...

183   1066   1066  

nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

258   1064   1064  

nlp-with-ruby

Curated List: Practical Natural Language Processing done in Ruby

68   1061   1061  

PyTorchText

1st Place Solution for Zhihu Machine Learning Challenge . Implementati...

368   1059   1059  

learn-nlp-with-transformers

we want to create a repo to illustrate usage of transformers in chines...

221   1055   1055  

nlp

:memo: This repository recorded my NLP journey.

323   1052   1052  

whatlang-rs

Natural language detection library for Rust. Try demo online: https://...

113   1033   1033  

Treasure-of-Transformers

💁 Awesome Treasure of Transformers Models for Natural Language proces...

217   1032   1032  

languagemodels

Explore large language models on any computer with 512MB of RAM

74   1031   1031  

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型...

107   1028   1028  

books

整理一些书籍 ,包含 C&C++ 、git 、Java、Keras 、Linux 、NLP 、Python 、...

301   1020   1020  

Summarization-Papers

Summarization Papers

146   1014   1014  

nlp-notebooks

A collection of notebooks for Natural Language Processing from NLP Tow...

381   1005   1005  

tutorials

AI-related tutorials. Access any of them for free → https://towardsai....

364   1005   1005  

gpt-2-Pytorch

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

228   1003   1003  

KGQA-Based-On-medicine

基于医药知识图谱的智能问答系统

277   998   998  

bigscience

Central place for the engineering/scaling WG: documentation, SLURM scr...

100   990   990  

GPT2-NewsTitle

Chinese NewsTitle Generation Project by GPT2.带有超级详细注释的中文GPT...

164   986   986  

lingua-rs

The most accurate natural language detection library for Rust, suitabl...

47   983   983  

clean-text

🧹 Python package for text cleaning

80   982   982  

QANet

A Tensorflow implementation of QANet for machine reading comprehension

303   981   981  

nlp-paper

自然语言处理领域下的相关论文(附阅读笔记),复现模型以及数据处理等(代...

167   979   979  

Prompt4ReasoningPapers

[ACL 2023] Reasoning with Language Model Prompting: A Survey

68   978   978  

YouTokenToMe

Unsupervised text tokenizer focused on computational efficiency

107   972   972  

plato-research-dialogue-system

This is the Plato Research Dialogue System, a flexible platform for de...

196   968   968  

awesome-knowledge-graph

A curated list of Knowledge Graph related learning materials, database...

96   964   964  

rasa-ui

Rasa UI is a frontend for the Rasa Framework

332   962   962  

data-science-portfolio

Portfolio of data science projects completed by me for academic, self...

424   959   959  

bert_language_understanding

Pre-training of Deep Bidirectional Transformers for Language Understan...

214   954   954  

bolt

Bolt is a deep learning library with high performance and heterogeneou...

163   954   954  

wikipedia2vec

A tool for learning vector representations of words and entities from...

102   953   953  

budoux

23   945   945  

chatgpt-comparison-detection

Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥

80   943   943  

kogpt

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

135   935   935  

awesome-huggingface

🤗 A list of wonderful open-source projects & applications integrated...

86   934   934  

gector

Official implementation of the papers "GECToR – Grammatical Error Corr...

221   931   931  

keras-hub

Pretrained model hub for Keras 3.

291   921   921  

jcseg

Jcseg is a light weight NLP framework developed with Java. Provide CJK...

213   921   921  

TextGAN-PyTorch

TextGAN is a PyTorch framework for Generative Adversarial Networks (GA...

209   916   916  

weibo-analysis-and-visualization

使用python抓取微博数据并对微博文本分析和可视化,LDA(树图)、关系图、...

143   914   914  

iowncode

A curated collection of iOS, ML, AR resources sprinkled with some UI a...

323   909   909  

Transformers-for-NLP-2nd-Edition

Transformer models from BERT to GPT-4, environments from Hugging Face...

344   908   908  

awesome-sentiment-analysis

😀😄😂😭 A curated list of Sentiment Analysis methods, implementations...

166   899   899  

pyresparser

A simple resume parser used for extracting information from resumes

439   899   899  

self-attentive-parser

High-accuracy NLP parser with models for 11 languages.

160   892   892  

pointer_summarizer

pytorch implementation of "Get To The Point: Summarization with Pointe...

248   886   886  

K-BERT

Source code of K-BERT (AAAI2020)

203   884   884  

KGQA_HLM

基于知识图谱的《红楼梦》人物关系可视化及问答系统

266   884   884  

Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential an...

127   867   867  

transformers-tutorials

Github repo with tutorials to fine tune transformers for diff NLP task...

193   857   857