Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型...

107   1028   1028  

whatlang-rs

Natural language detection library for Rust. Try demo online: https://...

112   1026   1026  

Treasure-of-Transformers

💁 Awesome Treasure of Transformers Models for Natural Language proces...

216   1023   1023  

basaran

Basaran is an open-source alternative to the OpenAI text completion AP...

55   1006   1006  

tutorials

AI-related tutorials. Access any of them for free → https://towardsai....

364   1005   1005  

KGQA-Based-On-medicine

基于医药知识图谱的智能问答系统

277   998   998  

books

整理一些书籍 ,包含 C&C++ 、git 、Java、Keras 、Linux 、NLP 、Python 、...

298   997   997  

bigscience

Central place for the engineering/scaling WG: documentation, SLURM scr...

100   990   990  

GPT2-NewsTitle

Chinese NewsTitle Generation Project by GPT2.带有超级详细注释的中文GPT...

164   986   986  

nlp-notebooks

A collection of notebooks for Natural Language Processing from NLP Tow...

377   984   984  

QANet

A Tensorflow implementation of QANet for machine reading comprehension

303   981   981  

nlp-paper

自然语言处理领域下的相关论文(附阅读笔记),复现模型以及数据处理等(代...

167   979   979  

clean-text

🧹 Python package for text cleaning

79   975   975  

plato-research-dialogue-system

This is the Plato Research Dialogue System, a flexible platform for de...

196   968   968  

question_generation

Neural question generation using transformers

324   967   967  

awesome-knowledge-graph

A curated list of Knowledge Graph related learning materials, database...

96   964   964  

rasa-ui

Rasa UI is a frontend for the Rasa Framework

332   962   962  

data-science-portfolio

Portfolio of data science projects completed by me for academic, self...

424   959   959  

bert_language_understanding

Pre-training of Deep Bidirectional Transformers for Language Understan...

214   954   954  

bolt

Bolt is a deep learning library with high performance and heterogeneou...

163   954   954  

lingua-go

The most accurate natural language detection library for Go, suitable...

55   945   945  

budoux

23   945   945  

chatgpt-comparison-detection

Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥

80   943   943  

Prompt4ReasoningPapers

[ACL 2023] Reasoning with Language Model Prompting: A Survey

69   939   939  

kogpt

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

135   935   935  

awesome-huggingface

🤗 A list of wonderful open-source projects & applications integrated...

86   934   934  

torchdistill

A coding-free framework built on PyTorch for reproducible deep learnin...

100   929   929  

keras-hub

Pretrained model hub for Keras 3.

291   920   920  

Summarization-Papers

Summarization Papers

139   919   919  

weibo-analysis-and-visualization

使用python抓取微博数据并对微博文本分析和可视化,LDA(树图)、关系图、...

143   914   914  

awesome-sentiment-analysis

😀😄😂😭 A curated list of Sentiment Analysis methods, implementations...

166   899   899  

iowncode

A curated collection of iOS, ML, AR resources sprinkled with some UI a...

326   896   896  

YouTokenToMe

Unsupervised text tokenizer focused on computational efficiency

81   889   889  

pointer_summarizer

pytorch implementation of "Get To The Point: Summarization with Pointe...

248   886   886  

K-BERT

Source code of K-BERT (AAAI2020)

203   884   884  

KGQA_HLM

基于知识图谱的《红楼梦》人物关系可视化及问答系统

266   884   884  

jcseg

Jcseg is a light weight NLP framework developed with Java. Provide CJK...

216   880   880  

Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential an...

127   867   867  

wikipedia2vec

A tool for learning vector representations of words and entities from...

97   866   866  

gpt-2-Pytorch

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

211   866   866  

soynlp

한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이...

184   853   853  

aiva

AIVA (A.I. Virtual Assistant): General-purpose virtual assistant for d...

597   840   840  

seq2seq-chatbot

Chatbot in 200 lines of code using TensorLayer

313   839   839  

wit

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multili...

36   838   838  

Chatito

🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recogn...

163   837   837  

bert4torch

An elegent pytorch implement of transformers

104   837   837  

MemN2N-tensorflow

"End-To-End Memory Networks" in Tensorflow

249   827   827  

WEB_KG

爬取百度百科中文页面,抽取三元组信息,构建中文知识图谱

188   823   823  

datacamp-python-data-science-track

All the slides, accompanying code and exercises all stored in this rep...

523   823   823  

ChatIE

The online version is temporarily unavailable because we cannot afford...

67   822   822