Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

insuranceqa-corpus-zh

:helicopter: 保险行业语料库,聊天机器人

334   928   928  

Summarization-Papers

Summarization Papers

139   919   919  

notes

Learn about Machine Learning and Artificial Intelligence

237   916   916  

seqeval

A Python framework for sequence labeling evaluation(named-entity recog...

120   908   908  

SoulverCore

A powerful Swift framework for evaluating natural language math expres...

35   906   906  

iowncode

A curated collection of iOS, ML, AR resources sprinkled with some UI a...

326   896   896  

my-cs-degree

A CS degree with a focus on full-stack ML engineering, 2020

141   890   890  

YouTokenToMe

Unsupervised text tokenizer focused on computational efficiency

81   889   889  

awesome-ai-awesomeness

A curated list of awesome awesomeness about artificial intelligence

118   888   888  

jcseg

Jcseg is a light weight NLP framework developed with Java. Provide CJK...

216   880   880  

ML-University

Machine Learning Open Source University

110   880   880  

bertsearch

Elasticsearch with BERT for advanced document search.

203   874   874  

hazm

Python library for digesting Persian text.

156   872   872  

skweak

skweak: A software toolkit for weak supervision applied to NLP tasks

70   870   870  

torchMoji

😇A pyTorch implementation of the DeepMoji model: state-of-the-art dee...

176   869   869  

wikipedia2vec

A tool for learning vector representations of words and entities from...

97   866   866  

gpt-2-Pytorch

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

211   866   866  

quanteda

An R package for the Quantitative Analysis of Textual Data

187   851   851  

clean-text

🧹 Python package for text cleaning

71   836   836  

DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

77   829   829  

Coursera

Quiz & Assignment of Coursera

649   828   828  

datacamp-python-data-science-track

All the slides, accompanying code and exercises all stored in this rep...

523   823   823  

GNN4NLP-Papers

A list of recent papers about Graph Neural Network methods applied in...

131   815   815  

DL-NLP-Readings

My Reading Lists of Deep Learning and Natural Language Processing

271   814   814  

TextGAN-PyTorch

TextGAN is a PyTorch framework for Generative Adversarial Networks (GA...

190   800   800  

self-attentive-parser

High-accuracy NLP parser with models for 11 languages.

150   799   799  

gector

Official implementation of the papers "GECToR – Grammatical Error Corr...

202   793   793  

pythainlp

Thai Natural Language Processing in Python.

239   791   791  

text2vec

Fast vectorization, topic modeling, distances and GloVe word embedding...

130   785   785  

catalyst

🚀 Catalyst is a C# Natural Language Processing library built for spee...

79   776   776  

Paper-Reading

📖 Paper reading list in dialogue systems and natural language generat...

150   774   774  

awesome-japanese-nlp-resources

A curated list of resources dedicated to Python libraries, LLMs, dicti...

31   769   769  

language-detection

A language detection library for PHP. Detects the language from a give...

81   766   766  

AI-Notes

:books: [.md & .ipynb] Series of Artificial Intelligence & Deep Learni...

242   765   765  

AI-Job-Recommend

国内公司人工智能方向(含机器学习、深度学习、计算机视觉和自然语言处理)...

85   760   760  

huggingface_hub

All the open source things related to the Hugging Face Hub.

181   758   758  

trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multili...

103   745   745  

keras-attention

Visualizing RNNs using the attention mechanism

248   735   735  

texar-pytorch

Integrating the Best of TF into PyTorch, for Machine Learning, Natural...

119   733   733  

transformers-tutorials

Github repo with tutorials to fine tune transformers for diff NLP task...

176   731   731  

Failed-ML

Compilation of high-profile real-world examples of failed machine lear...

48   727   727  

multiwoz

Source code for end-to-end dialogue model from the MultiWOZ paper (Bud...

177   717   717  

autotrain-advanced

🤗 AutoTrain Advanced

52   710   710  

SkyChat-Chinese-Chatbot-GPT3

SkyChat是一款基于中文GPT-3 api的聊天机器人项目。它可以像chatGPT一样,...

72   706   706  

hate-speech-and-offensive-language

Repository for the paper "Automated Hate Speech Detection and the Prob...

316   701   701  

lingua-rs

The most accurate natural language detection library for Rust, suitabl...

26   696   696  

deep-learning-guide

An evolving guide to learning Deep Learning effectively.

133   686   686  

holiday-cn

📅🇨🇳中国法定节假日数据 自动每日抓取国务院公告

92   686   686  

Me_Bot

Build a bot that speaks like you!

68   684   684  

FewRel

A Large-Scale Few-Shot Relation Extraction Dataset

153   684   684