Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

long-range-arena

Long Range Arena for Benchmarking Efficient Transformers

63   594   594  

NLP_Quickbook

NLP in Python with Deep Learning

231   591   591  

Sentence-VAE

PyTorch Re-Implementation of "Generating Sentences from a Continuous S...

153   590   590  

awesome-open-data-centric-ai

Curated list of open source tooling for data-centric AI on unstructure...

28   590   590  

Awesome-Simultaneous-Translation

Paper list of simultaneous translation / streaming translation, includ...

8   588   588  

weixin_public_corpus

微信公众号语料库

164   587   587  

Macropodus

自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中...

96   587   587  

attention-networks-for-classification

Hierarchical Attention Networks for Document Classification in PyTorch

136   586   586  

Building-a-Simple-Chatbot-in-Python-using-NLTK

Building a Simple Chatbot from Scratch in Python (using NLTK)

571   586   586  

datefinder

Find dates inside text using Python and get back datetime objects

160   584   584  

chat-bubble

Simple chatbot UI for the Web with JSON scripting 👋🤖🤙

172   584   584  

jieba-rs

The Jieba Chinese Word Segmentation Implemented in Rust

33   582   582  

xlnet-Pytorch

Simple XLNet implementation with Pytorch Wrapper

106   581   581  

text_mining_resources

Resources for learning about Text Mining and Natural Language Processi...

198   581   581  

nlp-paper

NLP Paper

128   580   580  

R-Net

Tensorflow Implementation of R-Net

210   578   578  

text2sql-data

A collection of datasets that pair questions with SQL queries.

113   576   576  

lite-transformer

[ICLR 2020] Lite Transformer with Long-Short Range Attention

77   576   576  

DensePhrases

ACL'2021: Learning Dense Representations of Phrases at Scale; EMNLP'20...

76   574   574  

neuspell

NeuSpell: A Neural Spelling Correction Toolkit

95   573   573  

DocProduct

Medical Q&A with Deep Language Models

157   571   571  

JamSpell

Modern spell checking library - accurate, fast, multi-language

95   567   567  

xgen

Salesforce open-source LLMs with 8k sequence length.

29   565   565  

voice-builder

An opensource text-to-speech (TTS) voice building tool

132   563   563  

stanford-openie-python

Stanford Open Information Extraction made simple!

101   561   561  

OpenHowNet

Core Data of HowNet and OpenHowNet Python API

87   561   561  

ML-paper-notes

:notebook: Notes and summaries of various ML, Computer Vision & NLP pa...

79   560   560  

ai-study

人工智能学习资料超全整理,包含机器学习基础ML、深度学习基础DL、计算机视...

78   559   559  

Transformers.jl

Julia Implementation of Transformer models

81   557   557  

vocabulary

[Not Maintained anymore] Python Module to get Meanings, Synonyms and w...

77   556   556  

pinferencia

Python + Inference - Model Deployment library in Python. Simplest mode...

85   555   555  

LMaaS-Papers

Awesome papers on Language-Model-as-a-Service (LMaaS)

32   554   554  

Sherlock

Natural-language event parser for Javascript

34   554   554  

tensorflow-nlp-tutorial

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM...

286   552   552  

japanese-pretrained-models

Code for producing Japanese pretrained models provided by rinna Co., L...

40   543   543  

QA

使用深度学习算法实现的中文问答系统

234   542   542  

awesome-nlp-sentiment-analysis

:book: 收集NLP领域相关的数据集、论文、开源实现,尤其是情感分析、情绪原...

82   541   541  

NLP_bahasa_resources

A Curated List of Dataset and Usable Library Resources for NLP in Baha...

142   541   541  

rebel

REBEL is a seq2seq model that simplifies Relation Extraction (EMNLP 20...

73   540   540  

happy-transformer

Happy Transformer makes it easy to fine-tune and perform inference wit...

69   538   538  

firefox-translations

Firefox Translations is a webextension that enables client side transl...

44   538   538  

Mengzi

Mengzi Pretrained Models

63   537   537  

m3tl

BERT for Multitask Learning

126   537   537  

tock

Tock, the open source conversational AI toolkit.

138   537   537  

Wordless

An Integrated Corpus Tool With Multilingual Support for the Study of L...

82   536   536  

nlp-notebook

NLP 领域常见任务的实现,包括新词发现、以及基于pytorch的词向量、中文文...

112   533   533  

chatglm.cpp

C++ implementation of ChatGLM-6B & ChatGLM2-6B

144   531   531  

headlines

Automatically generate headlines to short articles

152   527   527  

text_summurization_abstractive_methods

Multiple implementations for abstractive text summurization , using go...

219   526   526  

poplar

A web-based annotation tool for natural language processing (NLP)

140   524   524