Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

ML-paper-notes

:notebook: Notes and summaries of various ML, Computer Vision & NLP pa...

79   542   542  

awesome-nlp-sentiment-analysis

:book: 收集NLP领域相关的数据集、论文、开源实现,尤其是情感分析、情绪原...

82   541   541  

firefox-translations

Firefox Translations is a webextension that enables client side transl...

44   538   538  

BotLibre

An open platform for artificial intelligence, chat bots, virtual agent...

217   537   537  

m3tl

BERT for Multitask Learning

126   537   537  

weixin_public_corpus

微信公众号语料库

167   536   536  

Wordless

An Integrated Corpus Tool With Multilingual Support for the Study of L...

82   536   536  

pinferencia

Python + Inference - Model Deployment library in Python. Simplest mode...

87   534   534  

nlprule

A fast, low-resource Natural Language Processing and Text Correction l...

39   531   531  

chatglm.cpp

C++ implementation of ChatGLM-6B & ChatGLM2-6B

144   531   531  

MacBERT

Revisiting Pre-trained Models for Chinese Natural Language Processing...

50   530   530  

text_mining_resources

Resources for learning about Text Mining and Natural Language Processi...

199   529   529  

headlines

Automatically generate headlines to short articles

152   527   527  

Deep-Semantic-Similarity-Model

My Keras implementation of the Deep Semantic Similarity Model (DSSM)/C...

189   526   526  

awesome-tensorflow-2

👉 Tensorflow 2.x resources such as tutorial, blog, code and videos

104   523   523  

hands-on-nltk-tutorial

The hands-on NLTK tutorial for NLP in Python

230   522   522  

EventExtractionPapers

A list of NLP resources focused on event extraction task

90   519   519  

Mengzi

Mengzi Pretrained Models

60   518   518  

graphbrain

Language, Knowledge, Cognition

56   517   517  

bigbird

Transformers for Longer Sequences

96   516   516  

awesome-sentiment-analysis

Repository with all what is necessary for sentiment analysis and relat...

103   514   514  

prompt-tuning

Original Implementation of Prompt Tuning from Lester, et al, 2021

51   511   511  

Daily-DeepLearning

🔥机器学习/深度学习/Python/算法面试/自然语言处理教程/剑指offer/machine...

137   510   510  

LightAutoML

Fast and customizable framework for automatic ML model creation (AutoM...

31   510   510  

python-stanford-corenlp

Python interface to CoreNLP using a bidirectional server-client interf...

110   509   509  

datasets-server

Lightweight web API for visualizing and exploring all types of dataset...

37   509   509  

machine-learning-articles

Monthly Series - Top 10 Machine Learning Articles

40   508   508  

chat-bubble

Simple chatbot UI for the Web with JSON scripting 👋🤖🤙

153   506   506  

awesome-data-annotation

A list of tools for annotating data, managing annotations, etc.

60   504   504  

keras-nlp

Modular Natural Language Processing workflows with Keras

137   504   504  

OmniNet

Official Pytorch implementation of "OmniNet: A unified architecture fo...

61   501   501  

attn2d

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequenc...

75   497   497  

VnCoreNLP

A Vietnamese natural language processing toolkit (NAACL 2018)

134   497   497  

Building-a-Simple-Chatbot-in-Python-using-NLTK

Building a Simple Chatbot from Scratch in Python (using NLTK)

550   496   496  

text_summurization_abstractive_methods

Multiple implementations for abstractive text summurization , using go...

213   495   495  

Goopt

🔍 Search Engine for a Procedural Simulation of the Web with GPT-3.

36   493   493  

Event-Extraction

基于法律裁判文书的事件抽取及其应用,包括数据的分词、词性标注、命名实体...

129   491   491  

CPM-Live

Live Training for Open-source Big Models

34   490   490  

matchbox

Write PyTorch code at the level of individual examples, then run it ef...

25   488   488  

allennlp-models

Officially supported AllenNLP models

158   488   488  

tomotopy

Python package of Tomoto, the Topic Modeling Tool

57   487   487  

chinese_text_cnn

TextCNN Pytorch实现 中文文本分类 情感分析

109   487   487  

poplar

A web-based annotation tool for natural language processing (NLP)

132   486   486  

subreddit-analyzer

A comprehensive Data and Text Mining workflow for submissions and comm...

43   485   485  

mexican-government-report

Text Mining on the 2019 Mexican Government Report, covering from extra...

85   481   481  

code_search

Code For Medium Article: "How To Create Natural Language Semantic Sear...

138   480   480  

Deta_Parser

快速中文分词分析word segmentation

97   479   479  

Sherlock

Natural-language event parser for Javascript

34   479   479  

php-text-analysis

PHP Text Analysis is a library for performing Information Retrieval (I...

83   477   477  

fastT5

⚡ boost inference speed of T5 models by 5x & reduce the model size by...

63   477   477