Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

gpttools

gpttools extends gptstudio for package development to help you documen...

19   244   244  

chinese_ulmfit

中文ULMFiT 情感分析 文本分类

38   243   243  

spacy-lookup

Named Entity Recognition based on dictionaries

38   242   242  

nlp_profiler

A simple NLP library allows profiling datasets with one or more text c...

37   242   242  

backprop

Backprop makes it simple to use, finetune, and deploy state-of-the-art...

11   242   242  

gpt-2-tensorflow2.0

OpenAI GPT2 pre-training and sequence prediction implementation in Ten...

78   242   242  

text2text

Text2Text: Crosslingual NLP/G toolkit

31   242   242  

nlplot

Visualization Module for Natural Language Processing

11   241   241  

Siamese-LSTM

Siamese LSTM for evaluating semantic similarity between sentences of t...

68   241   241  

AIND-NLP

Coding exercises for the Natural Language Processing concentration, pa...

383   241   241  

spacy-services

💫 REST microservices for various spaCy-related tasks

75   240   240  

caml-mimic

multilabel classification of EHR notes

109   240   240  

cnn-text-classification-tf-chinese

CNN for Chinese Text Classification in Tensorflow

111   239   239  

dmn-tensorflow

Dynamic Memory Networks (https://arxiv.org/abs/1603.01417) in Tensorfl...

86   239   239  

monpa

MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型

25   239   239  

GermanWordEmbeddings

Toolkit to obtain and preprocess German text corpora, train models and...

51   239   239  

pyrouge

A Python wrapper for the ROUGE summarization evaluation package

70   238   238  

openfoodfacts-ai

This is a tracking repo for all our AI projects. 🍕 🤖🍼

54   238   238  

prosodic

Prosodic: a metrical-phonological parser, written in Python. For Engli...

40   237   237  

webanno

🆕 Work continues on INCEpTION 👉 https://github.com/inception-project...

96   236   236  

fairseq-gec

Source code for paper: Improving Grammatical Error Correction via Pre-...

68   236   236  

open-sesame

A frame-semantic parsing system based on a softmax-margin SegRNN.

67   236   236  

tableQA

AI Tool for querying natural language on tabular data.

44   235   235  

bnlp

BNLP is a natural language processing toolkit for Bengali Language.

49   234   234  

SummerTime

An open-source text summarization toolkit for non-experts. EMNLP'2021...

24   234   234  

nlp_classification

Implementing nlp papers relevant to classification with PyTorch, gluon...

41   231   231  

MetaLearning4NLP-Papers

A list of recent papers about Meta / few-shot learning methods applied...

25   231   231  

onnxt5

Summarization, translation, sentiment-analysis, text-generation and mo...

30   231   231  

machine-learning

从零基础开始机器学习之旅

88   230   230  

nlp_learning

结合python一起学习自然语言处理 (nlp): 语言模型、HMM、PCFG、Word2vec、...

91   230   230  

PIE

Fast + Non-Autoregressive Grammatical Error Correction using BERT. Cod...

39   230   230  

mindflow

🧠 code-awareness

20   230   230  

Persian-Swear-Words

Persian Swear Dataset - you can use in your production to filter unwan...

32   229   229  

vert-papers

This repository contains code and datasets related to entity/knowledge...

87   229   229  

headliner

🏖 Easy training and deployment of seq2seq models.

41   228   228  

pyfasttext

Yet another Python binding for fastText

31   228   228  

turkish-stemmer-python

:snake: Turkish Language Stemmer for Python

31   228   228  

SOHU_competition

Sohu's 2018 content recognition competition 1st solution(搜狐内容识别...

75   227   227  

DL-for-Chatbot

Deep Learning / NLP tutorial for Chatbot Developers

64   227   227  

4675-scifi

chinese NLP corpus of chinese science fiction,chinese science fiction...

37   226   226  

shared_colab_notebooks

A Repo to store the Google Colaboratory Notebooks that I have created...

59   225   225  

TextDescriptives

A Python library for calculating a large variety of metrics from text

19   225   225  

cs224n-2017-winter

All lecture notes, slides and assignments from CS224n: Natural Languag...

118   225   225  

vec4ir

Word Embeddings for Information Retrieval

41   225   225  

fastPunct

Punctuation restoration and spell correction experiments.

34   225   225  

TextCluster

短文本聚类预处理模块 Short text cluster

57   224   224  

FedNLP

FedNLP: An Industry and Research Integrated Platform for Federated Lea...

44   223   223  

LemmInflect

A python module for English lemmatization and inflection.

23   223   223  

ocrpy

OCR, Archive, Index and Search: Implementation agnostic OCR framework.

11   223   223  

ml-projects

ML based projects such as Spam Classification, Time Series Analysis, T...

106   222   222