Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

jProcessing

Japanese Natural Langauge Processing Libraries

31 142 142

RBERT

Implementation of BERT in R

17 142 142

indra

INDRA (Integrated Network and Dynamical Reasoning Assembler) is an aut...

55 142 142

Lango

Language Lego

15 141 141

hubot-natural

Natural Language Processing Chatbot for RocketChat

44 140 140

are-16-heads-really-better-than-1

Code for the paper "Are Sixteen Heads Really Better than One?"

14 140 140

UnilmChatchitRobot

Unilm for Chinese Chitchat Robot.基于Unilm模型的夸夸式闲聊机器人项目。

27 140 140

NL2SQL-RULE

Content Enhanced BERT-based Text-to-SQL Generation https://arxiv.org/a...

42 139 139

matilda

MATILDA: Multi-AnnoTator multi-language Interactive Lightweight Dialog...

31 138 138

w2n

Convert number words (eg. twenty one) to numeric digits (21)

62 138 138

getlang

Natural language detection package in pure Go

20 138 138

MnemonicReader

A PyTorch implementation of Mnemonic Reader for the Machine Comprehens...

40 137 137

Echo

Python package containing all custom layers used in Neural Networks (C...

29 137 137

kaggle-quora-dup

Solution to Kaggle's Quora Duplicate Question Detection Competition

51 137 137

RDRPOSTagger

A fast and accurate POS and morphological tagging toolkit (EACL 2014)

49 137 137

NLPnote

Gitbook Address: https://app.gitbook.com/@nlpgroup/s/nlpnote/

69 137 137

Scattertext-PyData

Notebooks for the Seattle PyData 2017 talk on Scattertext

50 136 136

lingo

package lingo provides the data structures and algorithms required for...

16 136 136

clojure-dsl-resources

A curated list of Clojure resources for dealing with domain-specific l...

2 136 136

BREDS

"Bootstrapping Relationship Extractors with Distributional Semantics"...

37 135 135

spokestack-python

Spokestack is a library that allows a user to easily incorporate a voi...

13 135 135

TIA

Your Advanced Twitter stalking tool

18 135 135

python-sutime

Python wrapper for Stanford CoreNLP's SUTime

40 135 135

steppy

Lightweight, Python library for fast and reproducible experimentation...

32 134 134

Twitter-Sentiment-Analysis

This script can tell you the sentiments of people regarding to any eve...

105 134 134

compling_nlp_hse_course

Материалы курса по компьютерной лингвистике Школы Лингвистики НИУ ВШЭ

67 134 134

ID-CNN-CWS

Source codes and corpora of paper "Iterated Dilated Convolutions for C...

41 133 133

FusionNet-NLI

An example for applying FusionNet to Natural Language Inference

38 133 133

fnc-1-baseline

A baseline implementation for FNC-1

110 133 133

abstractive_summarizer

Abstractive Text Summarization using Transformer

50 133 133

word-checker

🇨🇳🇬🇧Chinese and English word spelling corrector.(中文易错别字检测，中...

35 133 133

nlp_estimator_tutorial

Educational material on using the TensorFlow Estimator framework for t...

53 132 132

ruijin_round1

瑞金医院MMC人工智能辅助构建知识图谱大赛初赛

30 132 132

Lenta.Ru-News-Dataset

Corpus of Russian news articles collected from Lenta.Ru

20 132 132

Question-Answering

TensorFlow implementation of Match-LSTM and Answer pointer for the pop...

70 131 131

chinese-law-bert-similarity

bert chinese similarity

31 131 131

NegBio

:newspaper: High-performance tool for negation and uncertainty detecti...

35 131 131

TAKG

The official implementation of ACL 2019 paper "Topic-Aware Neural Keyp...

31 130 130

ake-datasets

Large, curated set of benchmark datasets for evaluating automatic keyp...

26 130 130

nlp-gym

NLPGym - A toolkit to develop RL agents to solve NLP tasks.

12 130 130

R-text-data

List of textual data sources to be used for text mining in R

14 130 130

emotion_dataset

:smile: Dataset for Emotion Classification

16 130 130

neural-question-generation

Pytorch implementation of Paragraph-level Neural Question Generation...

31 129 129

JapaneseTokenizers

aim to use JapaneseTokenizer as easy as possible

21 128 128

mongolian-nlp

Useful resources for Mongolian NLP

33 128 128

keras-gpt-2

Load GPT-2 checkpoint and generate texts

32 127 127

Hierarchical-Attention-Network

Implementation of Hierarchical Attention Networks in PyTorch

28 125 125

python-duckling

Python wrapper for wit.ai's Duckling Clojure library

24 125 125

vtext

Simple NLP in Rust with Python bindings

12 125 125

turkish-deasciifier

Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for...

20 125 125