Topic

nlp

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1261)

ZZZ-RETIRED__openstt
ZZZ-RETIRED__openstt MycroftAI

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

145
ml-nlp-paper-discussions
ml-nlp-paper-discussions dair-ai

📄 A repo containing notes and discussions for our weekly NLP/ML paper discussions.

145
Guide-to-Swift-Strings-Sample-Code
Guide-to-Swift-Strings-Sample-Code Flight-School Swift

Xcode Playground Sample Code for the Flight School Guide to Swift Strings

145
pubmed-rct
pubmed-rct Franck-Dernoncourt

PubMed 200k RCT dataset: a large dataset for sequential sentence classification.

145
summarus
summarus IlyaGusev Python

Models for automatic abstractive summarization

145
ASTRA
ASTRA microsoft Python

Self-training with Weak Supervision (NAACL 2021)

145
fcc_nn_research
fcc_nn_research j2kao Jupyter Notebook

(somewhat) cleaned-up notebooks used in researching public comments for FCC Proceeding 17-108 (Net Neutrality Repeal)

143
BREDS
BREDS davidsbatista Python

"Bootstrapping Relationship Extractors with Distributional Semantics" (Batista et al., 2015) in EMNLP'15 - Python implementation

143
jProcessing
jProcessing kevincobain2000 OpenEdge ABL

Japanese Natural Langauge Processing Libraries

142
Scattertext-PyData
Scattertext-PyData JasonKessler HTML

Notebooks for the Seattle PyData 2017 talk on Scattertext

142
indra
indra sorgerlab Python

INDRA (Integrated Network and Dynamical Reasoning Assembler) is an automated model assembly system interfacing with NLP systems and databases to colle...

142
dialogflow-ruby-client
dialogflow-ruby-client dialogflow Ruby

Ruby SDK for Dialogflow

141
Lango
Lango ayoungprogrammer Python

Language Lego

141
hubot-natural
hubot-natural RocketChat CoffeeScript

Natural Language Processing Chatbot for RocketChat

140
are-16-heads-really-better-than-1
are-16-heads-really-better-than-1 pmichel31415 Shell

Code for the paper "Are Sixteen Heads Really Better than One?"

140
UnilmChatchitRobot
UnilmChatchitRobot liucongg Python

Unilm for Chinese Chitchat Robot.基于Unilm模型的夸夸式闲聊机器人项目。

140
spokestack-python
spokestack-python spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

139
NL2SQL-RULE
NL2SQL-RULE guotong1988 Python

Content Enhanced BERT-based Text-to-SQL Generation https://arxiv.org/abs/1910.07179

139
matilda
matilda Wluper JavaScript

MATILDA: Multi-AnnoTator multi-language Interactive Lightweight Dialogue Annotator

138
fnc-1-baseline
fnc-1-baseline FakeNewsChallenge Python

A baseline implementation for FNC-1

138
w2n
w2n akshaynagpal Python

Convert number words (eg. twenty one) to numeric digits (21)

138
getlang
getlang rylans Go

Natural language detection package in pure Go

138
stanza-old
stanza-old stanfordnlp Python

Stanford NLP group's shared Python tools.

137
MnemonicReader
MnemonicReader HKUST-KnowComp Python

A PyTorch implementation of Mnemonic Reader for the Machine Comprehension task

137
Echo
Echo digantamisra98 Python

Python package containing all custom layers used in Neural Networks (Compatible with PyTorch, TensorFlow and MegEngine)

137
kaggle-quora-dup
kaggle-quora-dup aerdem4 Python

Solution to Kaggle's Quora Duplicate Question Detection Competition

137
RDRPOSTagger
RDRPOSTagger datquocnguyen HTML

A fast and accurate POS and morphological tagging toolkit (EACL 2014)

137
NLPnote
NLPnote SpringMagnolia

Gitbook Address: https://app.gitbook.com/@nlpgroup/s/nlpnote/

137
clojure-dsl-resources
clojure-dsl-resources simongray

A curated list of Clojure resources for dealing with domain-specific languages.

136
NLP
NLP DataForScience Jupyter Notebook

Natural Language Processing For Everyone

136
python-sutime
python-sutime FraBle Python

Python wrapper for Stanford CoreNLP's SUTime

135
steppy
steppy minerva-ml Python

Lightweight, Python library for fast and reproducible experimentation :microscope:

134
ID-CNN-CWS
ID-CNN-CWS hankcs Python

Source codes and corpora of paper "Iterated Dilated Convolutions for Chinese Word Segmentation"

133
FusionNet-NLI
FusionNet-NLI hsinyuan-huang Python

An example for applying FusionNet to Natural Language Inference

133
word-checker
word-checker houbb Java

🇨🇳🇬🇧Chinese and English word spelling corrector.(中文易错别字检测,中文拼写检测纠正。英文单词拼写校验工具)

133
nlp_estimator_tutorial
nlp_estimator_tutorial eisenjulian Jupyter Notebook

Educational material on using the TensorFlow Estimator framework for text classification

132
ruijin_round1
ruijin_round1 beader Jupyter Notebook

瑞金医院MMC人工智能辅助构建知识图谱大赛初赛

132
Lenta.Ru-News-Dataset
Lenta.Ru-News-Dataset yutkin Python

Corpus of Russian news articles collected from Lenta.Ru

132
clam
clam proycon Python

Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your command line app...

132
Question-Answering
Question-Answering MurtyShikhar Python

TensorFlow implementation of Match-LSTM and Answer pointer for the popular SQuAD dataset.

131
chinese-law-bert-similarity
chinese-law-bert-similarity xiongma Python

bert chinese similarity

131
NegBio
NegBio ncbi-nlp Python

:newspaper: High-performance tool for negation and uncertainty detection in radiology reports

131
awesome-bert-japanese
awesome-bert-japanese himkt

📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information

131
TAKG
TAKG yuewang-cuhk Python

The official implementation of ACL 2019 paper "Topic-Aware Neural Keyphrase Generation for Social Media Language"

130
nlp-gym
nlp-gym rajcscw Python

NLPGym - A toolkit to develop RL agents to solve NLP tasks.

130
R-text-data
R-text-data EmilHvitfeldt

List of textual data sources to be used for text mining in R

130
emotion_dataset
emotion_dataset dair-ai

:smile: Dataset for Emotion Classification

130
neural-question-generation
neural-question-generation seanie12 Python

Pytorch implementation of Paragraph-level Neural Question Generation with Maxout Pointer and Gated Self-attention Networks

129
JapaneseTokenizers
JapaneseTokenizers Kensuke-Mitsuzawa Python

aim to use JapaneseTokenizer as easy as possible

128
phrase-at-scale
phrase-at-scale kavgan Python

Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages oth...

128