Topic

nlp

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1261)

genie-toolkit
genie-toolkit stanford-oval TypeScript

The Genie open source kit for voice assistant (formerly known as Almond)

169
Snowball
Snowball davidsbatista Python

Implementation with some extensions of the paper "Snowball: Extracting Relations from Large Plain-Text Collections" (Agichtein and Gravano, 2000)

168
Awesome-Text-Classification
Awesome-Text-Classification fendouai

Awesome-Text-Classification Projects,Papers,Tutorial .

167
neural-paraphrase-generation
neural-paraphrase-generation iamaaditya Python

Neural Paraphrase Generation

167
torchtext-summary
torchtext-summary atnlp Jupyter Notebook

torchtext使用总结,从零开始逐步实现了torchtext文本预处理过程,包括截断补长,词表构建,使用预训练词向量,构建可用于PyTorch的可迭代数据等步骤。并结合Pyt...

167
Improved-Dynamic-Memory-Networks-DMN-plus
Improved-Dynamic-Memory-Networks-DMN-plus ethancaballero Python

Theano Implementation of DMN+ (Improved Dynamic Memory Networks) from the paper by Xiong, Merity, & Socher at MetaMind, http://arxiv.org/abs/1603.0141...

166
NLPMetrics
NLPMetrics gcunhase Jupyter Notebook

Python code for various NLP metrics

166
transformer-abstractive-summarization
transformer-abstractive-summarization rojagtap Jupyter Notebook

Abstractive Text Summarization using Transformer

166
Semantic
Semantic ownthink Python

语义理解/口语理解,项目包含有词法分析:中文分词、词性标注、命名实体识别;口语理解:领域分类、槽填充、意图识别。

164
BERT-GPU
BERT-GPU guotong1988 Python

multi-gpu pre-training in one machine for BERT from scratch without horovod

164
prenlp
prenlp lyeoni Python

Preprocessing Library for Natural Language Processing

164
words_counted
words_counted abitdodgy Ruby

A Ruby natural language processor.

163
What-I-Have-Read
What-I-Have-Read xcfcode

Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers

163
transformers-ru
transformers-ru vlarine Jupyter Notebook

A list of pretrained Transformer models for the Russian language.

162
spacy-udpipe
spacy-udpipe TakeLab Python

spaCy + UDPipe

162
Twitter-Sentiment-Analysis
Twitter-Sentiment-Analysis the-javapocalypse Python

This script can tell you the sentiments of people regarding to any events happening in the world by analyzing tweets related to that event

162
Open-IE-Papers
Open-IE-Papers NPCai

Open Information Extraction (OpenIE) and Open Relation Extraction (ORE) papers and data.

161
webml-polyfill
webml-polyfill intel Python

Deprecated, the Web Neural Network Polyfill project has been moved to https://github.com/webmachinelearning/webnn-polyfill

161
RBERT
RBERT jonathanbratt R

Implementation of BERT in R

161
quickai
quickai geekjr Python

QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.

161
converse
converse maxent-ai Jupyter Notebook

Conversational text Analysis using various NLP techniques

161
elastic_transformers
elastic_transformers md-experiments Jupyter Notebook

Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers

160
espial
espial Uzay-G Python

Espial is an engine for automated organization and discovery of personal knowledge

160
vmf_vae_nlp
vmf_vae_nlp jiacheng-xu Python

Code for EMNLP18 paper "Spherical Latent Spaces for Stable Variational Autoencoders"

159
twitter-sentiment-cnn
twitter-sentiment-cnn danielegrattarola Python

An implementation in TensorFlow of a convolutional neural network (CNN) to perform sentiment classification on tweets.

158
DateTimeSeer
DateTimeSeer p-v Java

A painless way to pick future time.

157
Abstractive-Summarization
Abstractive-Summarization JRC1995 Jupyter Notebook

Implementation of abstractive summarization using LSTM in the encoder-decoder architecture with local attention.

157
lingo
lingo chewxy Go

package lingo provides the data structures and algorithms required for natural language processing

156
norwegian-nlp-resources
norwegian-nlp-resources web64

Norwegian NLP Resources

156
ELMo-chinese
ELMo-chinese rokid Python

Deep contextualized word representations for Chinese

155
WordEmbeddings-Elmo-Fasttext-Word2Vec
WordEmbeddings-Elmo-Fasttext-Word2Vec PrashantRanjan09 Python

Using pre trained word embeddings (Fasttext, Word2Vec)

155
wp2txt
wp2txt yohasebe Ruby

A command-line toolkit to extract text content and category data from Wikipedia dump files

155
Dan-Jurafsky--Chris-Manning--NLP
Dan-Jurafsky--Chris-Manning--NLP Anwarvic Java

My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.

155
fake-news
fake-news mihail911 Jupyter Notebook

Building a fake news detector from initial ideation to model deployment

155
fine-grained-sentiment
fine-grained-sentiment prrao87 Python

A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.

154
OptiPrompt
OptiPrompt princeton-nlp Python

NAACL'2021: Factual Probing Is [MASK]: Learning vs. Learning to Recall https://arxiv.org/abs/2104.05240

153
BERTem
BERTem zhpmatrix Jupyter Notebook

论文实现(ACL2019):《Matching the Blanks: Distributional Similarity for Relation Learning》

152
semanaly
semanaly zqhZY Python

semantic analysis using word2vector, doc2vector,lstm and other method. mainly for text similarity analysis.

152
whatthelang
whatthelang indix Python

Lightning Fast Language Prediction 🚀

152
SpamMessage
SpamMessage hrwhisper Python

中文垃圾短信识别(手写分类器)

152
LexiconNER
LexiconNER v-mipeng Python

Lexicon-based Named Entity Recognition

152
open-solution-toxic-comments
open-solution-toxic-comments minerva-ml Python

Open solution to the Toxic Comment Classification Challenge

151
PALM
PALM PaddlePaddle Python

a Fast, Flexible, Extensible and Easy-to-use NLP Large-scale Pretraining and Multi-task Learning Framework.

151
TIA
TIA umitkaanusta Jupyter Notebook

Your Advanced Twitter stalking tool

151
azimuth
azimuth ServiceNow Python

Helping AI practitioners better understand their datasets and models in text classification. From ServiceNow.

151
lazy
lazy cagataycali JavaScript

Lazy, AI chatbot service.

149
browsecloud
browsecloud microsoft TypeScript

A web app to create and browse text visualizations for automated customer listening.

148
Question-Pairs-Matching
Question-Pairs-Matching LittletreeZou Python

第三届魔镜杯 智能客服问题相似性算法设计 第12名解决方案

147
stog
stog sheng-z Python

AMR Parsing as Sequence-to-Graph Transduction

147
ake-datasets
ake-datasets boudinfl Shell

Large, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.

146