Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Python code for various NLP metrics
WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings...
The "tl;dr" on a few notable transformer papers (pre-2022).
The Genie open source kit for voice assistant (formerly known as Almond)
Implementation with some extensions of the paper "Snowball: Extracting Relations from Large Plain-Text Collections" (Agichtein and Gravano, 2000)
Building a fake news detector from initial ideation to model deployment
Abstractive Text Summarization using Transformer
Theano Implementation of DMN+ (Improved Dynamic Memory Networks) from the paper by Xiong, Merity, & Socher at MetaMind, http://arxiv.org/abs/1603.0141...
Awesome-Text-Classification Projects,Papers,Tutorial .
Neural Paraphrase Generation
torchtext使用总结,从零开始逐步实现了torchtext文本预处理过程,包括截断补长,词表构建,使用预训练词向量,构建可用于PyTorch的可迭代数据等步骤。并结合Pyt...
spaCy + UDPipe
My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers
This script can tell you the sentiments of people regarding to any events happening in the world by analyzing tweets related to that event
语义理解/口语理解,项目包含有词法分析:中文分词、词性标注、命名实体识别;口语理解:领域分类、槽填充、意图识别。
multi-gpu pre-training in one machine for BERT from scratch without horovod
A Ruby natural language processor.
Preprocessing Library for Natural Language Processing
Implementation of BERT in R
A list of pretrained Transformer models for the Russian language.
QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.
Open Information Extraction (OpenIE) and Open Relation Extraction (ORE) papers and data.
Deprecated, the Web Neural Network Polyfill project has been moved to https://github.com/webmachinelearning/webnn-polyfill
Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers
Conversational text Analysis using various NLP techniques
Espial is an engine for automated organization and discovery of personal knowledge
Code for EMNLP18 paper "Spherical Latent Spaces for Stable Variational Autoencoders"
package lingo provides the data structures and algorithms required for natural language processing
A painless way to pick future time.
Implementation of abstractive summarization using LSTM in the encoder-decoder architecture with local attention.
An implementation in TensorFlow of a convolutional neural network (CNN) to perform sentiment classification on tweets.
Norwegian NLP Resources
Deep contextualized word representations for Chinese
Using pre trained word embeddings (Fasttext, Word2Vec)
A command-line toolkit to extract text content and category data from Wikipedia dump files
semantic analysis using word2vector, doc2vector,lstm and other method. mainly for text similarity analysis.
A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.
NAACL'2021: Factual Probing Is [MASK]: Learning vs. Learning to Recall https://arxiv.org/abs/2104.05240
论文实现(ACL2019):《Matching the Blanks: Distributional Similarity for Relation Learning》
Lightning Fast Language Prediction 🚀
中文垃圾短信识别(手写分类器)
Lexicon-based Named Entity Recognition
Open solution to the Toxic Comment Classification Challenge
a Fast, Flexible, Extensible and Easy-to-use NLP Large-scale Pretraining and Multi-task Learning Framework.
Helping AI practitioners better understand their datasets and models in text classification. From ServiceNow.
Your Advanced Twitter stalking tool
Lazy, AI chatbot service.
A web app to create and browse text visualizations for automated customer listening.
Large, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.