Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.
Random fun with statistical language models.
A toolkit for Vietnamese word segmentation
Some JavaScript works published as demos, mostly ML or DS
Notes and solutions to complement the official NLTK book
Mining Argument Structures with Expressive Inference (Linear and LSTM Engines)
The greynir.is Icelandic natural language processing API and website.
:dart: predict the price trend of individual stocks using deep learning and natural language processing
Sentiment analyzer for your tweets.
Trained T5 and T5-large model for creating keywords from text
A paper list of pre-trained language models (PLMs).
Repository for NLI models (EMNLP 2018)
pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference
Neutron: A pytorch based implementation of Transformer and its variants.
Reimplementation of a BERT based model (Shi et al, 2019), currently the state-of-the-art for English SRL. This model implements also predicate disambi...
GenieNLP: A versatile codebase for any NLP task
GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动...
PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxiv.org/pdf/221...
快速下载中文数据集,处理数据集,数据分析、可视化分析,一站式解决数据问题
Reading Comprehension Experiments repository.
ITP A2Z Course for Fall 2018
Tool for Evaluating Adversarial Perturbations on Text
Compare six baseline deep learning models on TrecQA
Deep learning research implemented on notebooks using PyTorch.
A keyphrase extractor for Persian
Deep Open Intent Classification with Adaptive Decision Boundary (AAAI 2021)
This repository has implementations of data augmentation for NLP for Japanese.
A repo containing code covering various aspects of deep learning on Pytorch. Great for beginners and intermediate in the field
Multi-layer Recurrent Neural Networks for character-level language models implements by TensorFlow
A python code to detect emotions from text
A simple python implementation of the Maximal Marginal Relevance (MMR) baseline system for text summarization.
Text classification with Sparse Composite Document Vectors.
This repository contains A-Z techniques of Natural Language Processing to get started in NLP.
Essential NLP & ML, short & fast pure Python code
Various utilities regarding Levenshtein transducers. (Java)
Chatbot in russian with speech recognition using PocketSphinx and speech synthesis using RHVoice. The AttentionSeq2Seq model is used. Imlemented using...
Azure Search Cognitive Skill to extract technical and business skills from text
:notebook_with_decorative_cover: notes for Multi-hop Reading Comprehension and open-domain question answering
Fast English word segmentation in Rust
[KDD'22] Learned Token Pruning for Transformers
This is a data pipeline for Twitter (ETL) using the elastic stack Elasticsearch, Logstash and Kibana (version 6.1)
Language-Adversarial Training for Cross-Lingual Text Classification (TACL)
Podium: a framework agnostic Python NLP library for data loading and preprocessing
This is our team's solution report, which achieves top 10% (305/3307) in this competition.
Natural language processing (NLP) newsletter right on GitHub
Named Entities Recognition Annotator Tool for Europeana Newspapers
Deep Recurrent Generative Decoder for Abstractive Text Summarization in DyNet
Japanese IOB2 tagged corpus for Named Entity Recognition.
KitanaQA: Adversarial training and data augmentation for neural question-answering models
Extracting relevant information from resume using deep learning.