Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Assignmends done for Udacity's Deep Learning MOOC with Vincent Vanhoucke
A simple markup language to write novel with types.
A very simple, bare-bones, inefficient, implementation of skip-gram word2vec from scratch with Python
The official tool for transforming doccano format into common dataset formats.
Pujangga - Indonesian Natural Language Processing Tool with REST API, an Interface for InaNLP and Deeplearning4j's Word2Vec
Knowledge Base Question Answering using memory networks
Knowledge extraction from web data
PyTorch implementation of our paper "Imposing Label-Relational Inductive Bias for Extremely Fine-Grained Entity Typing" (NAACL19)
An open information extraction system that provides compact extractions
Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.
[GSOC] Greek language support for spacy.io python NLP software
A Recurrent Neural Network implemented from scratch (using only numpy) in Python.
Implementation of the multi feed-forward network architecture by Parikh et al. (2016) for Natural Language Inference.
Label Embedding Network
A PyTorch implementation of Transformer in "Attention is All You Need"
Go Bindings for BERT NLP Models
spark-based library that helps construct and query knowledge graphs from unstructured and structured data
🤹♀️ Query spaCy's linguistic annotations using GraphQL
:surfer: 依存关系分析,NLP,自然语言处理
Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data
A full-process dialogue system that can be deployed online
A baseline for WenTianSearch
State of the art faster Transformer with Tensorflow 2.0 ( NLP, Computer Vision, Audio ).
Word2Vec implementation using numpy
Must-read papers on Natural Language Processing (NLP)
A spaCy wrapper for DBpedia Spotlight
TorchFlare is a simple, beginner-friendly, and easy-to-use PyTorch Framework train your models effortlessly.
Biterm Topic Modelling for Short Text with R
[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset
Term extraction for Russian language
Course materials for Introduction to Computational Literary Analysis, taught at UC Berkeley in Summer 2018, 2019, and 2020, at Columbia University in...
[ACL 2019]: Interconnected Question Generation with Coreference Alignment and Conversation Flow Modeling
Arabic support for textblob
Geolocating twitter users by the content of their tweets
This repo supports various cross-lingual transfer learning & multilingual NLP models.
Tons of fun with text and recurrent neural networks! Let your computer read a book and tell you its own story. 🤣
STriP Net: Semantic Similarity of Scientific Papers (S3P) Network
A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.
openai/whisper + extra features
Multinomial Adversarial Networks for Multi-Domain Text Classification (NAACL 2018)
PyTorch Implementation of NBA game summary generator.
Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sen...
[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
:eyeglasses: Platform to automatically detect what user might be interested in buying in near future
Toolkit for Auditing and Mitigating Bias and Fairness of Machine Learning Systems 🔎🤖🧰
⛔ [NOT MAINTAINED] A web-based annotator for closed-domain question answering datasets with SQuAD format.
classy is a simple-to-use library for building high-performance Machine Learning models in NLP.
Korean Easy Data Augmentation
This is a repo of basic Machine Learning what I learn. More to go...