Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Official Ruby client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Ruby apps.
Build unigram and bigram language models, implement Laplace smoothing and use the models to compute the perplexity of test corpora.
📂 Additional lookup tables and data resources for spaCy
자연어 처리와 관련한 여러 튜토리얼 저장소
Adapt Transformer-based language models to new text domains
PHP Client for Google Natural Language with Extras
remove signature blocks from emails
A Python package for gender classification.
হাতেকলমে ন্যাচারাল ল্যাঙ্গুয়েজ প্রসেসিং (এনএলপি) - শুরুর ধারণা
대량의 네이버 뉴스 기사를 수집하는 라이브러리입니다.
Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.
OpusFilter - Parallel corpus processing toolkit
Natural language parser for recurring events
Cornell Touchdown natural language navigation and spatial reasoning dataset.
A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.
Converts spoken words into text form.
Code for ACL'2021 paper WARP 🌀 Word-level Adversarial ReProgramming. Outperforming `GPT-3` on SuperGLUE Few-Shot text classification. https://aclanth...
Sentence Embeddings in NLI with Iterative Refinement Encoders
TETRE: a Toolkit for Exploring Text for Relation Extraction
Get phonetic spellings and syllable counts for any english word. Works with made-up and non-dictionary words
✍️ An intelligent system that takes a document and classifies different writing styles within the document using stylometric techniques.
Image Recognition and Information Extraction from Image Documents using Keras and Watson NLU
Lemmatization for Turkish Language
Implementation of CRF layer in Keras.
[ACM-CIKM] 2nd place solution at CIKM AnalytiCup 2018, a task for determining short text similarities.
PHP wrapper for the Stanford Natural Language Processing library. Supports POSTagger and CRFClassifier.
code and supplementary materials for a series of Medium articles about the BERT model
A simple yet strong implementation of neural machine translation in pytorch
Source code for our "TitleStylist" paper at ACL 2020
Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuning. State-of-...
🧪 Cutting-edge experimental spaCy components and features
Yelp round-10 review comments classification using deep learning (LSTM and CNN) and natural language processing.
Transition-based UCCA Parser
:books: social networks from novels
A practical guide to topic mining and interactive visualizations
Welcome, to this Open Source Repository regarding FREE ARTIFICIAL INTELLIGENCE RESOURCE. Get Benefit from the free resources mention & kindly five STA...
DaCy: The State of the Art Danish NLP pipeline using SpaCy
An React client library for Speechly API
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg...
Unannotated Spanish 3 Billion Words Corpora
Semantically be able to search through a database of videos (using generated summaries)
Tracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense...
Lyrics generation with GPT2-based Transformer
My notes.
Summarise text by finding relevant sentences and keywords using the Textrank algorithm
Accelerated Reinforcement Learning for Sentence Generation by Vocabulary Prediction
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing s...
Python Rule Processing Engine 🏺