Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Explains nlp building blocks in a simple manner.
This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.
gpttools extends gptstudio for package development to help you document code, write tests, or even explain code
中文ULMFiT 情感分析 文本分类
Named Entity Recognition based on dictionaries
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profile...
Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.
OpenAI GPT2 pre-training and sequence prediction implementation in Tensorflow 2.0
Text2Text: Crosslingual NLP/G toolkit
Siamese LSTM for evaluating semantic similarity between sentences of the Quora Question Pairs Dataset.
Coding exercises for the Natural Language Processing concentration, part of Udacity's AIND program.
Visualization Module for Natural Language Processing
💫 REST microservices for various spaCy-related tasks
multilabel classification of EHR notes
CNN for Chinese Text Classification in Tensorflow
Dynamic Memory Networks (https://arxiv.org/abs/1603.01417) in Tensorflow
MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型
Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.
A Python wrapper for the ROUGE summarization evaluation package
This is a tracking repo for all our AI projects. 🍕 🤖🍼
Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.
🆕 Work continues on INCEpTION 👉 https://github.com/inception-project/inception 👈 -- ⚠️ The official WebAnno repository has reached the end of the l...
Source code for paper: Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data
A frame-semantic parsing system based on a softmax-margin SegRNN.
Deep Learning / NLP tutorial for Chatbot Developers
AI Tool for querying natural language on tabular data.
BNLP is a natural language processing toolkit for Bengali Language.
An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo
Implementing nlp papers relevant to classification with PyTorch, gluonnlp
A list of recent papers about Meta / few-shot learning methods applied in NLP areas.
Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.
从零基础开始机器学习之旅
🧠 code-awareness
结合python一起学习自然语言处理 (nlp): 语言模型、HMM、PCFG、Word2vec、完形填空式阅读理解任务、朴素贝叶斯分类器、TFIDF、PCA、SVD
Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models for Local Seq...
Persian Swear Dataset - you can use in your production to filter unwanted content. دیتاست کلمات نامناسب و بد فارسی برای فیلتر کردن متن ها
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) pr...
🏖 Easy training and deployment of seq2seq models.
Yet another Python binding for fastText
:snake: Turkish Language Stemmer for Python
Sohu's 2018 content recognition competition 1st solution(搜狐内容识别大赛第一名解决方案)
chinese NLP corpus of chinese science fiction,chinese science fiction corpus : About 4675 Chinese science fiction novels 大约有4675本科幻小说,中文科...
All lecture notes, slides and assignments from CS224n: Natural Language Processing with Deep Learning class by Stanford
Word Embeddings for Information Retrieval
Punctuation restoration and spell correction experiments.
A Python library for calculating a large variety of metrics from text
短文本聚类预处理模块 Short text cluster
FedNLP: An Industry and Research Integrated Platform for Federated Learning in Natural Language Processing, Backed by FedML, Inc. The Previous Researc...
OCR, Archive, Index and Search: Implementation agnostic OCR framework.
A python module for English lemmatization and inflection.