Topic

nlp

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1261)

nlp_made_easy
nlp_made_easy Kyubyong Jupyter Notebook

Explains nlp building blocks in a simple manner.

244
concise-concepts
concise-concepts davidberenstein1957 Python

This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.

244
gpttools
gpttools JamesHWade R

gpttools extends gptstudio for package development to help you document code, write tests, or even explain code

244
chinese_ulmfit
chinese_ulmfit PracticingMan Python

中文ULMFiT 情感分析 文本分类

243
spacy-lookup
spacy-lookup mpuig Python

Named Entity Recognition based on dictionaries

242
nlp_profiler
nlp_profiler neomatrix369 Python

A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profile...

242
backprop
backprop backprop-ai Python

Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.

242
gpt-2-tensorflow2.0
gpt-2-tensorflow2.0 akanyaani Python

OpenAI GPT2 pre-training and sequence prediction implementation in Tensorflow 2.0

242
text2text
text2text artitw Python

Text2Text: Crosslingual NLP/G toolkit

242
Siamese-LSTM
Siamese-LSTM likejazz Python

Siamese LSTM for evaluating semantic similarity between sentences of the Quora Question Pairs Dataset.

241
AIND-NLP
AIND-NLP udacity Jupyter Notebook

Coding exercises for the Natural Language Processing concentration, part of Udacity's AIND program.

241
nlplot
nlplot takapy0210 Python

Visualization Module for Natural Language Processing

241
spacy-services
spacy-services explosion Python

💫 REST microservices for various spaCy-related tasks

240
caml-mimic
caml-mimic jamesmullenbach Python

multilabel classification of EHR notes

240
cnn-text-classification-tf-chinese
cnn-text-classification-tf-chinese indiejoseph Python

CNN for Chinese Text Classification in Tensorflow

239
dmn-tensorflow
dmn-tensorflow therne Python

Dynamic Memory Networks (https://arxiv.org/abs/1603.01417) in Tensorflow

239
monpa
monpa monpa-team Python

MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型

239
GermanWordEmbeddings
GermanWordEmbeddings devmount Jupyter Notebook

Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.

239
pyrouge
pyrouge bheinzerling Python

A Python wrapper for the ROUGE summarization evaluation package

238
openfoodfacts-ai
openfoodfacts-ai openfoodfacts Python

This is a tracking repo for all our AI projects. 🍕 🤖🍼

238
prosodic
prosodic quadrismegistus Python

Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.

237
webanno
webanno webanno Java

🆕 Work continues on INCEpTION 👉 https://github.com/inception-project/inception 👈 -- ⚠️ The official WebAnno repository has reached the end of the l...

236
fairseq-gec
fairseq-gec kanyun-inc Python

Source code for paper: Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data

236
open-sesame
open-sesame swabhs Python

A frame-semantic parsing system based on a softmax-margin SegRNN.

236
DL-for-Chatbot
DL-for-Chatbot j-min Jupyter Notebook

Deep Learning / NLP tutorial for Chatbot Developers

235
tableQA
tableQA abhijithneilabraham Python

AI Tool for querying natural language on tabular data.

235
bnlp
bnlp sagorbrur Jupyter Notebook

BNLP is a natural language processing toolkit for Bengali Language.

234
SummerTime
SummerTime Yale-LILY Python

An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo

234
nlp_classification
nlp_classification seopbo Python

Implementing nlp papers relevant to classification with PyTorch, gluonnlp

231
MetaLearning4NLP-Papers
MetaLearning4NLP-Papers ha-lins

A list of recent papers about Meta / few-shot learning methods applied in NLP areas.

231
onnxt5
onnxt5 abelriboulot Python

Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.

231
machine-learning
machine-learning jacksu Jupyter Notebook

从零基础开始机器学习之旅

230
mindflow
mindflow mindflowai Python

🧠 code-awareness

230
nlp_learning
nlp_learning SeanLee97 Python

结合python一起学习自然语言处理 (nlp): 语言模型、HMM、PCFG、Word2vec、完形填空式阅读理解任务、朴素贝叶斯分类器、TFIDF、PCA、SVD

230
PIE
PIE awasthiabhijeet Macaulay2

Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models for Local Seq...

230
Persian-Swear-Words
Persian-Swear-Words amirshnll Jupyter Notebook

Persian Swear Dataset - you can use in your production to filter unwanted content. دیتاست کلمات نامناسب و بد فارسی برای فیلتر کردن متن ها

229
vert-papers
vert-papers microsoft Python

This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) pr...

229
headliner
headliner spring-media Python

🏖 Easy training and deployment of seq2seq models.

228
pyfasttext
pyfasttext vrasneur Python

Yet another Python binding for fastText

228
turkish-stemmer-python
turkish-stemmer-python otuncelli Python

:snake: Turkish Language Stemmer for Python

228
SOHU_competition
SOHU_competition zhanzecheng Jupyter Notebook

Sohu's 2018 content recognition competition 1st solution(搜狐内容识别大赛第一名解决方案)

227
4675-scifi
4675-scifi guhhhhaa

chinese NLP corpus of chinese science fiction,chinese science fiction corpus : About 4675 Chinese science fiction novels 大约有4675本科幻小说,中文科...

226
cs224n-2017-winter
cs224n-2017-winter maxim5 HTML

All lecture notes, slides and assignments from CS224n: Natural Language Processing with Deep Learning class by Stanford

225
vec4ir
vec4ir lgalke Python

Word Embeddings for Information Retrieval

225
fastPunct
fastPunct notAI-tech Python

Punctuation restoration and spell correction experiments.

225
TextDescriptives
TextDescriptives HLasse Python

A Python library for calculating a large variety of metrics from text

225
TextCluster
TextCluster RandyPen Python

短文本聚类预处理模块 Short text cluster

224
FedNLP
FedNLP FedML-AI

FedNLP: An Industry and Research Integrated Platform for Federated Learning in Natural Language Processing, Backed by FedML, Inc. The Previous Researc...

223
ocrpy
ocrpy maxent-ai Jupyter Notebook

OCR, Archive, Index and Search: Implementation agnostic OCR framework.

223
LemmInflect
LemmInflect bjascob Python

A python module for English lemmatization and inflection.

223