Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

bertsearch

Elasticsearch with BERT for advanced document search.

203   874   874  

hazm

Python library for digesting Persian text.

156   872   872  

notes

Learn about Machine Learning and Artificial Intelligence

231   870   870  

skweak

skweak: A software toolkit for weak supervision applied to NLP tasks

70   870   870  

torchMoji

😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep...

176   869   869  

wikipedia2vec

A tool for learning vector representations of words and entities from...

97   866   866  

gpt-2-Pytorch

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

211   866   866  

my-cs-degree

A CS degree with a focus on full-stack ML engineering, 2020

140   860   860  

nlp-notebooks

A collection of notebooks for Natural Language Processing from NLP Tow...

344   847   847  

clean-text

🧹 Python package for text cleaning

71   836   836  

Coursera

Quiz & Assignment of Coursera

649   828   828  

awesome-document-understanding

A curated list of resources for Document Understanding (DU) topic

111   828   828  

GNN4NLP-Papers

A list of recent papers about Graph Neural Network methods applied in...

131   815   815  

DL-NLP-Readings

My Reading Lists of Deep Learning and Natural Language Processing

271   814   814  

iowncode

A curated collection of iOS, ML, AR resources sprinkled with some UI a...

316   809   809  

TextGAN-PyTorch

TextGAN is a PyTorch framework for Generative Adversarial Networks (GA...

190   800   800  

self-attentive-parser

High-accuracy NLP parser with models for 11 languages.

150   799   799  

gector

Official implementation of the papers "GECToR – Grammatical Error Corr...

202   793   793  

pythainlp

Thai Natural Language Processing in Python.

239   791   791  

text2vec

Fast vectorization, topic modeling, distances and GloVe word embedding...

130   785   785  

quanteda

An R package for the Quantitative Analysis of Textual Data

182   776   776  

Paper-Reading

📖 Paper reading list in dialogue systems and natural language generati...

150   774   774  

language-detection

A language detection library for PHP. Detects the language from a give...

81   766   766  

DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

70   761   761  

huggingface_hub

All the open source things related to the Hugging Face Hub.

181   758   758  

AI-Job-Recommend

国内公司人工智能方向(含机器学习、深度学习、计算机视觉和自然语言处理)...

90   747   747  

keras-attention

Visualizing RNNs using the attention mechanism

248   735   735  

texar-pytorch

Integrating the Best of TF into PyTorch, for Machine Learning, Natural...

119   733   733  

transformers-tutorials

Github repo with tutorials to fine tune transformers for diff NLP task...

176   731   731  

AI-Series

:books: [.md & .ipynb] Series of Artificial Intelligence & Deep Learni...

245   726   726  

Treasure-of-Transformers

💁 Awesome Treasure of Transformers Models for Natural Language process...

154   718   718  

multiwoz

Source code for end-to-end dialogue model from the MultiWOZ paper (Bud...

177   717   717  

autotrain-advanced

🤗 AutoTrain Advanced

52   710   710  

hate-speech-and-offensive-language

Repository for the paper "Automated Hate Speech Detection and the Prob...

316   701   701  

lingua-rs

The most accurate natural language detection library for Rust, suitabl...

26   696   696  

deep-learning-guide

An evolving guide to learning Deep Learning effectively.

133   686   686  

holiday-cn

📅🇨🇳中国法定节假日数据 自动每日抓取国务院公告

92   686   686  

FewRel

A Large-Scale Few-Shot Relation Extraction Dataset

153   684   684  

trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multili...

91   683   683  

spacy-stanza

💥 Use the latest Stanza (StanfordNLP) research models directly in spaC...

51   681   681  

awesome-grounding

awesome grounding: A curated list of research papers in visual groundi...

84   680   680  

booknlp

BookNLP, a natural language processing pipeline for books

70   679   679  

spacy-streamlit

👑 spaCy building blocks and visualizers for Streamlit apps

108   678   678  

Me_Bot

Build a bot that speaks like you!

70   675   675  

CS224n

CS224n: Natural Language Processing with Deep Learning Assignments Win...

278   673   673  

ML-University

Machine Learning Open Source University

92   670   670  

SpeedTorch

Library for faster pinned CPU <-> GPU transfer in Pytorch

40   661   661  

talisman

Straightforward fuzzy matching, information retrieval and NLP building...

47   661   661  

portuguese-bert

Portuguese pre-trained BERT models

115   659   659  

SoulverCore

A powerful Swift framework for evaluating mathematical expressions

21   656   656