Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

cleanNLP

R package providing annotators and a normalized data model for natural...

36   196   196  

ernie

Simple State-of-the-Art BERT-Based Sentence Classification with Keras...

27   195   195  

CRF-Layer-on-the-Top-of-BiLSTM

The CRF Layer was implemented by using Chainer 2.0. Please see more de...

53   195   195  

Deep-Survey-Text-Classification

The project surveys 16+ Natural Language Processing (NLP) research pap...

58   194   194  

sentence-similarity

This repository contains various ways to calculate sentence vector sim...

36   194   194  

Deep-math-machine-learning.ai

A blog which talks about machine learning, deep learning algorithms an...

180   194   194  

DeepToxic

top 1% solution to toxic comment classification challenge on Kaggle.

70   194   194  

kairon

Conversational AI Platform to build effective Proactive Digital Assist...

58   194   194  

Tree-Transformer

Implementation of the paper Tree Transformer

31   193   193  

gpt-j

A GPT-J API to use with python3 to generate text, blogs, code, and mor...

51   193   193  

SyferText

A privacy preserving NLP framework

50   192   192  

NewsRecommender

A news recommendation system tailored for user communities

86   192   192  

konoha

🌿 An easy-to-use Japanese Text Processing tool, which makes it possibl...

19   192   192  

Automated-Fact-Checking-Resources

Links to conference/journal publications in automated fact-checking (r...

22   192   192  

natml-unity

High performance, cross-platform machine learning for Unity Engine. Re...

20   191   191  

PyContinual

PyContinual (An Easy and Extendible Framework for Continual Learning)

48   190   190  

textvec

Text vectorization tool to outperform TFIDF for classification tasks

26   188   188  

edenai-apis

Eden AI: simplify the use and deployment of AI technologies by providi...

21   188   188  

glad

Global-Locally Self-Attentive Dialogue State Tracker

48   186   186  

NLPre

Python library for Natural Language Preprocessing (NLPre)

32   186   186  

blueprints-text

Jupyter notebooks for our O'Reilly book "Blueprints for Text Analysis...

109   186   186  

ngx-dynamic-dashboard-framework

This is a JSON driven angular x based dashboard framework that is insp...

96   185   185  

GPT2

PyTorch Implementation of OpenAI GPT-2

44   185   185  

Recurrent-Convolutional-Neural-Network-Text-Classifier

My (slightly modified) Keras implementation of the Recurrent Convoluti...

70   184   184  

KB-InfoBot

A dialogue bot for information access

66   184   184  

naturalcc

NaturalCC: An Open-Source Toolkit for Code Intelligence

28   184   184  

Kevinpro-NLP-demo

All NLP you Need Here. 目前包含15个NLP demo的pytorch实现(大量代码借...

37   184   184  

hntitlenator

Test your HN title against a neural network

13   183   183  

OPUS-MT-train

Training open neural machine translation models

33   183   183  

SwiftyChrono

A natural language date parser in Swift (ported from chrono.js)

46   182   182  

awesome-hungarian-nlp

A curated list of NLP resources for Hungarian

17   182   182  

nel

Entity linking framework

40   181   181  

files2rouge

Calculating ROUGE score between two files (line-by-line)

51   181   181  

goodreads

code samples for the goodreads datasets

47   181   181  

jupyterlab-prodigy

🧬 A JupyterLab extension for annotating data with Prodigy

25   181   181  

googleLanguageR

R client for the Google Translation API, Google Cloud Natural Language...

39   180   180  

lineflow

:zap:A Lightweight NLP Data Loader for All Deep Learning Frameworks in...

9   178   178  

Books

My book list

142   178   178  

multihead-siamese-nets

Implementation of Siamese Neural Networks built upon multihead attenti...

42   177   177  

covid-papers-browser

Browse Covid-19 & SARS-CoV-2 Scientific Papers with Transformers 🦠 📖

26   176   176  

swagaf

Repository for paper "SWAG: A Large-Scale Adversarial Dataset for Grou...

38   175   175  

LiveActionMap

An attempt to map the areas with active conflict in Ukraine using open...

17   175   175  

spacymoji

💙 Emoji handling and meta data for spaCy with custom extension attribu...

17   174   174  

nuspell

🖋️ Fast and safe spellchecking C++ library

22   174   174  

TabularSemanticParsing

Translating natural language questions to a structured query language

55   173   173  

question_generation

It is a question-generator model. It takes text and an answer as input...

59   172   172  

persian-stopwords

Persian (Farsi) Stop Words List

116   171   171  

66Days__NaturalLanguageProcessing

I am sharing my Journey of 66DaysofData in Natural Language Processing...

58   170   170  

deeplearning.ai

93   169   169  

visdial-rl

PyTorch code for Learning Cooperative Visual Dialog Agents using Deep...

37   168   168