Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Turkish-Bert-NLP-Pipeline

Bert-base NLP pipeline for Turkish, Ner, Sentiment Analysis, Question...

22   146   146  

EMPaper

This is a repository for sharing papers in the field of empathetic con...

17   145   145  

classy-classification

This repository contains an easy and intuitive approach to few-shot cl...

9   145   145  

dialogflow-ruby-client

Ruby SDK for Dialogflow

30   144   144  

KoSentenceBERT-ETRI

Sentence Embeddings using Siamese ETRI KoBERT-Networks

24   144   144  

NeuralDialog-LaRL

PyTorch implementation of latent space reinforcement learning for E2E...

26   143   143  

stanza-old

Stanford NLP group's shared Python tools.

38   142   142  

RBERT

Implementation of BERT in R

17   142   142  

PersianQA

Persian (Farsi) Question Answering Dataset (+ Models)

10   142   142  

label-studio-transformers

Label data using HuggingFace's transformers and automatically get a pr...

28   142   142  

gpt2-dialogue-generation-pytorch

The PyTorch implementation of fine-tuning the GPT-2(Generative Pre-Tra...

21   142   142  

awesome-ai-services

An overview of the AI-as-a-service landscape

20   141   141  

TwitterNER

Twitter named entity extraction for WNUT 2016 http://noisy-text.github...

32   140   140  

jury

Comprehensive NLP Evaluation System

12   140   140  

Data-Science-and-Machine-Learning-Projects-Dojo

collections of data science, machine learning and data visualization p...

59   140   140  

Deep-Lyrics

Lyrics Generator aka Character-level Language Modeling with Multi-laye...

25   139   139  

MachineLearningWithPython

Get started with Machine Learning with Python - An introduction with P...

84   138   138  

CocoaAI

🤖 The Cocoa Artificial Intelligence Lab

13   137   137  

Scattertext-PyData

Notebooks for the Seattle PyData 2017 talk on Scattertext

50   136   136  

lingo

package lingo provides the data structures and algorithms required for...

16   136   136  

GAIN

Source code for EMNLP 2020 paper: Double Graph Based Reasoning for Doc...

28   136   136  

nested-ner-tacl2020-transformers

Implementation of Nested Named Entity Recognition using BERT

26   136   136  

mgpt

Multilingual Generative Pretrained Model

8   136   136  

spokestack-python

Spokestack is a library that allows a user to easily incorporate a voi...

13   135   135  

linguistic-style-transfer

Neural network parametrized objective to disentangle and transfer styl...

32   135   135  

detecting-scientific-claim

Extracting scientific claims from biomedical abstracts (powered by All...

19   135   135  

MT-DNN

Multi-Task Deep Neural Networks for Natural Language Understanding

27   135   135  

docprompting

Data and code for "DocPrompting: Generating Code by Retrieving the Doc...

8   135   135  

compling_nlp_hse_course

Материалы курса по компьютерной лингвистике Школы Лингвистики НИУ ВШЭ

67   134   134  

fnc-1-baseline

A baseline implementation for FNC-1

110   133   133  

abstractive_summarizer

Abstractive Text Summarization using Transformer

50   133   133  

kor

Where Erwin's cat feels alive 😽

7   133   133  

augmenty

Augmenty is an augmentation library based on spaCy for augmenting text...

8   132   132  

clicr

Machine reading comprehension on clinical case reports

40   131   131  

character-mining

Mining individual characters in multiparty dialogue

20   130   130  

ake-datasets

Large, curated set of benchmark datasets for evaluating automatic keyp...

26   130   130  

iNeuron-Full-Stack-Data-Science-Assignments

This Repository consists of Assignments and projects of the iNeuron Fu...

147   130   130  

natural-language-preprocessings

Some recipes of natural language pre-processing

27   129   129  

cotk

Conversational Toolkit. An Open-Source Toolkit for Fast Development an...

27   129   129  

sling

SLING - A natural language frame semantics parser

11   129   129  

NLPCC-WordSeg-Weibo

NLPCC 2016 微博分词评测项目

45   128   128  

spf

Cornell Semantic Parsing Framework

13   128   128  

mongolian-nlp

Useful resources for Mongolian NLP

33   128   128  

data-science-tutorials

Python Tutorials for Data Science

33   128   128  

Distill-BERT-Textgen

Research code for ACL 2020 paper: "Distilling Knowledge Learned in BER...

17   127   127  

nlp_workshop_odsc_europe20

Extensive tutorials for the Advanced NLP Workshop in Open Data Science...

65   127   127  

Ask2Transformers

A Framework for Textual Entailment based Zero Shot text classification

13   127   127  

extend

Entity Disambiguation as text extraction (ACL 2022)

6   127   127  

keita

My personal toolkit for PyTorch development.

12   126   126  

asari

Japanese sentiment analyzer implemented in Python.

18   126   126