Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Machine-Learning

Awesome list (courses, books, videos etc.) and implementation of Machi...

15   61   61  

Deep-Learning-Pytorch

A repo containing code covering various aspects of deep learning on Py...

20   60   60  

char-rnn-tensorflow

Multi-layer Recurrent Neural Networks for character-level language mod...

13   60   60  

Emotion-Detector

A python code to detect emotions from text

56   60   60  

NLP-Extractive-NEWS-summarization-using-MMR

A simple python implementation of the Maximal Marginal Relevance (MMR)...

39   60   60  

SCDV

Text classification with Sparse Composite Document Vectors.

18   60   60  

Zero-to-Hero-in-NLP

This repository contains A-Z techniques of Natural Language Processing...

22   60   60  

grasp

Essential NLP & ML, short & fast pure Python code

16   60   60  

SkillsExtractorCognitiveSearch

Azure Search Cognitive Skill to extract technical and business skills...

43   60   60  

Multi-hopRC

:notebook_with_decorative_cover: notes for Multi-hop Reading Comprehen...

8   60   60  

instant-segment

Fast English word segmentation in Rust

3   60   60  

LTP

[KDD'22] Learned Token Pruning for Transformers

10   60   60  

ELK_twitter

This is a data pipeline for Twitter (ETL) using the elastic stack Elas...

27   59   59  

ucto

Unicode tokeniser. Ucto tokenizes text files: it separates words from...

13   59   59  

adan

Language-Adversarial Training for Cross-Lingual Text Classification (T...

13   59   59  

podium

Podium: a framework agnostic Python NLP library for data loading and p...

1   59   59  

rita-dsl

A Domain Specific Language (DSL) for building language patterns. These...

3   59   59  

Free-Artificial-Intelligence-Resources

Welcome, to this Open Source Repository regarding FREE ARTIFICIAL INTE...

23   59   59  

MAX-Text-Sentiment-Classifier

Detect the sentiment captured in short pieces of text

30   59   59  

Kaggle-Quora-Question-Pairs

This is our team's solution report, which achieves top 10% (305/3307)...

15   58   58  

nlp_newsletter

Natural language processing (NLP) newsletter right on GitHub

3   58   58  

europeananp-ner

Named Entities Recognition Annotator Tool for Europeana Newspapers

8   58   58  

li_emnlp_2017

Deep Recurrent Generative Decoder for Abstractive Text Summarization i...

17   58   58  

IOB2Corpus

Japanese IOB2 tagged corpus for Named Entity Recognition.

19   58   58  

KitanaQA

KitanaQA: Adversarial training and data augmentation for neural questi...

10   58   58  

ResumeFilter

Extracting relevant information from resume using deep learning.

33   58   58  

VideoSearchEngine

Semantically be able to search through a database of videos (using gen...

22   58   58  

joint-lstm-parser

Transition-based joint syntactic dependency parser and semantic role l...

8   57   57  

innatis

A Rasa NLU component library

13   57   57  

Voice_ChatBot

Chatbot in russian with speech recognition using PocketSphinx and spee...

18   57   57  

crfsuite

Labelling Sequential Data in Natural Language Processing with R - usin...

10   56   56  

nlg-games

A curated list of digital games that use Natural Language Generation t...

6   56   56  

nodejs-bot-platform

Node.js bot and personal assistants platform. Boilerplate, start point...

6   55   55  

code-intelligence

ML-Powered Developer Tools, using Kubeflow

20   55   55  

ml.manash.me

(Deprecated) Visit this one -> https://bn.mlmodel.fit/

42   55   55  

KERMIT

🐸 KERMIT - A lightweight library to encode and interpret Universal Sy...

7   55   55  

zeugma

📝Natural language processing (NLP) utils: word embeddings (Word2Vec, G...

4   54   54  

PLE

Label Noise Reduction in Entity Typing (KDD'16)

15   53   53  

spark-nkp

Natural Korean Processor for Apache Spark

17   53   53  

legi.py

Outils de manipulation des archives LEGI (lois françaises)

19   53   53  

py-nltools

A collection of basic python modules for spoken natural language proce...

15   53   53  

mtnt

Code for the collection and analysis of the MTNT dataset

4   53   53  

liblevenshtein-java

Various utilities regarding Levenshtein transducers. (Java)

19   53   53  

text-classification-baseline

Pipeline for fast building text classification TF-IDF + LogReg baselin...

5   53   53  

notes

The notes for Math, Machine Learning, Deep Learning and Research paper...

13   52   52  

thot

Thot toolkit for statistical machine translation

12   52   52  

CS224n-solutions

Solutions for CS224n course from Stanford University: Natural Language...

23   52   52  

PyLDA

A Latent Dirichlet Allocation implementation in Python.

21   52   52  

GreynirPackage

The Greynir NLP parser for Icelandic, packaged for PyPI

10   52   52  

AdvancedIndexing-PyTorch

(Batched) advanced indexing for PyTorch.

2   52   52