Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

cs224n

Stanford CS224n: Natural Language Processing with Deep Learning, Winte...

41   101   101  

DeezyMatch

A Flexible Deep Learning Approach to Fuzzy String Matching

29   101   101  

ATIS.keras

Spoken Language Understanding(SLU)/Slot Filling in Keras

41   100   100  

stminsights

A Shiny Application for Inspecting Structural Topic Models

14   100   100  

learn-deep-learning

AI Summer's complete catalog of articles

24   100   100  

DeepLearning.AI-TensorFlow-Developer-Course

DeepLearning.AI TensorFlow Developer Professional Certificate -Courser...

82   100   100  

tf-idf-python

Term frequency–inverse document frequency for Chinese novel/documents...

35   99   99  

DeepAligned-Clustering

Discovering New Intents with Deep Aligned Clustering (AAAI 2021)

17   99   99  

pytreebank

:rage::innocent: Stanford Sentiment Treebank loader in Python

23   98   98  

NLP_Toolkit

Library of state-of-the-art models (PyTorch) for NLP tasks

25   98   98  

Resume-Job-Description-Matching

The purpose of this project was to defeat the current Application Trac...

74   98   98  

recon

Recon NER, Debug and correct annotated Named Entity Recognition (NER)...

2   98   98  

Lbl2Vec

Lbl2Vec learns jointly embedded label, document and word vectors to re...

21   98   98  

neural-vqa-attention

:question: Attention-based Visual Question Answering in Torch

32   97   97  

EN-FA-CS-Dictionary

:speech_balloon: An English-Persian Dictionary of Computer Science and...

12   97   97  

ml-classify-text-js

Machine learning based text classification in JavaScript using n-grams...

11   97   97  

KoSentenceBERT-SKT

Sentence Embeddings using Siamese SKT KoBERT-Networks

26   97   97  

text_analytics

Basic text analytics and natural language processing in Python

45   97   97  

ruimtehol

R package to Embed All the Things! using StarSpace

11   96   96  

dialog-nlu

Tensorflow and Keras implementation of the state of the art researches...

40   96   96  

textblob-de

German language support for TextBlob.

13   96   96  

estnltk

Open source tools for Estonian natural language processing

18   96   96  

KenLM-training

Training an n-gram based Language Model using KenLM toolkit for Deep S...

17   96   96  

CoLAKE

COLING'2020: CoLAKE: Contextualized Language and Knowledge Embedding

20   96   96  

Enso

Enso: An Open Source Library for Benchmarking Embeddings + Transfer Le...

12   95   95  

practical-open

Oxford Deep NLP 2017 course - Open practical

73   95   95  

parsinlu

A comprehensive suite of high-level NLP tasks for Persian language

15   95   95  

Dual-Contrastive-Learning

Code for our paper "Dual Contrastive Learning: Text Classification via...

21   95   95  

tensorflow-font2char2word2sent2doc

TensorFlow implementation of Hierarchical Attention Networks for Docum...

31   94   94  

NLP

This is where I put all my work in Natural Language Processing

48   94   94  

practical-3

Oxford Deep NLP 2017 course - Practical 3: Text Classification with R...

75   94   94  

embedbase

The open source database for ChatGPT

5   94   94  

Arch-Data-Science

Archlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning,...

4   93   93  

Semantic-Texual-Similarity-Toolkits

Semantic Textual Similarity (STS) measures the degree of equivalence i...

24   93   93  

Twitter-Sentiment-Analysis-Classical-Approach-VS-Deep-Learning

This project's aim, is to explore the world of Natural Language Proces...

27   93   93  

Copycat-abstractive-opinion-summarizer

ACL 2020 Unsupervised Opinion Summarization as Copycat-Review Generati...

30   93   93  

Aspect-Based-Sentiment-Analysis

A paper list for aspect based sentiment analysis.

19   93   93  

emdr2

Code and Models for the paper "End-to-End Training of Multi-Document R...

10   93   93  

doc2vec-api

document embedding and machine learning script for beginners

35   92   92  

SimpleDNN

SimpleDNN is a machine learning lightweight open-source library writte...

8   92   92  

teanaps

자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.

10   92   92  

asent

Asent is a python library for performing efficient and transparent sen...

12   92   92  

deep-learning

Assignmends done for Udacity's Deep Learning MOOC with Vincent Vanhouc...

69   91   91  

dexter

Let your talking do the code

19   91   91  

word2vec-from-scratch-with-python

A very simple, bare-bones, inefficient, implementation of skip-gram wo...

46   91   91  

multiplex-plot

Multiplex: visualizations that tell stories—A Python library to create...

12   91   91  

wink-nlp-utils

NLP Functions for amplifying negations, managing elisions, creating ng...

10   91   91  

doccano-transformer

The official tool for transforming doccano format into common dataset...

27   91   91  

presidio-research

This package features data-science related tasks for developing new re...

40   91   91  

PaperScraper

A web scraping tool to systematically extract the text of scientific p...

39   91   91