Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Lbl2Vec

Lbl2Vec learns jointly embedded label, document and word vectors to re...

28   186   186  

KB-InfoBot

A dialogue bot for information access

64   185   185  

covid-papers-browser

Browse Covid-19 & SARS-CoV-2 Scientific Papers with Transformers 🦠...

27   184   184  

speechly

Client libraries, examples and demos of Speechly API for the Web.

18   184   184  

minecraft-mcp-server

A Minecraft MCP Server powered by Mineflayer API. It allows to control...

18   184   184  

project-lakechain

:zap: Cloud-native, AI-powered, document processing pipelines on AWS.

28   184   184  

nervaluate

Full named-entity (i.e., not tag/token) evaluation metrics based on Se...

23   183   183  

BunkaTopics

🗺️ Data Cleaning and Textual Data Visualization 🗺️

17   183   183  

Recurrent-Convolutional-Neural-Network-Text-Classifier

My (slightly modified) Keras implementation of the Recurrent Convoluti...

65   183   183  

Hands-On-Natural-Language-Processing-with-Python

This repository is for my students of Udemy. You can find all lecture...

244   183   183  

hntitlenator

Test your HN title against a neural network

13   182   182  

multihead-siamese-nets

Implementation of Siamese Neural Networks built upon multihead attenti...

43   182   182  

extend

Entity Disambiguation as text extraction (ACL 2022)

13   182   182  

LangChain-Chat-with-Your-Data

Explore LangChain and build powerful chatbots that interact with your...

110   181   181  

lineflow

:zap:A Lightweight NLP Data Loader for All Deep Learning Frameworks in...

9   181   181  

spacymoji

💙 Emoji handling and meta data for spaCy with custom extension attrib...

20   181   181  

nel

Entity linking framework

37   180   180  

TextING

[ACL 2020] Tensorflow implementation for "Every Document Owns Its Stru...

58   180   180  

pytorch-pos-tagging

A tutorial on how to implement models for part-of-speech tagging using...

27   180   180  

artificial-intelligence

AI projects in python, mostly Jupyter notebooks.

47   179   179  

swagaf

Repository for paper "SWAG: A Large-Scale Adversarial Dataset for Grou...

40   178   178  

Resume-Job-Description-Matching

The purpose of this project was to defeat the current Application Trac...

86   177   177  

nlp-class

A Natural Language Processing course taught by Professor Ghassemi

67   177   177  

gpt2-dialogue-generation-pytorch

The PyTorch implementation of fine-tuning the GPT-2(Generative Pre-tra...

24   176   176  

Natural-Language-Processing-NLP-Roadmap

A simple RoadMap to Natural Language Processing(NLP)

22   176   176  

char-cnn-text-classification-pytorch

Character-level Convolutional Neural Networks for text classification...

47   176   176  

AI-NLP-Paper-Readings

This is my reading list for my PhD in AI, NLP, Deep Learning and more.

26   175   175  

easy-bert

A Dead Simple BERT API for Python and Java (https://github.com/google-...

45   175   175  

LiveActionMap

An attempt to map the areas with active conflict in Ukraine using twit...

15   175   175  

deep-learning-for-nlp-lectures

Deep Learning for Natural Language Processing - Lectures 2023

30   174   174  

LLM-Drop

The official implementation of the paper "What Matters in Transformers...

22   174   174  

diffusion-of-thoughts

[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Tho...

15   174   174  

emeltal

Local ML voice chat using high-end models.

12   174   174  

FXDesktopSearch

A JavaFX based desktop search application.

41   174   174  

pymetamap

Python wraper for MetaMap

62   174   174  

metaknowledge

A Python library for doing bibliometric and network analysis in scienc...

34   174   174  

chars2vec

Character-based word embeddings model based on RNN for handling real w...

38   173   173  

Turkish-Bert-NLP-Pipeline

Bert-base NLP pipeline for Turkish, Ner, Sentiment Analysis, Question...

21   173   173  

lexpredict-contraxsuite

LexPredict ContraxSuite

65   172   172  

cep

CEP is a software platform designed for users that want to learn or ra...

22   172   172  

lighthouse

[EMNLP2024 Demo], [ICASSP 2025] A user-friendly library for reproducib...

9   171   171  

qb

QANTA Quiz Bowl AI

48   171   171  

character-mining

Mining individual characters in multiparty dialogue

25   171   171  

question_generation

It is a question-generator model. It takes text and an answer as input...

58   170   170  

visdial-rl

PyTorch code for Learning Cooperative Visual Dialog Agents using Deep...

39   170   170  

learn-to-select-data

Code for Learning to select data for transfer learning with Bayesian O...

43   170   170  

NLP-pretrained-model

A collection of Natural language processing pre-trained models.

29   170   170  

huspacy

HuSpaCy: industrial-strength Hungarian natural language processing

16   170   170  

imodelsX

Interpret text data using LLMs (scikit-learn compatible).

28   169   169  

monkeylearn-python

Official Python client for the MonkeyLearn API. Build and consume mach...

44   169   169