Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

OPUS-MT-train

Training open neural machine translation models

45   371   371  

KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inferenc...

34   368   368  

attention-mechanisms

Implementations for a family of attention mechanisms, suitable for all...

83   362   362  

melusine

📧 Melusine: Use python to automatize your email processing workflow

58   361   361  

tacred-relation

PyTorch implementation of the position-aware attention model for relat...

97   360   360  

ABSAPapers

Worth-reading papers and related awesome resources on aspect-based sen...

62   360   360  

MixText

MixText: Linguistically-Informed Interpolation of Hidden Space for Sem...

62   358   358  

adam_qas

ADAM - A Question Answering System. Inspired from IBM Watson

106   356   356  

tweetnlp

TweetNLP for all the NLP enthusiasts working on Twitter! The Python li...

34   356   356  

multimodal-sentiment-analysis

Attention-based multimodal fusion for sentiment analysis

74   355   355  

Artificial-Intelligence-And-Data-Science-Pro

Regularly Updated | Collection of the best Data Science and AI Materi...

147   354   354  

MultiMed

[LREC-COLING 2024 (Oral), Interspeech 2024 (Oral), NAACL 2025, ACL 202...

36   354   354  

coursera-natural-language-processing-specialization

Programming assignments from all courses in the Coursera Natural Langu...

334   352   352  

AwesomeFakeNews

This repository contains recent research on fake news.

78   352   352  

NNDIAL

NNDial is an open source toolkit for building end-to-end trainable tas...

104   350   350  

awesome-nlprojects

List of projects related to Natural Language Processing (NLP) that mak...

89   349   349  

megabots

🤖 State-of-the-art, production ready LLM apps made mega-easy, so you...

34   348   348  

tensorlayer-tricks

How to use TensorLayer

62   347   347  

polish-nlp-resources

Pre-trained models and language resources for Natural Language Process...

32   347   347  

Entity-Linking-Recent-Trends

Recent trends of Entity Linking, Disambiguation, and Representation.

18   346   346  

displacy

:boom: displaCy.js: An open-source NLP visualiser for the modern web

75   345   345  

100-Days-of-NLP

120   344   344  

pyss3

A Python package implementing a new interpretable machine learning mod...

44   343   343  

BMList

A List of Big Models

14   343   343  

awesome-list-of-awesomes

A curated list of all the Awesome --Topic Name-- lists I've found till...

47   341   341  

dataset

darija <-> english dataset

117   337   337  

PyTorch-Beam-Search-Decoding

PyTorch implementation of beam search decoding for seq2seq models

64   337   337  

GPT2

PyTorch Implementation of OpenAI GPT-2

65   337   337  

hardware-aware-transformers

[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Langua...

52   336   336  

ner

Named Entity Recognition

64   335   335  

chakin

Simple downloader for pre-trained word vectors

48   334   334  

ChemDataExtractor

Automatically extract chemical information from scientific documents

120   334   334  

cherche

Neural Search

15   333   333  

data_management_LLM

Collection of training data management explorations for large language...

31   331   331  

chatbot_ner

chatbot_ner: Named Entity Recognition for chatbots.

133   330   330  

Dynamic-memory-networks-in-Theano

Implementation of Dynamic memory networks by Kumar et al. http://arxiv...

108   329   329  

WhatsAppInfoBot

A Framework to Build Bots

99   327   327  

OpenUE

[EMNLP 2020] OpenUE: An Open Toolkit of Universal Extraction from Text

59   327   327  

CS224N-2019

My completed solutions for CS224N 2021 & 2019

123   326   326  

NLPython

This repository contains the code related to Natural Language Processi...

206   323   323  

voltaserve

⚡️ Reality OS for Creators

17   322   322  

Binder

[ICLR 2023] Code for the paper "Binding Language Models in Symbolic La...

36   321   321  

cybertron

Cybertron: the home planet of the Transformers in Go

27   319   319  

byteNet-tensorflow

ByteNet for character-level language modelling

67   319   319  

conllu

A CoNLL-U parser that takes a CoNLL-U formatted string and turns it in...

52   317   317  

MINERVA

Meandering In Networks of Entities to Reach Verisimilar Answers

89   317   317  

rasa-chatbot-templates

RASA chatbot use case boilerplate

197   317   317  

PyContinual

PyContinual (An Easy and Extendible Framework for Continual Learning)

66   317   317  

dostoevsky

Sentiment analysis library for russian language

35   316   316  

book-nlp

Natural language processing pipeline for book-length documents (archiv...

48   315   315