Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

this-word-does-not-exist

This Word Does Not Exist

84   1021   1021  

Summarization-Papers

Summarization Papers

146   1014   1014  

nlp-notebooks

A collection of notebooks for Natural Language Processing from NLP Tow...

381   1005   1005  

gpt-2-Pytorch

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

228   1003   1003  

awesome-language-agents

List of language agents based on paper "Cognitive Architectures for La...

67   1001   1001  

ThoughtSource

A central, open resource for data and tools related to chain-of-though...

80   987   987  

lingua-rs

The most accurate natural language detection library for Rust, suitabl...

47   983   983  

clean-text

๐Ÿงน Python package for text cleaning

80   982   982  

Prompt4ReasoningPapers

[ACL 2023] Reasoning with Language Model Prompting: A Survey

68   978   978  

SoulverCore

A powerful Swift framework for evaluating natural language math expres...

40   973   973  

YouTokenToMe

Unsupervised text tokenizer focused on computational efficiency

107   972   972  

Llama-2-Open-Source-LLM-CPU-Inference

Running Llama 2 and other Open-Source LLMs on CPU Inference Locally fo...

210   966   966  

LLM-Blender

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework...

88   957   957  

wikipedia2vec

A tool for learning vector representations of words and entities from...

102   953   953  

P-tuning

A novel method to tune language models. Codes and datasets for paper `...

114   936   936  

awesome-ai-awesomeness

A curated list of awesome awesomeness about artificial intelligence

118   931   931  

gector

Official implementation of the papers "GECToR โ€“ Grammatical Error Corr...

221   931   931  

conformal-prediction

Lightweight, useful implementation of conformal prediction on real dat...

110   929   929  

skweak

skweak: A software toolkit for weak supervision applied to NLP tasks

77   926   926  

torchMoji

๐Ÿ˜‡A pyTorch implementation of the DeepMoji model: state-of-the-art dee...

191   925   925  

jcseg

Jcseg is a light weight NLP framework developed with Java. Provide CJK...

213   921   921  

keras-hub

Pretrained model hub for Keras 3.

291   921   921  

notes

Learn about Machine Learning and Artificial Intelligence

237   918   918  

TextGAN-PyTorch

TextGAN is a PyTorch framework for Generative Adversarial Networks (GA...

209   916   916  

iowncode

A curated collection of iOS, ML, AR resources sprinkled with some UI a...

323   909   909  

GNN4NLP-Papers

A list of recent papers about Graph Neural Network methods applied in...

138   909   909  

Transformers-for-NLP-2nd-Edition

Transformer models from BERT to GPT-4, environments from Hugging Face...

344   908   908  

multiwoz

Source code for end-to-end dialogue model from the MultiWOZ paper (Bud...

207   907   907  

ML-University

Machine Learning Open Source University

115   906   906  

Coursera

Quiz & Assignment of Coursera

662   903   903  

pyresparser

A simple resume parser used for extracting information from resumes

439   899   899  

bertsearch

Elasticsearch with BERT for advanced document search.

203   899   899  

self-attentive-parser

High-accuracy NLP parser with models for 11 languages.

160   892   892  

my-cs-degree

A CS degree with a focus on full-stack ML engineering, 2020

140   889   889  

factool

FacTool: Factuality Detection in Generative AI

67   887   887  

mindnlp

Easy-to-use and high-performance NLP and LLM framework based on MindSp...

256   883   883  

imbalanced-regression

[ICML 2021, Long Talk] Delving into Deep Imbalanced Regression

140   883   883  

awesome-llm-role-playing-with-persona

Awesome-llm-role-playing-with-persona: a curated list of resources for...

45   876   876  

text2vec

Fast vectorization, topic modeling, distances and GloVe word embedding...

132   866   866  

quanteda

An R package for the Quantitative Analysis of Textual Data

188   863   863  

transformers-tutorials

Github repo with tutorials to fine tune transformers for diff NLP task...

193   857   857  

booknlp

BookNLP, a natural language processing pipeline for books

110   857   857  

datacamp-python-data-science-track

All the slides, accompanying code and exercises all stored in this rep...

525   853   853  

DL-NLP-Readings

My Reading Lists of Deep Learning and Natural Language Processing

260   850   850  

Natural-Language-Processing-Specialization

This repo contains my coursework, assignments, and Slides for Natural...

704   847   847  

portuguese-bert

Portuguese pre-trained BERT models

129   841   841  

spacy-streamlit

๐Ÿ‘‘ spaCy building blocks and visualizers for Streamlit apps

118   840   840  

awesome-japanese-nlp-resources

A curated list of resources dedicated to Python libraries, LLMs, dicti...

34   837   837  

DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

76   829   829  

hate-speech-and-offensive-language

Repository for the paper "Automated Hate Speech Detection and the Prob...

334   827   827