Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

TextGAN-PyTorch

TextGAN is a PyTorch framework for Generative Adversarial Networks (GA...

190   800   800  

CodeT5

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

146   800   800  

TextClassification-Keras

Text classification models implemented in Keras, including: FastText,...

188   799   799  

self-attentive-parser

High-accuracy NLP parser with models for 11 languages.

150   799   799  

gector

Official implementation of the papers "GECToR – Grammatical Error Corr...

202   793   793  

inltk

Natural Language Toolkit for Indic Languages aims to provide out of th...

165   791   791  

bigscience

Central place for the engineering/scaling WG: documentation, SLURM scr...

75   790   790  

cltk

The Classical Language Toolkit

318   789   789  

StarryDivineSky

精选了10K+项目,包括机器学习、深度学习、NLP、GNN、推荐系统、生物医药、...

112   784   784  

Bert-Multi-Label-Text-Classification

This repo contains a PyTorch implementation of a pretrained BERT model...

193   783   783  

catalyst

🚀 Catalyst is a C# Natural Language Processing library built for spee...

79   776   776  

Paper-Reading

📖 Paper reading list in dialogue systems and natural language generat...

150   774   774  

awesome-japanese-nlp-resources

A curated list of resources dedicated to Python libraries, LLMs, dicti...

31   769   769  

language-detection

A language detection library for PHP. Detects the language from a give...

81   766   766  

OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

59   758   758  

lstm-char-cnn-tensorflow

in progress

248   757   757  

awesome-qa

😎 A curated list of the Question Answering (QA)

105   757   757  

dbpedia-spotlight

DBpedia Spotlight is a tool for automatically annotating mentions of D...

201   748   748  

RLSeq2Seq

Deep Reinforcement Learning For Sequence to Sequence Models

162   748   748  

trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multili...

103   745   745  

CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

69   745   745  

awesome_deep_learning_interpretability

深度学习近年来关于神经网络模型解释性的相关高引用/顶会论文(附带代码)

123   741   741  

MiNLP

XiaoMi Natural Language Processing Toolkits

85   740   740  

tensorflow-tutorial

TensorFlow and Deep Learning Tutorials

210   734   734  

transformers-tutorials

Github repo with tutorials to fine tune transformers for diff NLP task...

176   731   731  

pywsd

Python Implementations of Word Sense Disambiguation (WSD) Technologies...

133   719   719  

PURE

NAACL'2021: A Frustratingly Easy Approach for Entity and Relation Extr...

114   712   712  

naacl_transfer_learning_tutorial

Repository of code for the tutorial on Transfer Learning in NLP held a...

124   709   709  

mordecai

Full text geoparsing as a Python library

97   709   709  

hate-speech-and-offensive-language

Repository for the paper "Automated Hate Speech Detection and the Prob...

316   701   701  

lingua-rs

The most accurate natural language detection library for Rust, suitabl...

26   696   696  

THUOCL

THUOCL(THU Open Chinese Lexicon)中文词库

185   694   694  

bookcorpus

Crawl BookCorpus

94   694   694  

Python-ai-assistant

Python AI assistant 🧠

202   694   694  

albert_pytorch

A Lite Bert For Self-Supervised Learning Language Representations

152   690   690  

nlp-pytorch-zh

《Natural Language Processing with PyTorch》中文翻译

180   688   688  

PatrickStar

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP...

58   686   686  

pypostal

Python bindings to libpostal for fast international address parsing/no...

82   685   685  

magpie

Deep neural network framework for multi-label text classification

192   683   683  

spacy-stanza

💥 Use the latest Stanza (StanfordNLP) research models directly in spa...

51   681   681  

spacy-streamlit

👑 spaCy building blocks and visualizers for Streamlit apps

108   678   678  

Annotated-Semantic-Relationships-Datasets

A collections of public and free annotated datasets of relationships b...

132   671   671  

meta

A Modern C++ Data Sciences Toolkit

264   670   670  

nboost

NBoost is a scalable, search-api-boosting platform for deploying trans...

69   663   663  

SpeedTorch

Library for faster pinned CPU <-> GPU transfer in Pytorch

40   661   661  

detoxify

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic...

87   659   659  

Deeplearning.ai-Natural-Language-Processing-Specialization

This repository contains my full work and notes on Coursera's NLP Spec...

483   655   655  

chat

基于自然语言理解与机器学习的聊天机器人,支持多用户并发及自定义多轮对话

218   655   655  

Med-ChatGLM

Repo for Chinese Medical ChatGLM 基于中文医学知识的ChatGLM指令微调

93   655   655  

griptape

Python framework for AI workflows and pipelines with chain of thought...

29   649   649