Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

openai-kotlin

OpenAI API client for Kotlin with multiplatform and coroutines capabil...

84   821   821  

LightAutoML

LAMA - automatic model creation framework

93   815   815  

BERT-keras

Keras implementation of BERT with pre-trained weights

195   813   813  

NLP-Tutorials

Simple implementations of NLP models. Tutorials are written in Chinese...

300   810   810  

awesome-gcn

resources for graph convolutional networks (图卷积神经网络相关资源)

131   808   808  

lightNLP

基于Pytorch和torchtext的自然语言处理深度学习框架。

217   805   805  

MiNLP

XiaoMi Natural Language Processing Toolkits

90   801   801  

TextGAN-PyTorch

TextGAN is a PyTorch framework for Generative Adversarial Networks (GA...

190   800   800  

CodeT5

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

146   800   800  

TextClassification-Keras

Text classification models implemented in Keras, including: FastText,...

188   799   799  

self-attentive-parser

High-accuracy NLP parser with models for 11 languages.

150   799   799  

gector

Official implementation of the papers "GECToR – Grammatical Error Corr...

202   793   793  

inltk

Natural Language Toolkit for Indic Languages aims to provide out of th...

165   791   791  

cltk

The Classical Language Toolkit

318   789   789  

lstm-char-cnn-tensorflow

in progress

241   784   784  

StarryDivineSky

精选了10K+项目,包括机器学习、深度学习、NLP、GNN、推荐系统、生物医药、...

112   784   784  

Bert-Multi-Label-Text-Classification

This repo contains a PyTorch implementation of a pretrained BERT model...

193   783   783  

catalyst

🚀 Catalyst is a C# Natural Language Processing library built for spee...

79   776   776  

Paper-Reading

📖 Paper reading list in dialogue systems and natural language generat...

150   774   774  

awesome-japanese-nlp-resources

A curated list of resources dedicated to Python libraries, LLMs, dicti...

31   769   769  

language-detection

A language detection library for PHP. Detects the language from a give...

81   766   766  

Daily-DeepLearning

🔥机器学习/深度学习/Python/大模型/多模态/LLM/deeplearning/Python/Algor...

156   762   762  

OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

59   758   758  

awesome-qa

😎 A curated list of the Question Answering (QA)

105   757   757  

awesome_deep_learning_interpretability

深度学习近年来关于神经网络模型解释性的相关高引用/顶会论文(附带代码)

122   751   751  

dbpedia-spotlight

DBpedia Spotlight is a tool for automatically annotating mentions of D...

201   748   748  

RLSeq2Seq

Deep Reinforcement Learning For Sequence to Sequence Models

162   748   748  

trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multili...

103   745   745  

CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

69   745   745  

tensorflow-tutorial

TensorFlow and Deep Learning Tutorials

209   732   732  

transformers-tutorials

Github repo with tutorials to fine tune transformers for diff NLP task...

176   731   731  

pywsd

Python Implementations of Word Sense Disambiguation (WSD) Technologies...

133   719   719  

PURE

NAACL'2021: A Frustratingly Easy Approach for Entity and Relation Extr...

114   712   712  

naacl_transfer_learning_tutorial

Repository of code for the tutorial on Transfer Learning in NLP held a...

124   709   709  

mordecai

Full text geoparsing as a Python library

97   709   709  

sequence-labeling-BiLSTM-CRF

The BiLSTM-CRF model implementation in Tensorflow, for sequence labeli...

256   706   706  

hate-speech-and-offensive-language

Repository for the paper "Automated Hate Speech Detection and the Prob...

316   701   701  

chat

基于自然语言理解与机器学习的聊天机器人,支持多用户并发及自定义多轮对话

217   699   699  

lingua-rs

The most accurate natural language detection library for Rust, suitabl...

26   696   696  

THUOCL

THUOCL(THU Open Chinese Lexicon)中文词库

185   694   694  

bookcorpus

Crawl BookCorpus

94   694   694  

Python-ai-assistant

Python AI assistant 🧠

202   694   694  

albert_pytorch

A Lite Bert For Self-Supervised Learning Language Representations

152   690   690  

nlp-pytorch-zh

《Natural Language Processing with PyTorch》中文翻译

180   688   688  

PatrickStar

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP...

58   686   686  

pypostal

Python bindings to libpostal for fast international address parsing/no...

82   685   685  

WeCron

:heavy_check_mark: 微信上的定时提醒 - Cron on WeChat

117   684   684  

magpie

Deep neural network framework for multi-label text classification

192   683   683  

spacy-stanza

💥 Use the latest Stanza (StanfordNLP) research models directly in spa...

51   681   681  

spacy-streamlit

👑 spaCy building blocks and visualizers for Streamlit apps

108   678   678