Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

chat

基于自然语言理解与机器学习的聊天机器人,支持多用户并发及自定义多轮对话

217   700   700  

THUOCL

THUOCL(THU Open Chinese Lexicon)中文词库

185   694   694  

bookcorpus

Crawl BookCorpus

94   694   694  

Python-ai-assistant

Python AI assistant 🧠

202   694   694  

albert_pytorch

A Lite Bert For Self-Supervised Learning Language Representations

152   690   690  

nlp-pytorch-zh

《Natural Language Processing with PyTorch》中文翻译

180   688   688  

PatrickStar

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP...

58   686   686  

SpeedTorch

Library for faster pinned CPU <-> GPU transfer in Pytorch

40   685   685  

pypostal

Python bindings to libpostal for fast international address parsing/no...

82   685   685  

WeCron

:heavy_check_mark: 微信上的定时提醒 - Cron on WeChat

117   684   684  

magpie

Deep neural network framework for multi-label text classification

192   683   683  

DNABERT

DNABERT: pre-trained Bidirectional Encoder Representations from Transf...

172   682   682  

MacBERT

Revisiting Pre-trained Models for Chinese Natural Language Processing...

58   678   678  

nboost

NBoost is a scalable, search-api-boosting platform for deploying trans...

69   673   673  

Annotated-Semantic-Relationships-Datasets

A collections of public and free annotated datasets of relationships b...

132   671   671  

meta

A Modern C++ Data Sciences Toolkit

264   670   670  

Octopii

An AI-powered Personal Identifiable Information (PII) scanner.

58   668   668  

whatlanggo

Natural language detection library for Go

66   664   664  

Awesome-Korean-NLP

A curated list of resources for NLP (Natural Language Processing) for...

116   661   661  

detoxify

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic...

87   659   659  

tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training...

42   658   658  

Med-ChatGLM

Repo for Chinese Medical ChatGLM 基于中文医学知识的ChatGLM指令微调

93   655   655  

Deeplearning.ai-Natural-Language-Processing-Specialization

This repository contains my full work and notes on Coursera's NLP Spec...

483   655   655  

obsidian-ava

Quickly format your notes with ChatGPT in Obsidian

17   654   654  

griptape

Python framework for AI workflows and pipelines with chain of thought...

29   649   649  

seqGAN

A simplified PyTorch implementation of "SeqGAN: Sequence Generative Ad...

150   647   647  

mynlp

一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感...

86   646   646  

COMET

A Neural Framework for MT Evaluation

94   643   643  

nlprule

A fast, low-resource Natural Language Processing and Text Correction l...

39   641   641  

VnCoreNLP

A Vietnamese natural language processing toolkit (NAACL 2018)

153   635   635  

word_forms

Accurately generate all possible forms of an English word e.g "electio...

72   634   634  

ekphrasis

Ekphrasis is a text processing tool, geared towards text from social n...

92   634   634  

homer

Homer, a text analyser in Python, can help make your text more clear,...

37   634   634  

nlpia

Examples and libraries for "Natural Language Processing in Action" boo...

264   631   631  

RocketQA

🚀 RocketQA, dense retrieval for information retrieval and question an...

116   625   625  

small-text

Active Learning for Text Classification in Python

71   621   621  

lexpredict-lexnlp

LexNLP by LexPredict

163   621   621  

BotLibre

An open platform for artificial intelligence, chat bots, virtual agent...

229   620   620  

KoELECTRA

Pretrained ELECTRA Model for Korean

137   620   620  

cdQA

⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering Sys...

191   616   616  

babyai

BabyAI platform. A testbed for training agents to understand and execu...

140   614   614  

graphbrain

Language, Knowledge, Cognition

70   614   614  

Chinese_models_for_SpaCy

SpaCy 中文模型 | Models for SpaCy that support Chinese

112   612   612  

SmoothNLP

专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Infer...

114   612   612  

indonlu

The first-ever vast natural language processing benchmark for Indonesi...

204   612   612  

Blackstone

:black_circle: A spaCy pipeline and model for NLP on unstructured lega...

97   611   611  

poetry

汉语现代诗歌语料库整理,3489诗人,81.7K诗歌,15.43M字。持续扩充...

78   610   610  

ifopt

An Eigen-based, light-weight C++ Interface to Nonlinear Programming So...

142   606   606  

DeepNLP-Course

Deep NLP Course

162   601   601  

articulate

A platform for building conversational interfaces with intelligent age...

151   596   596