Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

175   627   627  

RocketQA

🚀 RocketQA, dense retrieval for information retrieval and question ans...

116   625   625  

lexpredict-lexnlp

LexNLP by LexPredict

163   621   621  

babyai

BabyAI platform. A testbed for training agents to understand and execu...

140   614   614  

awesome-huggingface

🤗 A list of wonderful open-source projects & applications integrated w...

46   613   613  

Chinese_models_for_SpaCy

SpaCy 中文模型 | Models for SpaCy that support Chinese

112   612   612  

SmoothNLP

专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Infer...

114   612   612  

Blackstone

:black_circle: A spaCy pipeline and model for NLP on unstructured lega...

97   611   611  

Natural-Language-Processing-Specialization

This repo contains my coursework, assignments, and Slides for Natural...

596   611   611  

poetry

汉语现代诗歌语料库整理,3489诗人,81.7K诗歌,15.43M字。持续扩充...

78   610   610  

seqGAN

A simplified PyTorch implementation of "SeqGAN: Sequence Generative Ad...

147   606   606  

ifopt

An Eigen-based, light-weight C++ Interface to Nonlinear Programming So...

142   606   606  

DeepNLP-Course

Deep NLP Course

162   601   601  

weibo-analysis-and-visualization

使用python抓取微博数据并对微博文本分析和可视化,LDA(树图)、关系图、...

118   600   600  

cdQA

⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering Syst...

193   599   599  

articulate

A platform for building conversational interfaces with intelligent age...

151   596   596  

long-range-arena

Long Range Arena for Benchmarking Efficient Transformers

63   594   594  

whatlanggo

Natural language detection library for Go

62   591   591  

primeqa

The prime repository for state-of-the-art Multilingual Question Answer...

50   590   590  

awesome-open-data-centric-ai

Curated list of open source tooling for data-centric AI on unstructure...

28   590   590  

nlpia

Examples and libraries for "Natural Language Processing in Action" boo...

251   587   587  

Macropodus

自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中...

96   587   587  

attention-networks-for-classification

Hierarchical Attention Networks for Document Classification in PyTorch

136   586   586  

obsidian-ava

Quickly format your notes with ChatGPT in Obsidian

15   586   586  

R-Net

Tensorflow Implementation of R-Net

215   585   585  

word_forms

Accurately generate all possible forms of an English word e.g "electio...

69   585   585  

datefinder

Find dates inside text using Python and get back datetime objects

160   584   584  

jieba-rs

The Jieba Chinese Word Segmentation Implemented in Rust

33   582   582  

nlp-paper

NLP Paper

128   580   580  

lingua

The most accurate natural language detection library for Java and the...

53   580   580  

lite-transformer

[ICLR 2020] Lite Transformer with Long-Short Range Attention

77   576   576  

DensePhrases

ACL'2021: Learning Dense Representations of Phrases at Scale; EMNLP'20...

76   574   574  

Sentence-VAE

PyTorch Re-Implementation of "Generating Sentences from a Continuous S...

156   573   573  

neuspell

NeuSpell: A Neural Spelling Correction Toolkit

95   573   573  

OCTIS

OCTIS: Comparing Topic Models is Simple! A python package to optimize...

71   572   572  

OpenAttack

An Open-Source Package for Textual Adversarial Attack.

112   568   568  

JamSpell

Modern spell checking library - accurate, fast, multi-language

95   567   567  

xgen

Salesforce open-source LLMs with 8k sequence length.

29   565   565  

xlnet-Pytorch

Simple XLNet implementation with Pytorch Wrapper

102   564   564  

voice-builder

An opensource text-to-speech (TTS) voice building tool

132   563   563  

lingua-py

The most accurate natural language detection library for Python, suita...

26   563   563  

stanford-openie-python

Stanford Open Information Extraction made simple!

101   561   561  

OpenHowNet

Core Data of HowNet and OpenHowNet Python API

87   561   561  

vocabulary

[Not Maintained anymore] Python Module to get Meanings, Synonyms and w...

77   556   556  

catalyst

🚀 Catalyst is a C# Natural Language Processing library built for speed...

64   556   556  

KoELECTRA

Pretrained ELECTRA Model for Korean

137   553   553  

DocProduct

Medical Q&A with Deep Language Models

155   546   546  

NLP_Quickbook

NLP in Python with Deep Learning

226   545   545  

japanese-pretrained-models

Code for producing Japanese pretrained models provided by rinna Co., L...

40   543   543  

QA

使用深度学习算法实现的中文问答系统

234   542   542