Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

mynlp

一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感...

86   646   646  

WeCron

:heavy_check_mark: 微信上的定时提醒 - Cron on WeChat

110   642   642  

pyresparser

A simple resume parser used for extracting information from resumes

333   637   637  

Awesome-Korean-NLP

A curated list of resources for NLP (Natural Language Processing) for...

117   635   635  

ekphrasis

Ekphrasis is a text processing tool, geared towards text from social n...

92   634   634  

homer

Homer, a text analyser in Python, can help make your text more clear,...

37   634   634  

nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

175   627   627  

RocketQA

🚀 RocketQA, dense retrieval for information retrieval and question an...

116   625   625  

nlpia

Examples and libraries for "Natural Language Processing in Action" boo...

266   623   623  

lexpredict-lexnlp

LexNLP by LexPredict

163   621   621  

cdQA

⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering Sys...

191   617   617  

babyai

BabyAI platform. A testbed for training agents to understand and execu...

140   614   614  

Chinese_models_for_SpaCy

SpaCy 中文模型 | Models for SpaCy that support Chinese

112   612   612  

SmoothNLP

专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Infer...

114   612   612  

Blackstone

:black_circle: A spaCy pipeline and model for NLP on unstructured lega...

97   611   611  

Natural-Language-Processing-Specialization

This repo contains my coursework, assignments, and Slides for Natural...

596   611   611  

poetry

汉语现代诗歌语料库整理,3489诗人,81.7K诗歌,15.43M字。持续扩充...

78   610   610  

seqGAN

A simplified PyTorch implementation of "SeqGAN: Sequence Generative Ad...

147   606   606  

ifopt

An Eigen-based, light-weight C++ Interface to Nonlinear Programming So...

142   606   606  

DeepNLP-Course

Deep NLP Course

162   601   601  

weibo-analysis-and-visualization

使用python抓取微博数据并对微博文本分析和可视化,LDA(树图)、关系图、...

118   600   600  

articulate

A platform for building conversational interfaces with intelligent age...

151   596   596  

BotLibre

An open platform for artificial intelligence, chat bots, virtual agent...

227   595   595  

long-range-arena

Long Range Arena for Benchmarking Efficient Transformers

63   594   594  

graphbrain

Language, Knowledge, Cognition

68   592   592  

whatlanggo

Natural language detection library for Go

62   591   591  

Sentence-VAE

PyTorch Re-Implementation of "Generating Sentences from a Continuous S...

153   590   590  

primeqa

The prime repository for state-of-the-art Multilingual Question Answer...

50   590   590  

awesome-open-data-centric-ai

Curated list of open source tooling for data-centric AI on unstructure...

28   590   590  

Macropodus

自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中...

96   587   587  

attention-networks-for-classification

Hierarchical Attention Networks for Document Classification in PyTorch

136   586   586  

Building-a-Simple-Chatbot-in-Python-using-NLTK

Building a Simple Chatbot from Scratch in Python (using NLTK)

571   586   586  

obsidian-ava

Quickly format your notes with ChatGPT in Obsidian

15   586   586  

R-Net

Tensorflow Implementation of R-Net

215   585   585  

word_forms

Accurately generate all possible forms of an English word e.g "electio...

69   585   585  

datefinder

Find dates inside text using Python and get back datetime objects

160   584   584  

chat-bubble

Simple chatbot UI for the Web with JSON scripting 👋🤖🤙

172   584   584  

jieba-rs

The Jieba Chinese Word Segmentation Implemented in Rust

33   582   582  

nlp-paper

NLP Paper

128   580   580  

lingua

The most accurate natural language detection library for Java and the...

53   580   580  

lite-transformer

[ICLR 2020] Lite Transformer with Long-Short Range Attention

77   576   576  

DensePhrases

ACL'2021: Learning Dense Representations of Phrases at Scale; EMNLP'20...

76   574   574  

neuspell

NeuSpell: A Neural Spelling Correction Toolkit

95   573   573  

OCTIS

OCTIS: Comparing Topic Models is Simple! A python package to optimize...

71   572   572  

OpenAttack

An Open-Source Package for Textual Adversarial Attack.

112   568   568  

DocProduct

Medical Q&A with Deep Language Models

158   567   567  

JamSpell

Modern spell checking library - accurate, fast, multi-language

95   567   567  

xgen

Salesforce open-source LLMs with 8k sequence length.

29   565   565  

xlnet-Pytorch

Simple XLNet implementation with Pytorch Wrapper

102   564   564  

voice-builder

An opensource text-to-speech (TTS) voice building tool

132   563   563