Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

datacamp-python-data-science-track

All the slides, accompanying code and exercises all stored in this rep...

525   853   853  

Natural-Language-Processing-Specialization

This repo contains my coursework, assignments, and Slides for Natural...

704   847   847  

aiva

AIVA (A.I. Virtual Assistant): General-purpose virtual assistant for d...

597   840   840  

spacy-streamlit

👑 spaCy building blocks and visualizers for Streamlit apps

118   840   840  

seq2seq-chatbot

Chatbot in 200 lines of code using TensorLayer

313   839   839  

wit

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multili...

36   838   838  

Chatito

🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recogn...

163   837   837  

bert4torch

An elegent pytorch implement of transformers

104   837   837  

awesome-japanese-nlp-resources

A curated list of resources dedicated to Python libraries, LLMs, dicti...

34   837   837  

MemN2N-tensorflow

"End-To-End Memory Networks" in Tensorflow

249   827   827  

language-detection

A language detection library for PHP. Detects the language from a give...

86   827   827  

WEB_KG

爬取百度百科中文页面,抽取三元组信息,构建中文知识图谱

188   823   823  

ChatIE

The online version is temporarily unavailable because we cannot afford...

67   822   822  

openai-kotlin

OpenAI API client for Kotlin with multiplatform and coroutines capabil...

84   821   821  

hate-speech-and-offensive-language

Repository for the paper "Automated Hate Speech Detection and the Prob...

333   820   820  

LightAutoML

LAMA - automatic model creation framework

93   815   815  

BERT-keras

Keras implementation of BERT with pre-trained weights

195   813   813  

NLP-Tutorials

Simple implementations of NLP models. Tutorials are written in Chinese...

300   810   810  

awesome-gcn

resources for graph convolutional networks (图卷积神经网络相关资源)

131   808   808  

lightNLP

基于Pytorch和torchtext的自然语言处理深度学习框架。

217   805   805  

MiNLP

XiaoMi Natural Language Processing Toolkits

90   801   801  

catalyst

🚀 Catalyst is a C# Natural Language Processing library built for spee...

83   801   801  

CodeT5

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

146   800   800  

TextClassification-Keras

Text classification models implemented in Keras, including: FastText,...

188   799   799  

inltk

Natural Language Toolkit for Indic Languages aims to provide out of th...

165   791   791  

cltk

The Classical Language Toolkit

318   789   789  

lstm-char-cnn-tensorflow

in progress

241   784   784  

StarryDivineSky

精选了10K+项目,包括机器学习、深度学习、NLP、GNN、推荐系统、生物医药、...

112   784   784  

Bert-Multi-Label-Text-Classification

This repo contains a PyTorch implementation of a pretrained BERT model...

193   783   783  

OCTIS

OCTIS: Comparing Topic Models is Simple! A python package to optimize...

115   781   781  

Paper-Reading

📖 Paper reading list in dialogue systems and natural language generat...

150   774   774  

lingua

The most accurate natural language detection library for Java and the...

72   765   765  

Daily-DeepLearning

🔥机器学习/深度学习/Python/大模型/多模态/LLM/deeplearning/Python/Algor...

156   762   762  

trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multili...

103   761   761  

OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

59   758   758  

awesome-qa

😎 A curated list of the Question Answering (QA)

105   757   757  

awesome_deep_learning_interpretability

深度学习近年来关于神经网络模型解释性的相关高引用/顶会论文(附带代码)

122   751   751  

dbpedia-spotlight

DBpedia Spotlight is a tool for automatically annotating mentions of D...

201   748   748  

RLSeq2Seq

Deep Reinforcement Learning For Sequence to Sequence Models

162   748   748  

CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

69   745   745  

OpenAttack

An Open-Source Package for Textual Adversarial Attack.

131   741   741  

spacy-stanza

💥 Use the latest Stanza (StanfordNLP) research models directly in spa...

62   739   739  

primeqa

The prime repository for state-of-the-art Multilingual Question Answer...

57   736   736  

PromptKG

PromptKG Family: a Gallery of Prompt Learning & KG-related research wo...

75   733   733  

tensorflow-tutorial

TensorFlow and Deep Learning Tutorials

209   732   732  

pywsd

Python Implementations of Word Sense Disambiguation (WSD) Technologies...

133   719   719  

PURE

NAACL'2021: A Frustratingly Easy Approach for Entity and Relation Extr...

114   712   712  

naacl_transfer_learning_tutorial

Repository of code for the tutorial on Transfer Learning in NLP held a...

124   709   709  

mordecai

Full text geoparsing as a Python library

97   709   709  

sequence-labeling-BiLSTM-CRF

The BiLSTM-CRF model implementation in Tensorflow, for sequence labeli...

256   706   706