Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Bert-Multi-Label-Text-Classification

This repo contains a PyTorch implementation of a pretrained BERT model...

193   783   783  

Paper-Reading

📖 Paper reading list in dialogue systems and natural language generati...

150   774   774  

language-detection

A language detection library for PHP. Detects the language from a give...

81   766   766  

OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

59   758   758  

lstm-char-cnn-tensorflow

in progress

248   757   757  

dbpedia-spotlight

DBpedia Spotlight is a tool for automatically annotating mentions of D...

201   748   748  

RLSeq2Seq

Deep Reinforcement Learning For Sequence to Sequence Models

162   748   748  

CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

69   745   745  

MiNLP

XiaoMi Natural Language Processing Toolkits

85   740   740  

awesome-qa

😎 A curated list of the Question Answering (QA)

111   737   737  

tensorflow-tutorial

TensorFlow and Deep Learning Tutorials

212   736   736  

transformers-tutorials

Github repo with tutorials to fine tune transformers for diff NLP task...

176   731   731  

pywsd

Python Implementations of Word Sense Disambiguation (WSD) Technologies...

133   719   719  

Treasure-of-Transformers

💁 Awesome Treasure of Transformers Models for Natural Language process...

154   718   718  

PURE

NAACL'2021: A Frustratingly Easy Approach for Entity and Relation Extr...

114   712   712  

naacl_transfer_learning_tutorial

Repository of code for the tutorial on Transfer Learning in NLP held a...

124   709   709  

mordecai

Full text geoparsing as a Python library

97   709   709  

hate-speech-and-offensive-language

Repository for the paper "Automated Hate Speech Detection and the Prob...

316   701   701  

sensitive-word

👮‍♂️The sensitive word tool for java.(基于 DFA 算法实现的高性能 java...

158   701   701  

lingua-rs

The most accurate natural language detection library for Rust, suitabl...

26   696   696  

THUOCL

THUOCL(THU Open Chinese Lexicon)中文词库

185   694   694  

bookcorpus

Crawl BookCorpus

94   694   694  

Python-ai-assistant

Python AI assistant 🧠

202   694   694  

albert_pytorch

A Lite Bert For Self-Supervised Learning Language Representations

152   690   690  

nlp-pytorch-zh

《Natural Language Processing with PyTorch》中文翻译

180   688   688  

PatrickStar

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP...

58   686   686  

pypostal

Python bindings to libpostal for fast international address parsing/no...

82   685   685  

trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multili...

91   683   683  

spacy-stanza

💥 Use the latest Stanza (StanfordNLP) research models directly in spaC...

51   681   681  

spacy-streamlit

👑 spaCy building blocks and visualizers for Streamlit apps

108   678   678  

magpie

Deep neural network framework for multi-label text classification

192   677   677  

awesome_deep_learning_interpretability

深度学习近年来关于神经网络模型解释性的相关高引用/顶会论文(附带代码)

119   675   675  

Annotated-Semantic-Relationships-Datasets

A collections of public and free annotated datasets of relationships b...

132   671   671  

meta

A Modern C++ Data Sciences Toolkit

264   670   670  

nboost

NBoost is a scalable, search-api-boosting platform for deploying trans...

69   663   663  

SpeedTorch

Library for faster pinned CPU <-> GPU transfer in Pytorch

40   661   661  

detoxify

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic...

87   659   659  

Deeplearning.ai-Natural-Language-Processing-Specialization

This repository contains my full work and notes on Coursera's NLP Spec...

483   655   655  

chat

基于自然语言理解与机器学习的聊天机器人,支持多用户并发及自定义多轮对话

218   655   655  

Med-ChatGLM

Repo for Chinese Medical ChatGLM 基于中文医学知识的ChatGLM指令微调

93   655   655  

datacamp-python-data-science-track

All the slides, accompanying code and exercises all stored in this rep...

490   650   650  

griptape

Python framework for AI workflows and pipelines with chain of thought...

29   649   649  

awesome-ChatGPT-repositories

A curated list of resources dedicated to open source GitHub repositori...

86   647   647  

mynlp

一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感...

86   646   646  

WeCron

:heavy_check_mark: 微信上的定时提醒 - Cron on WeChat

110   642   642  

books

整理一些书籍 ,包含 C&C++ 、git 、Java、Keras 、Linux 、NLP 、Python 、...

245   641   641  

pyresparser

A simple resume parser used for extracting information from resumes

333   637   637  

Awesome-Korean-NLP

A curated list of resources for NLP (Natural Language Processing) for...

117   635   635  

ekphrasis

Ekphrasis is a text processing tool, geared towards text from social n...

92   634   634  

homer

Homer, a text analyser in Python, can help make your text more clear,...

37   634   634