Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

SentimentAnalysis

Sentiment analysis neural network trained by fine-tuning BERT, ALBERT,...

43   333   333  

troll

Language sentiment analysis and neural networks... for trolls.

13   332   332  

chatbot_ner

chatbot_ner: Named Entity Recognition for chatbots.

133   330   330  

camel_tools

A suite of Arabic natural language processing tools developed by the C...

68   330   330  

deep_srl

Code and pre-trained model for: Deep Semantic Role Labeling: What Work...

76   329   329  

Issue-Label-Bot

Code For The Issue Label Bot, an App that automatically labels issues...

84   329   329  

paraphrase-id-tensorflow

Various models and code (Manhattan LSTM, Siamese LSTM + Matching Layer...

72   328   328  

OpenUE

[EMNLP 2020] OpenUE: An Open Toolkit of Universal Extraction from Text

59   327   327  

spanish-word-embeddings

Spanish word embeddings computed with different methods and from diffe...

77   326   326  

Seq2seq-Chatbot-for-Keras

This repository contains a new generative model of chatbot based on se...

98   325   325  

qa_match

A simple effective ToolKit for short text matching

83   324   324  

tldrstory

📊 Semantic search for headlines and story text

26   324   324  

gsdmm

GSDMM: Short text clustering

92   324   324  

PyKoSpacing

Automatic Korean word spacing with Python

106   322   322  

KR-WordRank

비지도학습 방법으로 한국어 텍스트에서 단어/키워드를 자동으로 추출하는...

57   321   321  

code-switching-papers

A curated list of research papers and resources on code-switching

40   321   321  

chatgpt

Interface to ChatGPT from R

37   320   320  

MLDemo

This repo is all the machine learning related project codes and their...

136   319   319  

NLPGNN

1. Use BERT, ALBERT and GPT2 as tensorflow2.0's layer. 2. Implement...

62   318   318  

dliss-tutorial

Tutorial for International Summer School on Deep Learning, 2019

60   317   317  

pytextclassifier

pytextclassifier is a toolkit for text classification. 文本分类,LR,X...

54   317   317  

tner

Language model fine-tuning on NER with an easy interface and cross-dom...

33   316   316  

electra_pytorch

Pretrain and finetune ELECTRA with fastai and huggingface. (Results of...

41   314   314  

NLP_Datasets

My NLP datasets for Russian language

50   313   313  

kg-baseline-pytorch

2019百度的关系抽取比赛,使用Pytorch实现苏神的模型,F1在dev集可达到0.75...

56   313   313  

OpenAI-CLIP

Simple implementation of OpenAI CLIP model in PyTorch.

50   313   313  

StruMatchDL

Codes for ICML 2022 paper: Matching Structure for Dual Learning

2   311   311  

stringi

Fast and portable character string processing in R (with the Unicode I...

49   311   311  

mlm-scoring

Python library & examples for Masked Language Model Scoring (ACL 2020)

60   308   308  

stopwords

Default English stopword lists from many different sources

128   307   307  

insight

Repository for Project Insight: NLP as a Service

46   306   306  

Transformers_for_Text_Classification

基于Transformers的文本分类

66   305   305  

fugashi

A Cython MeCab wrapper for fast, pythonic Japanese tokenization and mo...

26   305   305  

HSCRF-pytorch

ACL 2018: Hybrid semi-Markov CRF for Neural Sequence Labeling (http://...

70   304   304  

bert_distill

BERT distillation(基于BERT的蒸馏实验 )

83   304   304  

nlp_newsletter

📰Natural language processing (NLP) newsletter

20   303   303  

sparsezoo

Neural network model repository for highly sparse and sparse-quantized...

20   302   302  

prodigy-openai-recipes

✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-...

26   302   302  

textpipe

Textpipe: clean and extract metadata from text

27   301   301  

Chatette

A powerful dataset generator for Rasa NLU, inspired by Chatito

52   301   301  

bert-sklearn

a sklearn wrapper for Google's BERT model

70   301   301  

news-emotion

📉 金融文本情感分析模型

126   300   300  

RasaTalk

A chatbot framework for Rasa NLU

86   300   300  

rasa_nlu_gq

turn natural language into structured data(支持中文,自定义了N种模型,...

99   298   298  

BERT-of-Theseus

⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressi...

38   298   298  

lda

LDA topic modeling for node.js

48   297   297  

XPretrain

Multi-modality pre-training

15   297   297  

simple-effective-text-matching-pytorch

A pytorch implementation of the ACL2019 paper "Simple and Effective Te...

54   296   296  

multi-criteria-cws

Simple Solution for Multi-Criteria Chinese Word Segmentation

85   295   295  

NER-pytorch

LSTM+CRF NER

102   295   295