Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

OpenGPT

A framework for creating grounded instruction based datasets and train...

25   215   215  

udpipe

R package for Tokenization, Parts of Speech Tagging, Lemmatization and...

33   215   215  

Speech_Signal_Processing_and_Classification

Front-end speech processing aims at extracting proper features from sh...

61   215   215  

bert-chainer

Chainer implementation of "BERT: Pre-training of Deep Bidirectional Tr...

41   214   214  

open-sesame

A frame-semantic parsing system based on a softmax-margin SegRNN.

65   214   214  

radish

C++ model train&inference framework

36   214   214  

PIE

Fast + Non-Autoregressive Grammatical Error Correction using BERT. Cod...

40   214   214  

KoGPT2-FineTuning

🔥 Korean GPT-2, KoGPT2 FineTuning cased. 한국어 가사 데이터 학습 🔥

60   213   213  

graph-convolution-nlp

Graph Convolution Network for NLP

36   213   213  

Awesome-NLP-Resources

This repository contains landmark research papers in Natural Language...

54   213   213  

triviaqa

Code for the TriviaQA reading comprehension dataset

40   212   212  

Persian-NER

پیکره بزرگ شناسایی موجودیت‌های نامدار فارسی برچسب خورده

19   212   212  

blog

My Tech Blog: about Rust / Golang / Python / Flutter / Blockchain etc.

18   212   212  

python-bpe

Byte Pair Encoding for Python!

38   211   211  

laserembeddings

LASER multilingual sentence embeddings as a pip package

27   211   211  

sharingan

Tool to extract news articles from newspaper and give the context abou...

26   211   211  

unify-emotion-datasets

A Survey and Experiments on Annotated Corpora for Emotion Classificati...

46   211   211  

fixy

Amacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çöz...

18   211   211  

turkish-stemmer-python

:snake: Turkish Language Stemmer for Python

31   211   211  

indonesian-NLP-resources

data resource untuk NLP bahasa indonesia

53   209   209  

NSP-BERT

The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Thr...

39   209   209  

AdaSeq

AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence...

19   209   209  

emailGPT

a quick and easy interface to generate emails with ChatGPT

30   208   208  

RLHF

Implementation of Chinese ChatGPT

23   208   208  

bert_for_corrector

基于bert进行中文文本纠错

47   207   207  

doc-han-att

Hierarchical Attention Networks for Chinese Sentiment Classification

55   207   207  

ml_things

This is where I put things I find useful that speed up my work with Ma...

60   206   206  

vnlp

State-of-the-art, lightweight NLP tools for Turkish language. Develope...

17   206   206  

gpt-j

A GPT-J API to use with python3 to generate text, blogs, code, and mor...

53   206   206  

Competition_CAIL

2018中国‘法研杯’法律智能挑战赛(CAIL2018)个人作品

61   205   205  

cutlet

Japanese to romaji converter in Python

19   205   205  

numerizer

A Python module to convert natural language numerics into ints and flo...

23   205   205  

mauve

Package to compute Mauve, a similarity score between neural text and h...

17   205   205  

programming-book-3

Programming books 3: Python、 Machine-Learning、 Deep-Learning、 NLP

82   205   205  

vaporetto

🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer

8   203   203  

spacy-clausie

Implementation of the ClausIE information extraction system for python...

30   201   201  

transition-amr-parser

SoTA Abstract Meaning Representation (AMR) parsing with word-node alig...

42   201   201  

examples

Analyze the unstructured data with Towhee, such as reverse image searc...

62   201   201  

text-emotion-classification

Archived - not answering issues

82   200   200  

FlowQA

Implementation of conversational QA model: FlowQA (with slight improve...

58   199   199  

node-postal

NodeJS bindings to libpostal for fast international address parsing/no...

33   199   199  

markup

A web-based document annotation tool, powered by GPT-4 :rocket:

32   199   199  

exercises_thushv_dot_com

Here lies all the exercises I implement and share in my website

158   198   198  

nl2sql

阿里天池首届中文NL2SQL挑战赛top6

49   198   198  

KcELECTRA

🤗 Korean Comments ELECTRA: 한국어 댓글로 학습한 ELECTRA 모델

22   198   198  

eudex

A blazingly fast phonetic reduction/hashing algorithm.

12   198   198  

cadmium

Natural Language Processing (NLP) library for Crystal

16   198   198  

arXivNotes

IssuesにNLP(自然言語処理)に関連するの論文を読んだまとめを書いていま...

8   197   197  

datastories-semeval2017-task4

Deep-learning model presented in "DataStories at SemEval-2017 Task 4:...

63   197   197  

tensorflow-ml-nlp

텐서플로우와 머신러닝으로 시작하는 자연어처리(로지스틱회귀부터 트랜스...

110   197   197