Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

KoGPT2-FineTuning

🔥 Korean GPT-2, KoGPT2 FineTuning cased. 한국어 가사 데이터 학습 🔥

60   213   213  

hmni

📛 Fuzzy Name Matching with Machine Learning

43   213   213  

triviaqa

Code for the TriviaQA reading comprehension dataset

40   212   212  

Persian-NER

پیکره بزرگ شناسایی موجودیت‌های نامدار فارسی برچسب خورده

19   212   212  

blog

My Tech Blog: about Rust / Golang / Python / Flutter / Blockchain etc.

18   212   212  

sharingan

Tool to extract news articles from newspaper and give the context abou...

26   211   211  

unify-emotion-datasets

A Survey and Experiments on Annotated Corpora for Emotion Classificati...

46   211   211  

turkish-stemmer-python

:snake: Turkish Language Stemmer for Python

31   211   211  

python-bpe

Byte Pair Encoding for Python!

38   211   211  

laserembeddings

LASER multilingual sentence embeddings as a pip package

27   211   211  

indonesian-NLP-resources

data resource untuk NLP bahasa indonesia

53   209   209  

text-segmentation

Implementation of the paper: Text Segmentation as a Supervised Learnin...

56   209   209  

NSP-BERT

The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Thr...

39   209   209  

AdaSeq

AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence...

19   209   209  

emailGPT

a quick and easy interface to generate emails with ChatGPT

30   208   208  

RLHF

Implementation of Chinese ChatGPT

23   208   208  

doc-han-att

Hierarchical Attention Networks for Chinese Sentiment Classification

55   207   207  

bert_for_corrector

基于bert进行中文文本纠错

47   207   207  

ml_things

This is where I put things I find useful that speed up my work with Ma...

60   206   206  

vnlp

State-of-the-art, lightweight NLP tools for Turkish language. Develope...

17   206   206  

Competition_CAIL

2018中国‘法研杯’法律智能挑战赛(CAIL2018)个人作品

61   205   205  

cutlet

Japanese to romaji converter in Python

19   205   205  

numerizer

A Python module to convert natural language numerics into ints and flo...

23   205   205  

mauve

Package to compute Mauve, a similarity score between neural text and h...

17   205   205  

programming-book-3

Programming books 3: Python、 Machine-Learning、 Deep-Learning、 NLP

82   205   205  

Multi-Type-TD-TSR

Extracting Tables from Document Images using a Multi-stage Pipeline fo...

44   204   204  

udpipe

R package for Tokenization, Parts of Speech Tagging, Lemmatization and...

31   203   203  

vaporetto

🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer

8   203   203  

spacy-clausie

Implementation of the ClausIE information extraction system for python...

30   201   201  

transition-amr-parser

SoTA Abstract Meaning Representation (AMR) parsing with word-node alig...

42   201   201  

examples

Analyze the unstructured data with Towhee, such as reverse image searc...

62   201   201  

text-emotion-classification

Archived - not answering issues

82   200   200  

FlowQA

Implementation of conversational QA model: FlowQA (with slight improve...

58   199   199  

fixy

Amacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çöz...

20   199   199  

node-postal

NodeJS bindings to libpostal for fast international address parsing/no...

33   199   199  

markup

A web-based document annotation tool, powered by GPT-4 :rocket:

32   199   199  

eudex

A blazingly fast phonetic reduction/hashing algorithm.

12   198   198  

cadmium

Natural Language Processing (NLP) library for Crystal

16   198   198  

exercises_thushv_dot_com

Here lies all the exercises I implement and share in my website

158   198   198  

nl2sql

阿里天池首届中文NL2SQL挑战赛top6

49   198   198  

KcELECTRA

🤗 Korean Comments ELECTRA: 한국어 댓글로 학습한 ELECTRA 모델

22   198   198  

arXivNotes

IssuesにNLP(自然言語処理)に関連するの論文を読んだまとめを書いていま...

8   197   197  

tensorflow-ml-nlp

텐서플로우와 머신러닝으로 시작하는 자연어처리(로지스틱회귀부터 트랜스...

110   197   197  

Awesome-NLP-Resources

This repository contains landmark research papers in Natural Language...

53   197   197  

bi-lstm-crf

A PyTorch implementation of the BI-LSTM-CRF model.

49   197   197  

wrench

WRENCH: Weak supeRvision bENCHmark

27   197   197  

displacy-ent

:boom: displaCy-ent.js: An open-source named entity visualiser for the...

43   196   196  

dkpro-core

Collection of software components for natural language processing (NLP...

71   196   196  

THUCTC

An Efficient Chinese Text Classifier

64   196   196  

akaza

Yet another Japanese IME for IBus/Linux

6   196   196