Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

FinNLP-Progress

NLP progress in Fintech. A repository to track the progress in Natural...

47   328   328  

Entity-Linking-Recent-Trends

Recent trends of Entity Linking, Disambiguation, and Representation.

20   327   327  

conformal-prediction

Lightweight, useful implementation of conformal prediction on real dat...

38   327   327  

korean-hate-speech

Korean HateSpeech Dataset

37   326   326  

textaugment

TextAugment: Text Augmentation Library

56   324   324  

chatbot_ner

chatbot_ner: Named Entity Recognition for chatbots.

133   321   321  

attention-mechanisms

Implementations for a family of attention mechanisms, suitable for all...

81   320   320  

byteNet-tensorflow

ByteNet for character-level language modelling

71   317   317  

MixText

MixText: Linguistically-Informed Interpolation of Hidden Space for Sem...

64   317   317  

TencentPretrain

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

31   317   317  

DataScience_ArtificialIntelligence_Utils

Examples of Data Science projects and Artificial Intelligence use case...

238   316   316  

ner

Named Entity Recognition

65   315   315  

ABSAPapers

Worth-reading papers and related awesome resources on aspect-based sen...

57   315   315  

BMList

A List of Big Models

10   312   312  

NLPython

This repository contains the code related to Natural Language Processi...

206   309   309  

PyTorch-Beam-Search-Decoding

PyTorch implementation of beam search decoding for seq2seq models

64   309   309  

ML-ProjectKart

🙌Kart of 210+ projects based on machine learning, deep learning, compu...

196   305   305  

hardware-aware-transformers

[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Langua...

41   303   303  

Opus-MT

Open neural machine translation models and web services

49   302   302  

rebel

REBEL is a seq2seq model that simplifies Relation Extraction (EMNLP 20...

47   302   302  

pyss3

A Python package implementing a new interpretable machine learning mod...

41   300   300  

awesome-list-of-awesomes

A curated list of all the Awesome --Topic Name-- lists I've found till...

42   297   297  

OpenUE

OpenUE是一个轻量级知识图谱抽取工具 (An Open Toolkit for Universal Ext...

59   295   295  

insight

Repository for Project Insight: NLP as a Service

44   294   294  

tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training...

14   294   294  

book-nlp

Natural language processing pipeline for book-length documents (archiv...

52   293   293  

pycantonese

Cantonese Linguistics and NLP

36   293   293  

cherche

📑 Neural Search

12   292   292  

deep-learning-nlp-rl-papers

Recent Deep Learning papers in NLU and RL

49   291   291  

SWEM

The Tensorflow code for this ACL 2018 paper: "Baseline Needs More Love...

53   290   290  

conllu

A CoNLL-U parser that takes a CoNLL-U formatted string and turns it in...

51   290   290  

CS224N-2019

My completed implementation solutions for CS224N 2021 & 2019

126   290   290  

NonAutoregGenProgress

Tracking the progress in non-autoregressive generation (translation, t...

30   290   290  

awesome-nlprojects

List of projects related to Natural Language Processing (NLP) that mak...

92   287   287  

languagecrunch

LanguageCrunch NLP server docker image

29   286   286  

dostoevsky

Sentiment analysis library for russian language

30   283   283  

lda

LDA topic modeling for node.js

43   281   281  

pytorch-transformers-classification

Based on the Pytorch-Transformers library by HuggingFace. To be used a...

100   281   281  

WordGCN

ACL 2019: Incorporating Syntactic and Semantic Information in Word Emb...

63   281   281  

bert-sklearn

a sklearn wrapper for Google's BERT model

70   279   279  

BOND

BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant S...

35   279   279  

extreme-bert

ExtremeBERT is a toolkit that accelerates the pretraining of customize...

15   279   279  

MINERVA

Meandering In Networks of Entities to Reach Verisimilar Answers

82   277   277  

cliport

CLIPort: What and Where Pathways for Robotic Manipulation

46   277   277  

nlp-tutorial

Tutorial: Natural Language Processing in Python

153   276   276  

bist-parser

Graph-based and Transition-based dependency parsers based on BiLSTMs

98   275   275  

stringi

Fast and portable character string processing in R (with the Unicode I...

43   275   275  

recurrent-entity-networks

TensorFlow implementation of "Tracking the World State with Recurrent...

68   274   274  

Good-Papers

I try my best to keep updated cutting-edge knowledge in Machine Learni...

57   273   273  

COMET

A Neural Framework for MT Evaluation

44   271   271