Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Contrastive-Learning-NLP-Papers

Paper List for Contrastive Learning for Natural Language Processing

38   336   336  

German-NLP

Curated list of open-access/open-source/off-the-shelf resources and to...

51   334   334  

Dynamic-memory-networks-in-Theano

Implementation of Dynamic memory networks by Kumar et al. http://arxiv...

111   333   333  

AwesomeFakeNews

This repository contains recent research on fake news.

79   332   332  

low-resource-languages

Resources for conservation, development, and documentation of low reso...

58   332   332  

efaqa-corpus-zh

❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库

54   332   332  

chakin

Simple downloader for pre-trained word vectors

48   331   331  

ner

Named Entity Recognition

63   331   331  

FinNLP-Progress

NLP progress in Fintech. A repository to track the progress in Natural...

47   328   328  

Entity-Linking-Recent-Trends

Recent trends of Entity Linking, Disambiguation, and Representation.

20   327   327  

conformal-prediction

Lightweight, useful implementation of conformal prediction on real dat...

38   327   327  

korean-hate-speech

Korean HateSpeech Dataset

37   326   326  

textaugment

TextAugment: Text Augmentation Library

56   324   324  

chatbot_ner

chatbot_ner: Named Entity Recognition for chatbots.

133   321   321  

attention-mechanisms

Implementations for a family of attention mechanisms, suitable for all...

81   320   320  

WhatsAppInfoBot

A Framework to Build Bots

97   319   319  

byteNet-tensorflow

ByteNet for character-level language modelling

71   317   317  

MixText

MixText: Linguistically-Informed Interpolation of Hidden Space for Sem...

64   317   317  

TencentPretrain

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

31   317   317  

ABSAPapers

Worth-reading papers and related awesome resources on aspect-based sen...

57   315   315  

BMList

A List of Big Models

10   312   312  

NLPython

This repository contains the code related to Natural Language Processi...

206   309   309  

PyTorch-Beam-Search-Decoding

PyTorch implementation of beam search decoding for seq2seq models

64   309   309  

stringi

Fast and portable character string processing in R (with the Unicode I...

47   309   309  

NonAutoregGenProgress

Tracking the progress in non-autoregressive generation (translation, t...

28   307   307  

ML-ProjectKart

🙌Kart of 210+ projects based on machine learning, deep learning, comp...

196   305   305  

hardware-aware-transformers

[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Langua...

41   303   303  

Opus-MT

Open neural machine translation models and web services

49   302   302  

rebel

REBEL is a seq2seq model that simplifies Relation Extraction (EMNLP 20...

47   302   302  

awesome-list-of-awesomes

A curated list of all the Awesome --Topic Name-- lists I've found till...

42   297   297  

OpenUE

OpenUE是一个轻量级知识图谱抽取工具 (An Open Toolkit for Universal Ext...

59   295   295  

tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training...

14   294   294  

insight

Repository for Project Insight: NLP as a Service

44   294   294  

lda

LDA topic modeling for node.js

49   294   294  

book-nlp

Natural language processing pipeline for book-length documents (archiv...

52   293   293  

pycantonese

Cantonese Linguistics and NLP

36   293   293  

cherche

📑 Neural Search

12   292   292  

deep-learning-nlp-rl-papers

Recent Deep Learning papers in NLU and RL

49   291   291  

SWEM

The Tensorflow code for this ACL 2018 paper: "Baseline Needs More Love...

53   290   290  

conllu

A CoNLL-U parser that takes a CoNLL-U formatted string and turns it in...

51   290   290  

CS224N-2019

My completed implementation solutions for CS224N 2021 & 2019

126   290   290  

awesome-nlprojects

List of projects related to Natural Language Processing (NLP) that mak...

92   287   287  

languagecrunch

LanguageCrunch NLP server docker image

29   286   286  

dostoevsky

Sentiment analysis library for russian language

30   283   283  

Good-Papers

I try my best to keep updated cutting-edge knowledge in Machine Learni...

57   281   281  

pytorch-transformers-classification

Based on the Pytorch-Transformers library by HuggingFace. To be used a...

100   281   281  

WordGCN

ACL 2019: Incorporating Syntactic and Semantic Information in Word Emb...

63   281   281  

bert-sklearn

a sklearn wrapper for Google's BERT model

70   279   279  

BOND

BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant S...

35   279   279  

extreme-bert

ExtremeBERT is a toolkit that accelerates the pretraining of customize...

15   279   279