Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

xk-time

xk-time 是时间转换,时间计算,时间格式化,时间解析,日历,时间cron表达...

81   279   279  

NLP-Vietnamese-progress

Repository to track the progress in Vietnamese Natural Language Proces...

69   279   279  

BOND

BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant S...

35   279   279  

extreme-bert

ExtremeBERT is a toolkit that accelerates the pretraining of customize...

15   279   279  

Data-Science-EBooks

Data Science E-books, Interview Resources and Cheat-sheets

119   278   278  

nlp-tutorial

Tutorial: Natural Language Processing in Python

153   276   276  

fancy-nlp

NLP for human. A fast and easy-to-use natural language processing (NLP...

41   276   276  

Text-Summarization-Repo

텍스트 요약 분야의 주요 연구 주제, Must-read Papers, 이용 가능한 model...

41   276   276  

PyTorch-Batch-Attention-Seq2seq

PyTorch implementation of batched bi-RNN encoder and attention-decoder...

49   275   275  

stringi

Fast and portable character string processing in R (with the Unicode I...

43   275   275  

Taisite-Platform

最强接口测试平台

135   275   275  

dodrio

Exploring attention weights in transformer-based models with linguisti...

28   275   275  

THUTag

A Package of Keyphrase Extraction and Social Tag Suggestion

82   273   273  

gobbli

Deep learning with text doesn't have to be scary.

25   272   272  

AHANLP

啊哈自然语言处理包,提供包括分词、依存句法分析、语义角色标注、自动摘要...

91   272   272  

StarrySky

精选了千余项目,包括机器学习、深度学习、NLP、GNN、推荐系统、生物医药、...

47   272   272  

COMET

A Neural Framework for MT Evaluation

44   271   271  

browser-ml-inference

Edge Inference in Browser with Transformer NLP model

50   271   271  

awesome-semantic-search

A curated list of awesome resources related to Semantic Search🔎 and...

22   269   269  

SAPConversationalAI

✨ 🤖 🤖 Build your own conversational bot on our Collaborative Bot Platf...

67   268   268  

BRIO

ACL 2022: BRIO: Bringing Order to Abstractive Summarization

41   268   268  

stopwords

Default English stopword lists from many different sources

131   267   267  

chatgpt

Interface to ChatGPT from R

38   267   267  

awesome-bioie

🧫 A curated list of resources relevant to doing Biomedical Information...

29   266   266  

squirrel-core

A Python library that enables ML teams to share, load, and transform d...

6   266   266  

VSUA-Captioning

Code for "Aligning Linguistic Words and Visual Semantic Units for Imag...

24   264   264  

100-Days-of-NLP

106   264   264  

DeepResearch

This repository is the collection of research papers in Deep learning...

108   263   263  

tensorflow-ml-nlp-tf2

텐서플로2와 머신러닝으로 시작하는 자연어처리 (로지스틱회귀부터 BERT와...

134   262   262  

nlp-tutorial

自然语言处理(NLP)教程,包括:词向量,词法分析,预训练语言模型,文本...

50   262   262  

LagouJob

Job data mining repo for lagou.com

129   261   261  

summarizer

A Reddit bot that summarizes news articles written in Spanish or Engli...

31   260   260  

pytorch-question-answering

Important paper implementations for Question Answering using PyTorch

49   260   260  

toxic

Toxic Comment Classification Challenge

75   259   259  

textnets

Text analysis with networks.

18   259   259  

weibo_terminator_workflow

Update Version of weibo_terminator, This is Workflow Version aim at Ge...

78   258   258  

negspacy

spaCy pipeline object for negating concepts in text

33   258   258  

genie-server

The home server version of Almond

40   258   258  

NLP-Natural-Language-Processing

Projects and useful articles / links

51   258   258  

engtagger

English Part-of-Speech Tagger Library; a Ruby port of Lingua::EN::Tagg...

49   256   256  

scoper

Fuzzy and semantic search for captioned YouTube videos.

15   255   255  

Indic-BERT-v1

Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages an...

40   255   255  

Semantic-Retrieval-Models

A curated list of awesome papers for Semantic Retrieval (TOIS Accepted...

25   255   255  

vibrato

🎤 vibrato: Viterbi-based accelerated tokenizer

12   255   255  

rnn.wgan

Code for training and evaluation of the model from "Language Generatio...

76   254   254  

nlp-labelling

Labelling platform for text using weak supervision.

18   253   253  

LasUIE

Universal Information Extraction, codes for the NeurIPS-2022 paper: Un...

3   253   253  

papers_we_read

Summaries for exciting works in the field of Deep Learning.

32   252   252  

character-based-cnn

Implementation of character based convolutional neural network

54   252   252  

KOMORAN

Korean Morphological Analyzer by shineware

59   252   252