Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

discopy

The Python toolkit for computing with string diagrams.

61   283   283  

behemoth

Behemoth is an open source platform for large scale document analysis...

60   283   283  

RNNSharp

RNNSharp is a toolkit of deep recurrent neural network which is widely...

91   283   283  

pyate

PYthon Automated Term Extraction

37   283   283  

multifit

The code to reproduce results from paper "MultiFiT: Efficient Multi-li...

56   282   282  

Customer-Chatbot

中文智能客服机器人demo,包含闲聊和专业问答2个部分,支持自定义组件(Chi...

110   282   282  

pixel

Research code for pixel-based encoders of language (PIXEL)

19   282   282  

BOND

BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant S...

35   279   279  

extreme-bert

ExtremeBERT is a toolkit that accelerates the pretraining of customize...

15   279   279  

bert-sklearn

a sklearn wrapper for Google's BERT model

70   279   279  

xk-time

xk-time 是时间转换,时间计算,时间格式化,时间解析,日历,时间cron表达...

81   279   279  

NLP-Vietnamese-progress

Repository to track the progress in Vietnamese Natural Language Proces...

69   279   279  

Data-Science-EBooks

Data Science E-books, Interview Resources and Cheat-sheets

119   278   278  

Text-Summarization-Repo

텍스트 요약 분야의 주요 연구 주제, Must-read Papers, 이용 가능한 model...

41   276   276  

nlp-tutorial

Tutorial: Natural Language Processing in Python

153   276   276  

fancy-nlp

NLP for human. A fast and easy-to-use natural language processing (NLP...

41   276   276  

PyTorch-Batch-Attention-Seq2seq

PyTorch implementation of batched bi-RNN encoder and attention-decoder...

49   275   275  

Taisite-Platform

最强接口测试平台

135   275   275  

dodrio

Exploring attention weights in transformer-based models with linguisti...

28   275   275  

THUTag

A Package of Keyphrase Extraction and Social Tag Suggestion

82   273   273  

gobbli

Deep learning with text doesn't have to be scary.

25   272   272  

AHANLP

啊哈自然语言处理包,提供包括分词、依存句法分析、语义角色标注、自动摘要...

91   272   272  

Multi-Type-TD-TSR

Extracting Tables from Document Images using a Multi-stage Pipeline fo...

53   272   272  

browser-ml-inference

Edge Inference in Browser with Transformer NLP model

50   271   271  

awesome-semantic-search

A curated list of awesome resources related to Semantic Search🔎 and...

22   269   269  

BRIO

ACL 2022: BRIO: Bringing Order to Abstractive Summarization

41   268   268  

stopwords

Default English stopword lists from many different sources

131   267   267  

awesome-bioie

🧫 A curated list of resources relevant to doing Biomedical Informatio...

29   266   266  

squirrel-core

A Python library that enables ML teams to share, load, and transform d...

6   266   266  

100-Days-of-NLP

106   264   264  

VSUA-Captioning

Code for "Aligning Linguistic Words and Visual Semantic Units for Imag...

24   264   264  

DeepResearch

This repository is the collection of research papers in Deep learning...

108   263   263  

hmni

📛 Fuzzy Name Matching with Machine Learning

51   263   263  

tensorflow-ml-nlp-tf2

텐서플로2와 머신러닝으로 시작하는 자연어처리 (로지스틱회귀부터 BERT와...

134   262   262  

nlp-tutorial

自然语言处理(NLP)教程,包括:词向量,词法分析,预训练语言模型,文本...

50   262   262  

LagouJob

Job data mining repo for lagou.com

129   261   261  

summarizer

A Reddit bot that summarizes news articles written in Spanish or Engli...

31   260   260  

pytorch-question-answering

Important paper implementations for Question Answering using PyTorch

49   260   260  

toxic

Toxic Comment Classification Challenge

75   259   259  

textnets

Text analysis with networks.

18   259   259  

text-segmentation

Implementation of the paper: Text Segmentation as a Supervised Learnin...

57   258   258  

NLP-Natural-Language-Processing

Projects and useful articles / links

51   258   258  

weibo_terminator_workflow

Update Version of weibo_terminator, This is Workflow Version aim at Ge...

78   258   258  

negspacy

spaCy pipeline object for negating concepts in text

33   258   258  

genie-server

The home server version of Almond

40   258   258  

engtagger

English Part-of-Speech Tagger Library; a Ruby port of Lingua::EN::Tagg...

49   256   256  

scoper

Fuzzy and semantic search for captioned YouTube videos.

15   255   255  

Indic-BERT-v1

Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages an...

40   255   255  

spaczz

Fuzzy matching and more functionality for spaCy.

28   255   255  

Semantic-Retrieval-Models

A curated list of awesome papers for Semantic Retrieval (TOIS Accepted...

25   255   255