Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

sentence-splitter

Text to sentence splitter using heuristic algorithm by Philipp Koehn a...

27   180   180  

ParseLawDocuments

对收集的法律文档进行一系列分析,包括根据规范自动切分、案件相似度计算、...

57   179   179  

MultiModalStory-demo

FairyTailor: Multimodal Generative Framework for Storytelling

16   179   179  

python_autocomplete

Use Transformers and LSTMs to learn Python source code

41   178   178  

KR-BERT

KoRean based BERT pre-trained models (KR-BERT) for Tensorflow and PyTo...

20   178   178  

Python_Natural_Language_Processing

This repository consists of a complete guide on natural language proce...

170   178   178  

multihead-siamese-nets

Implementation of Siamese Neural Networks built upon multihead attenti...

42   177   177  

mindflow

🧠 AI-powered CLI git wrapper, boilerplate code generator, chat history...

15   177   177  

tokenizers

Fast, Consistent Tokenization of Natural Language Text

26   176   176  

A-Hierarchical-Latent-Structure-for-Variational-Conversation-Modeling

PyTorch Implementation of "A Hierarchical Latent Structure for Variati...

46   176   176  

nlp_research

NLP research:基于tensorflow的nlp深度学习项目,支持文本分类/句子匹配/...

48   176   176  

AI-Competition-Collections

AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验...

23   176   176  

text-detector

Tool which allow you to detect and translate text.

41   175   175  

APE

Parser for Attempto Controlled English (ACE)

28   175   175  

ner-slot_filling

中文自然语言的实体抽取和意图识别(Natural Language Understanding),可...

45   175   175  

LiveActionMap

An attempt to map the areas with active conflict in Ukraine using open...

17   175   175  

ruijin_round2

瑞金医院MMC人工智能辅助构建知识图谱大赛复赛

56   174   174  

spacymoji

💙 Emoji handling and meta data for spaCy with custom extension attribu...

17   174   174  

pytorch-acnn-model

code of Relation Classification via Multi-Level Attention CNNs

31   172   172  

keras-xlnet

Implementation of XLNet that can load pretrained checkpoints

26   171   171  

Hash-Embeddings

PyTorch implementation of Hash Embeddings (NIPS 2017). Submission to t...

26   171   171  

ML-DL-scripts

The repository provides usefull python scripts for ML and data analysi...

78   171   171  

Magento-Chatbot

Magento Chatbot Integration with Telegram, Messenger, Whatsapp, WeChat...

63   170   170  

RobBERT

A Dutch RoBERTa-based language model

26   170   170  

SolrTextTagger

A text tagger based on Lucene / Solr, using FST technology

36   169   169  

VDCNN

Implementation of Very Deep Convolutional Neural Network for Text Clas...

39   169   169  

All4NLP

All For NLP, especially Chinese.

32   169   169  

wefe

WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a fra...

11   169   169  

tldr-transformers

The "tl;dr" on a few notable transformer papers (pre-2022).

4   169   169  

genie-toolkit

The Genie open source kit for voice assistant (formerly known as Almon...

30   169   169  

Improved-Dynamic-Memory-Networks-DMN-plus

Theano Implementation of DMN+ (Improved Dynamic Memory Networks) from...

64   168   168  

Snowball

Implementation with some extensions of the paper "Snowball: Extracting...

40   168   168  

danlp

DaNLP is a repository for Natural Language Processing resources for th...

31   168   168  

Awesome-Text-Classification

Awesome-Text-Classification Projects,Papers,Tutorial .

32   167   167  

neural-paraphrase-generation

Neural Paraphrase Generation

56   167   167  

torchtext-summary

torchtext使用总结,从零开始逐步实现了torchtext文本预处理过程,包括截断...

42   167   167  

neuro

🔮 Neuro.js is machine learning library for building AI assistants and...

33   167   167  

NLPMetrics

Python code for various NLP metrics

30   166   166  

NLP-pretrained-model

A collection of Natural language processing pre-trained models.

29   166   166  

Semantic

语义理解/口语理解,项目包含有词法分析:中文分词、词性标注、命名实体识...

56   164   164  

BERT-GPU

multi-gpu pre-training in one machine for BERT from scratch without ho...

52   164   164  

What-I-Have-Read

Paper Lists, Notes and Slides, Focus on NLP. For summarization, pleas...

16   163   163  

transformers-ru

A list of pretrained Transformer models for the Russian language.

8   162   162  

dna2vec

dna2vec: Consistent vector representations of variable-length k-mers

60   162   162  

Open-IE-Papers

Open Information Extraction (OpenIE) and Open Relation Extraction (ORE...

17   161   161  

spacy-js

🎀 JavaScript API for spaCy with Python REST API

24   161   161  

xatkit

The simplest way to build all types of smart chatbots and digital assi...

22   161   161  

converse

Conversational text Analysis using various NLP techniques

16   161   161  

espial

Espial is an engine for automated organization and discovery of person...

4   160   160