Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

tokenizers

Fast, Consistent Tokenization of Natural Language Text

25   185   185  

sandbox-topically

Topic modeling helpers using managed language models from Cohere. Name...

16   185   185  

uniem

unified embedding model

11   184   184  

Guyu

Chinese GPT2: pre-training and fine-tuning framework for text generati...

42   183   183  

coreference-resolution

Efficient and clean PyTorch reimplementation of "End-to-end Neural Cor...

61   183   183  

dna2vec

dna2vec: Consistent vector representations of variable-length k-mers

60   183   183  

spacy-universal-sentence-encoder

Google USE (Universal Sentence Encoder) for spaCy

12   183   183  

Recurrent-Convolutional-Neural-Network-Text-Classifier

My (slightly modified) Keras implementation of the Recurrent Convoluti...

65   183   183  

hntitlenator

Test your HN title against a neural network

13   182   182  

multihead-siamese-nets

Implementation of Siamese Neural Networks built upon multihead attenti...

43   182   182  

gpt4-playground

Clone of OpenAI's ChatGPT and Playground environments to enable experi...

74   182   182  

spacymoji

💙 Emoji handling and meta data for spaCy with custom extension attrib...

20   181   181  

FinBERT

BERT for Finance : UC Berkeley MIDS w266 Final Project

62   181   181  

R-NET-in-Keras

Open R-NET (hy` առնետ 🐁) implementation and detailed analysis: https:...

90   181   181  

2017-CCF-BDCI-AIJudge

2017-CCF-BDCI-让AI当法官(初赛):7th/415 (Top 1.68%)

79   180   180  

sentence-splitter

Text to sentence splitter using heuristic algorithm by Philipp Koehn a...

27   180   180  

ParseLawDocuments

对收集的法律文档进行一系列分析,包括根据规范自动切分、案件相似度计算、...

57   179   179  

MultiModalStory-demo

FairyTailor: Multimodal Generative Framework for Storytelling

16   179   179  

python_autocomplete

Use Transformers and LSTMs to learn Python source code

41   178   178  

KR-BERT

KoRean based BERT pre-trained models (KR-BERT) for Tensorflow and PyTo...

20   178   178  

Python_Natural_Language_Processing

This repository consists of a complete guide on natural language proce...

170   178   178  

Magento-Chatbot

Magento Chatbot Integration with Telegram, Messenger, Whatsapp, WeChat...

61   177   177  

xatkit

The simplest way to build all types of smart chatbots and digital assi...

22   177   177  

A-Hierarchical-Latent-Structure-for-Variational-Conversation-Modeling

PyTorch Implementation of "A Hierarchical Latent Structure for Variati...

46   176   176  

nlp_research

NLP research:基于tensorflow的nlp深度学习项目,支持文本分类/句子匹配/...

48   176   176  

AI-Competition-Collections

AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验...

23   176   176  

ner-slot_filling

中文自然语言的实体抽取和意图识别(Natural Language Understanding),可...

45   175   175  

AI-NLP-Paper-Readings

This is my reading list for my PhD in AI, NLP, Deep Learning and more.

26   175   175  

easy-bert

A Dead Simple BERT API for Python and Java (https://github.com/google-...

45   175   175  

text-detector

Tool which allow you to detect and translate text.

41   175   175  

APE

Parser for Attempto Controlled English (ACE)

28   175   175  

ruijin_round2

瑞金医院MMC人工智能辅助构建知识图谱大赛复赛

56   174   174  

awesome-data-science-viz

:boom: :chart_with_upwards_trend: A curated list of data science, anal...

27   174   174  

pymetamap

Python wraper for MetaMap

62   174   174  

LiveActionMap

An attempt to map the areas with active conflict in Ukraine using twit...

15   173   173  

pytorch-acnn-model

code of Relation Classification via Multi-Level Attention CNNs

31   172   172  

keras-xlnet

Implementation of XLNet that can load pretrained checkpoints

26   171   171  

Hash-Embeddings

PyTorch implementation of Hash Embeddings (NIPS 2017). Submission to t...

26   171   171  

ML-DL-scripts

The repository provides usefull python scripts for ML and data analysi...

78   171   171  

NLP-pretrained-model

A collection of Natural language processing pre-trained models.

29   170   170  

RobBERT

A Dutch RoBERTa-based language model

26   170   170  

VDCNN

Implementation of Very Deep Convolutional Neural Network for Text Clas...

39   169   169  

All4NLP

All For NLP, especially Chinese.

32   169   169  

monkeylearn-python

Official Python client for the MonkeyLearn API. Build and consume mach...

44   169   169  

turkish-morphology

A two-level morphological analyzer for Turkish.

27   169   169  

wefe

WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a fra...

11   169   169  

tldr-transformers

The "tl;dr" on a few notable transformer papers (pre-2022).

4   169   169  

genie-toolkit

The Genie open source kit for voice assistant (formerly known as Almon...

30   169   169  

SolrTextTagger

A text tagger based on Lucene / Solr, using FST technology

36   169   169