Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

tokenizers

Fast, Consistent Tokenization of Natural Language Text

25   185   185  

sandbox-topically

Topic modeling helpers using managed language models from Cohere. Name...

16   185   185  

uniem

unified embedding model

11   184   184  

Guyu

Chinese GPT2: pre-training and fine-tuning framework for text generati...

42   183   183  

coreference-resolution

Efficient and clean PyTorch reimplementation of "End-to-end Neural Cor...

61   183   183  

dna2vec

dna2vec: Consistent vector representations of variable-length k-mers

60   183   183  

spacy-universal-sentence-encoder

Google USE (Universal Sentence Encoder) for spaCy

12   183   183  

Recurrent-Convolutional-Neural-Network-Text-Classifier

My (slightly modified) Keras implementation of the Recurrent Convoluti...

65   183   183  

hntitlenator

Test your HN title against a neural network

13   182   182  

multihead-siamese-nets

Implementation of Siamese Neural Networks built upon multihead attenti...

43   182   182  

gpt4-playground

Clone of OpenAI's ChatGPT and Playground environments to enable experi...

74   182   182  

spacymoji

💙 Emoji handling and meta data for spaCy with custom extension attrib...

20   181   181  

FinBERT

BERT for Finance : UC Berkeley MIDS w266 Final Project

62   181   181  

R-NET-in-Keras

Open R-NET (hy` առնետ 🐁) implementation and detailed analysis: https:...

90   181   181  

2017-CCF-BDCI-AIJudge

2017-CCF-BDCI-让AI当法官(初赛):7th/415 (Top 1.68%)

79   180   180  

sentence-splitter

Text to sentence splitter using heuristic algorithm by Philipp Koehn a...

27   180   180  

ParseLawDocuments

对收集的法律文档进行一系列分析,包括根据规范自动切分、案件相似度计算、...

57   179   179  

MultiModalStory-demo

FairyTailor: Multimodal Generative Framework for Storytelling

16   179   179  

python_autocomplete

Use Transformers and LSTMs to learn Python source code

41   178   178  

KR-BERT

KoRean based BERT pre-trained models (KR-BERT) for Tensorflow and PyTo...

20   178   178  

Python_Natural_Language_Processing

This repository consists of a complete guide on natural language proce...

170   178   178  

Magento-Chatbot

Magento Chatbot Integration with Telegram, Messenger, Whatsapp, WeChat...

61   177   177  

xatkit

The simplest way to build all types of smart chatbots and digital assi...

22   177   177  

A-Hierarchical-Latent-Structure-for-Variational-Conversation-Modeling

PyTorch Implementation of "A Hierarchical Latent Structure for Variati...

46   176   176  

nlp_research

NLP research:基于tensorflow的nlp深度学习项目,支持文本分类/句子匹配/...

48   176   176  

ML

此仓库将介绍Deep Learning 所需要的基础知识以及NLP方面的模型原理到项目...

55   176   176  

AI-Competition-Collections

AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验...

23   176   176  

ner-slot_filling

中文自然语言的实体抽取和意图识别(Natural Language Understanding),可...

45   175   175  

AI-NLP-Paper-Readings

This is my reading list for my PhD in AI, NLP, Deep Learning and more.

26   175   175  

easy-bert

A Dead Simple BERT API for Python and Java (https://github.com/google-...

45   175   175  

LiveActionMap

An attempt to map the areas with active conflict in Ukraine using twit...

15   175   175  

text-detector

Tool which allow you to detect and translate text.

41   175   175  

APE

Parser for Attempto Controlled English (ACE)

28   175   175  

ruijin_round2

瑞金医院MMC人工智能辅助构建知识图谱大赛复赛

56   174   174  

awesome-data-science-viz

:boom: :chart_with_upwards_trend: A curated list of data science, anal...

27   174   174  

pymetamap

Python wraper for MetaMap

62   174   174  

pytorch-acnn-model

code of Relation Classification via Multi-Level Attention CNNs

31   172   172  

keras-xlnet

Implementation of XLNet that can load pretrained checkpoints

26   171   171  

Hash-Embeddings

PyTorch implementation of Hash Embeddings (NIPS 2017). Submission to t...

26   171   171  

ML-DL-scripts

The repository provides usefull python scripts for ML and data analysi...

78   171   171  

NLP-pretrained-model

A collection of Natural language processing pre-trained models.

29   170   170  

RobBERT

A Dutch RoBERTa-based language model

26   170   170  

VDCNN

Implementation of Very Deep Convolutional Neural Network for Text Clas...

39   169   169  

All4NLP

All For NLP, especially Chinese.

32   169   169  

monkeylearn-python

Official Python client for the MonkeyLearn API. Build and consume mach...

44   169   169  

turkish-morphology

A two-level morphological analyzer for Turkish.

27   169   169  

wefe

WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a fra...

11   169   169  

tldr-transformers

The "tl;dr" on a few notable transformer papers (pre-2022).

4   169   169  

genie-toolkit

The Genie open source kit for voice assistant (formerly known as Almon...

30   169   169