Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

bi-lstm-crf

A PyTorch implementation of the BI-LSTM-CRF model.

49   197   197  

wrench

WRENCH: Weak supeRvision bENCHmark

27   197   197  

akaza

Yet another Japanese IME for IBus/Linux

6   196   196  

UABSA-SyMux

Codes for the IJCAI2022 paper: Inheriting the Wisdom of Predecessors:...

2   196   196  

displacy-ent

:boom: displaCy-ent.js: An open-source named entity visualiser for the...

43   196   196  

dkpro-core

Collection of software components for natural language processing (NLP...

71   196   196  

THUCTC

An Efficient Chinese Text Classifier

64   196   196  

ernie

Simple State-of-the-Art BERT-Based Sentence Classification with Keras...

27   195   195  

cedille-ai

✒️ Cedille is a large French language model (6B), released under an op...

10   195   195  

neuro

🔮 Neuro.js is machine learning library for building AI assistants and...

33   194   194  

paper-reading

比做算法的懂工程落地,比做工程的懂算法模型。

34   194   194  

ROUGE-2.0

ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N...

39   194   194  

denspi

Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Inde...

29   194   194  

financial-news-dataset

Reuters and Bloomberg

92   193   193  

Getting-Started-with-Google-BERT

Build and train state-of-the-art natural language processing models us...

74   193   193  

KoBigBird

🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)

19   193   193  

pdfanno

Linguistic Annotation and Visualization Tool for PDF Documents

58   192   192  

konoha

🌿 An easy-to-use Japanese Text Processing tool, which makes it possib...

19   192   192  

SyferText

A privacy preserving NLP framework

50   192   192  

tmtoolkit

Text Mining and Topic Modeling Toolkit for Python with parallel proces...

28   191   191  

Java-Deep-Learning-Cookbook

Code for Java Deep Learning Cookbook

43   190   190  

jupyterlab-prodigy

🧬 A JupyterLab extension for annotating data with Prodigy

23   189   189  

awesome-community-curated-nlp

Community Curated NLP List

35   189   189  

practical-torchtext

A set of tutorials for torchtext

57   188   188  

guri-vr

https://gurivr.com

42   188   188  

textvec

Text vectorization tool to outperform TFIDF for classification tasks

26   188   188  

edenai-apis

Eden AI: simplify the use and deployment of AI technologies by providi...

21   188   188  

embedding-as-service

One-Stop Solution to encode sentence to fixed length vectors from vari...

29   187   187  

Turkish-Word2Vec

Pre-trained Word2Vec Model for Turkish

29   187   187  

slovnet

Deep Learning based NLP modeling for Russian language

19   187   187  

machine-learning-exams

This repository contains links to machine learning exams, homework ass...

45   187   187  

syntok

Text tokenization and sentence segmentation (segtok v2)

34   186   186  

hawking

A Natural Language Date Time Parser that Extract date and time from te...

13   186   186  

varbook

适合中文程序员的变量命名助手,NLP+翻译,规范变量命名,定制化变量命名规...

38   186   186  

acl-papers

paper summary of Association for Computational Linguistics

10   186   186  

NLPre

Python library for Natural Language Preprocessing (NLPre)

32   186   186  

tokenizers

Fast, Consistent Tokenization of Natural Language Text

25   185   185  

sandbox-topically

Topic modeling helpers using managed language models from Cohere. Name...

16   185   185  

GPT2

PyTorch Implementation of OpenAI GPT-2

44   185   185  

Kevinpro-NLP-demo

All NLP you Need Here. 目前包含15个NLP demo的pytorch实现(大量代码借...

37   184   184  

uniem

unified embedding model

11   184   184  

Recurrent-Convolutional-Neural-Network-Text-Classifier

My (slightly modified) Keras implementation of the Recurrent Convoluti...

70   184   184  

hntitlenator

Test your HN title against a neural network

13   183   183  

Microsoft-Student-Partner-Workshop-Learning-Materials-AI-NLP

This repository contains all codes and materials of the current sessio...

230   183   183  

Guyu

Chinese GPT2: pre-training and fine-tuning framework for text generati...

42   183   183  

coreference-resolution

Efficient and clean PyTorch reimplementation of "End-to-end Neural Cor...

61   183   183  

dna2vec

dna2vec: Consistent vector representations of variable-length k-mers

60   183   183  

twitterDataMining

Twitter数据挖掘及其可视化

67   182   182  

awesome-hungarian-nlp

A curated list of NLP resources for Hungarian

17   182   182  

gpt4-playground

Clone of OpenAI's ChatGPT and Playground environments to enable experi...

74   182   182