Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

word2word

Easy-to-use word-to-word translations for 3,564 language pairs.

52   340   340  

language_tool_python

a free python grammar checker 📝✅

41   340   340  

node-word2vec

Node.js interface to the Google word2vec tool.

54   340   340  

program-y

Python 3.x based AIML 2.0 Chatbot interpreter, framework, related prog...

134   339   339  

contextualSpellCheck

✔️Contextual word checker for better suggestions

48   339   339  

nagisa

A Japanese tokenizer based on recurrent neural networks

19   339   339  

simple-effective-text-matching

Source code of the ACL2019 paper "Simple and Effective Text Matching w...

67   338   338  

MedCAT

Medical Concept Annotation Tool

90   338   338  

Awesome-Simultaneous-Translation

Paper list of simultaneous translation / streaming translation, includ...

3   337   337  

COVID-QA

API & Webapp to answer questions about COVID-19. Using NLP (Question A...

121   335   335  

airy

💬 Open Source App Framework to build streaming apps with real-time da...

46   335   335  

German-NLP

Curated list of open-access/open-source/off-the-shelf resources and to...

51   334   334  

jiebaR

Chinese text segmentation with R. R语言中文分词 (文档已更新 🎉 :htt...

110   333   333  

neo4j-nlp

NLP Capabilities in Neo4j

80   333   333  

SentimentAnalysis

Sentiment analysis neural network trained by fine-tuning BERT, ALBERT,...

43   333   333  

troll

Language sentiment analysis and neural networks... for trolls.

19   332   332  

low-resource-languages

Resources for conservation, development, and documentation of low reso...

58   332   332  

tflite-android-transformers

DistilBERT / GPT-2 for on-device inference thanks to TensorFlow Lite w...

66   332   332  

camel_tools

A suite of Arabic natural language processing tools developed by the C...

68   330   330  

deep_srl

Code and pre-trained model for: Deep Semantic Role Labeling: What Work...

76   329   329  

paraphrase-id-tensorflow

Various models and code (Manhattan LSTM, Siamese LSTM + Matching Layer...

72   328   328  

Entity-Linking-Recent-Trends

Recent trends of Entity Linking, Disambiguation, and Representation.

20   327   327  

clifs

Contrastive Language-Image Forensic Search allows free text searching...

36   327   327  

spanish-word-embeddings

Spanish word embeddings computed with different methods and from diffe...

77   326   326  

Seq2seq-Chatbot-for-Keras

This repository contains a new generative model of chatbot based on se...

98   325   325  

qa_match

A simple effective ToolKit for short text matching

83   324   324  

tldrstory

📊 Semantic search for headlines and story text

26   324   324  

gsdmm

GSDMM: Short text clustering

92   324   324  

textaugment

TextAugment: Text Augmentation Library

56   324   324  

Issue-Label-Bot

Code For The Issue Label Bot, an App that automatically labels issues...

103   323   323  

PyKoSpacing

Automatic Korean word spacing with Python

106   322   322  

chatbot_ner

chatbot_ner: Named Entity Recognition for chatbots.

133   321   321  

KR-WordRank

비지도학습 방법으로 한국어 텍스트에서 단어/키워드를 자동으로 추출하는...

57   321   321  

MLDemo

This repo is all the machine learning related project codes and their...

136   319   319  

NLPGNN

1. Use BERT, ALBERT and GPT2 as tensorflow2.0's layer. 2. Implement...

62   318   318  

dliss-tutorial

Tutorial for International Summer School on Deep Learning, 2019

60   317   317  

pytextclassifier

pytextclassifier is a toolkit for text classification. 文本分类,LR,X...

54   317   317  

alpaca_eval

A automatic evaluator for instruction-following language models. Human...

43   317   317  

tner

Language model fine-tuning on NER with an easy interface and cross-dom...

33   316   316  

electra_pytorch

Pretrain and finetune ELECTRA with fastai and huggingface. (Results of...

41   314   314  

NLP_Datasets

My NLP datasets for Russian language

50   313   313  

kg-baseline-pytorch

2019百度的关系抽取比赛,使用Pytorch实现苏神的模型,F1在dev集可达到0.75...

56   313   313  

OpenAI-CLIP

Simple implementation of OpenAI CLIP model in PyTorch.

50   313   313  

BMList

A List of Big Models

10   312   312  

StruMatchDL

Codes for ICML 2022 paper: Matching Structure for Dual Learning

2   311   311  

mlm-scoring

Python library & examples for Masked Language Model Scoring (ACL 2020)

60   308   308  

megabots

🤖 State-of-the-art, production ready LLM apps made mega-easy, so you d...

29   307   307  

Transformers_for_Text_Classification

基于Transformers的文本分类

66   305   305  

fugashi

A Cython MeCab wrapper for fast, pythonic Japanese tokenization and mo...

26   305   305  

PERT

PERT: Pre-training BERT with Permuted Language Model

22   305   305