Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

phrasal

A large-scale statistical machine translation system written in Java.

89   201   201  

paraphrase_identification

Examine two sentences and determine whether they have the same meaning...

78   201   201  

vntk

Vietnamese NLP Toolkit for Node

59   200   200  

markup

A web-based document annotation tool, powered by GPT-4 :rocket:

32   199   199  

nl2sql

阿里天池首届中文NL2SQL挑战赛top6

49   198   198  

arXivNotes

IssuesにNLP(自然言語処理)に関連するの論文を読んだまとめを書いていま...

8   197   197  

Black-Box-Tuning

ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'20...

20   197   197  

notebooks

Jupyter Notebooks with Deep Learning Tutorials

123   196   196  

displacy-ent

:boom: displaCy-ent.js: An open-source named entity visualiser for the...

43   196   196  

dkpro-core

Collection of software components for natural language processing (NLP...

71   196   196  

cleanNLP

R package providing annotators and a normalized data model for natural...

36   196   196  

googleLanguageR

R client for the Google Translation API, Google Cloud Natural Language...

42   196   196  

ernie

Simple State-of-the-Art BERT-Based Sentence Classification with Keras...

27   195   195  

CRF-Layer-on-the-Top-of-BiLSTM

The CRF Layer was implemented by using Chainer 2.0. Please see more de...

53   195   195  

Deep-Survey-Text-Classification

The project surveys 16+ Natural Language Processing (NLP) research pap...

58   194   194  

sentence-similarity

This repository contains various ways to calculate sentence vector sim...

36   194   194  

Deep-math-machine-learning.ai

A blog which talks about machine learning, deep learning algorithms an...

180   194   194  

DeepToxic

top 1% solution to toxic comment classification challenge on Kaggle.

70   194   194  

neuro

🔮 Neuro.js is machine learning library for building AI assistants and...

33   194   194  

Tree-Transformer

Implementation of the paper Tree Transformer

31   193   193  

SyferText

A privacy preserving NLP framework

50   192   192  

NewsRecommender

A news recommendation system tailored for user communities

86   192   192  

konoha

🌿 An easy-to-use Japanese Text Processing tool, which makes it possib...

19   192   192  

Automated-Fact-Checking-Resources

Links to conference/journal publications in automated fact-checking (r...

22   192   192  

natml-unity

High performance, cross-platform machine learning for Unity Engine. Re...

20   191   191  

PyContinual

PyContinual (An Easy and Extendible Framework for Continual Learning)

48   190   190  

jupyterlab-prodigy

🧬 A JupyterLab extension for annotating data with Prodigy

23   189   189  

textvec

Text vectorization tool to outperform TFIDF for classification tasks

26   188   188  

glad

Global-Locally Self-Attentive Dialogue State Tracker

48   186   186  

NLPre

Python library for Natural Language Preprocessing (NLPre)

32   186   186  

ngx-dynamic-dashboard-framework

This is a JSON driven angular x based dashboard framework that is insp...

95   186   186  

blueprints-text

Jupyter notebooks for our O'Reilly book "Blueprints for Text Analysis...

109   186   186  

GPT2

PyTorch Implementation of OpenAI GPT-2

44   185   185  

Recurrent-Convolutional-Neural-Network-Text-Classifier

My (slightly modified) Keras implementation of the Recurrent Convoluti...

70   184   184  

KB-InfoBot

A dialogue bot for information access

66   184   184  

naturalcc

NaturalCC: An Open-Source Toolkit for Code Intelligence

28   184   184  

Kevinpro-NLP-demo

All NLP you Need Here. 目前包含15个NLP demo的pytorch实现(大量代码借...

37   184   184  

hntitlenator

Test your HN title against a neural network

13   183   183  

OPUS-MT-train

Training open neural machine translation models

33   183   183  

SwiftyChrono

A natural language date parser in Swift (ported from chrono.js)

46   182   182  

awesome-hungarian-nlp

A curated list of NLP resources for Hungarian

17   182   182  

nel

Entity linking framework

40   181   181  

files2rouge

Calculating ROUGE score between two files (line-by-line)

51   181   181  

goodreads

code samples for the goodreads datasets

47   181   181  

lineflow

:zap:A Lightweight NLP Data Loader for All Deep Learning Frameworks in...

9   178   178  

multihead-siamese-nets

Implementation of Siamese Neural Networks built upon multihead attenti...

42   177   177  

covid-papers-browser

Browse Covid-19 & SARS-CoV-2 Scientific Papers with Transformers 🦠...

26   176   176  

swagaf

Repository for paper "SWAG: A Large-Scale Adversarial Dataset for Grou...

38   175   175  

LiveActionMap

An attempt to map the areas with active conflict in Ukraine using open...

17   175   175  

spacymoji

💙 Emoji handling and meta data for spaCy with custom extension attrib...

17   174   174