Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

CS224n-2019-solutions

Complete solutions for Stanford CS224n, winter, 2019

232   525   525  

Book-SocialMediaMiningPython

Companion code for the book "Mastering Social Media Mining with Python...

265   523   523  

Mengzi

Mengzi Pretrained Models

60   518   518  

Goopt

🔍 Search Engine for a Procedural Simulation of the Web with GPT-3.

37   517   517  

BERT4doc-Classification

Code and source for paper ``How to Fine-Tune BERT for Text Classificat...

81   513   513  

embedbase

A dead-simple API to build LLM-powered apps

53   506   506  

pyswip

PySwip is a Python-Prolog interface that enables querying SWI-Prolog i...

99   503   503  

nextjs-chatgpt-app

💬 Responsive chat application powered by OpenAI's GPT-4, with respons...

59   498   498  

VnCoreNLP

A Vietnamese natural language processing toolkit (NAACL 2018)

134   497   497  

MatchZoo-py

Facilitating the design, comparison and sharing of deep text matching...

106   496   496  

DataScience_ArtificialIntelligence_Utils

Examples of Data Science projects and Artificial Intelligence use-case...

296   494   494  

Best-Data-Science-Resources

This repository contains the best Data Science free hand-picked resour...

141   494   494  

RNNLG

RNNLG is an open source benchmark toolkit for Natural Language Generat...

125   490   490  

CPM-Live

Live Training for Open-source Big Models

34   490   490  

prodigy-recipes

🍳 Recipes for the Prodigy, our fully scriptable annotation tool

117   489   489  

awesome-text-generation

A curated list of recent models of text generation and application

75   487   487  

neural-vqa

:grey_question: Visual Question Answering in Torch

95   487   487  

MLInterview

:octocat: A curated awesome list of AI Startups in India & Machine Lea...

148   484   484  

code_search

Code For Medium Article: "How To Create Natural Language Semantic Sear...

138   480   480  

Sherlock

Natural-language event parser for Javascript

34   479   479  

CNSurvey

一份中文综述文章列表(自然语言处理&机器学习)

87   478   478  

BioSentVec

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words a...

80   476   476  

bluebert

BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-II...

73   472   472  

LMaaS-Papers

Awesome papers on Language-Model-as-a-Service (LMaaS)

30   470   470  

PyShortTextCategorization

Various Algorithms for Short Text Mining

72   469   469  

indic_nlp_library

Resources and tools for Indian language Natural Language Processing

148   469   469  

kaggle-HomeDepot

3rd Place Solution for HomeDepot Product Search Results Relevance Comp...

210   467   467  

oie-resources

A curated list of Open Information Extraction (OIE) resources: papers,...

55   465   465  

pynlpl

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Lan...

64   464   464  

ln2sql

A tool to query a database in natural language

189   463   463  

small-text

Active Learning for Text Classification in Python

48   463   463  

cogcomp-nlp

CogComp's Natural Language Processing Libraries and Demos: Modules inc...

147   461   461  

dont-stop-pretraining

Code associated with the Don't Stop Pretraining ACL 2020 paper

59   460   460  

bert-embedding

🔡 Token level embeddings from BERT model on mxnet and gluonnlp

68   450   450  

PromptKG

PromptKG Family: a Gallery of Prompt Learning & KG-related research wo...

52   448   448  

natural-language-processing

Programming Assignments and Lectures for Stanford's CS 224: Natural La...

273   446   446  

Reductio

Automatic summarizer text in Swift

38   446   446  

CS224n-Reading-Notes

CS224n Reading Notes in Chinese 中文阅读笔记

112   441   441  

NLP-conference-compendium

Compendium of the resources available from top NLP conferences.

50   440   440  

Transformers.jl

Julia Implementation of Transformer models

54   437   437  

awesome-arabic

A curated list of awesome projects and dev/design resources for suppor...

93   437   437  

pytorch-bert-crf-ner

KoBERT와 CRF로 만든 한국어 개체명인식기 (BERT+CRF based Named Entity R...

104   437   437  

NeuroNLP2

Deep neural models for core NLP tasks (Pytorch version)

89   435   435  

Deep-Learning-NLP

:satellite: Organized Resources for Deep Learning in Natural Language...

125   433   433  

Aspect-Based-Sentiment-Analysis

A paper list for aspect based sentiment analysis.

84   431   431  

Multimodal-Toolkit

Multimodal model for text and tabular data with HuggingFace transforme...

69   427   427  

DNABERT

DNABERT: pre-trained Bidirectional Encoder Representations from Transf...

124   427   427  

nlp-papers-with-arxiv

Statistics and accepted paper list of NLP conferences with arXiv link

53   426   426  

TextFooler

A Model for Natural Language Attack on Text Classification and Inferen...

71   423   423  

ChineseBLUE

Chinese Biomedical Language Understanding Evaluation benchmark (Chines...

82   423   423