Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

ner-lstm

Named Entity Recognition using multilayered bidirectional LSTM

181   538   538  

NLP_bahasa_resources

A Curated List of Dataset and Usable Library Resources for NLP in Baha...

141   538   538  

weixin_public_corpus

微信公众号语料库

167   536   536  

nlprule

A fast, low-resource Natural Language Processing and Text Correction l...

39   531   531  

happy-transformer

Happy Transformer makes it easy to fine-tune and perform inference wit...

68   529   529  

Deep-Semantic-Similarity-Model

My Keras implementation of the Deep Semantic Similarity Model (DSSM)/C...

189   526   526  

CS224n-2019-solutions

Complete solutions for Stanford CS224n, winter, 2019

232   525   525  

Book-SocialMediaMiningPython

Companion code for the book "Mastering Social Media Mining with Python...

265   523   523  

Mengzi

Mengzi Pretrained Models

60   518   518  

Goopt

🔍 Search Engine for a Procedural Simulation of the Web with GPT-3.

37   517   517  

ln2sql

A tool to query a database in natural language

200   513   513  

BERT4doc-Classification

Code and source for paper ``How to Fine-Tune BERT for Text Classificat...

81   513   513  

embedbase

A dead-simple API to build LLM-powered apps

53   506   506  

pyswip

PySwip is a Python-Prolog interface that enables querying SWI-Prolog i...

99   503   503  

malaya

Natural Language Toolkit for Malaysian language, https://malaya.readt...

131   500   500  

VnCoreNLP

A Vietnamese natural language processing toolkit (NAACL 2018)

134   497   497  

MatchZoo-py

Facilitating the design, comparison and sharing of deep text matching...

106   496   496  

DataScience_ArtificialIntelligence_Utils

Examples of Data Science projects and Artificial Intelligence use-case...

296   494   494  

Best-Data-Science-Resources

This repository contains the best Data Science free hand-picked resour...

141   494   494  

RNNLG

RNNLG is an open source benchmark toolkit for Natural Language Generat...

125   490   490  

CPM-Live

Live Training for Open-source Big Models

34   490   490  

code_search

Code For Medium Article: "How To Create Natural Language Semantic Sear...

137   489   489  

prodigy-recipes

🍳 Recipes for the Prodigy, our fully scriptable annotation tool

117   489   489  

awesome-text-generation

A curated list of recent models of text generation and application

75   487   487  

neural-vqa

:grey_question: Visual Question Answering in Torch

95   487   487  

MLInterview

:octocat: A curated awesome list of AI Startups in India & Machine Lea...

148   484   484  

Sherlock

Natural-language event parser for Javascript

34   479   479  

CNSurvey

一份中文综述文章列表(自然语言处理&机器学习)

87   478   478  

pynlpl

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Lan...

68   476   476  

BioSentVec

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words a...

80   476   476  

bluebert

BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-II...

73   472   472  

PyShortTextCategorization

Various Algorithms for Short Text Mining

75   471   471  

LMaaS-Papers

Awesome papers on Language-Model-as-a-Service (LMaaS)

30   470   470  

Data-Science-and-Machine-Learning-Projects-Dojo

collections of data science, machine learning and data visualization p...

96   470   470  

indic_nlp_library

Resources and tools for Indian language Natural Language Processing

148   469   469  

kaggle-HomeDepot

3rd Place Solution for HomeDepot Product Search Results Relevance Comp...

210   467   467  

oie-resources

A curated list of Open Information Extraction (OIE) resources: papers,...

55   465   465  

Coursera-Deep-Learning

My notes / works on deep learning from Coursera

365   465   465  

small-text

Active Learning for Text Classification in Python

48   463   463  

cogcomp-nlp

CogComp's Natural Language Processing Libraries and Demos: Modules inc...

147   461   461  

dont-stop-pretraining

Code associated with the Don't Stop Pretraining ACL 2020 paper

59   460   460  

bert-embedding

🔡 Token level embeddings from BERT model on mxnet and gluonnlp

68   450   450  

PromptKG

PromptKG Family: a Gallery of Prompt Learning & KG-related research wo...

52   448   448  

Reductio

Automatic summarizer text in Swift

38   446   446  

natural-language-processing

Programming Assignments and Lectures for Stanford's CS 224: Natural La...

273   446   446  

edenai-apis

Eden AI: simplify the use and deployment of AI technologies by providi...

67   442   442  

CS224n-Reading-Notes

CS224n Reading Notes in Chinese 中文阅读笔记

112   441   441  

NLP-conference-compendium

Compendium of the resources available from top NLP conferences.

50   440   440  

awesome-arabic

A curated list of awesome projects and dev/design resources for suppor...

93   437   437  

Transformers.jl

Julia Implementation of Transformer models

54   437   437