Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

hunspell-dict-ko

Korean spellchecking dictionary for Hunspell

44   266   266  

squirrel-core

A Python library that enables ML teams to share, load, and transform d...

6   266   266  

KeyphraseVectorizers

Set of vectorizers that extract keyphrases with part-of-speech pattern...

36   265   265  

AI-Job-Info

互联网大厂面试经验

39   265   265  

rnn_lstm_from_scratch

How to build RNNs and LSTMs from scratch with NumPy.

70   264   264  

hmni

📛 Fuzzy Name Matching with Machine Learning

50   264   264  

nlp-labelling

Labelling platform for text using weak supervision.

18   264   264  

nlvr

Cornell NLVR and NLVR2 are natural language grounding datasets. Each e...

60   263   263  

MAMS-for-ABSA

A Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment anal...

63   262   262  

ua-gec

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrain...

22   262   262  

markup

A web-based document annotation tool, powered by GPT-4 :rocket:

31   262   262  

picollm

On-device LLM Inference Powered by X-Bit Quantization

14   262   262  

blueprints-text

Jupyter notebooks for our O'Reilly book "Blueprints for Text Analysis...

146   261   261  

EMPaper

This is a repository for sharing papers in the field of empathetic con...

28   261   261  

character-based-cnn

Implementation of character based convolutional neural network

54   261   261  

scientific-paper-summarisation

Machine learning models to automatically summarise scientific papers

65   261   261  

SpeechTransProgress

Tracking the progress in end-to-end speech translation

25   260   260  

jack

Jack the Reader

79   259   259  

chatbot

Русскоязычный генеративный чатбот с профилем и фактами

64   259   259  

spaczz

Fuzzy matching and more functionality for spaCy.

28   256   256  

ChatGPT-Bot

ChatGPT Bot - AI-powered conversation tool

208   256   256  

I-BERT

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

41   255   255  

kairon

Agentic AI platform that harnesses Visual LLM Chaining to build proact...

82   255   255  

google-bard-api

This project provides a FastAPI wrapper for interacting with Google Ba...

59   254   254  

practical-1

Oxford Deep NLP 2017 course - Practical 1: word2vec

142   254   254  

awesome-hungarian-nlp

A curated list of NLP resources for Hungarian

19   254   254  

konoha

🌿 An easy-to-use Japanese Text Processing tool, which makes it possib...

28   253   253  

cs224n-win2223

Code and written solutions of the assignments of the Stanford CS224N:...

68   253   253  

neat-vision

Neat (Neural Attention) Vision, is a visualization tool for the attent...

24   251   251  

OpenUnivCourses

FREE ML Courses from Top Universities in CS

49   250   250  

Awesome-Swiss-German

Multi-language Analyze text in 26 Cantonal Swiss German, Italian, Germ...

26   249   249  

relevanceai

Home of the AI workforce - Multi-agent system, AI agents & tools

41   249   249  

nuspell

🖋️ Fast and safe spellchecking C++ library

26   249   249  

forte

Forte is a flexible and powerful ML workflow builder. This is part of...

59   248   248  

docprompting

Data and code for "DocPrompting: Generating Code by Retrieving the Doc...

20   248   248  

HugNLP

HugNLP is a unified and comprehensive NLP library based on HuggingFace...

13   247   247  

RESIDE

EMNLP 2018: RESIDE: Improving Distantly-Supervised Neural Relation Ext...

48   247   247  

Speech_Signal_Processing_and_Classification

Front-end speech processing aims at extracting proper features from sh...

66   247   247  

dilated-cnn-ner

Dilated CNNs for NER in TensorFlow

58   244   244  

concise-concepts

This repository contains an easy and intuitive approach to few-shot NE...

14   244   244  

Awesome_Mamba

Computation-Efficient Era: A Comprehensive Survey of State Space Model...

19   243   243  

prosody

Helsinki Prosody Corpus and A System for Predicting Prosodic Prominenc...

41   243   243  

spacy-lookup

Named Entity Recognition based on dictionaries

38   242   242  

nlp_profiler

A simple NLP library allows profiling datasets with one or more text c...

37   242   242  

backprop

Backprop makes it simple to use, finetune, and deploy state-of-the-art...

11   242   242  

text2text

Text2Text: Crosslingual NLP/G toolkit

31   242   242  

chazutsu

The tool to make NLP datasets ready to use

32   241   241  

AIND-NLP

Coding exercises for the Natural Language Processing concentration, pa...

383   241   241  

awesome-ml-blogs

Curated list of technical blogs on machine learning · AI/ML/DL/CV/NLP/...

28   241   241  

spacy-services

💫 REST microservices for various spaCy-related tasks

75   240   240