Topic

natural-language-processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1431)

dialoglue
dialoglue alexa Python

DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue

287
InsTag
InsTag OFA-Sys

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning

285
Multi-Type-TD-TSR
Multi-Type-TD-TSR Psarpei Jupyter Notebook

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition

285
languagecrunch
languagecrunch artpar Python

LanguageCrunch NLP server docker image

285
Good-Papers
Good-Papers hoangcuong2011

I try my best to keep updated cutting-edge knowledge in Machine Learning/Deep Learning and Natural Language Processing. These are my notes on some go...

284
rnn_lstm_from_scratch
rnn_lstm_from_scratch nicklashansen Jupyter Notebook

How to build RNNs and LSTMs from scratch with NumPy.

282
hunspell-dict-ko
hunspell-dict-ko spellcheck-ko Python

Korean spellchecking dictionary for Hunspell

280
relevanceai
relevanceai RelevanceAI Python

Home of the AI workforce - Multi-agent system, AI agents & tools

279
pytorch_graph-rel
pytorch_graph-rel tsujuifu Python

A PyTorch implementation of GraphRel

278
awesome-emotion-recognition-in-conversations
awesome-emotion-recognition-in-conversations declare-lab

A comprehensive reading list for Emotion Recognition in Conversations

277
presidio-research
presidio-research microsoft Jupyter Notebook

This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as w...

276
cs224n-win2223
cs224n-win2223 floriankark Python

Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2022/2023

275
native-sparse-attention-triton
native-sparse-attention-triton XunhaoLai Python

Efficient triton implementation of Native Sparse Attention.

275
awesome-hungarian-nlp
awesome-hungarian-nlp oroszgy

A curated list of NLP resources for Hungarian

275
bist-parser
bist-parser elikip Python

Graph-based and Transition-based dependency parsers based on BiLSTMs

275
kairon
kairon digiteinfotech Python

Agentic AI platform that harnesses Visual LLM Chaining to build proactive digital assistants

275
nlp-tutorial
nlp-tutorial bonzanini Jupyter Notebook

Tutorial: Natural Language Processing in Python

274
KagNet
KagNet INK-USC Python

Knowledge-Aware Graph Networks for Commonsense Reasoning (EMNLP-IJCNLP 19)

273
recurrent-entity-networks
recurrent-entity-networks jimfleming Python

TensorFlow implementation of "Tracking the World State with Recurrent Entity Networks".

273
Black-Box-Tuning
Black-Box-Tuning txsun1997 Python

ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Models

271
karukan
karukan togatoga Rust

Japanese Input Method System for Linux, Neural Kana-Kanji Conversion Engine + fcitx5 IME

271
AI-Job-Info
AI-Job-Info Sophia-11

互联网大厂面试经验

270
ua-gec
ua-gec grammarly Macaulay2

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

270
blueprints-text
blueprints-text blueprints-for-text-analytics-python Jupyter Notebook

Jupyter notebooks for our O'Reilly book "Blueprints for Text Analysis Using Python"

270
awesome-tensorlayer
awesome-tensorlayer tensorlayer

A curated list of dedicated resources and applications

270
pytorch-question-answering
pytorch-question-answering kushalj001 Jupyter Notebook

Important paper implementations for Question Answering using PyTorch

269
GLiREL
GLiREL jackboyla Python

Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)

269
EMPaper
EMPaper Sahandfer

This is a repository for sharing papers in the field of empathetic conversational AI. The related source code for each paper is linked if available.

269
spacy-api-docker
spacy-api-docker jgontrum Python

spaCy REST API, wrapped in a Docker container.

268
extreme-bert
extreme-bert extreme-bert Python

ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A...

268
markup
markup samueldobbie TypeScript

A web-based document annotation tool, powered by GPT-4 :rocket:

268
nlvr
nlvr lil-lab HTML

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with...

267
I-BERT
I-BERT kssteven418 Python

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

267
awesome-llm-courses
awesome-llm-courses wikit-ai

A curated list of awesome online courses about Large Langage Models (LLMs)

267
MAMS-for-ABSA
MAMS-for-ABSA siat-nlp Python

A Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.

267
hmni
hmni Christopher-Thornton Python

📛 Fuzzy Name Matching with Machine Learning

267
KeyphraseVectorizers
KeyphraseVectorizers TimSchopf Python

Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a document-keyphrase...

267
squirrel-core
squirrel-core merantix-momentum Python

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:

266
Awesome_Mamba
Awesome_Mamba xmindflow

Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis

265
nuspell
nuspell nuspell C++

🖋️ Fast and safe spellchecking C++ library

264
character-based-cnn
character-based-cnn ahmedbesbes Python

Implementation of character based convolutional neural network

262
konoha
konoha himkt Python

🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.

261
nlp-labelling
nlp-labelling dataqa JavaScript

Labelling platform for text using weak supervision.

261
SpeechTransProgress
SpeechTransProgress kahne

Tracking the progress in end-to-end speech translation

261
chatbot
chatbot Koziev Python

Русскоязычный генеративный чатбот с профилем и фактами

260
AI-Text-Humanizer-App
AI-Text-Humanizer-App DadaNanjesha Python

Transform AI-generated text into formal, human-like, and academic writing with ease, avoids AI detector!

260
jack
jack uclnlp Python

Jack the Reader

260
scientific-paper-summarisation
scientific-paper-summarisation EdCo95 Python

Machine learning models to automatically summarise scientific papers

259
spaczz
spaczz gandersen101 Python

Fuzzy matching and more functionality for spaCy.

258
KnowAgent
KnowAgent zjunlp Python

[NAACL 2025] KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents

257