Topic

natural-language-processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1431)

BLINK_Benchmark
BLINK_Benchmark zeyofu Python

This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.org/abs/2404.12...

165
words_counted
words_counted abitdodgy Ruby

A Ruby natural language processor.

164
prenlp
prenlp lyeoni Python

Preprocessing Library for Natural Language Processing

164
RBERT
RBERT jonathanbratt R

Implementation of BERT in R

164
nlp-startups
nlp-startups Huffon

국내 자연어 처리 기술을 연구 및 개발하는 스타트업 목록

164
mtdata
mtdata thammegowda Python

A tool that locates, downloads, and extracts machine translation corpora

163
OSWorld-G
OSWorld-G xlang-ai TypeScript

[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis

163
postagga
postagga turbopape Clojure

A Library to parse natural language in pure Clojure and ClojureScript

162
pythonrouge
pythonrouge tagucci Perl

Python wrapper for evaluating summarization quality by ROUGE package

162
KoSentenceBERT-ETRI
KoSentenceBERT-ETRI BM-K Python

Sentence Embeddings using Siamese ETRI KoBERT

162
awesome-AI-tutorials-surveys
awesome-AI-tutorials-surveys qingsongedu

A professional list of Tutorials and Surveys on DL, ML, DM, CV, NLP, Speech in top AI conferences and journals.

162
TreebankPreprocessing
TreebankPreprocessing hankcs Python

Python scripts preprocessing Penn Treebank and Chinese Treebank

161
ETO
ETO Yifan-Song793 Python

Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)

161
awesome-ai-services
awesome-ai-services sekwiatkowski Java

An overview of the AI-as-a-service landscape

160
python-tutorial-notebooks
python-tutorial-notebooks dcavar Jupyter Notebook

Python tutorials as Jupyter Notebooks for NLP, ML, AI

160
byt5-geotagging
byt5-geotagging Yachay-AI Python

Confidence and Byt5 - based geotagging model predicting coordinates from text alone.

160
DeepLearning_NLP
DeepLearning_NLP supercoderhawk Python

基于深度学习的自然语言处理库

159
lingo
lingo chewxy Go

package lingo provides the data structures and algorithms required for natural language processing

158
MarathiNLP
MarathiNLP l3cube-pune Jupyter Notebook

Marathi NLP - is a repository dedicated to development of tools and resources for Marathi language.

158
awesome-llm-os
awesome-llm-os bilalonur

A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating Systems (LLM-OS).

158
augmenty
augmenty KennethEnevoldsen Python

Augmenty is an augmentation library based on spaCy for augmenting texts.

156
paper-survey
paper-survey shunk031 HTML

📚 Survey of previous research and related works on machine learning (especially Deep Learning) in Japanese

155
sluice-networks
sluice-networks sebastianruder Python

Code for Sluice networks: Learning what to share between loosely related tasks

155
long-doc-summarization
long-doc-summarization huankoh

Long Document Summarization Papers

155
AttrPrompt
AttrPrompt yueyu1030 Python

[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.

155
anchoring-ai
anchoring-ai AnchoringAI JavaScript

An open-source no-code tool for teams to collaborate on building, evaluating, and hosting applications leveraging GPT and other large language models....

155
CheXbert
CheXbert stanfordmlgroup Python

Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT

155
Knowledge-Conflicts-Survey
Knowledge-Conflicts-Survey pillowsofwind

[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"

154
Deep-Lyrics
Deep-Lyrics tonybeltramelli Python

Lyrics Generator aka Character-level Language Modeling with Multi-layer LSTM Recurrent Neural Network

153
asari
asari Hironsan Python

Japanese sentiment analyzer implemented in Python.

153
LongRoPE
LongRoPE jshuadvd Python

Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper

153
LightThinker
LightThinker zjunlp Python

[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression

153
Ask2Transformers
Ask2Transformers osainz59 Python

A Framework for Textual Entailment based Zero Shot text classification

152
negapoji
negapoji liaoziyang Python

Japanese negative positive classification.日本語文書のネガポジを判定。

151
clicr
clicr clips Python

Machine reading comprehension on clinical case reports

151
lftk
lftk brucewlee Python

[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate s...

151
Spider2-V
Spider2-V xlang-ai Jupyter Notebook

[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

151
NeuSum
NeuSum magic282 Python

Code for the ACL 2018 paper "Neural Document Summarization by Jointly Learning to Score and Select Sentences"

150
DeezyMatch
DeezyMatch Living-with-machines Jupyter Notebook

A Flexible Deep Learning Approach to Fuzzy String Matching

150
greek-bert
greek-bert nlpaueb Python

A Greek edition of BERT pre-trained language model

149
gpt-paper-title-generator
gpt-paper-title-generator csinva Jupyter Notebook

Generating paper titles (and more!) with GPT trained on data scraped from arXiv.

149
ake-datasets
ake-datasets boudinfl Shell

Large, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.

148
awesome-speech-translation
awesome-speech-translation dqqcasia
148
htmldate
htmldate adbar Python

Fast and robust date extraction from web pages, with Python or on the command-line

148
ChatSQL
ChatSQL ademakdogan Python

Convert the given plain text to MySQL query by ChatGPT

148
WorfBench
WorfBench zjunlp Python

[ICLR 2025] Benchmarking Agentic Workflow Generation

148
quantulum3
quantulum3 nielstron Python

Library for unit extraction - fork of quantulum for python3

147
OneGen
OneGen zjunlp Python

[EMNLP 2024] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.

147
detecting-scientific-claim
detecting-scientific-claim titipata Python

Extracting scientific claims from biomedical abstracts (powered by AllenNLP)

146
GAIN
GAIN DreamInvoker Python

Source code for EMNLP 2020 paper: Double Graph Based Reasoning for Document-level Relation Extraction

146