Topic

natural-language-processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1431)

BLINK_Benchmark zeyofu Python

This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.org/abs/2404.12...

165 8 165

words_counted abitdodgy Ruby

A Ruby natural language processor.

164 28 164

prenlp lyeoni Python

Preprocessing Library for Natural Language Processing

164 12 164

RBERT jonathanbratt R

Implementation of BERT in R

164 19 164

nlp-startups Huffon

국내 자연어 처리 기술을 연구 및 개발하는 스타트업 목록

164 17 164

mtdata thammegowda Python

A tool that locates, downloads, and extracts machine translation corpora

163 23 163

OSWorld-G xlang-ai TypeScript

[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis

163 7 163

postagga turbopape Clojure

A Library to parse natural language in pure Clojure and ClojureScript

162 16 162

pythonrouge tagucci Perl

Python wrapper for evaluating summarization quality by ROUGE package

162 34 162

KoSentenceBERT-ETRI BM-K Python

Sentence Embeddings using Siamese ETRI KoBERT

162 24 162

awesome-AI-tutorials-surveys qingsongedu

A professional list of Tutorials and Surveys on DL, ML, DM, CV, NLP, Speech in top AI conferences and journals.

162 19 162

TreebankPreprocessing hankcs Python

Python scripts preprocessing Penn Treebank and Chinese Treebank

161 42 161

ETO Yifan-Song793 Python

Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)

161 15 161

awesome-ai-services sekwiatkowski Java

An overview of the AI-as-a-service landscape

160 23 160

python-tutorial-notebooks dcavar Jupyter Notebook

Python tutorials as Jupyter Notebooks for NLP, ML, AI

160 99 160

byt5-geotagging Yachay-AI Python

Confidence and Byt5 - based geotagging model predicting coordinates from text alone.

160 22 160

DeepLearning_NLP supercoderhawk Python

基于深度学习的自然语言处理库

159 40 159

lingo chewxy Go

package lingo provides the data structures and algorithms required for natural language processing

158 15 158

MarathiNLP l3cube-pune Jupyter Notebook

Marathi NLP - is a repository dedicated to development of tools and resources for Marathi language.

158 19 158

awesome-llm-os bilalonur

A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating Systems (LLM-OS).

158 11 158

augmenty KennethEnevoldsen Python

Augmenty is an augmentation library based on spaCy for augmenting texts.

156 9 156

paper-survey shunk031 HTML

📚 Survey of previous research and related works on machine learning (especially Deep Learning) in Japanese

155 12 155

sluice-networks sebastianruder Python

Code for Sluice networks: Learning what to share between loosely related tasks

155 35 155

long-doc-summarization huankoh

Long Document Summarization Papers

155 12 155

AttrPrompt yueyu1030 Python

[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.

155 14 155

anchoring-ai AnchoringAI JavaScript

An open-source no-code tool for teams to collaborate on building, evaluating, and hosting applications leveraging GPT and other large language models....

155 28 155

CheXbert stanfordmlgroup Python

Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT

155 32 155

Knowledge-Conflicts-Survey pillowsofwind

[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"

154 8 154

Deep-Lyrics tonybeltramelli Python

Lyrics Generator aka Character-level Language Modeling with Multi-layer LSTM Recurrent Neural Network

153 27 153

asari Hironsan Python

Japanese sentiment analyzer implemented in Python.

153 21 153

LongRoPE jshuadvd Python

Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper

153 13 153

LightThinker zjunlp Python

[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression

153 5 153

Ask2Transformers osainz59 Python

A Framework for Textual Entailment based Zero Shot text classification

152 13 152

negapoji liaoziyang Python

Japanese negative positive classification.日本語文書のネガポジを判定。

151 33 151

clicr clips Python

Machine reading comprehension on clinical case reports

151 39 151

lftk brucewlee Python

[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate s...

151 27 151

Spider2-V xlang-ai Jupyter Notebook

[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

151 16 151