Topic

natural-language-processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1402)

spacy-services
spacy-services explosion Python

💫 REST microservices for various spaCy-related tasks

240
GermanWordEmbeddings
GermanWordEmbeddings devmount Jupyter Notebook

Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.

239
tensorflow_qrnn
tensorflow_qrnn icoxfog417 Python

QRNN implementation for TensorFlow

236
open-sesame
open-sesame swabhs Python

A frame-semantic parsing system based on a softmax-margin SegRNN.

236
word2vec-pytorch
word2vec-pytorch OlgaChernytska Python

Implementation of the first paper on word2vec

236
KnowAgent
KnowAgent zjunlp Python

[NAACL 2025] KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents

235
neuralqa
neuralqa victordibia JavaScript

NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT

233
GLiREL
GLiREL jackboyla Python

Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)

232
natml-unity
natml-unity natmlx C#

High performance, cross-platform machine learning for Unity Engine.

231
TEXTOIR
TEXTOIR thuiar Python

TEXTOIR is the first opensource toolkit for text open intent recognition. (ACL 2021)

231
encodechka
encodechka avidale Python

The tiniest sentence encoder for Russian language

231
bert-vocab-builder
bert-vocab-builder kwonmha Python

Builds wordpiece(subword) vocabulary compatible for Google Research's BERT

230
PIE
PIE awasthiabhijeet Macaulay2

Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models for Local Seq...

230
visdial
visdial batra-mlp-lab Lua

[CVPR 2017] Torch code for Visual Dialog

230
presidio-research
presidio-research microsoft Jupyter Notebook

This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as w...

230
AutoAct
AutoAct zjunlp Python

[ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning

229
NLP4Rec-Papers
NLP4Rec-Papers THUDM

Paper list of NLP for recommender systems

229
TabularSemanticParsing
TabularSemanticParsing salesforce Jupyter Notebook

Translating natural language questions to structured query language (SQL)

229
AIDL_KB
AIDL_KB arthchan2003 HTML

A Knowledge Base for the FB Group Artificial Intelligence and Deep Learning (AIDL)

228
turkish-stemmer-python
turkish-stemmer-python otuncelli Python

:snake: Turkish Language Stemmer for Python

228
vec4ir
vec4ir lgalke Python

Word Embeddings for Information Retrieval

225
paraphrase_identification
paraphrase_identification wasiahmad HTML

Examine two sentences and determine whether they have the same meaning.

224
FedNLP
FedNLP FedML-AI

FedNLP: An Industry and Research Integrated Platform for Federated Learning in Natural Language Processing, Backed by FedML, Inc. The Previous Researc...

223
Awesome-Biomolecule-Language-Cross-Modeling
Awesome-Biomolecule-Language-Cross-Modeling QizhiPei

Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through Multi-Modal Le...

220
llama-2-jax
llama-2-jax ayaka14732 Python

JAX implementation of the Llama 2 model

219
data-science-toolkit
data-science-toolkit pmaji HTML

Collection of stats, modeling, and data science tools in Python and R.

219
claf
claf naver Python

CLaF: Open-Source Clova Language Framework

218
classy-classification
classy-classification davidberenstein1957 Python

This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-shot classific...

218
vntk
vntk vunb JavaScript

Vietnamese NLP Toolkit for Node

217
nl2sql
nl2sql eguilg Python

阿里天池首届中文NL2SQL挑战赛top6

217
awesome-NLP-resources
awesome-NLP-resources HanXinzi-AI

a collection of NLP projects&tools. 自然语言处理方向项目和工具集合。

216
AGI-Papers
AGI-Papers gyunggyung

Papers and Book to look at when starting AGI 📚

215
Awesome-NLP-Resources
Awesome-NLP-Resources Robofied

This repository contains landmark research papers in Natural Language Processing that came out in this century.

215
udpipe
udpipe bnosac C++

R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit

214
Tree-Transformer
Tree-Transformer yaushian Python

Implementation of the paper Tree Transformer

214
awesome-llm-courses
awesome-llm-courses wikit-ai

A curated list of awesome online courses about Large Langage Models (LLMs)

213
fixy
fixy Fixy-TR Jupyter Notebook

Amacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gi...

213
delbot
delbot shaildeliwala Python

It understands your voice commands, searches news and knowledge sources, and summarizes and reads out content to you.

212
SimplyRetrieve
SimplyRetrieve RCGAI Python

Lightweight chat AI platform featuring custom knowledge, open-source LLMs, prompt-engineering, retrieval analysis. Highly customizable. For Retrieval-...

212
graph-convolution-nlp
graph-convolution-nlp icoxfog417 Jupyter Notebook

Graph Convolution Network for NLP

212
phrasal
phrasal stanfordnlp Java

A large-scale statistical machine translation system written in Java.

211
deeplearning.ai
deeplearning.ai limberc HTML
211
PersianQA
PersianQA sajjjadayobi Jupyter Notebook

Persian (Farsi) Question Answering Dataset (+ Models)

210
LAMDA-SSL
LAMDA-SSL YGZWQZD Python

30 Semi-Supervised Learning Algorithms

208
XFUND
XFUND doc-analysis

XFUND: A Multilingual Form Understanding Benchmark

208
DeepLearning.AI-TensorFlow-Developer-Professional-Certificate
DeepLearning.AI-TensorFlow-Developer-Professional-Certificate williamcwi Jupyter Notebook

DeepLearning.AI TensorFlow Developer Professional Certificate

208
CRF-Layer-on-the-Top-of-BiLSTM
CRF-Layer-on-the-Top-of-BiLSTM createmomo Python

The CRF Layer was implemented by using Chainer 2.0. Please see more details here: https://createmomo.github.io/2017/09/12/CRF_Layer_on_the_Top_of_BiLS...

207
awesome-ukrainian-nlp
awesome-ukrainian-nlp osyvokon

Curated list of Ukrainian natural language processing (NLP) resources (corpora, pretrained models, libriaries, etc.)

207
PaperScraper
PaperScraper NLPatVCU Python

A web scraping tool to systematically extract the text of scientific papers and corresponding metadata from university accessible journals.

207
NewsRecommender
NewsRecommender huangy22 Jupyter Notebook

A news recommendation system tailored for user communities

206