Most popular natural-language-processing repositories and open source projects

Turkish-Bert-NLP-Pipeline savasy Jupyter Notebook

Bert-base NLP pipeline for Turkish, Ner, Sentiment Analysis, Question Answering etc.

180 21 180

swagaf rowanz Python

Repository for paper "SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference"

179 39 179

pytorch-pos-tagging bentrevett Jupyter Notebook

A tutorial on how to implement models for part-of-speech tagging using PyTorch and TorchText.

179 27 179

easy-bert robrua Java

A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)

178 45 178

metaknowledge UWNETLAB Python

A Python library for doing bibliometric and network analysis in science and health policy research

178 35 178

LiveActionMap kinshukdua Python

An attempt to map the areas with active conflict in Ukraine using twitter data and NLP.

178 15 178

character-mining emorynlp Python

Mining individual characters in multiparty dialogue

176 26 176

sling ringgaard C++

SLING - A natural language frame semantics parser

176 11 176

gpt2-dialogue-generation-pytorch devjwsong Python

The PyTorch implementation of fine-tuning the GPT-2(Generative Pre-trained Transformer 2) for dialogue generation.

176 25 176

char-cnn-text-classification-pytorch srviest Python

Character-level Convolutional Neural Networks for text classification in PyTorch

175 47 175

deep-learning-for-nlp-lectures dl4nlp-tuda TeX

Deep Learning for Natural Language Processing - Lectures 2023

175 33 175

imodelsX csinva Python

Interpret text data with LLMs (sklearn compatible).

175 27 175

learn-to-select-data sebastianruder Python

Code for Learning to select data for transfer learning with Bayesian Optimization

174 43 174

chars2vec IntuitionEngineeringTeam Python

Character-based word embeddings model based on RNN for handling real world texts

174 38 174

AI-NLP-Paper-Readings zhongpeixiang

This is my reading list for my PhD in AI, NLP, Deep Learning and more.

174 25 174

pythorch-text-classification Lan-ce-lot Python

对豆瓣影评进行文本分类情感分析，利用爬虫豆瓣爬取评论，进行数据清洗，分词，采用BERT、CNN、LSTM等模型进行训练，采用tensorboardX可视化训练过程，自然语言...

174 10 174

pymetamap AnthonyMRios Python

Python wraper for MetaMap

173 62 173

cep cortictechnology JavaScript

CEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.

173 25 173

qb Pinafore Python

QANTA Quiz Bowl AI

172 48 172

OneKE zjunlp HTML

[WWW 2025] A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System.

172 19 172

Movie_Review_Analysis WLXie-Tony Jupyter Notebook

Official replication package for IJFE (2026). Asynchronous ETL pipeline using GPT-4o to quantify investor distraction shocks from unstructured movie r...

172 0 172

NLP-pretrained-model balavenkatesh3322

A collection of Natural language processing pre-trained models.

171 30 171

Pre-modern_Chinese_corpus_dataset JiangYanting HTML

近代汉语语料库数据集自然语言处理语料库古代汉语古汉语文言文数字人文计算语言

171 18 171

PersonaPaper Sahandfer

This is a repository for sharing papers in the field of persona-based conversational AI. The related source code for each paper is linked if available...

171 11 171

awesome-ai4lam AI4LAM SCSS

A list of awesome AI in libraries, archives, and museum collections from around the world 🕶️

171 14 171

question_generation deeppavlov Python

It is a question-generator model. It takes text and an answer as input and outputs a question.

170 57 170

nl4dv nl4dv Python

A python toolkit to create Visualizations (Vis) using natural language (NL) or add an NL interface to existing Vis.

170 27 170

spacyfishing Lucaterre Python

A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata

170 8 170

gitagpt vxsahu TypeScript

Gita GPT A personal productivity assistant (RAG), a platform of AI chatbots, Ask Krishna GPT that uses Bhagavad Gita references to answer your questio...

170 41 170

visdial-rl batra-mlp-lab Python

PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning

169 39 169

monkeylearn-python monkeylearn Python

Official Python client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Python apps.

169 44 169

zamia-ai gooofy Prolog

Free and open source A.I. system based on Python, TensorFlow and Prolog.

169 26 169

fake-news mihail911 Jupyter Notebook

Building a fake news detector from initial ideation to model deployment

168 64 168

parsinlu persiannlp Python

A comprehensive suite of high-level NLP tasks for Persian language

168 23 168

TwitterScraper MatthewWolff Python

Scrape a User's Twitter data! Bypass the 3,200 tweet API limit for a User!

168 17 168

transformer-abstractive-summarization rojagtap Jupyter Notebook

Abstractive Text Summarization using Transformer

168 47 168

Dual-Contrastive-Learning hiyouga Python

Code for our paper "Dual Contrastive Learning: Text Classification via Label-Aware Data Augmentation"

168 31 168

Awesome-Mixup Westlake-AI

[Survey] Awesome List of Mixup Augmentation and Beyond (https://arxiv.org/abs/2409.05202)

168 11 168

LLM-Minutes-of-Meeting inboxpraveen HTML

A tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings...

168 18 168