Most popular nlp repositories and open source projects

nlpcda 425776024 Python

一键中文数据增强包； NLP数据增强、bert数据增强、EDA：pip install nlpcda

1.9k 171 7

awesome-knowledge-graph totogo

A curated list of Knowledge Graph related learning materials, databases, tools and other resources

1.9k 174 41

LLMCompiler SqueezeAILab Python

[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

1.9k 134 23

contextgem shcherbak-ai Python

ContextGem: Effortless LLM extraction from documents

1.9k 158 12

spago nlpodyssey Go

Self-contained Machine Learning and Natural Language Processing library in Go

1.9k 88 37

DeepLearn GauravBh1010tt Python

Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.

1.8k 349 105

openai-kotlin aallam Kotlin

OpenAI API client for Kotlin with multiplatform and coroutines capabilities.

1.8k 236 29

awesome-bert Jiakui

bert nlp papers, applications and github resources, including the newst xlnet ， BERT、XLNet 相关论文和 github 项目

1.8k 346 66

nltk_data nltk Python

NLTK Data

1.8k 1.1k 40

QA-Survey-CN BDBC-KG-NLP

北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答（KBQA），基于文本的问答系统（TextQA），基于...

1.8k 260 40

Keras-TextClassification yongzhuo Python

中文长文本分类、短句子分类、多标签分类、两句子相似度（Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or sh...

1.8k 397 32

ChineseNLP didi HTML

Datasets, SOTA results of every fields of Chinese NLP

1.8k 260 59

Awesome-pytorch-list-CNVersion xavier-zy Jupyter Notebook

Awesome-pytorch-list 翻译工作进行中......

1.8k 402 65

underthesea undertheseanlp Python

Underthesea - AI Assistant

1.8k 308 75

Recognizers-Text microsoft C#

Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, T...

1.8k 435 62

fastRAG IntelLabs Python

Efficient Retrieval Augmentation and Generation Framework

1.8k 167 16

ChineseGLUE ChineseGLUE Python

Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard

1.8k 245 63

NLP-Models-Tensorflow mesolitica Jupyter Notebook

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

1.8k 712 1

kaggle-CrowdFlower ChenglongChen C++

1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.

1.8k 649 99

paperai neuml Python

📄 🤖 AI for medical and scientific papers

1.8k 146 29

extractous yobix-ai Rust

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

1.8k 97 18

lingua-py pemistahl Python

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

1.8k 62 10

NLP-Knowledge-Graph lihanghang

自然语言处理、知识图谱、对话系统，大模型等技术研究与应用。

1.8k 368 59

Research PaddlePaddle Python

novel deep learning research works with PaddlePaddle

1.8k 768 43

transfer-learning-conv-ai huggingface Python

🦄 State-of-the-Art Conversational AI with Transfer Learning

1.8k 431 78

FARM deepset-ai Python

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

1.8k 246 3

budoux google Python

1.7k 42 8

TextInfoExp Roshanson Python

自然语言处理实验（sougou数据集），TF-IDF，文本分类、聚类、词向量、情感识别、关系抽取等

1.7k 756 88

lightning-flash Lightning-Universe Python

Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains

1.7k 211 7

NeuroNER Franck-Dernoncourt Python

Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.

1.7k 472 77

cakechat lukalabs Python

CakeChat: Emotional Generative Dialog System

1.7k 913 0

scibert allenai Python

A BERT model for scientific text.

1.7k 231 49

TextBrewer airaria Python

A PyTorch-based knowledge distillation toolkit for natural language processing

1.7k 244 24

ModernBERT AnswerDotAI Python

Bringing BERT into modernity via both architecture changes and scaling

1.7k 145 28

gpt2-ml imcaspar Python

GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型

1.7k 326 36

awesome-ai-ml-dl neomatrix369 Jupyter Notebook

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics...

1.7k 377 77

transformer-deploy ELS-RD Python

Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀

1.7k 152 25

graph4nlp graph4ai Python

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP. Welcome to visit our DLG4NLP website (https://dlg4nlp.github.io/index.html...

1.7k 207 28

sense2vec explosion Python

🦆 Contextually-keyed word vectors

1.7k 236 44

jiant nyu-mll Python

jiant is an nlp toolkit

1.7k 296 40

magnitude plasticityai Python

A fast, efficient universal vector embedding utility package.

1.7k 122 34

tika-python chrismattmann Python

Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

1.7k 250 39

eda_nlp jasonwei20 Python

Data augmentation for NLP, presented at EMNLP 2019

1.7k 311 34

Chinese-XLNet ymcui Python

Pre-Trained Chinese XLNet（中文XLNet预训练模型）

1.6k 279 30

Transformers-Recipe dair-ai

🧠 A study guide to learn about Transformers

1.6k 163 29

usaddress datamade Python

:us: a python library for parsing unstructured United States address strings into address components

1.6k 308 38

pet timoschick Python

This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"

1.6k 282 42

torchdistill yoshitomo-matsubara Python

A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆26 knowledge distillation methods presented at T...

1.6k 145 17

WikiChat stanford-oval Python

WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.

1.6k 146 17

delta Delta-ML Python

DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/

1.6k 283 10

nlp

Repositories (1478)