Topic

nlp

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1462)

awesome-knowledge-graph
awesome-knowledge-graph totogo

A curated list of Knowledge Graph related learning materials, databases, tools and other resources

1.8k
DeepLearn
DeepLearn GauravBh1010tt Python

Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.

1.8k
openai-kotlin
openai-kotlin aallam Kotlin

OpenAI API client for Kotlin with multiplatform and coroutines capabilities.

1.8k
contextgem
contextgem shcherbak-ai Python

ContextGem: Effortless LLM extraction from documents

1.8k
ChineseNLP
ChineseNLP didi HTML

Datasets, SOTA results of every fields of Chinese NLP

1.8k
QA-Survey-CN
QA-Survey-CN BDBC-KG-NLP

北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA),基于...

1.8k
Keras-TextClassification
Keras-TextClassification yongzhuo Python

中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or sh...

1.8k
nltk_data
nltk_data nltk Python

NLTK Data

1.8k
ChineseGLUE
ChineseGLUE ChineseGLUE Python

Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard

1.8k
Awesome-pytorch-list-CNVersion
Awesome-pytorch-list-CNVersion xavier-zy Jupyter Notebook

Awesome-pytorch-list 翻译工作进行中......

1.8k
NLP-Models-Tensorflow
NLP-Models-Tensorflow mesolitica Jupyter Notebook

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

1.8k
fastRAG
fastRAG IntelLabs Python

Efficient Retrieval Augmentation and Generation Framework

1.8k
Recognizers-Text
Recognizers-Text microsoft C#

Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, T...

1.8k
kaggle-CrowdFlower
kaggle-CrowdFlower ChenglongChen C++

1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.

1.8k
transfer-learning-conv-ai
transfer-learning-conv-ai huggingface Python

🦄 State-of-the-Art Conversational AI with Transfer Learning

1.8k
NLP-Knowledge-Graph
NLP-Knowledge-Graph lihanghang

自然语言处理、知识图谱、对话系统,大模型等技术研究与应用。

1.8k
Research
Research PaddlePaddle Python

novel deep learning research works with PaddlePaddle

1.8k
FARM
FARM deepset-ai Python

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

1.8k
paperai
paperai neuml Python

📄 🤖 AI for medical and scientific papers

1.8k
extractous
extractous yobix-ai Rust

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

1.7k
TextInfoExp
TextInfoExp Roshanson Python

自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类、词向量、情感识别、关系抽取等

1.7k
lightning-flash
lightning-flash Lightning-Universe Python

Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains

1.7k
underthesea
underthesea undertheseanlp Python

Underthesea - Agentic AI Toolkit

1.7k
NeuroNER
NeuroNER Franck-Dernoncourt Python

Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.

1.7k
RAGHub
RAGHub Andrew-Jang

A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG eco...

1.7k
cakechat
cakechat lukalabs Python

CakeChat: Emotional Generative Dialog System

1.7k
lingua-py
lingua-py pemistahl Python

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

1.7k
gpt2-ml
gpt2-ml imcaspar Python

GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型

1.7k
TextBrewer
TextBrewer airaria Python

A PyTorch-based knowledge distillation toolkit for natural language processing

1.7k
scibert
scibert allenai Python

A BERT model for scientific text.

1.7k
graph4nlp
graph4nlp graph4ai Python

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP. Welcome to visit our DLG4NLP website (https://dlg4nlp.github.io/index.html...

1.7k
transformer-deploy
transformer-deploy ELS-RD Python

Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀

1.7k
jiant
jiant nyu-mll Python

jiant is an nlp toolkit

1.7k
sense2vec
sense2vec explosion Python

🦆 Contextually-keyed word vectors

1.7k
ModernBERT
ModernBERT AnswerDotAI Python

Bringing BERT into modernity via both architecture changes and scaling

1.7k
awesome-ai-ml-dl
awesome-ai-ml-dl neomatrix369 Jupyter Notebook

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics...

1.7k
magnitude
magnitude plasticityai Python

A fast, efficient universal vector embedding utility package.

1.7k
tika-python
tika-python chrismattmann Python

Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

1.7k
eda_nlp
eda_nlp jasonwei20 Python

Data augmentation for NLP, presented at EMNLP 2019

1.7k
Chinese-XLNet
Chinese-XLNet ymcui Python

Pre-Trained Chinese XLNet(中文XLNet预训练模型)

1.6k
Transformers-Recipe
Transformers-Recipe dair-ai

🧠 A study guide to learn about Transformers

1.6k
pet
pet timoschick Python

This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"

1.6k
usaddress
usaddress datamade Python

:us: a python library for parsing unstructured United States address strings into address components

1.6k
torchdistill
torchdistill yoshitomo-matsubara Python

A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆26 knowledge distillation methods presented at T...

1.6k
budoux
budoux google Python
1.6k
delta
delta Delta-ML Python

DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/

1.6k
opennlp
opennlp apache Java

Apache OpenNLP

1.6k
WikiChat
WikiChat stanford-oval Python

WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.

1.6k
similarity
similarity shibing624 Java

similarity: Text similarity calculation Toolkit for Java. 文本相似度计算工具包,java编写,可用于文本相似度计算、情感分析等任务,开箱即用。

1.6k
TAADpapers
TAADpapers thunlp Python

Must-read Papers on Textual Adversarial Attack and Defense

1.6k