Topic

natural-language-processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1431)

AliceMind
AliceMind alibaba Python

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

2k
The-NLP-Pandect
The-NLP-Pandect ivan-bilan Python

A comprehensive reference for all topics related to Natural Language Processing

2k
ABigSurvey
ABigSurvey NiuTrans

A collection of 1000+ survey papers on Natural Language Processing (NLP) and Machine Learning (ML).

2k
Senta
Senta baidu Python

Baidu's open-source Sentiment Analysis System.

2k
Awesome-FL
Awesome-FL youngfish42 Python

Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)

2k
tensorflow-1.4-billion-password-analysis
tensorflow-1.4-billion-password-analysis philipperemy Python

Deep Learning model to analyze a large corpus of clear text passwords.

2k
sling
sling google C++

SLING - A natural language frame semantics parser

1.9k
holiday-cn
holiday-cn NateScarlet Python

📅🇨🇳中国法定节假日数据 自动每日抓取国务院公告

1.9k
NCRFpp
NCRFpp jiesutd Python

NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, wor...

1.9k
bert_score
bert_score Tiiiger Jupyter Notebook

BERT score for text generation

1.9k
How-to-use-Transformers
How-to-use-Transformers jsksxs360 Python

Transformers 库快速入门教程

1.9k
spacy-models
spacy-models explosion Python

💫 Models for the spaCy Natural Language Processing (NLP) library

1.9k
awesome-semi-supervised-learning
awesome-semi-supervised-learning yassouali

😎 An up-to-date & curated list of awesome semi-supervised learning papers, methods & resources.

1.9k
spago
spago nlpodyssey Go

Self-contained Machine Learning and Natural Language Processing library in Go

1.8k
LLMCompiler
LLMCompiler SqueezeAILab Python

[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

1.8k
awesome-embedding-models
awesome-embedding-models Hironsan Jupyter Notebook

A curated list of awesome embedding models tutorials, projects and communities.

1.8k
nltk_data
nltk_data nltk Python

NLTK Data

1.8k
WikiSQL
WikiSQL salesforce HTML

A large annotated semantic parsing corpus for developing natural language interfaces.

1.8k
knowledge-graphs
knowledge-graphs shaoxiongji JavaScript

A collection of research on knowledge graphs

1.8k
kaggle-CrowdFlower
kaggle-CrowdFlower ChenglongChen C++

1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.

1.8k
language
language google-research Python

Shared repository for open-sourced projects from the Google AI Language team.

1.8k
lightning-bolts
lightning-bolts Lightning-Universe Python

Toolbox of models, callbacks, and datasets for AI/ML researchers.

1.8k
extractous
extractous yobix-ai Rust

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

1.7k
RAGHub
RAGHub Andrew-Jang

A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG eco...

1.7k
underthesea
underthesea undertheseanlp Python

Underthesea - Agentic AI Toolkit

1.7k
lingua-py
lingua-py pemistahl Python

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

1.7k
text-analytics-with-python
text-analytics-with-python dipanjanS Jupyter Notebook

Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository...

1.7k
kor
kor eyurtsev Python

LLM(😽)

1.7k
graph4nlp
graph4nlp graph4ai Python

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP. Welcome to visit our DLG4NLP website (https://dlg4nlp.github.io/index.html...

1.7k
transformer-deploy
transformer-deploy ELS-RD Python

Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀

1.7k
sense2vec
sense2vec explosion Python

🦆 Contextually-keyed word vectors

1.7k
curator
curator bespokelabsai Python

Synthetic data curation for post-training and structured data extraction

1.7k
magnitude
magnitude plasticityai Python

A fast, efficient universal vector embedding utility package.

1.7k
awesome-ai-ml-dl
awesome-ai-ml-dl neomatrix369 Jupyter Notebook

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics...

1.7k
Chinese-XLNet
Chinese-XLNet ymcui Python

Pre-Trained Chinese XLNet(中文XLNet预训练模型)

1.6k
Transformers-Recipe
Transformers-Recipe dair-ai

🧠 A study guide to learn about Transformers

1.6k
Style-Transfer-in-Text
Style-Transfer-in-Text fuzhenxin

Paper List for Style Transfer in Text

1.6k
DAT8
DAT8 justmarkham Jupyter Notebook

General Assembly's 2015 Data Science course in Washington, DC

1.6k
usaddress
usaddress datamade Python

:us: a python library for parsing unstructured United States address strings into address components

1.6k
torchdistill
torchdistill yoshitomo-matsubara Python

A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆26 knowledge distillation methods presented at T...

1.6k
Macaw-LLM
Macaw-LLM lyuchenyang Python

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

1.6k
pke
pke boudinfl Python

Python Keyphrase Extraction module

1.6k
WikiChat
WikiChat stanford-oval Python

WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.

1.6k
TAADpapers
TAADpapers thunlp Python

Must-read Papers on Textual Adversarial Attack and Defense

1.6k
entity-recognition-datasets
entity-recognition-datasets juand-r Python

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domain...

1.6k
Semi-supervised-learning
Semi-supervised-learning microsoft Python

A Unified Semi-Supervised Learning Codebase (NeurIPS'22)

1.6k
DeepMoji
DeepMoji bfelbo Python

State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.

1.6k
DAMO-ConvAI
DAMO-ConvAI AlibabaResearch Python

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

1.5k
chatarena
chatarena Farama-Foundation Python

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of...

1.5k
awesome-search
awesome-search frutik Shell

Awesome Search - this is all about the (e-commerce, but not only) search and its awesomeness

1.5k