Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
A comprehensive reference for all topics related to Natural Language Processing
A collection of 1000+ survey papers on Natural Language Processing (NLP) and Machine Learning (ML).
Baidu's open-source Sentiment Analysis System.
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
Deep Learning model to analyze a large corpus of clear text passwords.
SLING - A natural language frame semantics parser
📅🇨🇳中国法定节假日数据 自动每日抓取国务院公告
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, wor...
BERT score for text generation
Transformers 库快速入门教程
💫 Models for the spaCy Natural Language Processing (NLP) library
😎 An up-to-date & curated list of awesome semi-supervised learning papers, methods & resources.
Self-contained Machine Learning and Natural Language Processing library in Go
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
A curated list of awesome embedding models tutorials, projects and communities.
NLTK Data
A large annotated semantic parsing corpus for developing natural language interfaces.
A collection of research on knowledge graphs
1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.
Shared repository for open-sourced projects from the Google AI Language team.
Toolbox of models, callbacks, and datasets for AI/ML researchers.
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG eco...
Underthesea - Agentic AI Toolkit
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository...
LLM(😽)
Graph4nlp is the library for the easy use of Graph Neural Networks for NLP. Welcome to visit our DLG4NLP website (https://dlg4nlp.github.io/index.html...
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
🦆 Contextually-keyed word vectors
Synthetic data curation for post-training and structured data extraction
A fast, efficient universal vector embedding utility package.
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics...
Pre-Trained Chinese XLNet(中文XLNet预训练模型)
🧠 A study guide to learn about Transformers
Paper List for Style Transfer in Text
General Assembly's 2015 Data Science course in Washington, DC
:us: a python library for parsing unstructured United States address strings into address components
A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆26 knowledge distillation methods presented at T...
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
Python Keyphrase Extraction module
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
Must-read Papers on Textual Adversarial Attack and Defense
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domain...
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.
DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of...
Awesome Search - this is all about the (e-commerce, but not only) search and its awesomeness