Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
A curated list of Knowledge Graph related learning materials, databases, tools and other resources
Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.
OpenAI API client for Kotlin with multiplatform and coroutines capabilities.
ContextGem: Effortless LLM extraction from documents
Datasets, SOTA results of every fields of Chinese NLP
北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA),基于...
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or sh...
NLTK Data
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
Awesome-pytorch-list 翻译工作进行中......
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
Efficient Retrieval Augmentation and Generation Framework
Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, T...
1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.
🦄 State-of-the-Art Conversational AI with Transfer Learning
自然语言处理、知识图谱、对话系统,大模型等技术研究与应用。
novel deep learning research works with PaddlePaddle
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
📄 🤖 AI for medical and scientific papers
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类、词向量、情感识别、关系抽取等
Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains
Underthesea - Agentic AI Toolkit
Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG eco...
CakeChat: Emotional Generative Dialog System
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
A PyTorch-based knowledge distillation toolkit for natural language processing
A BERT model for scientific text.
Graph4nlp is the library for the easy use of Graph Neural Networks for NLP. Welcome to visit our DLG4NLP website (https://dlg4nlp.github.io/index.html...
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
jiant is an nlp toolkit
🦆 Contextually-keyed word vectors
Bringing BERT into modernity via both architecture changes and scaling
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics...
A fast, efficient universal vector embedding utility package.
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Data augmentation for NLP, presented at EMNLP 2019
Pre-Trained Chinese XLNet(中文XLNet预训练模型)
🧠 A study guide to learn about Transformers
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
:us: a python library for parsing unstructured United States address strings into address components
A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆26 knowledge distillation methods presented at T...
DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/
Apache OpenNLP
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
similarity: Text similarity calculation Toolkit for Java. 文本相似度计算工具包,java编写,可用于文本相似度计算、情感分析等任务,开箱即用。
Must-read Papers on Textual Adversarial Attack and Defense