Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
🦄 State-of-the-Art Conversational AI with Transfer Learning
OpenAI API client for Kotlin with multiplatform and coroutines capabilities.
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
novel deep learning research works with PaddlePaddle
Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
Datasets, SOTA results of every fields of Chinese NLP
收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中
NLTK Data
Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
CakeChat: Emotional Generative Dialog System
自然语言处理、知识图谱、对话系统三大技术研究与应用。
中文nlp解决方案(大模型、数据、模型、训练、推理)
Graph4nlp is the library for the easy use of Graph Neural Networks for NLP. Welcome to visit our DLG4NLP website (https://dlg4nlp.github.io/index.html...
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
🦆 Contextually-keyed word vectors
A fast, efficient universal vector embedding utility package.
Pre-Trained Chinese XLNet(中文XLNet预训练模型)
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类、词向量、情感识别、关系抽取等
🧠 A study guide to learn about Transformers
DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/
:us: a python library for parsing unstructured United States address strings into address components
Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, T...
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics...
Efficient Retrieval Augmentation and Generation Framework
Underthesea - Vietnamese NLP Toolkit
Must-read Papers on Textual Adversarial Attack and Defense
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domain...
TigerBot: A multi-language multi-task LLM
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or sh...
A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆26 knowledge distillation methods presented at C...
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and use...
jiant is an nlp toolkit
Study E-Book(ComputerVision DeepLearning MachineLearning Math NLP Python ReinforcementLearning)
What's in your data? Extract schema, statistics and entities from datasets
这个项目是一个基本包.封装了大多数nlp项目中常用工具
NLP and Text Generation Experiments in TensorFlow 2.x / 1.x
similarity: Text similarity calculation Toolkit for Java. 文本相似度计算工具包,java编写,可用于文本相似度计算、情感分析等任务,开箱即用。
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(...
Efficient few-shot learning with Sentence Transformers
The Open Source Chatbot Framework in .NET
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Data augmentation for NLP, presented at EMNLP 2019
✍️ A carefully curated list of NLP paper summaries
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
A curated list of resources for Document Understanding (DU) topic
A full spaCy pipeline and models for scientific/biomedical documents.