Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
A Vietnamese natural language processing toolkit (NAACL 2018)
Find dates inside text using Python and get back datetime objects
甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for C...
自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要,文本相似度,科学计算...
A fast, low-resource Natural Language Processing and Text Correction library written in Rust.
Quickly format your notes with ChatGPT in Obsidian
Modern spell checking library - accurate, fast, multi-language
A curated list of resources for NLP (Natural Language Processing) for Korean
聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)
Automatically split your PyTorch models on multiple GPUs for training & inference
中文Mixtral-8x7B(Chinese-Mixtral-8x7B)
A Pretrained BERT Model for Financial Communications. https://arxiv.org/abs/2006.08097
A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks
Python framework for AI workflows and pipelines with chain of thought reasoning, external tools, and memory.
A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
TextCNN Pytorch实现 中文文本分类 情感分析
Library for clinical NLP with spaCy.
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models,...
AutoPrompt: Automatic Prompt Construction for Masked Language Models.
A foundational library for Semantic Hypergraphs
Active Learning for Text Classification in Python
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
Core Data of HowNet and OpenHowNet Python API
Homer, a text analyser in Python, can help make your text more clear, simple and useful for your readers.
Examples and libraries for "Natural Language Processing in Action" book
Transformers for Longer Sequences
An open platform for artificial intelligence, chat bots, virtual agents, social media automation, and live chat automation.
Pretrained ELECTRA Model for Korean
Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.
Deep NLP Course
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具...
ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational...
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference
HTML to Markdown converter and crawler.
⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.
Build chatbots and conversational experiences using React
[ICLR 2020] Lite Transformer with Long-Short Range Attention
This repository collects an extensive list of awesome papers about Story Generation / Storytelling, exclusively focusing on the era of Large Language...
中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)
A list of tools for annotating data, managing annotations, etc.
A content-based recommender system that recommends movies similar to the movie the user likes and analyses the sentiments of the reviews given by the...
Hierarchical Attention Networks for Document Classification in PyTorch
A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.org/abs/2012.1...
基于法律裁判文书的事件抽取及其应用,包括数据的分词、词性标注、命名实体识别、事件要素抽取和判决结果预测等内容
Tock, the open source conversational AI toolkit.
PyTorch implementation for "Matching the Blanks: Distributional Similarity for Relation Learning" paper
NLP in Python with Deep Learning