Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
cntext 是一个专为社会科学实证研究设计的中文文本分析 Python 库。它不仅提供传统的词频统计和情感分析,还支持词嵌入训练、语义投影计算等高级功能,帮助研究...
🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.
A relation-aware semantic parsing model from English to SQL
chinese NLP corpus of chinese science fiction,chinese science fiction corpus : About 4675 Chinese science fiction novels 大约有4675本科幻小说,中文科...
🌟 A curated collection of free, high quality AI tools 🤖, APIs 🔗, datasets 📊, and learning resources 📚 covering machine learning 🧠, deep learning...
First class Sublime Text AI assistant with gpt-5, Opus 4.6, Gemini 3 and ollama support!
A list of online news & info sources in the AI/ML/Data Science space
A curated list of awesome Distributed Deep Learning resources.
Keras, PyTorch, and NumPy Implementations of Deep Learning Architectures for NLP
Resources for conservation, development, and documentation of low resource (human) languages.
Natural Language Engine on WikiData
中文文本摘要/关键词提取
TextAugment: Text Augmentation Library
Attention Guided Graph Convolutional Networks for Relation Extraction (authors' PyTorch implementation for the ACL19 paper)
天池 疫情相似句对判定大赛 线上第一名方案
Dialogflow Web Integration. Supports rich components
A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in dashboard.
Tutel MoE: An Optimized Mixture-of-Experts Implementation
RAG LLM Ops App for easy deployment and testing
Intelligo is powerful chatbot builder that enables anyone to create and deploy chatbots anywhere.
Projects and useful articles / links
🧫 A curated list of resources relevant to doing Biomedical Information Extraction (including BioNLP)
Statistics and accepted paper list of NLP conferences with arXiv link
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data sc...
Automatic Korean word spacing with Python
Code for NeurIPS 2019 - Glyce: Glyph-vectors for Chinese Character Representations
AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验帖)
🤖 Deep Reinforcement Learning Chatbot
AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community w...
DistilBERT / GPT-2 for on-device inference thanks to TensorFlow Lite with Android demo apps
Collaborative Training of Large Language Models in an Efficient Way
✔️Contextual word checker for better suggestions (not actively maintained)
An open-source tool for sequence learning in NLP built on TensorFlow.
Automatic Web Article Summarizer
Natural Language Processing Papers
Learn how to use PyTorch to solve some common NLP problems with deep learning.
The Python toolkit for computing with string diagrams.
超轻量级bert的pytorch版本,大量中文注释,容易修改结构,持续更新
A Japanese tokenizer based on recurrent neural networks
Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.
API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模型,只用現代白話中文語法規則,...
a Deep Learning Framework for Text https://delft.readthedocs.io/
:page_facing_up: A PyTorch implementation of Paragraph Vectors (doc2vec).
The CMU Link Grammar natural language parser
Chat2Graph: Graph Native Agentic System.
A dataset of millions of news articles scraped from a curated list of data sources.
🤗 ParsBERT: Transformer-based Model for Persian Language Understanding
Abstractive summarisation using Bert as encoder and Transformer Decoder
Juman++ (a Morphological Analyzer Toolkit)
📝 Automatically annotate papers using LLMs