Topic

nlp

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1462)

cntext
cntext hiDaDeng Python

cntext 是一个专为社会科学实证研究设计的中文文本分析 Python 库。它不仅提供传统的词频统计和情感分析,还支持词嵌入训练、语义投影计算等高级功能,帮助研究...

450
rag
rag neuml Python

🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.

448
rat-sql
rat-sql microsoft Python

A relation-aware semantic parsing model from English to SQL

447
4675-scifi
4675-scifi guhhhhaa

chinese NLP corpus of chinese science fiction,chinese science fiction corpus : About 4675 Chinese science fiction novels 大约有4675本科幻小说,中文科...

447
free-ai-resources-x
free-ai-resources-x CelaDaniel

🌟 A curated collection of free, high quality AI tools 🤖, APIs 🔗, datasets 📊, and learning resources 📚 covering machine learning 🧠, deep learning...

446
OpenAI-sublime-text
OpenAI-sublime-text yaroslavyaroslav Python

First class Sublime Text AI assistant with gpt-5, Opus 4.6, Gemini 3 and ollama support!

444
allainews_sources
allainews_sources foorilla

A list of online news & info sources in the AI/ML/Data Science space

443
Awesome-Distributed-Deep-Learning
Awesome-Distributed-Deep-Learning bharathgs

A curated list of awesome Distributed Deep Learning resources.

442
deep_learning_NLP
deep_learning_NLP Tixierae Jupyter Notebook

Keras, PyTorch, and NumPy Implementations of Deep Learning Architectures for NLP

440
low-resource-languages
low-resource-languages RichardLitt TeX

Resources for conservation, development, and documentation of low resource (human) languages.

438
nlquery
nlquery ayoungprogrammer Python

Natural Language Engine on WikiData

436
FastTextRank
FastTextRank ArtistScript Python

中文文本摘要/关键词提取

436
textaugment
textaugment dsfsi Python

TextAugment: Text Augmentation Library

436
AGGCN
AGGCN Cartus Python

Attention Guided Graph Convolutional Networks for Relation Extraction (authors' PyTorch implementation for the ACL19 paper)

435
epidemic-sentence-pair
epidemic-sentence-pair zzy99 Python

天池 疫情相似句对判定大赛 线上第一名方案

435
dialogflow-web-v2
dialogflow-web-v2 mishushakov Vue

Dialogflow Web Integration. Supports rich components

432
interpret-text
interpret-text interpretml Python

A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in dashboard.

432
tutel
tutel microsoft Python

Tutel MoE: An Optimized Mixture-of-Experts Implementation

432
dialog
dialog talkdai Python

RAG LLM Ops App for easy deployment and testing

430
intelligo
intelligo intelligo-mn TypeScript

Intelligo is powerful chatbot builder that enables anyone to create and deploy chatbots anywhere.

429
NLP-Natural-Language-Processing
NLP-Natural-Language-Processing ElizaLo Jupyter Notebook

Projects and useful articles / links

429
awesome-bioie
awesome-bioie caufieldjh

🧫 A curated list of resources relevant to doing Biomedical Information Extraction (including BioNLP)

428
nlp-papers-with-arxiv
nlp-papers-with-arxiv roomylee Jupyter Notebook

Statistics and accepted paper list of NLP conferences with arXiv link

426
Data-Science-Hacks
Data-Science-Hacks kunalj101 Jupyter Notebook

Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data sc...

425
PyKoSpacing
PyKoSpacing haven-jeon Python

Automatic Korean word spacing with Python

425
glyce
glyce ShannonAI Python

Code for NeurIPS 2019 - Glyce: Glyph-vectors for Chinese Character Representations

424
AI-Competition-Collections
AI-Competition-Collections SWHL HTML

AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验帖)

423
RL-Chatbot
RL-Chatbot pochih Python

🤖 Deep Reinforcement Learning Chatbot

422
aravec
aravec bakrianoo Jupyter Notebook

AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community w...

421
tflite-android-transformers
tflite-android-transformers huggingface Java

DistilBERT / GPT-2 for on-device inference thanks to TensorFlow Lite with Android demo apps

421
CoLLiE
CoLLiE OpenMOSS Python

Collaborative Training of Large Language Models in an Efficient Way

420
contextualSpellCheck
contextualSpellCheck R1j1t Python

✔️Contextual word checker for better suggestions (not actively maintained)

419
neuralmonkey
neuralmonkey ufal Python

An open-source tool for sequence learning in NLP built on TensorFlow.

418
ExplainToMe
ExplainToMe jjangsangy Python

Automatic Web Article Summarizer

417
NLP-Papers
NLP-Papers llhthinker

Natural Language Processing Papers

417
pytorch-nlp-notebooks
pytorch-nlp-notebooks scoutbee Jupyter Notebook

Learn how to use PyTorch to solve some common NLP problems with deep learning.

417
discopy
discopy discopy Python

The Python toolkit for computing with string diagrams.

417
bert4pytorch
bert4pytorch MuQiuJun-AI Python

超轻量级bert的pytorch版本,大量中文注释,容易修改结构,持续更新

417
nagisa
nagisa taishi-i Python

A Japanese tokenizer based on recurrent neural networks

417
Selective_Context
Selective_Context liyucheng09 Python

Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.

416
ArticutAPI
ArticutAPI Droidtown Python

API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模型,只用現代白話中文語法規則,...

415
delft
delft kermitt2 Python

a Deep Learning Framework for Text https://delft.readthedocs.io/

415
paragraph-vectors
paragraph-vectors inejc Python

:page_facing_up: A PyTorch implementation of Paragraph Vectors (doc2vec).

414
link-grammar
link-grammar opencog C

The CMU Link Grammar natural language parser

414
chat2graph
chat2graph TuGraph-family Python

Chat2Graph: Graph Native Agentic System.

414
FakeNewsCorpus
FakeNewsCorpus several27

A dataset of millions of news articles scraped from a curated list of data sources.

413
parsbert
parsbert hooshvare Jupyter Notebook

🤗 ParsBERT: Transformer-based Model for Persian Language Understanding

413
Abstractive-Summarization-With-Transfer-Learning
Abstractive-Summarization-With-Transfer-Learning santhoshkolloju Python

Abstractive summarisation using Bert as encoder and Transformer Decoder

412
jumanpp
jumanpp ku-nlp C++

Juman++ (a Morphological Analyzer Toolkit)

412
annotateai
annotateai neuml Python

📝 Automatically annotate papers using LLMs

412