Topic

nlp

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1462)

VnCoreNLP
VnCoreNLP vncorenlp Java

A Vietnamese natural language processing toolkit (NAACL 2018)

665
datefinder
datefinder akoumjian Python

Find dates inside text using Python and get back datetime objects

664
Jiayan
Jiayan jiaeyan Python

甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for C...

663
Macropodus
Macropodus yongzhuo Python

自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要,文本相似度,科学计算...

662
nlprule
nlprule bminixhofer Rust

A fast, low-resource Natural Language Processing and Text Correction library written in Rust.

661
obsidian-ava
obsidian-ava different-ai TypeScript

Quickly format your notes with ChatGPT in Obsidian

661
JamSpell
JamSpell bakwc C++

Modern spell checking library - accurate, fast, multi-language

660
Awesome-Korean-NLP
Awesome-Korean-NLP datanada

A curated list of resources for NLP (Natural Language Processing) for Korean

660
Cornucopia-LLaMA-Fin-Chinese
Cornucopia-LLaMA-Fin-Chinese jerry1993-tech Python

聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

658
tensor_parallel
tensor_parallel BlackSamorez Python

Automatically split your PyTorch models on multiple GPUs for training & inference

655
Chinese-Mixtral-8x7B
Chinese-Mixtral-8x7B HIT-SCIR Python

中文Mixtral-8x7B(Chinese-Mixtral-8x7B)

653
FinBERT
FinBERT yya518 Jupyter Notebook

A Pretrained BERT Model for Financial Communications. https://arxiv.org/abs/2006.08097

651
pysentimiento
pysentimiento pysentimiento Jupyter Notebook

A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks

650
griptape
griptape griptape-ai Python

Python framework for AI workflows and pipelines with chain of thought reasoning, external tools, and memory.

649
seqGAN
seqGAN suragnair Python

A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)

647
chinese_text_cnn
chinese_text_cnn practicingman Python

TextCNN Pytorch实现 中文文本分类 情感分析

646
medspacy
medspacy medspacy Jupyter Notebook

Library for clinical NLP with spaCy.

646
LLM-Shearing
LLM-Shearing princeton-nlp Python

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

643
indonlu
indonlu IndoNLP Jupyter Notebook

The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models,...

641
autoprompt
autoprompt ucinlp Python

AutoPrompt: Automatic Prompt Construction for Masked Language Models.

640
hyperbase
hyperbase hyperquest-hq Python

A foundational library for Semantic Hypergraphs

639
small-text
small-text webis-de Python

Active Learning for Text Classification in Python

638
awesome-foundation-and-multimodal-models
awesome-foundation-and-multimodal-models SkalskiP Python

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

637
OpenHowNet
OpenHowNet thunlp Python

Core Data of HowNet and OpenHowNet Python API

637
homer
homer wyounas Python

Homer, a text analyser in Python, can help make your text more clear, simple and useful for your readers.

635
nlpia
nlpia totalgood HTML

Examples and libraries for "Natural Language Processing in Action" book

635
bigbird
bigbird google-research Python

Transformers for Longer Sequences

633
BotLibre
BotLibre BotLibre Java

An open platform for artificial intelligence, chat bots, virtual agents, social media automation, and live chat automation.

633
KoELECTRA
KoELECTRA monologg Python

Pretrained ELECTRA Model for Korean

632
word_forms
word_forms gutfeeling Python

Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.

632
DeepNLP-Course
DeepNLP-Course DanAnastasyev Jupyter Notebook

Deep NLP Course

632
Awesome-LLM-Eval
Awesome-LLM-Eval onejune2018

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具...

631
ConvoKit
ConvoKit CornellNLP Jupyter Notebook

ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational...

628
TrustLLM
TrustLLM HowieHwong Python

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

622
SmoothNLP
SmoothNLP smoothnlp Java

专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference

622
clipper.js
clipper.js philschmid TypeScript

HTML to Markdown converter and crawler.

618
cdQA
cdQA cdqa-suite Python

⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.

617
botonic
botonic hubtype TypeScript

Build chatbots and conversational experiences using React

614
lite-transformer
lite-transformer mit-han-lab Python

[ICLR 2020] Lite Transformer with Long-Short Range Attention

611
Awesome-Story-Generation
Awesome-Story-Generation yingpengma Python

This repository collects an extensive list of awesome papers about Story Generation / Storytelling, exclusively focusing on the era of Large Language...

610
Chinese-Mixtral
Chinese-Mixtral ymcui Python

中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)

610
awesome-data-annotation
awesome-data-annotation taivop

A list of tools for annotating data, managing annotations, etc.

609
AJAX-Movie-Recommendation-System-with-Sentiment-Analysis
AJAX-Movie-Recommendation-System-with-Sentiment-Analysis kishan0725 Jupyter Notebook

A content-based recommender system that recommends movies similar to the movie the user likes and analyses the sentiments of the reviews given by the...

609
attention-networks-for-classification
attention-networks-for-classification EdGENetworks Jupyter Notebook

Hierarchical Attention Networks for Document Classification in PyTorch

609
semchunk
semchunk isaacus-dev Python

A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.

608
DensePhrases
DensePhrases princeton-nlp Python

[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.org/abs/2012.1...

607
Event-Extraction
Event-Extraction zhang17173 Python

基于法律裁判文书的事件抽取及其应用,包括数据的分词、词性标注、命名实体识别、事件要素抽取和判决结果预测等内容

606
tock
tock theopenconversationkit Kotlin

Tock, the open source conversational AI toolkit.

605
BERT-Relation-Extraction
BERT-Relation-Extraction plkmo Python

PyTorch implementation for "Matching the Blanks: Distributional Similarity for Relation Learning" paper

603
NLP_Quickbook
NLP_Quickbook NirantK Jupyter Notebook

NLP in Python with Deep Learning

602