Topic

natural-language-processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1431)

TencentPretrain
TencentPretrain Tencent Python

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

1.1k
lingua-rs
lingua-rs pemistahl Rust

The most accurate natural language detection library for Rust, suitable for short text and mixed-language text

1.1k
nlp-with-ruby
nlp-with-ruby arbox Ruby

Curated List: Practical Natural Language Processing done in Ruby

1.1k
insuranceqa-corpus-zh
insuranceqa-corpus-zh chatopera Python

:helicopter: 保险行业语料库,聊天机器人

1.1k
Paper-Reading-ConvAI
Paper-Reading-ConvAI iwangjian

📖 Paper reading list in conversational AI.

1k
conformal-prediction
conformal-prediction aangelopoulos Jupyter Notebook

Lightweight, useful implementation of conformal prediction on real data.

1k
SoulverCore
SoulverCore soulverteam Swift

A powerful Swift framework for evaluating natural language math expressions

1k
this-word-does-not-exist
this-word-does-not-exist turtlesoupy Python

This Word Does Not Exist

1k
nlp-notebooks
nlp-notebooks nlptown Jupyter Notebook

A collection of notebooks for Natural Language Processing from NLP Town

1k
gpt-2-Pytorch
gpt-2-Pytorch graykode Python

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

1k
ThoughtSource
ThoughtSource OpenBioLink Jupyter Notebook

A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https:...

1k
Summarization-Papers
Summarization-Papers xcfcode TeX

Summarization Papers

1k
clean-text
clean-text jfilter Python

🧹 Python package for text cleaning

1k
awesome-llm-role-playing-with-persona
awesome-llm-role-playing-with-persona Neph0s

Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas

1k
Prompt4ReasoningPapers
Prompt4ReasoningPapers zjunlp

[ACL 2023] Reasoning with Language Model Prompting: A Survey

1k
YouTokenToMe
YouTokenToMe VKCOM C++

Unsupervised text tokenizer focused on computational efficiency

977
awesome-ai-awesomeness
awesome-ai-awesomeness amusi

A curated list of awesome awesomeness about artificial intelligence

977
Llama-2-Open-Source-LLM-CPU-Inference
Llama-2-Open-Source-LLM-CPU-Inference kennethleungty Python

Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A

976
keras-hub
keras-hub keras-team Python

Pretrained model hub for Keras 3.

976
LLM-Blender
LLM-Blender yuchenlin Python

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths...

976
wikipedia2vec
wikipedia2vec wikipedia2vec Python

A tool for learning vector representations of words and entities from Wikipedia

964
gector
gector grammarly Python

Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)

962
Transformers-for-NLP-2nd-Edition
Transformers-for-NLP-2nd-Edition Denis2054 Jupyter Notebook

Transformer models from BERT to GPT-4, environments from Hugging Face to OpenAI. Fine-tuning, training, and prompt engineering examples. A bonus secti...

959
pyresparser
pyresparser OmkarPathak Python

A simple resume parser used for extracting information from resumes

955
awesome-japanese-nlp-resources
awesome-japanese-nlp-resources taishi-i

A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese

947
multiwoz
multiwoz budzianowski Python

Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)

944
P-tuning
P-tuning THUDM Python

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

938
ML-University
ML-University d0r1h

Machine Learning Open Source University

937
notes
notes brylevkirill

Learn about Machine Learning and Artificial Intelligence

927
Coursera
Coursera shenweichen Jupyter Notebook

Quiz & Assignment of Coursera

927
skweak
skweak NorskRegnesentral Python

skweak: A software toolkit for weak supervision applied to NLP tasks

926
factool
factool GAIR-NLP Python

FacTool: Factuality Detection in Generative AI

925
jcseg
jcseg lionsoul2014 Java

Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extractio...

922
torchMoji
torchMoji huggingface Python

😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc

921
mindnlp
mindnlp mindspore-lab Python

MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore with seamless compatibility and acceleration.

917
imbalanced-regression
imbalanced-regression YyzHarry Python

[ICML 2021, Long Talk] Delving into Deep Imbalanced Regression

914
paperlists
paperlists papercopilot Python

Processed / Cleaned Data for Paper Copilot

912
booknlp
booknlp booknlp Python

BookNLP, a natural language processing pipeline for books

910
iowncode
iowncode anupamchugh Swift

A curated collection of iOS, ML, AR resources sprinkled with some UI additions

909
TextGAN-PyTorch
TextGAN-PyTorch williamSYSU Python

TextGAN is a PyTorch framework for Generative Adversarial Networks (GANs) based text generation models.

909
self-attentive-parser
self-attentive-parser nikitakit Python

High-accuracy NLP parser with models for 11 languages.

907
GNN4NLP-Papers
GNN4NLP-Papers IndexFziQ

A list of recent papers about Graph Neural Network methods applied in NLP areas.

905
bertsearch
bertsearch Hironsan Python

Elasticsearch with BERT for advanced document search.

898
my-cs-degree
my-cs-degree logancyang

A CS degree with a focus on full-stack ML engineering, 2020

889
calvin
calvin mees Python

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

887
datacamp-python-data-science-track
datacamp-python-data-science-track AmoDinho Python

All the slides, accompanying code and exercises all stored in this repo. 🎈

887
spacy-layout
spacy-layout explosion Python

📚 Process PDFs, Word documents and more with spaCy

885
quanteda
quanteda quanteda R

An R package for the Quantitative Analysis of Textual Data

881
asreview
asreview asreview Python

Active learning for systematic reviews

875
text2vec
text2vec dselivanov R

Fast vectorization, topic modeling, distances and GloVe word embeddings in R.

871