Topic

nlp

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1462)

ESIM
ESIM coetaur0 Python

Implementation of the ESIM model for natural language inference with PyTorch

375
kiwipiepy
kiwipiepy bab2min Python

Python API for Kiwi

374
multi-task-NLP
multi-task-NLP hellohaptik Python

multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.

373
NLP-Vietnamese-progress
NLP-Vietnamese-progress undertheseanlp

Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for the most commo...

373
gcn-over-pruned-trees
gcn-over-pruned-trees qipeng Python

Graph Convolution over Pruned Dependency Trees Improves Relation Extraction (authors' PyTorch implementation)

372
nlp_fundamentals
nlp_fundamentals dair-ai Jupyter Notebook

๐Ÿ“˜ Contains a series of hands-on notebooks for learning the fundamentals of NLP

372
dodrio
dodrio poloclub Svelte

Exploring attention weights in transformer-based models with linguistic knowledge.

372
coursera-natural-language-processing-specialization
coursera-natural-language-processing-specialization amanchadha Jupyter Notebook

Programming assignments from all courses in the Coursera Natural Language Processing Specialization offered by deeplearning.ai.

371
lm-question-generation
lm-question-generation asahi417 Python

Multilingual/multidomain question generation datasets, models, and python library for question generation.

371
word2word
word2word kakaobrain Python

Easy-to-use word-to-word translations for 3,564 language pairs.

370
PERT
PERT ymcui

PERT: Pre-training BERT with Permuted Language Model

370
Awesome_Multimodel_LLM
Awesome_Multimodel_LLM Atomic-man007

Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). I...

369
demo-chinese-text-binary-classification-with-bert
demo-chinese-text-binary-classification-with-bert wshuyi Jupyter Notebook
368
nlp_highlights
nlp_highlights omarsar

The most important NLP highlights of 2018 (PDF Report)

366
jiebaR
jiebaR qinwf C++

Chinese text segmentation with R. R่ฏญ่จ€ไธญๆ–‡ๅˆ†่ฏ ๏ผˆๆ–‡ๆกฃๅทฒๆ›ดๆ–ฐ ๐ŸŽ‰ ๏ผšhttps://qinwenfeng.com/jiebaR/ )

366
stock_prediction
stock_prediction KittenCN Python

ๅŸบไบŽ็ฅž็ป็ฝ‘็ปœ็š„้€š็”จ่‚ก็ฅจ้ข„ๆต‹ๆจกๅž‹ A general stock prediction model based on neural networks

366
a-PyTorch-Tutorial-to-Sequence-Labeling
a-PyTorch-Tutorial-to-Sequence-Labeling sgrvinod Python

Empower Sequence Labeling with Task-Aware Neural Language Model | a PyTorch Tutorial to Sequence Labeling

365
spanish-word-embeddings
spanish-word-embeddings dccuchile

Spanish word embeddings computed with different methods and from different corpora

365
wikipron
wikipron CUNY-CL Python

Massively multilingual pronunciation mining

365
studies
studies imhuay Python

Notes of Develop/NLP/DeepLearning/Algorithms/LeetCodes

365
awesome-semantic-search
awesome-semantic-search Agrover112

A curated list of awesome resources related to Semantic Search๐Ÿ”Ž and Semantic Similarity tasks.

363
TextDescriptives
TextDescriptives HLasse Python

A Python library for calculating a large variety of metrics from text

363
LiLT
LiLT jpWang Python

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 202...

363
melusine
melusine MAIF Python

๐Ÿ“ง Melusine: Use python to automatize your email processing workflow

363
awesome-nlprojects
awesome-nlprojects costezki

List of projects related to Natural Language Processing (NLP) that make a geek smile for they exist

362
lemmatization-lists
lemmatization-lists michmech

Machine-readable lists of lemma-token pairs in 23 languages.

362
OpenGPT
OpenGPT CogStack Jupyter Notebook

A framework for creating grounded instruction based datasets and training conversational domain expert Large Language Models (LLMs).

362
tacred-relation
tacred-relation yuhaozhang Python

PyTorch implementation of the position-aware attention model for relation extraction

361
haystack-tutorials
haystack-tutorials deepset-ai Jupyter Notebook

Here you can find all the Tutorials for Haystack ๐Ÿ““

360
COVID-QA
COVID-QA deepset-ai Jupyter Notebook

API & Webapp to answer questions about COVID-19. Using NLP (Question Answering) and trusted data sources.

359
tldrstory
tldrstory neuml Python

๐Ÿ“Š Semantic search for headlines and story text

359
cargo-spellcheck
cargo-spellcheck drahnr Rust

Checks all your documentation for spelling and grammar mistakes with hunspell and a nlprule based checker for grammar

359
open-muse
open-muse huggingface Python

Open reproduction of MUSE for fast text2image generation.

358
dynasaur
dynasaur adobe-research Python

Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"

358
pytorch_RVAE
pytorch_RVAE kefirski Python

Recurrent Variational Autoencoder that generates sequential data implemented with pytorch

357
gsdmm
gsdmm rwalk Python

GSDMM: Short text clustering

357
GPT2
GPT2 affjljoo3581 Python

PyTorch Implementation of OpenAI GPT-2

357
obsidian-companion
obsidian-companion rizerphe TypeScript

Autocomplete your obsidian notes with AI, including ChatGPT, through a copilot-like interface.

355
news-emotion
news-emotion dongyuanxin Python

๐Ÿ“‰ ้‡‘่žๆ–‡ๆœฌๆƒ…ๆ„Ÿๅˆ†ๆžๆจกๅž‹

354
KR-WordRank
KR-WordRank lovit Python

๋น„์ง€๋„ํ•™์Šต ๋ฐฉ๋ฒ•์œผ๋กœ ํ•œ๊ตญ์–ด ํ…์ŠคํŠธ์—์„œ ๋‹จ์–ด/ํ‚ค์›Œ๋“œ๋ฅผ ์ž๋™์œผ๋กœ ์ถ”์ถœํ•˜๋Š” ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์ž…๋‹ˆ๋‹ค

354
ChemDataExtractor
ChemDataExtractor mcs07 Python

Automatically extract chemical information from scientific documents

353
node-word2vec
node-word2vec Planeshifter C

Node.js interface to the Google word2vec tool.

353
PromptAgent
PromptAgent maitrix-org Python

This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgent is a novel...

353
100-Days-of-NLP
100-Days-of-NLP graviraja Jupyter Notebook
352
doremi
doremi sangmichaelxie HTML

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

352
chatgpt-infinity
chatgpt-infinity adamlui JavaScript

โˆž Generate endless answers from all-knowing ChatGPT (on any topic!)

351
megabots
megabots momegas Python

๐Ÿค– State-of-the-art, production ready LLM apps made mega-easy, so you don't have to build them from scratch ๐Ÿคฏ Create a bot, now ๐Ÿซต

350
transformer-tensorflow
transformer-tensorflow DongjunLee Python

TensorFlow implementation of 'Attention Is All You Need (2017. 6)'

349
program-y
program-y keiffster Python

Python 3.x based AIML 2.0 Chatbot interpreter, framework, related programs and knowledge files

349
pyss3
pyss3 sergioburdisso Python

A Python library for Interpretable Machine Learning in Text Classification using the SS3 model, with easy-to-use visualization tools for Explainable A...

349