Topic

natural-language-processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1431)

OpenUE
OpenUE zjunlp Python

[EMNLP 2020] OpenUE: An Open Toolkit of Universal Extraction from Text

329
parallax
parallax uber-research Python

Tool for interactive embeddings visualization

328
cybertron
cybertron nlpodyssey Go

Cybertron: the home planet of the Transformers in Go

326
PyContinual
PyContinual ZixuanKe Python

PyContinual (An Easy and Extendible Framework for Continual Learning)

325
NLPython
NLPython jalajthanaki Jupyter Notebook

This repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitle...

324
Binder
Binder xlang-ai Python

[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"

324
rasa-chatbot-templates
rasa-chatbot-templates cedextech Python

RASA chatbot use case boilerplate

322
voltaserve
voltaserve kouprlabs Go

⚡️ Reality OS for Creators

322
dostoevsky
dostoevsky bureaucratic-labs Python

Sentiment analysis library for russian language

322
MINERVA
MINERVA shehzaadzd Python

Meandering In Networks of Entities to Reach Verisimilar Answers

321
conllu
conllu EmilStenstrom Python

A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.

320
insight
insight abhimishra91 Python

Repository for Project Insight: NLP as a Service

320
stringi
stringi gagolews C++

Fast and Portable Character String Processing in R (with the Unicode ICU)

318
byteNet-tensorflow
byteNet-tensorflow paarthneekhara Python

ByteNet for character-level language modelling

318
naturalcc
naturalcc CGCL-codes Python

NaturalCC: An Open-Source Toolkit for Code Intelligence

318
book-nlp
book-nlp dbamman Java

Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.com/booknlp/boo...

316
picollm
picollm Picovoice Python

On-device LLM Inference Powered by X-Bit Quantization

312
pytorch-transformers-classification
pytorch-transformers-classification ThilinaRajapakse Jupyter Notebook

Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for employing Transformer models in text classification tasks...

312
mishkal
mishkal linuxscout Python

Mishkal is an arabic text vocalization software

312
stopwords
stopwords igorbrigadir Python

Default English stopword lists from many different sources

311
MentalLLaMA
MentalLLaMA SteveKGYang Python

This repository introduces MentaLLaMA, the first open-source instruction following large language model for interpretable mental health analysis.

310
awesome-ml-blogs
awesome-ml-blogs antoinebrl

Curated list of technical blogs on machine learning · AI/ML/DL/CV/NLP/MLOps

309
Contrastive_Learning_Papers
Contrastive_Learning_Papers ContrastiveSR

A list of contrastive Learning papers

309
text2text
text2text artitw Python

Text2Text Language Modeling Toolkit

304
goodreads
goodreads MengtingWan Jupyter Notebook

code samples for the goodreads datasets

304
transfomers-silicon-research
transfomers-silicon-research aliemo Jupyter Notebook

Research and Materials on Hardware implementation of Transformer Model

304
NonAutoregGenProgress
NonAutoregGenProgress kahne

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

302
bert-sklearn
bert-sklearn charles9n Jupyter Notebook

a sklearn wrapper for Google's BERT model

301
ineuron-full-stack-data-science-assignments
ineuron-full-stack-data-science-assignments amanovishnu Jupyter Notebook

this repository features assignments and projects from the iNeuron full stack data science course, providing valuable resources for learners to enhanc...

300
revery
revery thesephist JavaScript

A personal semantic search engine capable of surfacing relevant bookmarks, journal entries, notes, blogs, contacts, and more, built on an efficient do...

300
ai-system-design-guide
ai-system-design-guide ombharatiya

AI system design guide for engineers building production AI systems and evals.

300
ScanRefer
ScanRefer daveredrum Python

[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language

299
question_generator
question_generator AMontgomerie Python

An NLP system for generating reading comprehension questions

298
deep-learning-nlp-rl-papers
deep-learning-nlp-rl-papers madrugado Python

Recent Deep Learning papers in NLU and RL

297
Kevinpro-NLP-demo
Kevinpro-NLP-demo Ricardokevins Python

All NLP you Need Here. 目前包含15个NLP demo的pytorch实现(大量代码借鉴于其他开源项目,原先是自己玩的,后来干脆也开源出来)

295
TeachingDataScience
TeachingDataScience yogeshhk Jupyter Notebook

Course notes for Data Science related topics, prepared in LaTeX

295
ToD-BERT
ToD-BERT jasonwu0731 Python

Pre-Trained Models for ToD-BERT

295
lda
lda primaryobjects JavaScript

LDA topic modeling for node.js

294
AI_ChatBot_Python
AI_ChatBot_Python FreeBirdsCrew Jupyter Notebook

AI ChatBot using Python Tensorflow and Natural Language Processing (NLP) along side TFLearn

294
Mol-Instructions
Mol-Instructions zjunlp Python

[ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models

293
retvec
retvec google-research Jupyter Notebook

RETVec is an efficient, multilingual, and adversarially-robust text vectorizer.

293
ContinualLM
ContinualLM UIC-Liu-Lab Python

An Extensible Continual Learning Framework Focused on Language Models (LMs)

292
BOND
BOND cliang1453 Python

BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision

291
WordGCN
WordGCN malllabiisc Python

ACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks

291
id-nlp-resource
id-nlp-resource kmkurn

A list of Indonesian NLP resources.

290
knowledge-gpt
knowledge-gpt geeks-of-data Python

Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.

290
diffusion-nlp-paper-arxiv
diffusion-nlp-paper-arxiv bansky-cl Python

Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".

289
SWEM
SWEM dinghanshen Python

The Tensorflow code for this ACL 2018 paper: "Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms"

288
shifterator
shifterator ryanjgallagher Python

Interpretable data visualizations for understanding how texts differ at the word level

287
Web-Database-Analytics
Web-Database-Analytics tirthajyoti Jupyter Notebook

Web scrapping and related analytics using Python tools

287