Topic

natural-language-processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1402)

InsTag
InsTag OFA-Sys

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning

267
hunspell-dict-ko
hunspell-dict-ko spellcheck-ko Python

Korean spellchecking dictionary for Hunspell

266
squirrel-core
squirrel-core merantix-momentum Python

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:

266
AI-Job-Info
AI-Job-Info Sophia-11

互联网大厂面试经验

265
KeyphraseVectorizers
KeyphraseVectorizers TimSchopf Python

Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a document-keyphrase...

265
rnn_lstm_from_scratch
rnn_lstm_from_scratch nicklashansen Jupyter Notebook

How to build RNNs and LSTMs from scratch with NumPy.

264
nlp-labelling
nlp-labelling dataqa JavaScript

Labelling platform for text using weak supervision.

264
hmni
hmni Christopher-Thornton Python

📛 Fuzzy Name Matching with Machine Learning

264
nlvr
nlvr lil-lab HTML

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with...

263
MAMS-for-ABSA
MAMS-for-ABSA siat-nlp Python

A Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.

262
ua-gec
ua-gec grammarly Macaulay2

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

262
picollm
picollm Picovoice Python

On-device LLM Inference Powered by X-Bit Quantization

262
markup
markup samueldobbie TypeScript

A web-based document annotation tool, powered by GPT-4 :rocket:

262
character-based-cnn
character-based-cnn ahmedbesbes Python

Implementation of character based convolutional neural network

261
scientific-paper-summarisation
scientific-paper-summarisation EdCo95 Python

Machine learning models to automatically summarise scientific papers

261
blueprints-text
blueprints-text blueprints-for-text-analytics-python Jupyter Notebook

Jupyter notebooks for our O'Reilly book "Blueprints for Text Analysis Using Python"

261
EMPaper
EMPaper Sahandfer

This is a repository for sharing papers in the field of empathetic conversational AI. The related source code for each paper is linked if available.

261
SpeechTransProgress
SpeechTransProgress kahne

Tracking the progress in end-to-end speech translation

260
jack
jack uclnlp Python

Jack the Reader

259
chatbot
chatbot Koziev Python

Русскоязычный генеративный чатбот с профилем и фактами

259
spaczz
spaczz gandersen101 Python

Fuzzy matching and more functionality for spaCy.

256
ChatGPT-Bot
ChatGPT-Bot Dravine1vDf7

ChatGPT Bot - AI-powered conversation tool

256
kairon
kairon digiteinfotech Python

Agentic AI platform that harnesses Visual LLM Chaining to build proactive digital assistants

255
I-BERT
I-BERT kssteven418 Python

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

255
practical-1
practical-1 oxford-cs-deepnlp-2017 Jupyter Notebook

Oxford Deep NLP 2017 course - Practical 1: word2vec

254
awesome-hungarian-nlp
awesome-hungarian-nlp oroszgy

A curated list of NLP resources for Hungarian

254
google-bard-api
google-bard-api ra83205 Python

This project provides a FastAPI wrapper for interacting with Google Bard, a conversational AI by Google. It allows users to send messages to Google Ba...

254
cs224n-win2223
cs224n-win2223 floriankark Python

Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2022/2023

253
konoha
konoha himkt Python

🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.

253
neat-vision
neat-vision cbaziotis Vue

Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Processing (NLP) tas...

251
OpenUnivCourses
OpenUnivCourses kabartay

FREE ML Courses from Top Universities in CS

250
Awesome-Swiss-German
Awesome-Swiss-German esthicodes Jupyter Notebook

Multi-language Analyze text in 26 Cantonal Swiss German, Italian, German, Chinese (simplified), French, Italian. pply natural language understanding (...

249
nuspell
nuspell nuspell C++

🖋️ Fast and safe spellchecking C++ library

249
relevanceai
relevanceai RelevanceAI Python

Home of the AI workforce - Multi-agent system, AI agents & tools

249
docprompting
docprompting shuyanzhou Python

Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023

248
forte
forte asyml Python

Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/

248
RESIDE
RESIDE malllabiisc CSS

EMNLP 2018: RESIDE: Improving Distantly-Supervised Neural Relation Extraction using Side Information

247
HugNLP
HugNLP wjn1996 Python

HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊 HugNLP will released to @HugAILab

247
Speech_Signal_Processing_and_Classification
Speech_Signal_Processing_and_Classification gionanide Python

Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite...

247
dilated-cnn-ner
dilated-cnn-ner iesl Python

Dilated CNNs for NER in TensorFlow

244
concise-concepts
concise-concepts davidberenstein1957 Python

This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.

244
Awesome_Mamba
Awesome_Mamba xmindflow

Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis

243
prosody
prosody Helsinki-NLP Python

Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text

243
spacy-lookup
spacy-lookup mpuig Python

Named Entity Recognition based on dictionaries

242
nlp_profiler
nlp_profiler neomatrix369 Python

A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profile...

242
backprop
backprop backprop-ai Python

Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.

242
text2text
text2text artitw Python

Text2Text: Crosslingual NLP/G toolkit

242
chazutsu
chazutsu chakki-works Python

The tool to make NLP datasets ready to use

241
AIND-NLP
AIND-NLP udacity Jupyter Notebook

Coding exercises for the Natural Language Processing concentration, part of Udacity's AIND program.

241
awesome-ml-blogs
awesome-ml-blogs antoinebrl

Curated list of technical blogs on machine learning · AI/ML/DL/CV/NLP/MLOps

241