Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition
LanguageCrunch NLP server docker image
I try my best to keep updated cutting-edge knowledge in Machine Learning/Deep Learning and Natural Language Processing. These are my notes on some go...
How to build RNNs and LSTMs from scratch with NumPy.
Korean spellchecking dictionary for Hunspell
Home of the AI workforce - Multi-agent system, AI agents & tools
A PyTorch implementation of GraphRel
A comprehensive reading list for Emotion Recognition in Conversations
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as w...
Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2022/2023
Efficient triton implementation of Native Sparse Attention.
A curated list of NLP resources for Hungarian
Graph-based and Transition-based dependency parsers based on BiLSTMs
Agentic AI platform that harnesses Visual LLM Chaining to build proactive digital assistants
Tutorial: Natural Language Processing in Python
Knowledge-Aware Graph Networks for Commonsense Reasoning (EMNLP-IJCNLP 19)
TensorFlow implementation of "Tracking the World State with Recurrent Entity Networks".
ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Models
Japanese Input Method System for Linux, Neural Kana-Kanji Conversion Engine + fcitx5 IME
互联网大厂面试经验
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Jupyter notebooks for our O'Reilly book "Blueprints for Text Analysis Using Python"
A curated list of dedicated resources and applications
Important paper implementations for Question Answering using PyTorch
Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)
This is a repository for sharing papers in the field of empathetic conversational AI. The related source code for each paper is linked if available.
spaCy REST API, wrapped in a Docker container.
ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A...
A web-based document annotation tool, powered by GPT-4 :rocket:
Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with...
[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
A curated list of awesome online courses about Large Langage Models (LLMs)
A Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.
📛 Fuzzy Name Matching with Machine Learning
Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a document-keyphrase...
A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:
Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis
🖋️ Fast and safe spellchecking C++ library
Implementation of character based convolutional neural network
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Labelling platform for text using weak supervision.
Tracking the progress in end-to-end speech translation
Русскоязычный генеративный чатбот с профилем и фактами
Transform AI-generated text into formal, human-like, and academic writing with ease, avoids AI detector!
Jack the Reader
Machine learning models to automatically summarise scientific papers
Fuzzy matching and more functionality for spaCy.
[NAACL 2025] KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents