Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
Korean spellchecking dictionary for Hunspell
A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:
互联网大厂面试经验
Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a document-keyphrase...
How to build RNNs and LSTMs from scratch with NumPy.
Labelling platform for text using weak supervision.
📛 Fuzzy Name Matching with Machine Learning
Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with...
A Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
On-device LLM Inference Powered by X-Bit Quantization
A web-based document annotation tool, powered by GPT-4 :rocket:
Implementation of character based convolutional neural network
Machine learning models to automatically summarise scientific papers
Jupyter notebooks for our O'Reilly book "Blueprints for Text Analysis Using Python"
This is a repository for sharing papers in the field of empathetic conversational AI. The related source code for each paper is linked if available.
Tracking the progress in end-to-end speech translation
Jack the Reader
Русскоязычный генеративный чатбот с профилем и фактами
Fuzzy matching and more functionality for spaCy.
ChatGPT Bot - AI-powered conversation tool
Agentic AI platform that harnesses Visual LLM Chaining to build proactive digital assistants
[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
Oxford Deep NLP 2017 course - Practical 1: word2vec
A curated list of NLP resources for Hungarian
This project provides a FastAPI wrapper for interacting with Google Bard, a conversational AI by Google. It allows users to send messages to Google Ba...
Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2022/2023
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Processing (NLP) tas...
FREE ML Courses from Top Universities in CS
Multi-language Analyze text in 26 Cantonal Swiss German, Italian, German, Chinese (simplified), French, Italian. pply natural language understanding (...
🖋️ Fast and safe spellchecking C++ library
Home of the AI workforce - Multi-agent system, AI agents & tools
Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023
Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/
EMNLP 2018: RESIDE: Improving Distantly-Supervised Neural Relation Extraction using Side Information
HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊 HugNLP will released to @HugAILab
Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite...
Dilated CNNs for NER in TensorFlow
This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.
Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Named Entity Recognition based on dictionaries
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profile...
Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.
Text2Text: Crosslingual NLP/G toolkit
The tool to make NLP datasets ready to use
Coding exercises for the Natural Language Processing concentration, part of Udacity's AIND program.
Curated list of technical blogs on machine learning · AI/ML/DL/CV/NLP/MLOps