Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-bas...
TextRank implementation for Python 3.
GNES is Generic Neural Elastic Search, a cloud-native semantic search system based on deep neural network.
基于医药知识图谱的智能问答系统
DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. P...
Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.
ktrain is a Python library that makes deep learning and AI more accessible and easier to apply
Keras implementation of "One pixel attack for fooling deep neural networks" using differential evolution on Cifar10 and ImageNet
Portfolio of data science projects completed by me for academic, self learning, and hobby purposes.
📖 A curated list of awesome resources dedicated to Relation Extraction, one of the most important tasks in Natural Language Processing (NLP).
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For ac...
Multilingual word vectors in 78 languages
Explore large language models in 512MB of RAM
🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLP
My first Python repo with codes in Machine Learning, NLP and Deep Learning with Keras and Theano
Videos, notes and experiments to understand deep learning
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word...
Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
Neural Network Compression Framework for enhanced OpenVINO™ inference
Neural question generation using transformers
A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.
💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks....
整理一些书籍 ,包含 C&C++ 、git 、Java、Keras 、Linux 、NLP 、Python 、Scala 、TensorFlow 、大数据 、推荐系统、数据库、数据挖掘 、机器学习 、深度学习...
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.
Chinese NewsTitle Generation Project by GPT2.带有超级详细注释的中文GPT2新闻标题生成项目。
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100...
llms 大模型 笔记50篇 此仓库包含关于机器学习、深度学习、计算机视觉、自然语言处理、大模型 爬虫等领域 项目实战
Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applicati...
:memo: This repository recorded my NLP journey.
Z80-μLM is a 2-bit quantized language model small enough to run on an 8-bit Z80 processor. Train conversational models in Python, export them as CP/M...
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
Curated List: Practical Natural Language Processing done in Ruby
curated collection of papers for the nlp practitioner 📖👩🔬
List of papers on hallucination detection in LLMs.
Natural language detection library for Rust. Try demo online: https://whatlang.org/
🤗 A list of wonderful open-source projects & applications integrated with Hugging Face libraries.
1st Place Solution for Zhihu Machine Learning Challenge . Implementation of various text-classification models.(知乎看山杯第一名解决方案)
THUOCL(THU Open Chinese Lexicon)中文词库
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
Repo for Chinese Medical ChatGLM 基于中文医学知识的ChatGLM指令微调
📖 Paper reading list in conversational AI.
AI-related tutorials. Access any of them for free → https://towardsai.net/editorial
A lightweight multilingual LLM
A collection of notebooks for Natural Language Processing from NLP Town
KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)
Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation
Summarization Papers