Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
This repository contains my full work and notes on Coursera's NLP Specialization (Natural Language Processing) taught by the instructor Younes Bensoud...
DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome
💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
Python Implementations of Word Sense Disambiguation (WSD) Technologies.
A Neural Framework for MT Evaluation
The prime repository for state-of-the-art Multilingual Question Answering research and development.
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
PromptKG Family: a Gallery of Prompt Learning & KG-related research works, toolkits, and paper-list.
Curated list of open source tooling for data-centric AI on unstructured data.
TensorFlow and Deep Learning Tutorials
:clipboard: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset
An AI-powered Personal Identifiable Information (PII) scanner.
Salesforce open-source LLMs with 8k sequence length.
地球上最全的华语现代诗歌语料库,3k+诗人,80K+诗歌,15M+字
Repository of code for the tutorial on Transfer Learning in NLP held at NAACL 2019 in Minneapolis, MN, USA
Simple implementation of OpenAI CLIP model in PyTorch.
《Natural Language Processing with PyTorch》中文翻译
A list of selected resources, methods, and tools dedicated to Legal Text Analytics.
A Lite Bert For Self-Supervised Learning Language Representations
A Modern C++ Data Sciences Toolkit
NeuSpell: A Neural Spelling Correction Toolkit
Kiwi(지능형 한국어 형태소 분석기)
Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)
The BiLSTM-CRF model implementation in Tensorflow, for sequence labeling tasks.
基于自然语言理解与机器学习的聊天机器人,支持多用户并发及自定义多轮对话
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
从 NLP 到 LLM 的算法全栈教程,在线阅读地址:https://datawhalechina.github.io/base-llm/
A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine lear...
Original Implementation of Prompt Tuning from Lester, et al, 2021
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
“AI-Compass”将为社区指引在 AI 技术海洋中航行的方向,无论你是初学者还是进阶开发者,都能在这里找到通往 AI 各大方向的路径。旨在帮助开发者系统性地了解 AI...
:book: 收集NLP领域相关的数据集、论文、开源实现,尤其是情感分析、情绪原因识别、评价对象和评价词抽取方面。
一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)
:heavy_check_mark: 微信上的定时提醒 - Cron on WeChat
Deep neural network framework for multi-label text classification
Natural language detection library for Go
An opensource text-to-speech (TTS) voice building tool
:black_circle: A spaCy pipeline and model for NLP on unstructured legal text.
机器学习、深度学习、自然语言处理、计算机视觉、各种算法等AI领域相关技术的路线、教程、干货分享。笔记有:机器学习实战、剑指Offer、cs231n、cs131、吴恩达机...
Library for faster pinned CPU <-> GPU transfer in Pytorch
Stanford Open Information Extraction made simple!
🔥 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation 🔥. Our toolkit integrates 40 pre-retrieved b...
Build LLM-powered Dart/Flutter applications.
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word norm...
NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms...
SpaCy 中文模型 | Models for SpaCy that support Chinese
人工智能学习资料超全整理,包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP、推荐系统、语音识别、图神经网路、算法工程师面试题