Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
This repository contains my full work and notes on Coursera's NLP Specialization (Natural Language Processing) taught by the instructor Younes Bensoud...
DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome
💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
Python Implementations of Word Sense Disambiguation (WSD) Technologies.
The prime repository for state-of-the-art Multilingual Question Answering research and development.
A Neural Framework for MT Evaluation
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
PromptKG Family: a Gallery of Prompt Learning & KG-related research works, toolkits, and paper-list.
Curated list of open source tooling for data-centric AI on unstructured data.
TensorFlow and Deep Learning Tutorials
:clipboard: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset
An AI-powered Personal Identifiable Information (PII) scanner.
Salesforce open-source LLMs with 8k sequence length.
Repository of code for the tutorial on Transfer Learning in NLP held at NAACL 2019 in Minneapolis, MN, USA
地球上最全的华语现代诗歌语料库,3k+诗人,80K+诗歌,15M+字
《Natural Language Processing with PyTorch》中文翻译
Simple implementation of OpenAI CLIP model in PyTorch.
A list of selected resources, methods, and tools dedicated to Legal Text Analytics.
A Lite Bert For Self-Supervised Learning Language Representations
A Modern C++ Data Sciences Toolkit
NeuSpell: A Neural Spelling Correction Toolkit
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)
A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)
Kiwi(지능형 한국어 형태소 분석기)
The BiLSTM-CRF model implementation in Tensorflow, for sequence labeling tasks.
基于自然语言理解与机器学习的聊天机器人,支持多用户并发及自定义多轮对话
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
Original Implementation of Prompt Tuning from Lester, et al, 2021
A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine lear...
:book: 收集NLP领域相关的数据集、论文、开源实现,尤其是情感分析、情绪原因识别、评价对象和评价词抽取方面。
一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)
:heavy_check_mark: 微信上的定时提醒 - Cron on WeChat
Deep neural network framework for multi-label text classification
Natural language detection library for Go
An opensource text-to-speech (TTS) voice building tool
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
从 NLP 到 LLM 的算法全栈教程,在线阅读地址:https://datawhalechina.github.io/base-llm/
:black_circle: A spaCy pipeline and model for NLP on unstructured legal text.
机器学习、深度学习、自然语言处理、计算机视觉、各种算法等AI领域相关技术的路线、教程、干货分享。笔记有:机器学习实战、剑指Offer、cs231n、cs131、吴恩达机...
Stanford Open Information Extraction made simple!
Library for faster pinned CPU <-> GPU transfer in Pytorch
“AI-Compass”将为社区指引在 AI 技术海洋中航行的方向,无论你是初学者还是进阶开发者,都能在这里找到通往 AI 各大方向的路径。旨在帮助开发者系统性地了解 AI...
🔥 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation 🔥. Our toolkit integrates 40 pre-retrieved b...
Build LLM-powered Dart/Flutter applications.
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word norm...
NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms...
SpaCy 中文模型 | Models for SpaCy that support Chinese
人工智能学习资料超全整理,包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP、推荐系统、语音识别、图神经网路、算法工程师面试题