Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
An open-source, low-code machine learning library in Python
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
ModelScope: bring the notion of Model-as-a-Service to life.
The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
BertViz: Visualize Attention in Transformer Models
all kinds of text classification models and more with deep learning
A curated list of resources for Chinese NLP 中文自然语言处理相关资料
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NL...
Chinese version of GPT2 training code, using BERT tokenizer.
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
NeMo: a toolkit for conversational AI
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
An open-source online reverse dictionary.
An open source library for deep learning end-to-end dialog systems and chatbots.
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
A log of things I'm learning
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Mycroft Core, the Mycroft Artificial Intelligence platform.
An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
Google AI 2018 BERT pytorch implementation
Natural Language Processing Best Practices & Examples
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Statistical Machine Intelligence & Learning Engine
TensorFlow 2.x version's Tutorials and Examples, including CNN, RNN, GAN, Auto-Encoders, FasterRCNN, GPT, BERT examples, etc. TF 2.0版入门实例代码,...
Free RPA tool by AI Singapore
Code for Tensorflow Machine Learning Cookbook
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Transforms PDF, Documents and Images into Enriched Structured Data
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
Rediscover your social memories with local, AI-powered analysis. 本地化的聊天记录分析工具,通过 AI Agent 回顾你的社交记忆。
Open-source offline translation library written in Python
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understan...
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
👮♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基于 DFA 算法实现的高性能 java 敏感词过滤工具框架。内置支持单词标签分类分级。请勿发...
Extract Keywords from sentence or Replace keywords in sentences.
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
:pencil: A markup-aware linter for prose built with speed and extensibility in mind.
Language Technology Platform
The unofficial python package that returns response of Google Bard through cookie value.