Topic

nlp

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1462)

stanford-tensorflow-tutorials
stanford-tensorflow-tutorials chiphuyen Python

This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.

10.4k
Chinese-BERT-wwm
Chinese-BERT-wwm ymcui Python

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

10.2k
LLMsPracticalGuide
LLMsPracticalGuide Mooler0410

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

10.2k
openvino
openvino openvinotoolkit C++

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

10.1k
petals
petals bigscience-workshop Python

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

10.1k
CoreNLP
CoreNLP stanfordnlp Java

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

10.1k
nlp_chinese_corpus
nlp_chinese_corpus brightmart

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9.9k
pycaret
pycaret pycaret Jupyter Notebook

An open-source, low-code machine learning library in Python

9.7k
attention-is-all-you-need-pytorch
attention-is-all-you-need-pytorch jadore801120 Python

A PyTorch implementation of the Transformer model in "Attention is All You Need".

9.7k
TextBlob
TextBlob sloria Python

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

9.5k
modelscope
modelscope modelscope Python

ModelScope: bring the notion of Model-as-a-Service to life.

8.9k
LLM-Agent-Paper-List
LLM-Agent-Paper-List WooooDyy

The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

8.1k
bertviz
bertviz jessevig Python

BertViz: Visualize Attention in Transformer Models

8k
text_classification
text_classification brightmart Python

all kinds of text classification models and more with deep learning

7.9k
Awesome-Chinese-NLP
Awesome-Chinese-NLP crownpku

A curated list of resources for Chinese NLP 中文自然语言处理相关资料

7.9k
stanza
stanza stanfordnlp Python

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

7.8k
presidio
presidio microsoft Python

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NL...

7.8k
GPT2-Chinese
GPT2-Chinese Morizeyao Python

Chinese version of GPT2 training code, using BERT tokenizer.

7.6k
BERTopic
BERTopic MaartenGr Python

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

7.6k
NeMo
NeMo NVIDIA Python

NeMo: a toolkit for conversational AI

7.2k
Chinese-LLaMA-Alpaca-2
Chinese-LLaMA-Alpaca-2 ymcui Python

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

7.1k
WantWords
WantWords thunlp JavaScript

An open-source online reverse dictionary.

7.1k
DeepPavlov
DeepPavlov deeppavlov Python

An open source library for deep learning end-to-end dialog systems and chatbots.

7k
models
models PaddlePaddle Python

Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.

6.9k
learning
learning amitness

A log of things I'm learning

6.9k
donut
donut clovaai Python

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

6.8k
mycroft-core
mycroft-core MycroftAI Python

Mycroft Core, the Mycroft Artificial Intelligence platform.

6.6k
nlp.js
nlp.js axa-group JavaScript

An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more

6.6k
ansj_seg
ansj_seg NLPchina Java

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典

6.5k
BERT-pytorch
BERT-pytorch codertimo Python

Google AI 2018 BERT pytorch implementation

6.5k
nlp-recipes
nlp-recipes microsoft Python

Natural Language Processing Best Practices & Examples

6.4k
courses
courses SkalskiP Python

This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)

6.4k
smile
smile haifengl Java

Statistical Machine Intelligence & Learning Engine

6.4k
TensorFlow-2.x-Tutorials
TensorFlow-2.x-Tutorials dragen1860 Jupyter Notebook

TensorFlow 2.x version's Tutorials and Examples, including CNN, RNN, GAN, Auto-Encoders, FasterRCNN, GPT, BERT examples, etc. TF 2.0版入门实例代码,...

6.4k
TagUI
TagUI aisingapore JavaScript

Free RPA tool by AI Singapore

6.3k
tensorflow_cookbook
tensorflow_cookbook nfmcclure Jupyter Notebook

Code for Tensorflow Machine Learning Cookbook

6.2k
xlnet
xlnet zihangdai Python

XLNet: Generalized Autoregressive Pretraining for Language Understanding

6.2k
Parsr
Parsr axa-group JavaScript

Transforms PDF, Documents and Images into Enriched Structured Data

6.2k
aim
aim aimhubio Python

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

6.1k
ChatLab
ChatLab ChatLab TypeScript

Rediscover your social memories with local, AI-powered analysis. 本地化的聊天记录分析工具,通过 AI Agent 回顾你的社交记忆。

6.1k
argos-translate
argos-translate argosopentech Python

Open-source offline translation library written in Python

5.9k
ERNIE
ERNIE PaddlePaddle Python

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understan...

5.9k
Chinese-CLIP
Chinese-CLIP OFA-Sys Jupyter Notebook

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

5.9k
trafilatura
trafilatura adbar Python

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

5.8k
sensitive-word
sensitive-word houbb Java

👮‍♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基于 DFA 算法实现的高性能 java 敏感词过滤工具框架。内置支持单词标签分类分级。请勿发...

5.8k
flashtext
flashtext vi3k6i5 Python

Extract Keywords from sentence or Replace keywords in sentences.

5.7k
awesome-pretrained-chinese-nlp-models
awesome-pretrained-chinese-nlp-models lonePatient Python

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

5.6k
vale
vale vale-cli Go

:pencil: A markup-aware linter for prose built with speed and extensibility in mind.

5.4k
ltp
ltp HIT-SCIR Python

Language Technology Platform

5.2k
Bard-API
Bard-API dsdanielpark Python

The unofficial python package that returns response of Google Bard through cookie value.

5.2k