Topic

natural-language-processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1402)

transformers
transformers huggingface Python

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both i...

148.2k
d2l-zh
d2l-zh d2l-ai Python

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

71.6k
Made-With-ML
Made-With-ML GokuMohandas Jupyter Notebook

Learn how to design, develop, deploy and iterate on production-grade ML applications.

41.8k
bert
bert google-research Python

TensorFlow code and pre-trained models for BERT

39.4k
HanLP
HanLP hankcs Python

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本...

35.5k
spaCy
spaCy explosion Python

💫 Industrial-strength Natural Language Processing (NLP) in Python

32.2k
applied-ml
applied-ml eugeneyan

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

28.2k
d2l-en
d2l-en d2l-ai Python

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT...

26.5k
NLP-progress
NLP-progress sebastianruder Python

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP...

22.9k
Resume-Matcher
Resume-Matcher srbhr Python

Improve your resumes with Resume Matcher. Get insights, keyword suggestions and tune your resumes to job descriptions.

21.9k
haystack
haystack deepset-ai Python

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipel...

21.2k
rasa
rasa RasaHQ Python

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and...

20.5k
datasets
datasets huggingface Python

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

20.5k
Ciphey
Ciphey bee-san Python

⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡

19.8k
Qwen
Qwen QwenLM Python

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

19k
Dive-into-DL-PyTorch
Dive-into-DL-PyTorch ShusenTang Jupyter Notebook

本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。

19k
awesome-nlp
awesome-nlp keon

:book: A curated list of resources dedicated to Natural Language Processing (NLP)

17.4k
DocsGPT
DocsGPT arc53 TypeScript

DocsGPT is an open-source genAI tool that helps users get reliable answers from knowledge source, while avoiding hallucinations. It enables private an...

16.9k
ML-YouTube-Courses
ML-YouTube-Courses dair-ai

📺 Discover the latest machine learning / AI courses on YouTube.

16.8k
gensim
gensim piskvorky Python

Topic Modelling for Humans

16.1k
Awesome-pytorch-list
Awesome-pytorch-list bharathgs

A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

16k
lectures
lectures oxford-cs-deepnlp-2017

Oxford Deep NLP 2017 course

15.9k
ml-visuals
ml-visuals dair-ai

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

15.6k
nlp-tutorial
nlp-tutorial graykode Jupyter Notebook

Natural Language Processing Tutorial for Deep Learning Researchers

14.7k
flair
flair flairNLP Python

A very simple framework for state-of-the-art Natural Language Processing (NLP)

14.3k
nltk
nltk nltk Python

NLTK Source

14.2k
camel
camel camel-ai Python

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

13.8k
languagetool
languagetool languagetool-org Java

Style and Grammar Checker for 25+ Languages

13.4k
clip-as-service
clip-as-service jina-ai Python

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

12.7k
deep-learning-drizzle
deep-learning-drizzle kmario23 HTML

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

12.7k
CV
CV AccumulateMore Jupyter Notebook

✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】

12.4k
unstructured
unstructured Unstructured-IO HTML

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured...

12.3k
MOSS
MOSS OpenMOSS Python

An open-source tool-augmented conversational language model from Fudan University

12.1k
allennlp
allennlp allenai Python

An open-source NLP research library, built on PyTorch.

11.9k
LLMSurvey
LLMSurvey RUCAIBox Python

The official GitHub page for the survey paper "A Survey of Large Language Models".

11.7k
ludwig
ludwig ludwig-ai Python

Low-code framework for building custom LLMs, neural networks, and other AI models

11.6k
sentencepiece
sentencepiece google C++

Unsupervised text tokenizer for Neural Network-based text generation.

11.2k
stanford-tensorflow-tutorials
stanford-tensorflow-tutorials chiphuyen Python

This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.

10.4k
doccano
doccano doccano Python

Open source annotation tool for machine learning practitioners.

10.3k
LLMsPracticalGuide
LLMsPracticalGuide Mooler0410

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

10k
tokenizers
tokenizers huggingface Rust

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

10k
CoreNLP
CoreNLP stanfordnlp Java

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

9.9k
TextBlob
TextBlob sloria Python

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

9.4k
attention-is-all-you-need-pytorch
attention-is-all-you-need-pytorch jadore801120 Python

A PyTorch implementation of the Transformer model in "Attention is All You Need".

9.3k
autogluon
autogluon autogluon Python

Fast and Accurate ML in 3 Lines of Code

9.3k
pattern
pattern clips Python

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

8.8k
openvino
openvino openvinotoolkit C++

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

8.7k
machine_learning_examples
machine_learning_examples lazyprogrammer Python

A collection of machine learning examples and tutorials.

8.7k
Deep-Learning-Interview-Book
Deep-Learning-Interview-Book amusi

深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)

8.4k
bertviz
bertviz jessevig Python

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

7.6k