Most popular machine-learning repositories and open source projects

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

fastai

The fastai deep learning library

7640   27298   27298  

awesome-datascience

:memo: An awesome Data Science repository to learn and apply for real...

6147   27054   27054  

ML-From-Scratch

Machine Learning From Scratch. Bare bones NumPy implementations of mac...

4769   26942   26942  

xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GB...

8761   26786   26786  

d2l-en

Interactive deep learning book with multi-framework code, math, and di...

4712   26549   26549  

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-t...

4071   26544   26544  

awesome-deep-learning-papers

The most cited deep learning papers

4468   25928   25928  

500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code

500 AI Machine learning Deep learning Computer vision NLP Projects wit...

5982   25925   25925  

awesome-deep-learning

A curated list of awesome Deep Learning tutorials, projects and commun...

6160   25922   25922  

handson-ml

⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.

12883   25536   25536  

pumpkin-book

《机器学习》(西瓜书)公式详解

4784   25059   25059  

modular

The Modular Platform (includes MAX & Mojo)

2679   24621   24621  

shap

A game theoretic approach to explain the output of any machine learnin...

3406   24221   24221  

WaveFunctionCollapse

Bitmap & tilemap generation from a single example with the help of ide...

1294   24129   24129  

homemade-machine-learning

🤖 Python examples of popular machine learning algorithms with interac...

4090   23575   23575  

fastbook

The fastai book, published as Jupyter Notebooks

9048   23504   23504  

Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from In...

5775   23094   23094  

NLP-progress

Repository to track the progress in Natural Language Processing (NLP),...

3618   22931   22931  

llm-app

Ready-to-run cloud templates for RAG, AI pipelines, and enterprise sea...

405   22901   22901  

qdrant

Qdrant - High-performance, massive-scale Vector Database and Vector Se...

1568   22851   22851  

learnopencv

Learn OpenCV : C++ and Python Examples

11721   22114   22114  

100-Days-Of-ML-Code

100-Days-Of-ML-Code中文版

5555   21887   21887  

Resume-Matcher

Improve your resumes with Resume Matcher. Get insights, keyword sugges...

4288   21877   21877  

serve

☁️ Build multimodal AI applications with cloud-native stack

2233   21683   21683  

best-of-ml-python

🏆 A ranked list of awesome machine learning Python libraries. Updated...

2906   21663   21663  

haystack

AI orchestration framework to build customizable, production-ready LLM...

2219   21169   21169  

Perplexica

Perplexica is an AI-powered search engine. It is an Open source altern...

2134   21128   21128  

pytorch-handbook

pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行...

5429   21117   21117  

CVPR2025-Papers-with-Code

CVPR 2025 论文和开源项目合集

2718   20637   20637  

recommenders

Best Practices on Recommendation Systems

3233   20548   20548  

rasa

💬 Open source machine learning framework to automate text- and voic...

4833   20513   20513  

datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, e...

2898   20487   20487  

C

Collection of various algorithms in mathematics, machine learning, com...

4514   20254   20254  

deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gen...

2709   20054   20054  

mlflow

Open source platform for the machine learning lifecycle

4438   20033   20033  

onnx

Open standard for machine learning interoperability

3777   19372   19372  

HivisionIDPhotos

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools....

2110   19080   19080  

Scrapegraph-ai

Python scraper based on AI

1608   19020   19020  

tfjs

A WebGL accelerated JavaScript library for training and deploying ML m...

1985   18911   18911  

awesome-production-machine-learning

A curated list of awesome open source libraries to deploy, monitor, ve...

2395   18906   18906  

ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-sourc...

4325   18477   18477  

gun

An open source cybersecurity protocol for syncing decentralized graph...

1191   18457   18457  

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

634   18389   18389  

stanford-cs-229-machine-learning

VIP cheatsheets for Stanford's CS 229 Machine Learning

4038   18347   18347  

CNTK

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolk...

4266   17592   17592  

awesome-nlp

:book: A curated list of resources dedicated to Natural Language Proce...

2622   17396   17396  

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and trai...

3375   17389   17389  

incubator-mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with...

6149   17334   17334  

mindsdb

MindsDB is a Server for Artificial Intelligence Logic. Enabling develo...

2195   17178   17178  

LightGBM

A fast, distributed, high performance gradient boosting (GBT, GBDT, GB...

3873   17092   17092