Most popular distributed repositories and open source projects

tensorflow

An Open Source Machine Learning Framework for Everyone

74880   191787   191787  

ClickHouse

ClickHouse® is a real-time analytics database management system

7669   43044   43044  

ray

Ray is an AI compute engine. Ray consists of a core distributed runtim...

6827   39080   39080  

milvus

Milvus is a high-performance, cloud-native vector database built for s...

3439   37625   37625  

LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others...

2781   35469   35469  

server

☁️ Nextcloud server, a safe home for all your data

4427   30864   30864  

surrealdb

A scalable, distributed, collaborative, document-graph database, for t...

1049   30088   30088  

xxl-job

A distributed task scheduling framework.(分布式任务调度平台XXL-JOB)

11319   29351   29351  

handson-ml

⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.

12881   25672   25672  

TDengine

High-performance, scalable time-series database designed for Industria...

4958   24348   24348  

redisson

Redisson - Valkey & Redis Java client. Real-Time Data Platform. Sync/A...

5476   24049   24049  

phoenix

Peace of mind from prototype to production

3004   22473   22473  

dgraph

high-performance graph database for real-time use cases

1551   21259   21259  

cat

CAT 作为服务端项目基础组件,提供了 Java, C/C++, Node.js, Python, Go 等...

5440   18939   18939  

bit

AI-powered development workspaces with reusable components, architectu...

945   18232   18232  

LightGBM

A fast, distributed, high performance gradient boosting (GBT, GBDT, GB...

3939   17672   17672  

CNTK

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolk...

4267   17590   17590  

Qix

Machine Learning、Deep Learning、PostgreSQL、Distributed System、Node....

4643   14930   14930  

nni

An open source AutoML toolkit for automate machine learning lifecycle,...

1828   14275   14275  

diaspora

A privacy-aware, distributed, open source social network.

2916   13695   13695  

optuna

A hyperparameter optimization framework

1167   12742   12742  

micro

An API development platform

1057   12218   12218  

nebula

A distributed, fast open-source graph database featuring horizontal...

1264   11710   11710  

dtm

A distributed transaction framework, supports workflow, saga, tcc, xa,...

997   10677   10677  

quickwit

Cloud-native search engine for observability. An open-source alternati...

482   10316   10316  

modin

Modin: Scale your Pandas workflows by changing a single line of code

663   10284   10284  

starrocks

The world's fastest open query engine for sub-second analytics both on...

2022   10129   10129  

oceanbase

The Fastest Distributed Database for Transactional, Analytical, and A...

1792   9550   9550  

oneflow

OneFlow is a deep learning framework designed to be user-friendly, sca...

1009   9368   9368  

orbitdb

Peer-to-Peer Databases for the Decentralized Web

582   8620   8620  

catboost

A fast, scalable, high performance Gradient Boosting on Decision Trees...

1237   8583   8583  

shardingsphere-elasticjob

Distributed scheduled job

3279   8192   8192  

transmittable-thread-local

📌 a missing Java std lib(simple & 0-dependency) for framework/middlew...

1717   8115   8115  

PowerJob

Enterprise job scheduling middleware with distributed computing abilit...

1335   7594   7594  

js-ipfs

IPFS implementation in JavaScript

1230   7421   7421  

h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning P...

2028   7293   7293  

toydb

Distributed SQL database in Rust, written as an educational project

619   7077   7077  

storm

Apache Storm

4059   6654   6654  

hertzbeat

Real-time observability system with agentless, performance cluster, pr...

1176   6616   6616  

hazelcast

Hazelcast is a unified real-time data platform combining stream proces...

1866   6394   6394  

flink-cdc

Flink CDC is a streaming data integration tool

2073   6230   6230  

hatchet

🪓 Run Background Tasks at Scale

263   6058   6058  

spicedb

Open Source, Google Zanzibar-inspired database for scalably storing an...

339   6041   6041  

scrapy-redis

Redis-based components for Scrapy.

1583   5637   5637  

zoneminder

ZoneMinder is a free, open source Closed-circuit television software a...

1258   5620   5620  

permify

An open-source authorization as a service inspired by Google Zanzibar,...

259   5584   5584  

greptimedb

Open-source, cloud-native, unified observability database for metrics,...

417   5547   5547  

haipproxy

:sparkling_heart: High available distributed ip proxy pool, powerd by...

909   5491   5491  

qTox

qTox is a chat, voice, video, and file transfer IM client using the en...

1097   4881   4881  

FluidFramework

Library for building distributed, real-time collaborative web applica...

557   4863   4863