petastorm

petastorm

uber

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

1.8k Stars
284 Forks
1.8k Watchers
Python Language
apache-2.0 License
Cost to Build
$167.9K
Market Value
$700.6K

Growth over time

10 data points  ·  2021-08-01 → 2025-08-01
Stars Forks Watchers
💬

How do you feel about this project?

Ask AI about petastorm

Question copied to clipboard

What is the uber/petastorm GitHub project? Description: "Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone petastorm

Clone via HTTPS

git clone https://github.com/uber/petastorm.git

Clone via SSH

[email protected]:uber/petastorm.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the petastorm issue tracker:

Open GitHub Issues