269 Forks
1593 Stars
1593 Watchers

petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

How to download and setup petastorm

Open terminal and run command
git clone https://github.com/uber/petastorm.git
git clone is used to create a copy or clone of petastorm repositories. You pass git clone a repository URL.
it supports a few different network protocols and corresponding URL formats.

Also you may download zip file with petastorm https://github.com/uber/petastorm/archive/master.zip

Or simply clone petastorm with SSH
[email protected]:uber/petastorm.git

If you have some problems with petastorm

You may open issue on petastorm support forum (system) here: https://github.com/uber/petastorm/issues