petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
How to download and setup petastorm
Open terminal and run command
git clone https://github.com/uber/petastorm.git
git clone is used to create a copy or clone of petastorm repositories.
You pass git clone a repository URL. it supports a few different network protocols and corresponding URL formats.
Also you may download zip file with petastorm https://github.com/uber/petastorm/archive/master.zip
Or simply clone petastorm with SSH
[email protected]:uber/petastorm.git
If you have some problems with petastorm
You may open issue on petastorm support forum (system) here: https://github.com/uber/petastorm/issuesSimilar to petastorm repositories
Here you may see petastorm alternatives and analogs
gold-miner tensorflow keras scikit-learn TensorFlow-Examples pytorch face_recognition CNTK data-science-ipython-notebooks Qix handong1587.github.io telegram-list mlcourse.ai stats Winds machine-learning-curriculum caffe tesseract machine-learning-for-software-engineers awesome-deep-learning-papers incubator-mxnet lectures cs-video-courses julia Screenshot-to-code spaCy cheatsheets-ai awesome-deep-learning python-machine-learning-book WaveFunctionCollapse