Inference app for a FP8-quantized flux1-dev model. This runs on graphic cards with 16 GB of VRAM.
What is the Neurone/flux.1-dev-fp8 GitHub project? Description: "Inference app for a FP8-quantized flux1-dev model. This runs on graphic cards with 16 GB of VRAM.". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.
Question is copied to clipboard — paste it after the AI opens.
Clone via HTTPS
Clone via SSH
Download ZIP
Download master.zipReport bugs or request features on the flux.1-dev-fp8 issue tracker:
Open GitHub Issues