Even though I'm running the quantized version of diffusers, shouldn't it be a bit faster? I'm literally getting 6 minutes for a single Img2Img task on an NVIDIA H100 80GB.
The slow speed is intended
Why?
Money and fame :D
Β· Sign up or log in to comment