Slow

#3
by phaser10 - opened

Hello. The generation speed is so low that it must be running on a CPU, yet it drains ZeroGPU quota.

Any ideas on why the generation is so slow?

Huggingface Projects org

Thanks for reporting!
Well, it's slow but actually using GPU. You can check it by running this Space locally and monitoring GPU usage. My colleague who tried the transformers implementation also reported that it was slow, and @ArthurZ suggested that using torch.compile would make it faster, but torch.compile is not used in this Space because it's not compatible with ZeroGPU.
@ArthurZ Do you have any other ideas why this Space is slow?

Sign up or log in to comment