SDXL 1.0 finetunes on vucinatim/spectrogram-captions for 89 epochs(800 steps). It generates spectrograms for simple sounds. It currently does not produce very good sound effects, but I will train the model for longer in the future.

Downloads last month: 3

Inference Providers NEW

Text-to-Image

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

sr5434
/

SDXL-v1.0-sfx-step-800

Dataset used to train sr5434/SDXL-v1.0-sfx-step-800

Space using sr5434/SDXL-v1.0-sfx-step-800 1