--- license: mit tags: - not-for-all-audiences --- A finetune of lodestones/fluffyrock-1088-minsnr-zsnr-vpred-ema-pytorch with the following changes: - Caption dropout of 10% to improve classifier-free guidance - Timesteps weighted by 1/(snr+1) to improve output quality - A higher virtual batch size of 64 - Includes RedRocket/furception_vae for convenience (licensed separately, please refer to its repository for more information) Like its predecessor, this is a v-prediction, zero terminal SNR model. Please use the provided .yaml file (or do whatever your preferred frontend requires you to do to load a model as v-prediction and zero terminal SNR) so the model can load correctly, or you will have a lot of trouble generating correct outputs! A refiner model is also planned. Stay tuned for more information. The VAE decoder is licensed under Furception's license (as of writing CC-BY-NC-SA 4.0 though due to change). All other model components are licensed under the MIT license. Special thanks to @RedHotTensors for general assistance.