Image-to-Image
Diffusers
Safetensors
drhead commited on
Commit
775ce4c
1 Parent(s): 5458b76

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -5,7 +5,7 @@ library_name: diffusers
5
 
6
  This is a VAE decoder finetune, resumed from stabilityai/sd-vae-ft-mse using images from e621. It is trained with a mixture of MAE and MSE loss to maintain an acceptable balance between sharpness and smooth outputs, and loss is calculated in Oklab color space in order to prioritize image reconstruction based on which color channels are more perceptually significant.
7
 
8
- Our testing has shown that the VAE is good at eliminating unwanted high-frequency noise when used on models trained on similar data. Results are far more apparent on flat-colored images than they are on realistic or painterly images, but we have not noticed any obvious loss of performance on any type of image. It may have some generalizability to a broader range of art styles due to the variety of different styles in the dataset.
9
 
10
  Default VAE (kl-f8):
11
  ![Default VAE](crop3[1].png)
 
5
 
6
  This is a VAE decoder finetune, resumed from stabilityai/sd-vae-ft-mse using images from e621. It is trained with a mixture of MAE and MSE loss to maintain an acceptable balance between sharpness and smooth outputs, and loss is calculated in Oklab color space in order to prioritize image reconstruction based on which color channels are more perceptually significant.
7
 
8
+ Our testing has shown that the VAE is good at eliminating unwanted high-frequency noise when used on models trained on similar data. Results are far more apparent on flat-colored images than they are on realistic or painterly images, but we have not noticed any obvious loss of performance on any type of image. The effects are also more noticeable on lower-resolution generated images, but there are improvements at all resolutions. It may have some generalizability to a broader range of art styles due to the variety of different styles in the dataset.
9
 
10
  Default VAE (kl-f8):
11
  ![Default VAE](crop3[1].png)