4xFFHQDAT / README.md
Phips's picture
Update README.md
f73a004 verified
|
raw
history blame
3.07 kB
---
license: cc-by-4.0
pipeline_tag: image-to-image
tags:
- pytorch
- super-resolution
---
[Link to Github Release](https://github.com/Phhofm/models/releases/tag/4xFFHQDAT)
# 4xFFHQDAT
Name: 4xFFHQDAT
Author: Philip Hofmann
Release Date: 25.08.2023
License: CC BY 4.0
Network: DAT
Scale: 4
Purpose: 4x upscaling model for faces
Iterations: 122000
epoch: 2
batch_size: 4
HR_size: 128
Dataset: FFHQ - full dataset till 50k, then first 10k img multiscaled (resulted in ~260k imgs, 126GB)
Number of train images: 259990
OTF Training: Yes
Pretrained_Model_G: DAT_x4.pth
Description: 4x photo upscaler for faces with otf jpg compression, blur and resize, trained on FFHQ dataset. This has been trained on and for faces, but i guess can also be used for other photos, might be able to retain skin detail. This is not face restoration, but simply a 4x upscaler trained on faces, therefore input images need to be of good quality if good output quality is desired.
Examples 4xFFHQDAT:
[Imgsli1](https://imgsli.com/MjAwNjUz)
[Imgsli2](https://imgsli.com/MjAwNjU0)
[Imgsli3](https://imgsli.com/MjAwNjU2)
[Imgsli4](https://imgsli.com/MjAwNjU3)
[Imgsli5](https://imgsli.com/MjAwNjU4)
[Imgsli6](https://imgsli.com/MjAwNjU5)
[Imgsli7](https://imgsli.com/MjAwNzk0)
![Example1](https://github.com/Phhofm/models/assets/14755670/3b69c1cb-3c94-4f26-8547-d8745a7165af)
![Example2](https://github.com/Phhofm/models/assets/14755670/57d92f97-0b62-44bc-ae6a-a15891d0d8a8)
![Example3](https://github.com/Phhofm/models/assets/14755670/968460e4-f94d-4c67-a657-4c634e1b03ff)
![Example4](https://github.com/Phhofm/models/assets/14755670/25261c31-a13c-43b3-96be-1116b4b12319)
![Example5](https://github.com/Phhofm/models/assets/14755670/b87b3226-24a5-4d17-a550-2c6894037e95)
---
Since the above 4xFFHQDAT model is not able to handle the noise present in low quality input images, i made a small variant/finetune of this, the 4xFFHQLDAT model. This model might come in handy if your input image is of bad quality/not suited for above model. I basically made this model in a response to an input image posted in upscaling-results channel as a request to this upscale model (since 4xFFHQDAT would not be able to handle noise), see Imgsli1 example below for result.
Name: 4xFFHQLDAT
Author: Philip Hofmann
Release Date: 25.08.2023
License: CC BY 4.0
Network: DAT
Scale: 4
Purpose: 4x upscaling model for low quality input photos of faces
Iterations: 44000
epoch: 0
batch_size: 4
HR_size: 128
Dataset: FFHQ - full dataset till 50k, then first 10k img multiscaled (resulted in ~260k imgs, 126GB)
Number of train images: 259990
OTF Training: Yes
Pretrained_Model_G: 4xFFHQDAT
Examples 4xFFHQLDAT:
[Imgsli1](https://imgsli.com/MjAwNjYx)
[Imgsli2](https://imgsli.com/MjAwNjYy)
[Imgsli3](https://imgsli.com/MjAwNjYz)
![Example6](https://github.com/Phhofm/models/assets/14755670/61b3cff7-117b-4510-bdcf-cd49a1494227)
![Example7](https://github.com/Phhofm/models/assets/14755670/de8e63a4-3b7b-4583-b638-720bb6423b2d)