KBlueLeaf
/

Kohaku-XL-Zeta

StableDiffusionXLPipeline

Inference Endpoints

Model card Files Files and versions Community

KBlueLeaf commited on Aug 25, 2024

Commit

cae9940

·

verified ·

1 Parent(s): 4d56c28

Update README.md

Files changed (1) hide show

README.md +26 -22

README.md CHANGED Viewed

@@ -74,28 +74,32 @@ As same as kxl eps rev2, I add realbooru and pvc figure images for more flexibil
 ## Training
 - Hardware: Quad RTX 3090s
-- Num Train Images: 8,468,798
-- Total Epoch: 1
-- Total Steps: 16548
-- Training Time: 430 hours (wall time)
-- Batch Size: 4
-- Grad Accumulation Step: 32
-- Equivalent Batch Size: 512
-- Optimizer: Lion8bit
-- Learning Rate: 1e-5 for UNet / TE training disabled
-- LR Scheduler: Constant (with warmup)
-- Warmup Steps: 100
-- Weight Decay: 0.1
-- Betas: 0.9, 0.95
-- Min SNR Gamma: 5
-- Debiased Estimation Loss: Enabled
-- IP Noise Gamma: 0.05
-- Resolution: 1024x1024
-- Min Bucket Resolution: 256
-- Max Bucket Resolution: 4096
-- Mixed Precision: FP16
-- Caption Tag Dropout: 0.2
-- Caption Group Dropout: 0.2 (for dropping tag/nl caption entirely)
 ## Why do you still use SDXL but not any Brand New DiT-Based Models?

 ## Training
 - Hardware: Quad RTX 3090s
+- Training
+  - Num Train Images: 8,468,798
+  - Total Epoch: 1
+  - Total Steps: 16548
+  - Training Time: 430 hours (wall time)
+  - Batch Size: 4
+  - Grad Accumulation Step: 32
+  - Equivalent Batch Size: 512
+  - Mixed Precision: FP16
+- Optimizer
+  - Optimizer: Lion8bit
+  - Learning Rate: 1e-5 for UNet / TE training disabled
+  - LR Scheduler: Constant (with warmup)
+  - Warmup Steps: 100
+  - Weight Decay: 0.1
+  - Betas: 0.9, 0.95
+- Diffusion
+  - Min SNR Gamma: 5
+  - Debiased Estimation Loss: Enabled
+  - IP Noise Gamma: 0.05
+  - Resolution: 1024x1024
+  - Min Bucket Resolution: 256
+  - Max Bucket Resolution: 4096
+- Other
+  - Caption Tag Dropout: 0.2
+  - Caption Group Dropout: 0.2 (for dropping tag/nl caption entirely)
 ## Why do you still use SDXL but not any Brand New DiT-Based Models?