kabachuha
/

maxwell-the-cat-diffusion

StableDiffusionPipeline

stable-diffusion

stable-diffusion-diffusers

Inference Endpoints

Model card Files Files and versions Community

kabachuha commited on Jan 19, 2023

Commit

455be6d

•

1 Parent(s): f62730b

Update README.md

Files changed (1) hide show

README.md +37 -0

README.md CHANGED Viewed

@@ -1,3 +1,40 @@
 ---
 license: openrail++
 ---

 ---
 license: openrail++
 ---
+## Description
+Maxwell the Cat Diffusion is a latent text-to-image diffusion model based on the original CompVis Stable Diffusion v1.5 and then fine-tuned on 5 images based on the 'Maxwell the Cat' meme originating from the modded Half Life 2 video.
+To use this gorgeous object in your generations, add `maxwell the cat` to the prompts.
+## Dreambooth hyperparameters
+```sh
+export MODEL_NAME="runwayml/stable-diffusion-v1-5"
+export INSTANCE_DIR="/home/kabachuha/kml/datasets/objects/maxwell"
+export CLASS_DIR="/home/kabachuha/kml/datasets/objects/maxwell_class"
+export OUTPUT_DIR="/home/kabachuha/kml/maxwell/"
+accelerate launch train_dreambooth.py \
+  --pretrained_model_name_or_path=$MODEL_NAME  \
+  --instance_data_dir=$INSTANCE_DIR \
+  --class_data_dir=$CLASS_DIR \
+  --output_dir=$OUTPUT_DIR \
+  --with_prior_preservation --prior_loss_weight=1.0 \
+  --instance_prompt="maxwell the cat" \
+  --class_prompt="3d model of a black cat, lowpoly" \
+  --resolution=512 \
+  --train_batch_size=1 \
+  --gradient_accumulation_steps=1 \
+  --learning_rate=1e-6 \
+  --lr_scheduler="constant" \
+  --lr_warmup_steps=0 \
+  --num_class_images=200 \
+  --max_train_steps=800 \
+  --mixed_precision 'no' \
+  --train_text_encoder \
+  --checkpointing_steps 1200
+```
+The dataset link https://drive.google.com/drive/folders/1nd8NHrwuu_VKHaU8iBFw95w1iqfmhNTo?usp=share_link