|
--- |
|
license: creativeml-openrail-m |
|
--- |
|
# Arcane Diffusion |
|
This is the fine-tuned Stable Diffusion model trained on images from the TV Show Arcane. |
|
Use the tokens **_arcane style_** in your prompts for the effect. |
|
|
|
If you enjoy this model, please check out my other models on [Huggingface](https://huggingface.co./nitrosocke) |
|
|
|
### 🧨 Diffusers |
|
|
|
This model can be used just like any other Stable Diffusion model. For more information, |
|
please have a look at the [Stable Diffusion](https://huggingface.co./docs/diffusers/api/pipelines/stable_diffusion). |
|
|
|
You can also export the model to [ONNX](https://huggingface.co./docs/diffusers/optimization/onnx), [MPS](https://huggingface.co./docs/diffusers/optimization/mps) and/or [FLAX/JAX](). |
|
|
|
```python |
|
from diffusers import StableDiffusionPipeline |
|
import torch |
|
|
|
model_id = "nitrosocke/Arcane-Diffusion" |
|
pipe = StableDiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.float16) |
|
pipe = pipe.to("cuda") |
|
|
|
prompt = "arcane style, a magical princess with golden hair" |
|
image = pipe(prompt).images[0] |
|
|
|
image.save("./magical_princess.png") |
|
``` |
|
|
|
![img](https://huggingface.co./nitrosocke/Arcane-Diffusion/resolve/main/magical_princess.png) |
|
|
|
### Sample images from v3: |
|
![output Samples v3](https://huggingface.co./nitrosocke/Arcane-Diffusion/resolve/main/arcane-v3-samples-01.jpg) |
|
![output Samples v3](https://huggingface.co./nitrosocke/Arcane-Diffusion/resolve/main/arcane-v3-samples-02.jpg) |
|
### Sample images from the model: |
|
![output Samples](https://huggingface.co./nitrosocke/Arcane-Diffusion/resolve/main/arcane-diffusion-output-images.jpg) |
|
### Sample images used for training: |
|
![Training Samples](https://huggingface.co./nitrosocke/Arcane-Diffusion/resolve/main/arcane-diffusion-training-images.jpg) |
|
|
|
**Version 3** (arcane-diffusion-v3): This version uses the new _train-text-encoder_ setting and improves the quality and edibility of the model immensely. Trained on 95 images from the show in 8000 steps. |
|
|
|
**Version 2** (arcane-diffusion-v2): This uses the diffusers based dreambooth training and prior-preservation loss is way more effective. The diffusers where then converted with a script to a ckpt file in order to work with automatics repo. |
|
Training was done with 5k steps for a direct comparison to v1 and results show that it needs more steps for a more prominent result. Version 3 will be tested with 11k steps. |
|
|
|
**Version 1** (arcane-diffusion-5k): This model was trained using _Unfrozen Model Textual Inversion_ utilizing the _Training with prior-preservation loss_ methods. There is still a slight shift towards the style, while not using the arcane token. |
|
|