|
--- |
|
tags: |
|
- text-to-image |
|
- lora |
|
- diffusers |
|
- template:diffusion-lora |
|
- collage |
|
- wallpaper |
|
- vibrant |
|
- multiple-panels |
|
widget: |
|
- text: >- |
|
A vibrant red spiderman is depicted in a collage of multiple panels. The |
|
background is a vibrant orange, adding a pop of color to the scene. The |
|
spidermans suit is a deep red, with a white stripe running down the center |
|
of his chest. His eyes are large, white, and his mouth is slightly open. His |
|
hands are positioned in a way that he is facing towards the right side of |
|
the frame. The entire frame is black, creating a striking contrast with the |
|
red and orange background. |
|
output: |
|
url: images/C1.webp |
|
- text: >- |
|
A vibrant, black-suited Batman is depicted in a collage of multiple panels. |
|
The background is a deep, intense blue, adding a dramatic tone to the scene. |
|
Batmans suit is a rich black with a dark gray bat symbol on his chest. His |
|
eyes are narrow and white, and his mouth is set in a serious expression. His |
|
cape flows around him, and he is facing toward the right side of the frame, |
|
giving a sense of movement. The entire frame is outlined in dark gray, |
|
creating a bold contrast with the dark suit and blue background. |
|
output: |
|
url: images/C2.webp |
|
- text: >- |
|
A vibrant, red-suited Deadpool is depicted in a collage of multiple panels. |
|
The background is a bold, electric yellow, adding an energetic vibe to the |
|
scene. Deadpool’s suit is a striking red with black patches around his eyes |
|
and across his torso, creating a dynamic look. His eyes are expressive and |
|
white, and his mouth is curved into a mischievous smirk. His hands are |
|
positioned confidently, holding one of his katanas, and he is facing toward |
|
the right side of the frame, exuding a playful, yet intense energy. The |
|
entire frame is outlined in black, creating a sharp contrast with the red |
|
suit and vibrant yellow background. |
|
output: |
|
url: images/L3.webp |
|
base_model: black-forest-labs/FLUX.1-dev |
|
instance_prompt: collage |
|
license: creativeml-openrail-m |
|
--- |
|
# Castor-Collage-Dim-Flux-LoRA |
|
|
|
<Gallery /> |
|
|
|
**The model is still in the training phase. This is not the final version and may contain artifacts and perform poorly in some cases.** |
|
|
|
## Model description |
|
|
|
**prithivMLmods/Castor-Collage-Dim-Flux-LoRA** |
|
|
|
Image Processing Parameters |
|
|
|
| Parameter | Value | Parameter | Value | |
|
|---------------------------|--------|---------------------------|--------| |
|
| LR Scheduler | constant | Noise Offset | 0.03 | |
|
| Optimizer | AdamW | Multires Noise Discount | 0.1 | |
|
| Network Dim | 64 | Multires Noise Iterations | 10 | |
|
| Network Alpha | 32 | Repeat & Steps | 28 & 3.2k | |
|
| Epoch | 12 | Save Every N Epochs | 1 | |
|
|
|
Labeling: florence2-en(natural language & English) |
|
|
|
Total Images Used for Training : 27 |
|
|
|
## Setting Up |
|
``` |
|
import torch |
|
from pipelines import DiffusionPipeline |
|
|
|
base_model = "black-forest-labs/FLUX.1-dev" |
|
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16) |
|
|
|
lora_repo = "prithivMLmods/Castor-Collage-Dim-Flux-LoRA" |
|
trigger_word = "collage" # Leave trigger_word blank if not used. |
|
pipe.load_lora_weights(lora_repo) |
|
|
|
device = torch.device("cuda") |
|
pipe.to(device) |
|
``` |
|
## Sample Image + Prompt |
|
|
|
![Castor-Collage-Dim-Flux-LoRA](images/gta-saga.webp) |
|
|
|
| **Prompt** | |
|
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| |
|
| A dynamic scene from GTA (Grand Theft Auto) is depicted in a collage of multiple panels. The background is a gritty, urban cityscape with neon lights casting vibrant shades of purple, blue, and orange, capturing the chaotic energy of the city at night. A character in casual streetwear, holding a bag of cash in one hand and a weapon in the other, stands confidently, slightly facing the right side of the frame. The character has a focused expression, exuding a sense of rebellion and risk. The entire frame is outlined in dark gray, creating a bold contrast with the vibrant city lights and shadowed streets, evoking the intense, action-packed vibe of GTA. | |
|
|
|
## App File Structure |
|
|
|
/project-root/ |
|
|
|
├── .gitattributes |
|
├── README.md |
|
├── app.py |
|
├── pythonproject.py |
|
|
|
# Best Dimensions |
|
|
|
- 512 X 512 |
|
- 1024 X 1024 |
|
- 768 X 1024 |
|
|
|
## Trigger words 🧨 |
|
|
|
> [!WARNING] |
|
> **Trigger words:** You should use `collage` to trigger the image generation. |
|
|
|
## Download model |
|
|
|
Weights for this model are available in Safetensors format. |
|
|
|
[Download](/prithivMLmods/Castor-Collage-Dim-Flux-LoRA/tree/main) them in the Files & versions tab. |