File size: 6,704 Bytes

---
tags:
- text-to-image
- lora
- diffusers
- template:diffusion-lora
- collage
- wallpaper
- vibrant
- multiple-panels
widget:
- text: >-
    A vibrant red spiderman is depicted in a collage of multiple panels. The
    background is a vibrant orange, adding a pop of color to the scene. The
    spidermans suit is a deep red, with a white stripe running down the center
    of his chest. His eyes are large, white, and his mouth is slightly open. His
    hands are positioned in a way that he is facing towards the right side of
    the frame. The entire frame is black, creating a striking contrast with the
    red and orange background.
  output:
    url: images/C1.webp
- text: >-
    A vibrant, black-suited Batman is depicted in a collage of multiple panels.
    The background is a deep, intense blue, adding a dramatic tone to the scene.
    Batmans suit is a rich black with a dark gray bat symbol on his chest. His
    eyes are narrow and white, and his mouth is set in a serious expression. His
    cape flows around him, and he is facing toward the right side of the frame,
    giving a sense of movement. The entire frame is outlined in dark gray,
    creating a bold contrast with the dark suit and blue background.
  output:
    url: images/C2.webp
- text: >-
    A vibrant, red-suited Deadpool is depicted in a collage of multiple panels.
    The background is a bold, electric yellow, adding an energetic vibe to the
    scene. Deadpool’s suit is a striking red with black patches around his eyes
    and across his torso, creating a dynamic look. His eyes are expressive and
    white, and his mouth is curved into a mischievous smirk. His hands are
    positioned confidently, holding one of his katanas, and he is facing toward
    the right side of the frame, exuding a playful, yet intense energy. The
    entire frame is outlined in black, creating a sharp contrast with the red
    suit and vibrant yellow background.
  output:
    url: images/L3.webp
base_model: black-forest-labs/FLUX.1-dev
instance_prompt: collage
license: creativeml-openrail-m
---
# Castor-Collage-Dim-Flux-LoRA

<Gallery />

**The model is still in the training phase. This is not the final version and may contain artifacts and perform poorly in some cases.**

## Model description 

**prithivMLmods/Castor-Collage-Dim-Flux-LoRA**

Image Processing Parameters 

| Parameter                 | Value  | Parameter                 | Value  |
|---------------------------|--------|---------------------------|--------|
| LR Scheduler              | constant | Noise Offset              | 0.03   |
| Optimizer                 | AdamW  | Multires Noise Discount   | 0.1    |
| Network Dim               | 64     | Multires Noise Iterations | 10     |
| Network Alpha             | 32     | Repeat & Steps           | 28 & 3.2k |
| Epoch                     | 12     | Save Every N Epochs       | 1      |

    Labeling: florence2-en(natural language & English)
    
    Total Images Used for Training : 27

## Setting Up
```
import torch
from pipelines import DiffusionPipeline

base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)

lora_repo = "prithivMLmods/Castor-Collage-Dim-Flux-LoRA"
trigger_word = "collage"  # Leave trigger_word blank if not used.
pipe.load_lora_weights(lora_repo)

device = torch.device("cuda")
pipe.to(device)
```
## Sample Image + Prompt

![Castor-Collage-Dim-Flux-LoRA](images/gta-saga.webp)

| **Prompt**                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| A dynamic scene from GTA (Grand Theft Auto) is depicted in a collage of multiple panels. The background is a gritty, urban cityscape with neon lights casting vibrant shades of purple, blue, and orange, capturing the chaotic energy of the city at night. A character in casual streetwear, holding a bag of cash in one hand and a weapon in the other, stands confidently, slightly facing the right side of the frame. The character has a focused expression, exuding a sense of rebellion and risk. The entire frame is outlined in dark gray, creating a bold contrast with the vibrant city lights and shadowed streets, evoking the intense, action-packed vibe of GTA. |

## App File Structure

/project-root/

├── .gitattributes         
├── README.md      
├── app.py             
├── pythonproject.py     

# Best Dimensions

- 512 X 512
- 1024 X 1024
- 768 X 1024

## Trigger words 🧨

> [!WARNING]
> **Trigger words:**  You should use `collage` to trigger the image generation.

## Download model

Weights for this model are available in Safetensors format.

[Download](/prithivMLmods/Castor-Collage-Dim-Flux-LoRA/tree/main) them in the Files & versions tab.