File size: 10,794 Bytes

a40cc0f
 
 
 
 
 
 
 
 
 
 
4ae73b4
a40cc0f
 
 
4ae73b4
a40cc0f
 
 
4ae73b4
a40cc0f
 
 
4ae73b4
a40cc0f
 
 
4ae73b4
a40cc0f
 
 
4ae73b4
a40cc0f
 
 
4ae73b4
a40cc0f
 
 
4ae73b4
a40cc0f
 
 
4ae73b4
a40cc0f
 
 
4ae73b4
a40cc0f

---

tags:
- text-to-image
- flux
- lora
- diffusers
- template:sd-lora
widget:
- text: 'A photorealistic business headshot of [token], a caucasian man in his 40s

        exuding confidence in a modern office setting.'
  output:
    url: samples/1739040705910__000004000_0.jpg
- text: 'A professional portrait of [token], a caucasian man in his 40s and full head

        of hair, dressed sharply in a navy suit, captured with soft natural light.'
  output:
    url: samples/1739040893254__000004000_1.jpg
- text: 'A high-quality headshot of [token], a caucasian man in his 40s and full head

        of hair, taken with an 85mm lens, emphasizing realism and authority.'
  output:
    url: samples/1739041020061__000004000_2.jpg
- text: 'A corporate headshot of [token] , a caucasian man in his 40s, standing before

        a sleek office backdrop with a confident expression.'
  output:
    url: samples/1739041198924__000004000_3.jpg
- text: 'An executive portrait of [token], a caucasian man in his 40s and full head

        of hair, softly lit, showcasing subtle facial details and approachability.'
  output:
    url: samples/1739041328899__000004000_4.jpg
- text: 'A detailed business headshot of [token], a caucasian man in his 40s and full

        head of hair, framed with a blurred modern office environment.'
  output:
    url: samples/1739041546886__000004000_5.jpg
- text: 'A professional close-up of [token], a caucasian man in his 40s and full head

        of hair, using a shallow depth of field to highlight facial authenticity.'
  output:
    url: samples/1739041677344__000004000_6.jpg
- text: 'A crisp and polished portrait of [token], a caucasian man in his 40s and full

        head of hair, captured in a well-lit professional setting.'
  output:
    url: samples/1739041859611__000004000_7.jpg
- text: 'A corporate-style image of [token], a caucasian man in his 40s and full head

        of hair, with a slight smile, reinforcing trust and professionalism'
  output:
    url: samples/1739041990114__000004000_8.jpg
- text: 'A photorealistic shot of [token], a caucasian man in his 40s and full head

        of hair, taken with a Nikon D850, emphasizing clarity and natural texture.'
  output:
    url: samples/1739042177504__000004000_9.jpg
base_model: black-forest-labs/FLUX.1-dev
instance_prompt: steve
license: other
license_name: flux-1-dev-non-commercial-license
license_link: https://huggingface.co./black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md
---


# steve_lora_flux_1_dev_v1.2



A LoRA-based Stable Diffusion model trained to generate images of a man named “Steve” in a wide variety of scenarios. This model is fine-tuned from **[black-forest-labs/FLUX.1-dev](https://huggingface.co./black-forest-labs/FLUX.1-dev)** using a Flow Matching–based noise scheduler.



<Gallery />



## Trigger Words



Use `steve` in your prompt to activate the specific style and character details for this LoRA.



## Model Information



- **LoRA Rank / Alpha**: 16 / 16  

- **Number of Steps**: 4000  

- **Batch Size**: 1  

- **Learning Rate**: 0.0001  

- **Noise Scheduler**: `flowmatch`  

- **Optimizer**: `adamw8bit`  

- **Precision**: `bf16`  

- **Gradient Checkpointing**: true  

- **EMA**: true (decay = 0.99)  

- **Quantization**: enabled  



## How to Use



This LoRA can be merged or applied to the **FLUX.1-dev** base model through [Diffusers](https://github.com/huggingface/diffusers) or a compatible UI/tool.



Example pseudocode:



```python

from diffusers import StableDiffusionPipeline

import torch



base_model = "black-forest-labs/FLUX.1-dev"
lora_model = "YOUR_USERNAME/steve_lora_flux_1_dev_v1.2"



pipe = StableDiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.float16).to("cuda")
# Load your LoRA weights (implementation depends on the UI or method)
# pipe.load_lora_weights(lora_model)  # Example call



prompt = "steve, man lounging in fitted athletic wear on crisp white linens, strong and confident"

image = pipe(prompt).images[0]

image.save("steve_example.jpg")
```



## Download Model

Weights for this LoRA are available in Safetensors format.

Download them from the Files & versions tab.



## License

This model is provided under a flux-1-dev-non-commercial-license. Please review the license file for details on acceptable use.



## Acknowledgements

Trained with AI Toolkit by Ostris

Based on the FLUX.1-dev base model



## Disclaimer:

Use responsibly. This model is intended for artistic, non-commercial purposes. The creators are not responsible for any misuse, generation of disallowed content, or potential harm caused by outputs. Always review and curate model outputs before sharing.





# steveant/steve-lora-v1.2



This is a [LoRA](https://arxiv.org/abs/2106.09685)-based Stable Diffusion model fine-tuned on a custom image dataset to generate images featuring a man named “Steve” in various settings and scenarios. It has been trained using the [FLUX.1-dev](https://huggingface.co./black-forest-labs/FLUX.1-dev) base model, leveraging a Flow Matching–based noise scheduler and LoRA network adapters.



> **Note:** This model is in version `v1.2` and is currently considered experimental.



---



## Model Details



- **Model type**: LoRA adapter for Stable Diffusion (`sd_trainer`)

- **Trigger word**: `steve`

- **Base model**: [black-forest-labs/FLUX.1-dev](https://huggingface.co./black-forest-labs/FLUX.1-dev)

- **Network**: LoRA (rank: 16, alpha: 16)

- **Quantization**: Enabled

- **Datasets**: Private dataset containing images and associated textual captions.



### Model Architecture and Training



This LoRA was trained using the following key parameters:



- **Training steps**: `4000`

- **Batch size**: `1`  

- **Gradient accumulation steps**: `1`  

- **Learning rate**: `0.0001`  

- **Noise Scheduler**: `flowmatch`  

- **Optimizer**: `adamw8bit`  

- **Precision**: `bf16`  

- **LoRA settings**:

  - Linear rank: `16`

  - Linear alpha: `16`

- **Sampling configuration** (for sample images):

  - Sampler: `flowmatch`

  - Resolution: `1200 x 1600`

  - Guidance scale: `4.1`

  - Sample steps: `29`



During training, image captions were drawn from `.txt` files. Some techniques applied include:

- **Caption dropout**: `0.00`

- **Token shuffling**: `true`

- **Gradient checkpointing**: `true`

- **Exponential moving average**: `use_ema = true` with `ema_decay = 0.99`



---



## Intended Use



This model is intended to generate images of a “Steve” character in various poses, outfits, and scenarios. Possible use cases include:



- Creative media and content generation  

- Character concepting for artistic projects  

- Test and experimentation with Flow Matching–based schedulers in Stable Diffusion  



> **Important**: This model is not intended to generate explicit or harmful content. Users are advised to comply with local regulations and handle outputs responsibly.



---



## How to Use



1. **Installation**  

   Make sure you have the [Diffusers library](https://github.com/huggingface/diffusers) or another Stable Diffusion–compatible framework installed.



2. **Loading the Model**  

   ```python

   from diffusers import StableDiffusionPipeline

   import torch



   # Example: Pseudocode for loading the base model + LoRA

   base_model_id = "black-forest-labs/FLUX.1-dev"  

   lora_model_id = "steveant/steve-lora-v1.2"  # hypothetical path on HF hub



   pipeline = StableDiffusionPipeline.from_pretrained(base_model_id, torch_dtype=torch.float16).to("cuda")

   # Load LoRA weights

   # Typically, you would merge or apply the LoRA as per your chosen library's method.

   ```

3. **Prompting**  
   Use the **trigger word** `steve` in your prompt to invoke the specific style or character details. For instance:
   ```python

   prompt = (

       "steve, man lounging in fitted athletic wear on crisp white linens, "

       "strong and confident expression, warm ambient lighting, full-body shot, "

       "textured fabric details"

   )

   result = pipeline(prompt).images[0]

   result.save("steve_lounging.png")

   ```

4. **Negative Prompting (Optional)**  
   Provide a `neg` (negative) prompt parameter to omit or reduce undesired elements.  
   ```python

   neg_prompt = "low resolution, bad quality"

   result = pipeline(prompt=prompt, negative_prompt=neg_prompt).images[0]

   ```

---

## Sample Prompts

Below are some sample prompts used during training:

- `A photorealistic business headshot of steve, a Caucasian man in his 40s exuding confidence in a modern office setting.`
- `A professional portrait of steve, a Caucasian man in his 40s and full head of hair, dressed sharply in a navy suit, captured with soft natural light.`
- `A high-quality headshot of steve, a Caucasian man in his 40s and full head of hair, taken with an 85mm lens, emphasizing realism and authority.`
- `A corporate headshot of steve, a Caucasian man in his 40s, standing before a sleek office backdrop with a confident expression.`
---

## Limitations and Biases

- The model’s outputs depend on the style and content of the dataset.  
- Since the training data is limited to “Steve” images in specific scenarios, the model may not generalize well to drastically different contexts.  
- **Bias**: Any biases in the original dataset might be reflected in the generated images.

---

## Training Data

- **Private dataset** of images featuring “Steve,” labeled with text captions.  
- **Resolution** used for latent caching: `720`, `960`, and `1440`.  
- **Data augmentation**: Slight caption dropout, token shuffling, etc.

---

## Citation

If you use this model or find it helpful for your research/projects, please cite:

```

@misc{steve_lora_flux_1_dev_v1.2,

  author = {steveant},

  title = {steve_lora_flux_1_dev_v1.2 (LoRA model)},

  year = {2024},

  howpublished = {\url{https://huggingface.co./steveant/steve-lora-v1.2}},

}

```

---

## License

This model and code are available under **CreativeML Open RAIL-M** or your chosen license. Please refer to the [repository’s license](./LICENSE) or contact the author for more details.

---

## Contributing

Contributions are welcome! If you wish to improve this model card or have new use cases and improvements to propose:

1. Open an issue on the [GitHub/Spaces project](#) (if available).  
2. Submit pull requests or suggestions.  
3. Respect the usage and license guidelines.

---

**Disclaimer**:  
This model is for research and educational purposes. Always validate and review images generated to ensure they align with your intended use and do not violate any regulations or ethical standards.