metadata
license: openrail++
base_model: stabilityai/stable-diffusion-xl-base-1.0
language:
- en
tags:
- stable-diffusion
- stable-diffusion-xl
- tensorrt
- text-to-image
Stable Diffusion XL 1.0 TensorRT
Introduction
This repository hosts the TensorRT versions of Stable Diffusion XL 1.0 created in collaboration with NVIDIA. The optimized versions give substantial improvements in speed and efficiency.
Performance Comparison
Timings for 30 steps at 1024x1024
Accelerator | Baseline (non-optimized) | NVIDIA TensorRT (optimized) | Percentage improvement |
---|---|---|---|
A10 | 9399 ms | 8160 ms | ~13% |
A100 | 3704 ms | 2742 ms | ~26% |
H100 | 2496 ms | 1471 ms | ~41% |
Image throughput for 30 steps
Accelerator | Baseline (non-optimized) | NVIDIA TensorRT (optimized) | Percentage improvement |
---|---|---|---|
A10 | 0.10 images/sec | 0.12 images/sec | ~20% |
A100 | 0.27 images/sec | 0.36 images/sec | ~33% |
H100 | 0.40 images/sec | 0.68 images/sec | ~70% |
Model Description
- Developed by: Stability AI
- Model type: Diffusion-based text-to-image generative model
- License: CreativeML Open RAIL++-M License
- Model Description: This is a conversion of the SDXL base 1.0 and SDXL refiner 1.0 models for NVIDIA TensorRT optimized inference