PAIR
/

PAIR-diffusion-sdv15-coco-finetune

segmentation-to-image

stable-diffusion

Model card Files Files and versions Community

PAIR-diffusion-sdv15-coco-finetune / README.md

vidit98's picture

Update README.md

4ac1cdc almost 2 years ago

|

2.35 kB

	---
	license: mit
	tags:
	- image-to-image
	- text-to-image
	- segmentation-to-image
	- controlnet
	- stable-diffusion
	language:
	- en
	---

	# PAIR-Diffusion Model Card - Stable Diffusion 1.5 finetuned with COCO-Stuff
	[PAIR Diffusion](https://arxiv.org/abs/2303.17546) models an image as composition of multiple objects and aim to control structural and appearance properties of these objects. It allows reference image-guided appearance manipulation and structure editing of an image at an object level. Describing object appearances using text can be challenging and ambiguous, PAIR Diffusion enables a user to control the appearance of an object using images. Having fine-grained control over appearance and structure at object level can be beneficial for future works in video and 3D beside image editing,
	where we need to have consistent appearance across time in case of video or across various viewing positions in case of 3D.

	For more information please refer to [GitHub](https://github.com/Picsart-AI-Research/PAIR-Diffusion/), [arXiv](https://arxiv.org/abs/2303.17546)

	![Main Diagram](Main_Diag.png)


	This model card applies the method to Stable Diffusion 1.5 (SDv1.5). We used COCO-Stuff dataset to finetune SDv1.5 using ControlNet due to its efficiency. The model can be tested using the publicly available demo here [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co./spaces/PAIR/PAIR-Diffusion)



	## Model Details

	- Developed by: Vidit Goel, Elia Peruzzo, Yifan Jiang, Dejia Xu, Nicu Sebe, Trevor, Darrell, Atlas Wang, Humphrey Shi
	- Model type: Object Level Editing using Diffusion Models
	- Language(s): English
	- License: MIT
	- Resources for more information: [GitHub Repository](https://github.com/Picsart-AI-Research/PAIR-Diffusion/), [Paper](https://arxiv.org/abs/2303.17546).
	- Cite as:
	```
	@article{goel2023pair,
	title={PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models},
	author={Goel, Vidit and Peruzzo, Elia and Jiang, Yifan and Xu, Dejia and Sebe, Nicu and Darrell, Trevor and
	Wang, Zhangyang and Shi, Humphrey},
	journal={arXiv preprint arXiv:2303.17546},
	year={2023}
	}
	```