lilelife
/

OmniBooth

Image-to-Image

Diffusers

Safetensors

Model card Files Files and versions Community

nielsr HF staff commited on 3 days ago

Commit

4c87ea8

•

1 Parent(s): 2d2b34c

Add metadata tags, link to paper

Browse files

This PR ensures the model can be viewed at https://huggingface.co./papers/2410.04932

Files changed (1) hide show

README.md +7 -25

README.md CHANGED Viewed

@@ -1,3 +1,8 @@
 # OmniBooth
 > OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction <br>
@@ -5,7 +10,7 @@
 OmniBooth is a project focused on synthesizing image data following multi-modal instruction. Users can use text or image to control instance generation. This repository provides tools and scripts to process, train, and generate synthetic image data using COCO dataset, or self-designed data.
-#### [Project Page](https://len-li.github.io/omnibooth-web) | [Paper](https://arxiv.org/) | [Video](https://len-li.github.io/omnibooth-web/videos/teaser-user-draw.mp4) | [Checkpoint](https://huggingface.co/lilelife/Omnibooth)
 code: https://github.com/Len-Li/OmniBooth
@@ -18,12 +23,6 @@ code: https://github.com/Len-Li/OmniBooth
   - [Inference](#inference)
   - [Behavior analysis](#behavior-analysis)
   - [Data sturture](#instance-data-structure)
 ## Installation
@@ -45,9 +44,6 @@ To get started with OmniBooth, follow these steps:
    pip install git+https://github.com/cocodataset/panopticapi.git
    ```
 ## Prepare Dataset
 You can skip this step if you just want to run a demo generation. I've prepared demo mask in `data/instance_dataset` for generation. Please see [Inference](#inference).
@@ -59,7 +55,6 @@ To train OmniBooth, follow the steps below:
     We use COCONut-S split.
     Please download the COCONut-S file and relabeled-COCO-val from [here](https://github.com/bytedance/coconut_cvpr2024?tab=readme-ov-file#dataset-splits) and put it in `data/coconut_dataset` folder. I recommend to use [Kaggle](https://www.kaggle.com/datasets/xueqingdeng/coconut) link.
 2. **Download the COCO dataset:**
     ```
     cd data/coconut_dataset
@@ -73,9 +68,6 @@ To train OmniBooth, follow the steps below:
     unzip annotations_trainval2017.zip
     ```
     After preparation, you will be able to see the following directory structure:
     ```
@@ -183,7 +175,6 @@ The mask file is a binary mask that indicate the instance location. The image fi
 }
 ```
 ## Acknowledgment
 Additionally, we express our gratitude to the authors of the following opensource projects:
@@ -192,7 +183,6 @@ Additionally, we express our gratitude to the authors of the following opensourc
 - [SyntheOcc](https://len-li.github.io/syntheocc-web/) (Network structure)
 ## BibTeX
 ```bibtex
@@ -204,12 +194,4 @@ Additionally, we express our gratitude to the authors of the following opensourc
             }
 ```
-This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
----
-license: mit
----

+---
+pipeline_tag: image-to-image
+license: mit
+---
 # OmniBooth
 > OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction <br>
 OmniBooth is a project focused on synthesizing image data following multi-modal instruction. Users can use text or image to control instance generation. This repository provides tools and scripts to process, train, and generate synthetic image data using COCO dataset, or self-designed data.
+#### [Project Page](https://len-li.github.io/omnibooth-web) | [Paper](https://huggingface.co/papers/2410.04932) | [Video](https://len-li.github.io/omnibooth-web/videos/teaser-user-draw.mp4) | [Checkpoint](https://huggingface.co/lilelife/Omnibooth)
 code: https://github.com/Len-Li/OmniBooth
   - [Inference](#inference)
   - [Behavior analysis](#behavior-analysis)
   - [Data sturture](#instance-data-structure)
 ## Installation
    pip install git+https://github.com/cocodataset/panopticapi.git
    ```
 ## Prepare Dataset
 You can skip this step if you just want to run a demo generation. I've prepared demo mask in `data/instance_dataset` for generation. Please see [Inference](#inference).
     We use COCONut-S split.
     Please download the COCONut-S file and relabeled-COCO-val from [here](https://github.com/bytedance/coconut_cvpr2024?tab=readme-ov-file#dataset-splits) and put it in `data/coconut_dataset` folder. I recommend to use [Kaggle](https://www.kaggle.com/datasets/xueqingdeng/coconut) link.
 2. **Download the COCO dataset:**
     ```
     cd data/coconut_dataset
     unzip annotations_trainval2017.zip
     ```
     After preparation, you will be able to see the following directory structure:
     ```
 }
 ```
 ## Acknowledgment
 Additionally, we express our gratitude to the authors of the following opensource projects:
 - [SyntheOcc](https://len-li.github.io/syntheocc-web/) (Network structure)
 ## BibTeX
 ```bibtex
             }
 ```
+This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.