JeffrinJohn commited on
Commit
afdd083
β€’
1 Parent(s): 2c7410d

Update README. md

Browse files
Files changed (1) hide show
  1. README.md +88 -9
README.md CHANGED
@@ -1,13 +1,92 @@
1
  ---
2
- title: PhotoMaker
3
- emoji: πŸ“·
4
- colorFrom: pink
5
- colorTo: blue
 
 
 
 
 
6
  sdk: gradio
7
- sdk_version: 4.37.2
8
- app_file: app.py
9
- pinned: false
10
- license: apache-2.0
11
  ---
12
 
13
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - en
4
+ library_name: diffusers
5
+ pipeline_tag: image-to-3d
6
+ datasets:
7
+ - HuggingFaceTB/cosmopedia
8
+ metrics:
9
+ - character
10
+ title: LMJ
11
  sdk: gradio
12
+ emoji: πŸš€
13
+ colorTo: yellow
 
 
14
  ---
15
 
16
+ # PhotoMaker Model Card
17
+
18
+ <div align="center">
19
+
20
+ [**Project Page**](https://photo-maker.github.io/) **|** [**Paper (ArXiv)**](https://arxiv.org/abs/2312.04461) **|** [**Code**](https://github.com/TencentARC/PhotoMaker)
21
+
22
+ [πŸ€— **Gradio demo (Realistic)**](https://huggingface.co/spaces/TencentARC/PhotoMaker) **|** [πŸ€— **Gradio demo (Stylization)**](https://huggingface.co/spaces/TencentARC/PhotoMaker-Style)
23
+
24
+ </div>
25
+
26
+ ## Introduction
27
+
28
+ <!-- Provide a quick summary of what the model is/does. -->
29
+ Users can input one or a few face photos, along with a text prompt, to receive a customized photo or painting within seconds (no training required!). Additionally, this model can be adapted to any base model based on SDXL or used in conjunction with other LoRA modules.
30
+
31
+ ### Realistic results
32
+
33
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6285a9133ab6642179158944/BYBZNyfmN4jBKBxxt4uxz.jpeg)
34
+
35
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6285a9133ab6642179158944/9KYqoDxfbNVLzVKZzSzwo.jpeg)
36
+
37
+ ### Stylization results
38
+
39
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6285a9133ab6642179158944/du884lcjpqqjnJIxpATM2.jpeg)
40
+
41
+
42
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6285a9133ab6642179158944/-AC7Hr5YL4yW1zXGe_Izl.jpeg)
43
+
44
+ More results can be found in our [project page](https://photo-maker.github.io/)
45
+
46
+ ## Model Details
47
+
48
+ It mainly contains two parts corresponding to two keys in loaded state dict:
49
+
50
+ 1. `id_encoder` includes finetuned OpenCLIP-ViT-H-14 and a few fuse layers.
51
+
52
+ 2. `lora_weights` applies to all attention layers in the UNet, and the rank is set to 64.
53
+
54
+
55
+ ## Usage
56
+
57
+ You can directly download the model in this repository.
58
+ You also can download the model in python script:
59
+
60
+ ```python
61
+ from huggingface_hub import hf_hub_download
62
+ photomaker_ckpt = hf_hub_download(repo_id="TencentARC/PhotoMaker", filename="photomaker-v1.bin", repo_type="model")
63
+ ```
64
+
65
+ Then, please follow the instructions in our [GitHub repository](https://github.com/TencentARC/PhotoMaker).
66
+
67
+
68
+ ## Limitations
69
+
70
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
71
+
72
+ - The model's customization performance degrades on Asian male faces.
73
+ - The model still struggles with accurately rendering human hands.
74
+
75
+ ## Bias
76
+
77
+ While the capabilities of image generation models are impressive, they can also reinforce or exacerbate social biases.
78
+
79
+ ## Citation
80
+
81
+ <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
82
+
83
+ **BibTeX:**
84
+
85
+ ```bibtex
86
+ @article{li2023photomaker,
87
+ title={PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding},
88
+ author={Li, Zhen and Cao, Mingdeng and Wang, Xintao and Qi, Zhongang and Cheng, Ming-Ming and Shan, Ying},
89
+ booktitle={arXiv preprint arxiv:2312.04461},
90
+ year={2023}
91
+ }
92
+ ```