FoodDesert
/

Boring_Embeddings

textual inversion embeddings

image-generation

stable diffusion

Model card Files Files and versions Community

FoodDesert commited on Jun 12, 2023

Commit

927d798

·

1 Parent(s): 9fdbbd4

Update README.md

Files changed (1) hide show

README.md +1 -3

README.md CHANGED Viewed

@@ -15,14 +15,12 @@ stable-diffusion-webui\embeddings and add "boring_e621_v4" to your negative prom
 ## Model Description
 The motivation for boring_e621 is that negative embeddings like [Bad Prompt](https://huggingface.co/datasets/Nerfgun3/bad_prompt),
 whose training is described [here](https://www.reddit.com/r/StableDiffusion/comments/yy2i5a/i_created_a_negative_embedding_textual_inversion/)
 depend on manually curated lists of tags describing features people do not want their images to have, such as "deformed hands".  Some problems with this approach are:
 * Manually compiled lists will inevitably be incomplete.
 * Models might not always understand the tags well due to a dearth of training images labeled with these tags.
 * It can only capture named concepts.  If there exist unnamed yet visually unappealing concepts that just make an image look wrong, but for reasons that cannot be succinctly explained, they will not be captured by a list of tags.
-<br>
 To address these problems, boring_e621 employs textual inversion on a set of images automatically extracted from the art site
 e621.net, a rich resource of millions of hand-labeled artworks, each of which is both human-labeled topically and rated
@@ -45,7 +43,7 @@ To qualitatively evaluate how well boring_e621 has learned to improve image qual
 ![boring_e621 and boring_e621_v4 Performance on Simple Prompts](tmpoqs1d_vv.png)
 As we can see, putting these embeddings in the negative prompt yields a more delicious burger, a more vibrant and detailed landscape, a prettier pharoah, and a more 3-d-looking aquarium.
 ## Other Models

 ## Model Description
 The motivation for boring_e621 is that negative embeddings like [Bad Prompt](https://huggingface.co/datasets/Nerfgun3/bad_prompt),
 whose training is described [here](https://www.reddit.com/r/StableDiffusion/comments/yy2i5a/i_created_a_negative_embedding_textual_inversion/)
 depend on manually curated lists of tags describing features people do not want their images to have, such as "deformed hands".  Some problems with this approach are:
 * Manually compiled lists will inevitably be incomplete.
 * Models might not always understand the tags well due to a dearth of training images labeled with these tags.
 * It can only capture named concepts.  If there exist unnamed yet visually unappealing concepts that just make an image look wrong, but for reasons that cannot be succinctly explained, they will not be captured by a list of tags.
 To address these problems, boring_e621 employs textual inversion on a set of images automatically extracted from the art site
 e621.net, a rich resource of millions of hand-labeled artworks, each of which is both human-labeled topically and rated
 ![boring_e621 and boring_e621_v4 Performance on Simple Prompts](tmpoqs1d_vv.png)
 As we can see, putting these embeddings in the negative prompt yields a more delicious burger, a more vibrant and detailed landscape, a prettier pharoah, and a more 3-d-looking aquarium.
+<br>
 ## Other Models