upload models

Browse files

Files changed (7) hide show

FlexWaifu1.3.1.safetensors +3 -0
LoRA/IR_1girl1boy_1.safetensors +3 -0
LoRA/IR_1girl1boy_2.safetensors +3 -0
LoRA/IR_1girl1boy_3.safetensors +3 -0
README.md +54 -10
images/grid-0050-1443377636.png +3 -0
images/grid-0051-3282638012.png +3 -0

FlexWaifu1.3.1.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7b8c719863591c5441e93a7d8f5263b81d1deaaa289713df953bd42d751be0a4
+size 4265145965

LoRA/IR_1girl1boy_1.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2bb59f1af5145f8f96a969966821afbd5a670c86dbbc592b4834528c27aee01e
+size 604095017

LoRA/IR_1girl1boy_2.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4a2bfd63a4a58fef4b42ffeac6e7a0a993b8f6b0ec2fdee82f3633a032a9cf9e
+size 604095017

LoRA/IR_1girl1boy_3.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:84fb199ed4a3c461b97c9e7194cddce0371a34c83566f23035c46420e876b725
+size 604095017

README.md CHANGED Viewed

@@ -15,26 +15,47 @@ FlexWaifu is a fine-tuned model from Waifu Diffusion 1.3 for producing high reso
 ## Model Description
-This model was created by merging two original LoRAs of [testLoRAs](https://huggingface.co/Ai-tensa/testLoRAs) into WD1.3.
-| Model Name | Recipe                                     |
-| ---------- | ------------------------------------------ |
-| FlexWaifu  | WD1.3 + 2.0 * hires_test_a + smooth_test_a |
 It is just a merged model.
 While this model is likely to produce good generation at medium resolution, consider using LoRAs of [testLoRAs](https://huggingface.co/Ai-tensa/testLoRAs) if it does not produce well.
-## Flex Waifu Rainbow
 This model is further fine-tuned from FlexWaifu with ~17k nijijourneyv5 tagged images of various authors published on the Internet.
 It is merged from six dim 8 LoRAs made in various settings, and FWRLoRA is the merged LoRA (dim 48).
 Most LoRAs were fine-tuned with Aspect Ratio Backetting with a maximum resolution of 1152x768 images, but some are up to 768x768 or 512x768.
 Image captions are made by BLIP and ~12k images also used WD1.4-tagger.
-| Model Name       | Recipe              |
-| ---------------- | ------------------- |
-| FlexWaifuRainbow | FlexWaifu + FWRLoRA |
 ### Usage
 The format of the caption suggests that a short natural language sentence followed by a comma-separated tags is the most natural way to describe the prompt.
@@ -42,6 +63,8 @@ Using more tags that are well-estimated by the tagger in the trained images may
 "tags.json" lists the tags estimated for over 200 of the 12k images.
 Tag semantics may be inappropriate for automatic tagging, so please emphasize appropriately.
 ## License
 This model is open access and available to all, with a CreativeML OpenRAIL-M license further specifying rights and usage.
@@ -56,7 +79,28 @@ The CreativeML OpenRAIL License specifies:
 These Models build on the two excellent works: SD1.4, developed by [CompVis Researchers](https://ommer-lab.com/), and WD1.3, developed by [Anthony Mercurio](https://github.com/harubaru), [Salt](https://github.com/sALTaccount/), and [Cafe](https://twitter.com/cafeai_labs).
-## Examples (Flex Waifu Rainbow)
 **Prompt 1**

 ## Model Description
+| Model Name       | Recipe                                                                            |
+| ---------------- | --------------------------------------------------------------------------------- |
+| FlexWaifu        | WD1.3 + 2.0 * hires_test_a + smooth_test_a                                        |
+| FlexWaifu v1.3.1 | FlexWaifu + 20.0 * IR_1girl1boy_1 + 16.0 * IR_1girl1boy_2 + 16.0 * IR_1girl1boy_3 |
+| FlexWaifuRainbow | FlexWaifu + FWRLoRA                                                               |
+### FlexWaifu
+This model was created by merging two original LoRAs of [testLoRAs](https://huggingface.co/Ai-tensa/testLoRAs) into WD1.3.
 It is just a merged model.
 While this model is likely to produce good generation at medium resolution, consider using LoRAs of [testLoRAs](https://huggingface.co/Ai-tensa/testLoRAs) if it does not produce well.
+#### v1.3.1
+The model is fine-tuned with self-generated images with a single word prompt "1girl" or "1boy" and generates well without much prompting.
+Twin LoRA has reduced the percentage of bad output without changing the style much.
+The images for the three Twin LoRAs were generated by FlexWaifu or a model merging Twin LoRA into it.
+3-5k images are used per Twin LORA, no duplicates.
+The reward value of ImageReward was used to classify the images.
+##### Twin LoRA
+When fine-tuning, we create two LoRAs, a good LoRA and a bad LoRA, and take the difference between them in order to suppress adverse effects and achieve the desired effect.
+**method**
+1. Create a set of images with the same prompt.
+2. Divide the image set into two equal parts, good and bad, according to certain evaluation criteria.
+3. Create a LoRA for each data set.
+4. Subtract the bad LoRA from the good LoRA. (Note: the dimension is twice the original LoRA)
+5. Apply differential LoRA in any weight.
+### Flex Waifu Rainbow
 This model is further fine-tuned from FlexWaifu with ~17k nijijourneyv5 tagged images of various authors published on the Internet.
 It is merged from six dim 8 LoRAs made in various settings, and FWRLoRA is the merged LoRA (dim 48).
 Most LoRAs were fine-tuned with Aspect Ratio Backetting with a maximum resolution of 1152x768 images, but some are up to 768x768 or 512x768.
 Image captions are made by BLIP and ~12k images also used WD1.4-tagger.
 ### Usage
 The format of the caption suggests that a short natural language sentence followed by a comma-separated tags is the most natural way to describe the prompt.
 "tags.json" lists the tags estimated for over 200 of the 12k images.
 Tag semantics may be inappropriate for automatic tagging, so please emphasize appropriately.
+CLIP Skip 1 is recommended.
 ## License
 This model is open access and available to all, with a CreativeML OpenRAIL-M license further specifying rights and usage.
 These Models build on the two excellent works: SD1.4, developed by [CompVis Researchers](https://ommer-lab.com/), and WD1.3, developed by [Anthony Mercurio](https://github.com/harubaru), [Salt](https://github.com/sALTaccount/), and [Cafe](https://twitter.com/cafeai_labs).
+## Examples
+**CLIP Skip 1 is recommended.**
+### Flex Waifu v1.3.1
+**Prompt 1**
+- with Negative Prompt
+![](images/grid-0050-1443377636.png)
+- without Negative Prompt
+![](images/grid-0051-3282638012.png)
+```
+solo, 1girl, white_background, full_body, twintails, braid, white_background, bangs, frills, closed_mouth, brown_hair, jewelry, blush, standing, dress, food, strawberry, (blueberry:1.1), (cake:1.1), sweets, brown_eyes, hair_ornament, skirt, bow
+Steps: 30, Sampler: UniPC, CFG scale: 7, Seed: 1443377636, Size: 768x768, Model hash: 7b8c719863, Model: FlexWaifu_FlexWaifu1.3.1, Denoising strength: 0.6, Version: v1.2.1, Hires upscale: 1.5, Hires steps: 18, Hires upscaler: Latent
+```
+### Flex Waifu Rainbow
 **Prompt 1**

images/grid-0050-1443377636.png ADDED Viewed

Git LFS Details

SHA256: 381536cb97bdf1ba8a16fd09477bb1cedf83671ebd749ab6f71fc3da5f67ee62
Pointer size: 133 Bytes
Size of remote file: 34.8 MB

images/grid-0051-3282638012.png ADDED Viewed

Git LFS Details

SHA256: afa51a31ffcd87bdaf55cf30306c38e283f55422340804203ba8b5dea9dc39f5
Pointer size: 133 Bytes
Size of remote file: 33.5 MB