FredZhang7
/

anime-anything-promptgen-v2

Text Generation

stable-diffusion

arxiv:2210.14140

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

FredZhang7 commited on Feb 10, 2023

Commit

3da8a05

•

1 Parent(s): 26f1219

Update README.md

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -21,10 +21,12 @@ datasets:
 ## Fast Anime PromptGen
-Trained on the 80K Safebooru prompts
 Todo:
-- complete data preprocessing and training description
 - upload Danbooru model
 ## Text-to-image Examples
@@ -48,7 +50,7 @@ tokenizer = GPT2Tokenizer.from_pretrained('distilgpt2')
 tokenizer.add_special_tokens({'pad_token': '[PAD]'})
 model = GPT2LMHeadModel.from_pretrained('FredZhang7/anime-anything-promptgen')
-prompt = r'1girl, genshin impact'
 # generate text using fine-tuned model
 nlp = pipeline('text-generation', model=model, tokenizer=tokenizer)

 ## Fast Anime PromptGen
+`pytorch_model` is trained on 80K anime tags, all with `up_score` greater than 8 and without "greyscale","girls","boys", and "others", fetched from the [Safebooru API](https://safebooru.donmai.us/posts/random.json).
+I didn't release the V1 model because it only generated gibberish prompts. After trying all means to correct that behavior, I eventually figured that the cause of the gibberish prompts is not from the model or training duration, but rather from the random usernames present in the training data.
+Here's the complete [prompt preprocessing](./preprocess.py).
 Todo:
 - upload Danbooru model
 ## Text-to-image Examples
 tokenizer.add_special_tokens({'pad_token': '[PAD]'})
 model = GPT2LMHeadModel.from_pretrained('FredZhang7/anime-anything-promptgen')
+prompt = r'1girl, genshin'
 # generate text using fine-tuned model
 nlp = pipeline('text-generation', model=model, tokenizer=tokenizer)