Jokers
Collection
Models for jokes generation
•
4 items
•
Updated
This model is a fine-tuned version of Dmitriy007/rugpt2_gen_news on the baneks dataset for 1 epoch. It achieved 2.0760
loss during training.
Model evaluation has not been performed.
The model is a fine-tuned variant of the Dmitriy007/rugpt2_gen_news architecture with causal language modeling head.
The model should be used for studying abilities of natural language models to generate jokes.
The model is trained on a list of anecdotes pulled from a few vk communities (see baneks dataset for more details).
The following hyperparameters were used during training:
Train Loss | Epoch |
---|---|
2.0760 | 10 |
Base model
Dmitriy007/rugpt2_gen_news