Uploaded model
- Developed by: SousiOmine
- Finetuned from model : weblab-GENIAC/Tanuki-8B-dpo-v1.0
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 14
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
Model tree for SousiOmine/minoshiro-v0.3
Base model
weblab-GENIAC/Tanuki-8B-dpo-v1.0