minoshiro-v0.3-7B / README.md

Trained with Unsloth

06a23c0 verified about 2 months ago

236 Bytes

metadata

language:
  - ja
base_model:
  - weblab-GENIAC/Tanuki-8B-dpo-v1.0
pipeline_tag: text-generation
tags:
  - unsloth
  - trl
  - sft

weblab-GENIAC/Tanuki-8B-dpo-v1.0をファインチューニングして作成した長考モデルです。