YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co./docs/hub/model-cards#model-card-metadata)
Quantization made by Richard Erkhov.
Qwen2.5-3B-DPO-TinyStories - AWQ
- Model creator: https://huggingface.co./tanquangduong/
- Original model: https://huggingface.co./tanquangduong/Qwen2.5-3B-DPO-TinyStories/
Original model description:
base_model: unsloth/Qwen2.5-3B language: - en license: apache-2.0 tags: - text-generation-inference - transformers - unsloth - qwen2 - trl - dpo
Uploaded model
- Developed by: tanquangduong
- License: apache-2.0
- Finetuned from model : unsloth/Qwen2.5-3B
This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 6