RichardErkhov
/

tanquangduong_-_Qwen2.5-3B-DPO-TinyStories-awq

4-bit precision

Model card Files Files and versions Community

YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co./docs/hub/model-cards#model-card-metadata)

Quantization made by Richard Erkhov.

Request more models

Qwen2.5-3B-DPO-TinyStories - AWQ

Model creator: https://huggingface.co./tanquangduong/
Original model: https://huggingface.co./tanquangduong/Qwen2.5-3B-DPO-TinyStories/

Original model description:

base_model: unsloth/Qwen2.5-3B language: - en license: apache-2.0 tags: - text-generation-inference - transformers - unsloth - qwen2 - trl - dpo

Uploaded model

Developed by: tanquangduong
License: apache-2.0
Finetuned from model : unsloth/Qwen2.5-3B

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month: 6

Safetensors

Model size

683M params

Tensor type

I32

·

FP16

·

Inference API

Unable to determine this model's library. Check the docs .