Finetune of amd135m using Rchatml format form reasoning-base-20k dataset from KingNish. Trying to see if i can get this small model to reason. Improvements, suggestions welcome. Will upload training script and dataset script soon (yell at me if I dont)

Downloads last month: 35

Safetensors

Model size

134M params

Tensor type

BF16

Inference API

Unable to determine this model's library. Check the docs .

Model tree for skdrx/amd135m_reasoning_finetune

Base model

amd/AMD-Llama-135m

Quantized

(15)

this model

skdrx
/

amd135m_reasoning_finetune

Model tree for skdrx/amd135m_reasoning_finetune

Datasets used to train skdrx/amd135m_reasoning_finetune