CaptainNemo-ChatML-12B

QLoRA ORPO tuned with 1x RTX A6000 for 2 epochs. Rank 64 LoRA, 2e-5 learning rate.

Safetensors

Model size

12.2B params

Tensor type

BF16

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

Model tree for nbeerbower/CaptainNemo-ChatML-12B

Base model

Finetuned

(1)

this model

Quantizations