DeepSeek-R1-Distill-Qwen-1.5B-Abliterated-dpo

dpo trained- to recoupe some of abliteration loss

Safetensors

Model size

1.78B params

Tensor type

FP16

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

Model tree for stepenZEN/DeepSeek-R1-Distill-Qwen-1.5B-Abliterated-dpo

Base model

Finetuned

(125)

this model

Quantizations