library_name: transformers tags: - Distill base_model: - deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
dpo trained- to recoupe some of abliteration loss