library_name: transformers | |
tags: | |
- Distill | |
base_model: | |
- deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | |
# DeepSeek-R1-Distill-Qwen-1.5B-Abliterated-dpo | |
dpo trained- to recoupe some of abliteration loss |
library_name: transformers | |
tags: | |
- Distill | |
base_model: | |
- deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | |
# DeepSeek-R1-Distill-Qwen-1.5B-Abliterated-dpo | |
dpo trained- to recoupe some of abliteration loss |