DeepSeek-R1-Distill-HumanLikeDPO-FineTuned-16bit / pytorch_model.bin.index.json

Commit History

Trained with Unsloth
c312753
verified

krishanwalia30 commited on