tmpmodelsave/group_server_fixed_no_sft_type3_7k_7ktype2_step250 Text Generation • Updated 8 days ago • 4
tmpmodelsave/group_server_fixed_no_sft_type3_7k_7ktype2_step200 Text Generation • Updated 8 days ago • 1
tmpmodelsave/group_server_fixed_no_sft_type3_7k_7ktype2_step150 Text Generation • Updated 8 days ago • 30
tmpmodelsave/nosft_llama3_morecorr_sft_4dpo_type12_4k_type3_8k_type4_step100 Text Generation • Updated 10 days ago • 3
tmpmodelsave/no_sft_llama3_morecorr_sft_4dpo_type12_6k_type3_8k_type4_step100 Text Generation • Updated 10 days ago • 1
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_100tmp07_vllmexp3 Viewer • Updated 2 days ago • 15k • 12
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_550tmp10_vllmexp3 Viewer • Updated 2 days ago • 15k • 13
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_500tmp10_vllmexp3 Viewer • Updated 2 days ago • 15k • 12
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_450tmp10_vllmexp3 Viewer • Updated 2 days ago • 15k • 12
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_400tmp10_vllmexp3 Viewer • Updated 2 days ago • 15k • 13
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_350tmp10_vllmexp3 Viewer • Updated 2 days ago • 15k • 11
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_300tmp10_vllmexp3 Viewer • Updated 2 days ago • 15k • 12
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_250tmp10_vllmexp3 Viewer • Updated 2 days ago • 15k • 11
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_200tmp10_vllmexp3 Viewer • Updated 2 days ago • 15k • 12
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_100tmp10_vllmexp3 Viewer • Updated 2 days ago • 15k • 11