dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_2_Skywork-Reward-Gemma-2-27B-v0.2_weighted_sft Updated about 12 hours ago
dogtooth/uf_Llama-3.1-8B-Instruct_2_Skywork-Reward-Gemma-2-27B-v0.2_weighted_sft Updated about 13 hours ago • 7
dogtooth/uf_Llama-3.1-8B-Instruct_2_Skywork-Reward-Gemma-2-27B-v0.2 Viewer • Updated 2 days ago • 61.1k • 5
dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_15_Skywork-Reward-Gemma-2-27B-v0.2 Viewer • Updated 3 days ago • 61.1k • 38
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations Paper • 2412.13171 • Published 8 days ago • 30
dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_6_Skywork-Reward-Gemma-2-27B-v0.2_weighted_sft Viewer • Updated 7 days ago • 122k • 6
dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_5_Skywork-Reward-Gemma-2-27B-v0.2 Viewer • Updated 7 days ago • 122k • 71
dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_6_Skywork-Reward-Gemma-2-27B-v0.2 Viewer • Updated 8 days ago • 61.1k • 4