arxiv:2310.00840
Tianjian Li
dogtooth
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 1 hour ago
dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_2_Skywork-Reward-Gemma-2-27B-v0.2_weighted_sft
updated
a dataset
about 1 hour ago
dogtooth/ultrafeedback_binarized_weighted_sft
updated
a dataset
about 2 hours ago
dogtooth/uf_Llama-3.1-8B-Instruct_2_Skywork-Reward-Gemma-2-27B-v0.2_weighted_sft
Organizations
Papers
1
models
None public yet
datasets
147
dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_2_Skywork-Reward-Gemma-2-27B-v0.2_weighted_sft
Updated
dogtooth/ultrafeedback_binarized_weighted_sft
Updated
dogtooth/uf_Llama-3.1-8B-Instruct_2_Skywork-Reward-Gemma-2-27B-v0.2_weighted_sft
Updated
dogtooth/uf_Llama-3.1-8B-Instruct_2_Skywork-Reward-Gemma-2-27B-v0.2
Viewer
•
Updated
•
61.1k
•
2
dogtooth/uf_Llama-3.1-8B-Instruct_2
Viewer
•
Updated
•
122k
•
2
dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_15_Skywork-Reward-Gemma-2-27B-v0.2
Viewer
•
Updated
•
61.1k
•
38
dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_15
Viewer
•
Updated
•
917k
•
6
dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_6_Skywork-Reward-Gemma-2-27B-v0.2_weighted_sft
Viewer
•
Updated
•
122k
•
5
dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_5_Skywork-Reward-Gemma-2-27B-v0.2
Viewer
•
Updated
•
122k
•
70
dogtooth/ultrafeedback_weighted_sft
Viewer
•
Updated
•
122k
•
11