arxiv:2410.10563
Dongfu Jiang
DongfuJiang
AI & ML interests
NLP, common sense reasoning
Organizations
Papers
10
models
13
DongfuJiang/PairRM-V2-phi-3-4k-mini-all
Updated
•
3
DongfuJiang/vapo_lora_all_data_iter_2
Updated
•
6
DongfuJiang/vapo_lora_all_data_iter_1
Updated
•
11
DongfuJiang/PairRM-V2-phi3-3-mini-unified-feedback
Updated
•
7
DongfuJiang/PairRM-V2-phi3-3-mini-ultra-feedback-binarized-lora
Updated
•
6
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-1600
Text Generation
•
Updated
•
10
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-1200
Text Generation
•
Updated
•
18
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-2000
Text Generation
•
Updated
•
17
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-2400
Text Generation
•
Updated
•
17
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-2882
Text Generation
•
Updated
•
11
datasets
7
DongfuJiang/zeroeval
Viewer
•
Updated
•
7.16k
•
38
DongfuJiang/MATH-500
Viewer
•
Updated
•
500
•
7
DongfuJiang/simpo_v2_ultrafeedback
Viewer
•
Updated
•
59.9k
•
35
DongfuJiang/VAPO
Viewer
•
Updated
•
72.5k
•
34
DongfuJiang/PairRM-data
Viewer
•
Updated
•
586k
•
33
DongfuJiang/WildFeedback
Viewer
•
Updated
•
26.5k
•
36
DongfuJiang/FeTaQA
Viewer
•
Updated
•
10.3k
•
166
•
7