arxiv:2409.12181
Wenting Zhao
wentingzhao
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 2 months ago
commit0/mbpp
commented on
a paper
about 2 months ago
Challenges in Trustworthy Human Evaluation of Chatbots
updated
a dataset
2 months ago
commit0/openai_humaneval
Organizations
models
2
datasets
36
wentingzhao/mbpp_predictions_1
Viewer
•
Updated
•
500
•
43
wentingzhao/SWE-bench_Verified
Viewer
•
Updated
•
500
•
57
wentingzhao/commit0_combined
Viewer
•
Updated
•
54
•
840
wentingzhao/SWE-bench_Verified_commit0
Viewer
•
Updated
•
2
•
43
wentingzhao/stack-v2-cpp-2011-windows-blamed
Viewer
•
Updated
•
41
wentingzhao/stack-v2-cpp-2011-windows
Viewer
•
Updated
•
224k
•
41
wentingzhao/stack-v2-cpp-2011
Viewer
•
Updated
•
947k
•
40
wentingzhao/humanevalplus_predictions_16
Viewer
•
Updated
•
163
•
40
wentingzhao/lmsys-arena-pairs
Viewer
•
Updated
•
52
•
46
wentingzhao/WildHallucinations
Viewer
•
Updated
•
7.92k
•
166
•
3