4 2 2

Wenting Zhao

wentingzhao

AI & ML interests

None yet

Recent Activity

updated a dataset about 2 months ago

commit0/mbpp

commented on a paper about 2 months ago

Challenges in Trustworthy Human Evaluation of Chatbots

updated a dataset 2 months ago

commit0/openai_humaneval

View all activity

Organizations

wentingzhao's activity

updated a dataset about 2 months ago

commit0/mbpp

Viewer • Updated Dec 8, 2024 • 974 • 62

commented a paper about 2 months ago

Challenges in Trustworthy Human Evaluation of Chatbots

Paper • 2412.04363 • Published Dec 5, 2024 • 2 •

updated 3 datasets 2 months ago

updated a dataset 3 months ago

wentingzhao/SWE-bench_Verified

Viewer • Updated Nov 11, 2024 • 500 • 57

New activity in wentingzhao/WildHallucinations 3 months ago

What exactly is the pipeline to use this data?

#2 opened 6 months ago by

Ouz-G

Arxiv link to WildHallucinations instead of WildChat (maybe have both)

#3 opened 5 months ago by

monsoon-nlp

updated 5 datasets 3 months ago

wentingzhao/commit0_combined

Viewer • Updated Oct 28, 2024 • 54 • 840

wentingzhao/SWE-bench_Verified_commit0

Viewer • Updated Oct 28, 2024 • 2 • 43

wentingzhao/stack-v2-cpp-2011-windows-blamed

Viewer • Updated Oct 22, 2024 • 41

wentingzhao/stack-v2-cpp-2011-windows

Viewer • Updated Oct 21, 2024 • 224k • 41

wentingzhao/stack-v2-cpp-2011

Viewer • Updated Oct 21, 2024 • 947k • 40

updated 3 datasets 4 months ago

wentingzhao/humanevalplus_predictions_16

Viewer • Updated Oct 19, 2024 • 163 • 40

wentingzhao/lmsys-arena-pairs

Viewer • Updated Oct 14, 2024 • 52 • 46

wentingzhao/WildHallucinations

Viewer • Updated Sep 22, 2024 • 7.92k • 166 • 3

authored a paper 5 months ago

A Controlled Study on Long Context Extension and Generalization in LLMs

Paper • 2409.12181 • Published Sep 18, 2024 • 44

upvoted a paper 5 months ago

A Controlled Study on Long Context Extension and Generalization in LLMs

Paper • 2409.12181 • Published Sep 18, 2024 • 44

authored 2 papers 5 months ago

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations

Paper • 2311.08469 • Published Nov 14, 2023 • 11

WildChat: 1M ChatGPT Interaction Logs in the Wild

Paper • 2405.01470 • Published May 2, 2024 • 62