arxiv:2412.17847
Seungone Kim
seungone
·
AI & ML interests
Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment
Recent Activity
liked
a model
4 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
updated
a dataset
5 days ago
prometheus-eval/outcome_meta_evaluation
updated
a dataset
6 days ago
prometheus-eval/outcome_meta_evaluation_heuristic
Organizations
Papers
24
datasets
5
seungone/ablation1_math_gpt4o_mini
Viewer
•
Updated
•
5.56k
•
61
seungone/ablation3_math_llama3.1_8b_instruct
Viewer
•
Updated
•
24.8k
•
53
seungone/ablation2_math_llama3.1_8b_instruct
Viewer
•
Updated
•
5.99k
•
48
seungone/ablation1_code_gpt4o_mini
Viewer
•
Updated
•
10k
•
45
seungone/final-math-claude3.5_sonnet-10000
Viewer
•
Updated
•
10k
•
37