arxiv:2412.03679
Seungone Kim
seungone
AI & ML interests
Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment
Recent Activity
liked
a Space
3 days ago
Qwen/QwQ-32B-preview
updated
a model
14 days ago
seungone/skywork-reward-replicate
upvoted
a
paper
16 days ago
MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at
Scale
Organizations
Papers
22
datasets
5
seungone/ablation1_math_gpt4o_mini
Viewer
•
Updated
•
5.56k
•
67
seungone/ablation3_math_llama3.1_8b_instruct
Viewer
•
Updated
•
24.8k
•
70
seungone/ablation2_math_llama3.1_8b_instruct
Viewer
•
Updated
•
5.99k
•
32
seungone/ablation1_code_gpt4o_mini
Viewer
•
Updated
•
10k
•
78
seungone/final-math-claude3.5_sonnet-10000
Viewer
•
Updated
•
10k
•
34