arxiv:2501.03124
Mingyang Song
hitsmy
AI & ML interests
LVLMs
Recent Activity
upvoted
a
paper
about 19 hours ago
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative
Textual Feedback
commented on
a paper
16 days ago
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level
Reward Models
authored
a paper
17 days ago
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level
Reward Models
Organizations
None yet
Papers
1
models
None public yet