huzican
huzican0419
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 17 hours ago
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative
Textual Feedback
upvoted
a
paper
29 days ago
Diving into Self-Evolving Training for Multimodal Reasoning
liked
a dataset
2 months ago
agent-eto/eto-sft-trajectory
Organizations
None yet
models
None public yet
datasets
None public yet