arxiv:2501.05452
Xingyu Fu
Fiaa
AI & ML interests
NLP, multimodal
Recent Activity
liked
a dataset
1 day ago
deepcs233/Visual-CoT
liked
a model
7 days ago
stabilityai/stable-video-diffusion-img2vid-xt
authored
a paper
11 days ago
ReFocus: Visual Editing as a Chain of Thought for Structured Image
Understanding