arxiv:2501.05179
Siteng Huang
huangsiteng
AI & ML interests
vision-language models
Recent Activity
authored
a paper
about 2 months ago
Accelerating Diffusion Transformers with Token-wise Feature Caching
authored
a paper
about 2 months ago
Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for
Training-Free Acceleration
authored
a paper
about 2 months ago
CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive
Prediction
Organizations
None yet
Papers
10
models
None public yet
datasets
None public yet