arxiv:2412.06782
Siteng Huang
huangsiteng
AI & ML interests
vision-language models
Recent Activity
authored
a paper
3 days ago
Accelerating Diffusion Transformers with Token-wise Feature Caching
authored
a paper
3 days ago
Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for
Training-Free Acceleration
authored
a paper
3 days ago
CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive
Prediction
Organizations
None yet
models
None public yet
datasets
None public yet