Siteng Huang's picture

3 4 1

Siteng Huang

huangsiteng

·

https://kyonhuang.top/

AI & ML interests

vision-language models

Recent Activity

authored a paper 16 days ago

Accelerating Diffusion Transformers with Token-wise Feature Caching

authored a paper 16 days ago

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration

authored a paper 17 days ago

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

View all activity

Organizations

None yet

huangsiteng's activity

commented a paper 17 days ago

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

Paper • 2412.06782 • Published 17 days ago • 6 •

commented a paper 30 days ago

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration

Paper • 2411.17686 • Published about 1 month ago • 18 •

commented a paper 3 months ago

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Paper • 2409.07239 • Published Sep 11 • 11 •