24 An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models · 7 authors 1
15 VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models · 2 authors 4
3 FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation · 6 authors 1