Identity-Preserving Text-to-Video Generation by Frequency Decomposition Paper • 2411.17440 • Published 1 day ago • 4
Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient Paper • 2411.17787 • Published 1 day ago • 8
Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding Paper • 2411.18462 • Published about 16 hours ago • 3
MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D Content Creation Paper • 2411.17945 • Published 1 day ago • 7
Interleaved Scene Graph for Interleaved Text-and-Image Generation Assessment Paper • 2411.17188 • Published 2 days ago • 13
ROICtrl: Boosting Instance Control for Visual Generation Paper • 2411.17949 • Published 1 day ago • 48
VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models Paper • 2411.17451 • Published 1 day ago • 8
TEXGen: a Generative Diffusion Model for Mesh Textures Paper • 2411.14740 • Published 6 days ago • 12
SketchAgent: Language-Driven Sequential Sketch Generation Paper • 2411.17673 • Published 1 day ago • 13
Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens Paper • 2411.17691 • Published 1 day ago • 7
SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis Paper • 2411.16173 • Published 3 days ago • 4
MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs Paper • 2411.15296 • Published 6 days ago • 15
Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration Paper • 2411.17686 • Published 1 day ago • 14
ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper • 2411.17465 • Published 1 day ago • 53
Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published 2 days ago • 32
Cautious Optimizers: Improving Training with One Line of Code Paper • 2411.16085 • Published 3 days ago • 11
Best of Both Worlds: Advantages of Hybrid Graph Sequence Models Paper • 2411.15671 • Published 4 days ago • 7
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper • 2411.16489 • Published 3 days ago • 27