Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment Paper ā¢ 2502.04328 ā¢ Published about 18 hours ago ā¢ 11
MatAnyone: Stable Video Matting with Consistent Memory Propagation Paper ā¢ 2501.14677 ā¢ Published 14 days ago ā¢ 28
Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos Paper ā¢ 2501.13826 ā¢ Published 15 days ago ā¢ 22
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper ā¢ 2501.13106 ā¢ Published 16 days ago ā¢ 79
CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities Paper ā¢ 2501.08983 ā¢ Published 23 days ago ā¢ 20
RepVideo: Rethinking Cross-Layer Representation for Video Generation Paper ā¢ 2501.08994 ā¢ Published 23 days ago ā¢ 15
RepVideo: Rethinking Cross-Layer Representation for Video Generation Paper ā¢ 2501.08994 ā¢ Published 23 days ago ā¢ 15
RepVideo: Rethinking Cross-Layer Representation for Video Generation Paper ā¢ 2501.08994 ā¢ Published 23 days ago ā¢ 15
RepVideo: Rethinking Cross-Layer Representation for Video Generation Paper ā¢ 2501.08994 ā¢ Published 23 days ago ā¢ 15
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives Paper ā¢ 2501.04003 ā¢ Published about 1 month ago ā¢ 25
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Paper ā¢ 2501.03847 ā¢ Published about 1 month ago ā¢ 23
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Paper ā¢ 2501.03847 ā¢ Published about 1 month ago ā¢ 23
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models Paper ā¢ 2412.09645 ā¢ Published Dec 10, 2024 ā¢ 35
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models Paper ā¢ 2412.09645 ā¢ Published Dec 10, 2024 ā¢ 35
VBench: Comprehensive Benchmark Suite for Video Generative Models Paper ā¢ 2311.17982 ā¢ Published Nov 29, 2023 ā¢ 7
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models Paper ā¢ 2411.13503 ā¢ Published Nov 20, 2024 ā¢ 30
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models Paper ā¢ 2412.09645 ā¢ Published Dec 10, 2024 ā¢ 35
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion Paper ā¢ 2412.09626 ā¢ Published Dec 12, 2024 ā¢ 20