RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy Paper โข 2503.24388 โข Published 10 days ago โข 29
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM Paper โข 2503.14478 โข Published 23 days ago โข 44
Running 105 105 Open VLM Video Leaderboard ๐ VLMEvalKit Eval Results in video understanding benchmark
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds Paper โข 2407.01494 โข Published Jul 1, 2024 โข 15
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models Paper โข 2312.13964 โข Published Dec 21, 2023 โข 20