MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos Paper • 2406.08407 • Published Jun 12 • 24
Discriminative Diffusion Models as Few-shot Vision and Language Learners Paper • 2305.10722 • Published May 18, 2023 • 3