MotionClone: Training-Free Motion Cloning for Controllable Video Generation Paper • 2406.05338 • Published 7 days ago • 35
NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing Paper • 2406.06523 • Published 4 days ago • 36
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models Paper • 2405.15738 • Published 21 days ago • 42
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models Paper • 2405.15574 • Published 21 days ago • 51
Chameleon: Mixed-Modal Early-Fusion Foundation Models Paper • 2405.09818 • Published 30 days ago • 101
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated May 14 • 313