MotionClone: Training-Free Motion Cloning for Controllable Video Generation Paper • 2406.05338 • Published 10 days ago • 38
NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing Paper • 2406.06523 • Published 8 days ago • 38
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models Paper • 2405.15738 • Published 25 days ago • 43
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models Paper • 2405.15574 • Published 25 days ago • 51
Chameleon: Mixed-Modal Early-Fusion Foundation Models Paper • 2405.09818 • Published May 16 • 102
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated May 14 • 316