VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper β’ 2502.02492 β’ Published 3 days ago β’ 43
Running on Zero 1.51k 1.51k Chat With Janus-Pro-7B π A unified multimodal understanding and generation model.
Negative Token Merging: Image-based Adversarial Feature Guidance Paper β’ 2412.01339 β’ Published Dec 2, 2024 β’ 22
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper β’ 2412.03555 β’ Published Dec 4, 2024 β’ 126