SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper β’ 2502.14786 β’ Published 3 days ago β’ 96
Running on Zero 1.8k 1.8k Chat With Janus-Pro-7B π A unified multimodal understanding and generation model.
Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation Paper β’ 2501.04144 β’ Published Jan 7 β’ 19