REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers Paper • 2504.10483 • Published 10 days ago • 20
Stanford-ILIAD/prism-qwen25-extra-dinosiglip-224px-0_5b Image-Text-to-Text • Updated Dec 12, 2024 • 741 • 2
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data Paper • 2502.08468 • Published Feb 12 • 13