stabilityai/stable-diffusion-xl-base-1.0 Text-to-Image • Updated Oct 30, 2023 • 2.72M • • 6.48k
view article Article You could have designed state of the art positional encoding Nov 25, 2024 • 217
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models Paper • 2503.06749 • Published 29 days ago • 25
Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers? Paper • 2503.10632 • Published 25 days ago • 14