Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 4 days ago • 287
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Paper • 2501.13629 • Published 7 days ago • 40
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 7 days ago • 61
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 8 days ago • 271
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 9 days ago • 47
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper • 2501.10120 • Published 14 days ago • 40
Learnings from Scaling Visual Tokenizers for Reconstruction and Generation Paper • 2501.09755 • Published 14 days ago • 33