Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 10 items • Updated 3 days ago • 417
Physical AI Collection Collection of commercial-grade datasets for physical AI developers • 10 items • Updated about 17 hours ago • 25
Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills Paper • 2503.12533 • Published 11 days ago • 60
Personalize Anything for Free with Diffusion Transformer Paper • 2503.12590 • Published 11 days ago • 41
AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning Paper • 2503.07608 • Published 17 days ago • 19
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 15 days ago • 348
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM Paper • 2503.04724 • Published 21 days ago • 66
Instella ✨ Collection Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs. • 5 items • Updated 21 days ago • 7
C4AI Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 23 days ago • 68
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published Feb 19 • 68
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper • 2502.18449 • Published 30 days ago • 71
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning Paper • 2502.14768 • Published Feb 20 • 47
FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading Paper • 2502.11433 • Published Feb 17 • 32