SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 2 days ago • 129
view article Article NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets 23 days ago • 33
nGPT: Normalized Transformer with Representation Learning on the Hypersphere Paper • 2410.01131 • Published Oct 1, 2024 • 10
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published Feb 16 • 151
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control Feb 4 • 138
From LLMs to Actions: Latent Codes as Bridges in Hierarchical Robot Control Paper • 2405.04798 • Published May 8, 2024 • 1
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Paper • 2502.07316 • Published Feb 11 • 48
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 • 71