SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 4 days ago • 145
One-Minute Video Generation with Test-Time Training Paper • 2504.05298 • Published 4 days ago • 85
Running 2.44k 2.44k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-bnb-4bit Image-Text-to-Text • Updated 3 days ago • 41.4k • 69
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 Text Generation • Updated about 15 hours ago • 8.84k • 194
meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • Updated 2 days ago • 228k • • 716
Vision-Speech Models: Teaching Speech Models to Converse about Images Paper • 2503.15633 • Published 23 days ago • 1