Running Featured 1.29k FineWeb: decanting the web for the finest text data at scale 🍷 1.29k Read about FineWeb, a large web‑text dataset for LLMs
Running on CPU Upgrade Featured 2.98k The Smol Training Playbook 📚 2.98k The secrets to building world-class LLMs
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 Text Generation • 253B • Updated Oct 15, 2025 • 1.21k • • 343
meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • 109B • Updated May 22, 2025 • 205k • • 1.22k