nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 Text Generation • Updated about 16 hours ago • 28.8k • • 293
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published Feb 10 • 152
Wavelets Are All You Need for Autoregressive Image Generation Paper • 2406.19997 • Published Jun 28, 2024 • 32
Running on Zero 758 758 Florence 2 📉 Analyze images to generate captions, detect objects, or perform OCR
Running on CPU Upgrade 203 203 MMLU-Pro Leaderboard 🥇 More advanced and challenging multi-task evaluation