Running on CPU Upgrade 195 195 MMLU-Pro Leaderboard π₯ More advanced and challenging multi-task evaluation
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. β’ 4 items β’ Updated 6 days ago β’ 162
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. β’ 39 items β’ Updated Nov 28, 2024 β’ 361
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 26 items β’ Updated Jan 8 β’ 566
Running 543 543 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects