Running 2.78k 2.78k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
meta-llama/Llama-3.3-70B-Instruct Text Generation β’ 71B β’ Updated Dec 21, 2024 β’ 586k β’ β’ 2.43k
deepseek-ai/DeepSeek-Prover-V2-671B Text Generation β’ 685B β’ Updated Apr 30 β’ 3.42k β’ β’ 804
meta-llama/Llama-3.2-3B-Instruct Text Generation β’ 3B β’ Updated Oct 24, 2024 β’ 1.37M β’ β’ 1.58k