Running 2.4k 2.4k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4 Text Generation • Updated Aug 7, 2024 • 5.62k • 23