Running 2.24k 2.24k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4 Text Generation • Updated Aug 7, 2024 • 5.35k • 23