Running 2.51k 2.51k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
gradientai/Llama-3-8B-Instruct-Gradient-1048k Text Generation β’ Updated Oct 29, 2024 β’ 26.3k β’ 679