Running 2.47k 2.47k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
michaelbenayoun/llama-2-tiny-4kv-heads-4layers-random Text Generation • Updated Oct 14, 2024 • 7.37k
Running on Zero 647 647 Whisper Large V3 🤫 Transcribe audio from microphone, files, or YouTube videos
Distributed Training Collection Papers and resources related to distributed training. • 5 items • Updated Jun 3, 2024