BioAspire Collection Reconstruction of ASPIRE variants detailed in the original paper + fine-tuning experiments from different base models. • 6 items • Updated about 4 hours ago
BioAspire Collection Reconstruction of ASPIRE variants detailed in the original paper + fine-tuning experiments from different base models. • 6 items • Updated about 4 hours ago
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published Feb 19 • 69
Running 2.46k 2.46k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters