Adaptive Orchestration for Large-Scale Inference on Heterogeneous Accelerator Systems Balancing Cost, Performance, and Resilience Paper • 2503.20074 • Published 22 days ago • 5
Trained on AWS Trainium Collection Collection of models on Hugging Face that have been trained on AWS Trainium. Learn more here: https://huggingface.co/docs/optimum-neuron/index • 7 items • Updated May 7, 2024 • 7