mlfoundations-dev/hp_ablations_qwen_scheduler_linear_warmup0.10 Text Generation • Updated 16 days ago • 95
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10 Text Generation • Updated 16 days ago • 96
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05 Text Generation • Updated 16 days ago • 121
mlfoundations-dev/hp_ablations_qwen_scheduler_inverse_sqrt Text Generation • Updated 16 days ago • 95
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.15 Text Generation • Updated 16 days ago • 95
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05 Text Generation • Updated 15 days ago • 22
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10 Text Generation • Updated 15 days ago • 21