mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05 Text Generation • Updated 16 days ago • 148
mlfoundations-dev/hp_ablations_mistral_scheduler_linear_warmup0.10 Text Generation • Updated 16 days ago • 159
mlfoundations-dev/hp_ablations_mistral_scheduler_inverse_sqrt Text Generation • Updated 16 days ago • 158
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.15 Text Generation • Updated 16 days ago • 97
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10 Text Generation • Updated 16 days ago • 101
mlfoundations-dev/hp_ablations_mistral_scheduler_linear_warmup0.05 Text Generation • Updated 16 days ago • 162
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr1e-7 Text Generation • Updated 15 days ago • 98
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr1e-6 Text Generation • Updated 15 days ago • 168
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr5e-7 Text Generation • Updated 15 days ago • 95
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr5e-7 Text Generation • Updated 15 days ago • 95
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr1e-7 Text Generation • Updated 15 days ago • 97
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr1e-6 Text Generation • Updated 15 days ago • 101
DongfuJiang/prm_gsm_2k_with_full_sol_mix_ref_redistribution_hf Text Generation • Updated 17 days ago • 385