mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr1e-7_dcftv1.2 Text Generation • Updated 13 days ago • 98
mlfoundations-dev/hp_ablations_qwen_adambeta1_0.9_dcftv1.2 Text Generation • Updated 13 days ago • 95
mlfoundations-dev/hp_ablations_qwen_adambeta1_0.95_dcftv1.2 Text Generation • Updated 13 days ago • 95
mlfoundations-dev/hp_ablations_qwen_adambeta1_0.92_dcftv1.2 Text Generation • Updated 13 days ago • 95
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.95_dcftv1.2 Text Generation • Updated 13 days ago • 95
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.98_dcftv1.2 Text Generation • Updated 13 days ago • 95
mlfoundations-dev/hp_ablations_qwen_adambeta1_0.85_dcftv1.2 Text Generation • Updated 13 days ago • 171
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.999_dcftv1.2 Text Generation • Updated 13 days ago • 99
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.995_dcftv1.2 Text Generation • Updated 13 days ago • 176
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.9995_dcftv1.2 Text Generation • Updated 13 days ago • 99
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.99_dcftv1.2 Text Generation • Updated 13 days ago • 95
mlfoundations-dev/hp_ablations_qwen_scheduler_constant_dcftv1.2 Text Generation • Updated 13 days ago • 97
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_minlr1e-6_dcftv1.2 Text Generation • Updated 13 days ago • 95
mlfoundations-dev/hp_ablations_qwen_scheduler_inverse_sqrt_dcftv1.2 Text Generation • Updated 13 days ago • 97
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_minlr5e-7_dcftv1.2 Text Generation • Updated 13 days ago • 95
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_dcftv1.2 Text Generation • Updated 13 days ago • 114
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.15_dcftv1.2 Text Generation • Updated 13 days ago • 118
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_minlr1e-7_dcftv1.2 Text Generation • Updated 13 days ago • 165
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_minlr1e-6_dcftv1.2 Text Generation • Updated 13 days ago • 144