mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.15 Text Generation • Updated 15 days ago • 23
mlfoundations-dev/hp_ablations_gemma_scheduler_inverse_sqrt Text Generation • Updated 15 days ago • 14
mlfoundations-dev/hp_ablations_gemma_scheduler_linear_warmup0.05 Text Generation • Updated 15 days ago • 16
mlfoundations-dev/hp_ablations_gemma_scheduler_linear_warmup0.10 Text Generation • Updated 15 days ago • 14
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_minlr1e-6 Text Generation • Updated 15 days ago • 100
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_minlr1e-6 Text Generation • Updated 15 days ago • 99
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_minlr1e-7 Text Generation • Updated 15 days ago • 95
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_minlr5e-7 Text Generation • Updated 15 days ago • 98
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_minlr5e-7 Text Generation • Updated 15 days ago • 92
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_minlr1e-7 Text Generation • Updated 15 days ago • 95
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr1e-6 Text Generation • Updated 15 days ago • 108
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr5e-7 Text Generation • Updated 15 days ago • 95
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr1e-7 Text Generation • Updated 15 days ago • 18
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr1e-7 Text Generation • Updated 15 days ago • 97
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr5e-7 Text Generation • Updated 15 days ago • 19
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr1e-6 Text Generation • Updated 15 days ago • 105
mlfoundations-dev/airoboros_none_resp_gpt-4o-mini_inst_gpt-4o_resp Text Generation • Updated 14 days ago • 138
mlfoundations-dev/hp_ablations_mistral_adambeta1_0.85_dcftv1.2 Text Generation • Updated 13 days ago • 96
mlfoundations-dev/hp_ablations_mistral_adambeta1_0.95_dcftv1.2 Text Generation • Updated 13 days ago • 95
mlfoundations-dev/hp_ablations_mistral_adambeta1_0.9_dcftv1.2 Text Generation • Updated 13 days ago • 94
mlfoundations-dev/hp_ablations_mistral_adambeta1_0.92_dcftv1.2 Text Generation • Updated 13 days ago • 148