GENIAC-Team-Ozaki/lora-dpo-finetuned-stage4-full-sft-0.1_1e-6_ep-1 Text Generation • Updated 23 days ago
EleutherAI/Mistral-7B-v0.1-multiplication-random-standardized-random-names Text Generation • Updated 22 days ago
EleutherAI/Mistral-7B-v0.1-modularaddition-random-standardized-random-names Text Generation • Updated 21 days ago
GENIAC-Team-Ozaki/lora-dpo-finetuned-stage4-full-sft-0.5_1e-6_ep-1 Text Generation • Updated 23 days ago • 1