vukien2301/qwen2.5-1.5b-sft-gpt54mini-math_cot-dora-adapter Text Generation • Updated 10 days ago • 15
vukien2301/llama-3.1-8b-ultrafeedback-dpo-from-epoch1 Text Generation • 8B • Updated 18 days ago • 135 •