martimfasantos/tinyllama-1.1b-mt-simpo_beta2.0_gamma1.0_LR5e-8_BS16_adamw_3epochs Text Generation • Updated 16 days ago • 14 • 1
joshuasundance/phi3-mini-4k-qlora-python-code-20k-mypo-4k-rfc-pipe Text Generation • Updated 14 days ago • 20 • 1
Magpie-Align/Llama-3.1-8B-Magpie-Mix-RC-UltraDPO-08 Text Generation • Updated about 23 hours ago • 22 • 1