Ayush-1722/Mistral-7B-Instruct-v0.1-Summarize-64K-QLoRANET-Merged Text Generation • Updated 30 days ago • 16
Holarissun/REPROD_dpo_helpfulhelpful_human_subset-1_modelgemma2b_maxsteps10000_bz8_lr5e-06 Updated 26 days ago
Holarissun/REPROD_dpo_helpfulhelpful_gpt3_subset-1_modelgemma2b_maxsteps10000_bz8_lr5e-06 Updated 30 days ago
Holarissun/REPROD_dpo_harmlessharmless_human_subset-1_modelgemma2b_maxsteps6000_bz8_lr1e-05 Updated 30 days ago