Holarissun/dpo_harmlessharmless_gpt4_subset20000_modelgpt2_maxsteps5000_bz8_lr1e-05 Updated 7 days ago
Holarissun/dpo_harmlessharmless_gpt4_subset20000_modelgpt2_maxsteps5000_bz8_lr5e-06 Updated 7 days ago
Holarissun/dpo_helpful_gemmaneghelpful_gpt4_subset20000_modelgemma2b_maxsteps5000_bz8_lr5e-06 Updated 5 days ago
Holarissun/dpo_helpful_gemmaneghelpful_gpt4_subset20000_modelgemma2b_maxsteps5000_bz8_lr1e-06 Updated 5 days ago