Holarissun/REPROD_dpo_helpfulhelpful_human_subset-1_modelgemma7b_maxsteps10000_bz8_lr5e-06 Updated May 29
Mohamed-Ahmed161/llama-3-8b-Instruct-bnb-16bit-MedicalQnADataset Text Generation • Updated May 29 • 23 • 1
Holarissun/REPROD_dpo_harmlessharmless_gpt4_subset-1_modelgemma7b_maxsteps10000_bz8_lr5e-06 Updated May 29 • 1