Holarissun/REPROD_dpo_helpfulhelpful_human_subset-1_modelgemma2b_maxsteps10000_bz8_lr5e-06 Updated 25 days ago
Holarissun/REPROD_dpo_helpfulhelpful_gpt3_subset-1_modelgemma2b_maxsteps10000_bz8_lr5e-06 Updated 29 days ago
Holarissun/REPROD_dpo_harmlessharmless_human_subset-1_modelgemma2b_maxsteps6000_bz8_lr1e-05 Updated 29 days ago
Holarissun/REPROD_dpo_helpfulhelpful_gpt4_subset-1_modelgemma2b_maxsteps10000_bz8_lr5e-06 Updated 29 days ago
Holarissun/REPROD_dpo_helpfulhelpful_human_subset-1_modelgemma2b_maxsteps10000_bz8_lr1e-05 Updated 29 days ago
Holarissun/REPROD_dpo_helpfulhelpful_gpt3_subset-1_modelgemma2b_maxsteps10000_bz8_lr1e-05 Updated 29 days ago • 1 • 1
Holarissun/REPROD_dpo_harmlessharmless_human_subset-1_modelgemma2b_maxsteps6000_bz8_lr5e-06 Updated 29 days ago
Holarissun/REPROD_dpo_helpfulhelpful_gpt4_subset-1_modelgemma2b_maxsteps10000_bz8_lr1e-05 Updated 29 days ago
bmehrba/Llama-2-7b-chat-hf-fine-tuned-adapters_Llama2_7b_contamination_8digits_Seed2024 Updated 29 days ago