helpful_gpt4_subset-1_modelgemma2b_maxsteps10000_bz8_lr1e-05 23fb827 verified Holarissun commited on May 24