helpful_gpt4_subset20000_modelgemma2b_maxsteps5000_bz8_lr1e-06 ed6f17e verified Holarissun commited on May 2