ppo_lunar_lander-v2 / lunar_landing_ppo_030723_02

Commit History

1M learn steps, learning_rate 5e-5, n_steps 800, ent_coef 5e-2
162146e

bobobert4 commited on