ppo-LunarLander-v2 / mlp_model_5Msteps /_stable_baselines3_version
fedorn's picture
5 million training steps
4528257
2.0.0a5