new-PPO-LunarLander-v2 / PPO-LunarLander-v2 /_stable_baselines3_version
EvanMath's picture
PPO trained on 500,000 steps.
e2eaf0e
1.6.0