unit-1-PPO-LunarLander-v2 / mlp_ppo_lunarlander

Commit History

Upload PPO agent trained in LunarLander-v2 for Unit 1 Deep-RL Course. Epochs: 500k, Mean Reward: 192 +/- 75
64b3873

Vladimir Abramov commited on