unit-1-PPO-LunarLander-v2 / mlp_ppo_lunarlander /_stable_baselines3_version
Vladimir Abramov
Upload PPO agent trained in LunarLander-v2 for Unit 1 Deep-RL Course. Epochs: 500k, Mean Reward: 192 +/- 75
64b3873
raw
history blame contribute delete
No virus
5 Bytes
1.5.0