Commit History

Upload PPO agent trained in LunarLander-v2 for Unit 1 Deep-RL Course. Epochs: 500k, Mean Reward: 192 +/- 75
64b3873

Vladimir Abramov commited on

initial commit
54fd83b

Vladimir Abramov commited on