ppo-LunarLander-v2 / replay.mp4

Commit History

First LunarLander-v2 PPO model: mean_reward=251.58 +/- 13.59
7210aa6

amal94 commited on