ppo-LunarLander-v2 / replay.mp4

Commit History

Tuned PPO agent trained on LunarLander-v2 (1 million timesteps)
958737f
verified

crossroderick commited on