stochastic's picture
Upload my first PPO LunarLander-v2 trained agent
8df87a4
{"mean_reward": -253.2283852959168, "std_reward": 66.77114890222038, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-23T15:43:08.614209"}