ppo-LunarLander-v2 / results.json
dficenec's picture
Initial training run
7c60cb2
{"mean_reward": 280.14667746235875, "std_reward": 21.58929555250594, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-11T21:11:49.310195"}