ppo-LunarLander-v2 / results.json
dficenec's picture
Initial training run
56194ae
{"mean_reward": 285.30866639765424, "std_reward": 28.676550945046486, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-11T19:56:32.277240"}