ppo-LunarLander-v2 / results.json
dficenec's picture
Initial training run
c1d227a
{"mean_reward": 278.91090946998986, "std_reward": 18.53603208652666, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-10T23:28:44.917252"}