ppo-LunarLander-v2 / results.json
{"mean_reward": 254.63723977288936, "std_reward": 22.64728587425626, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-13T23:12:08.677813"}