ppo-LunarLander-v2 / results.json
dficenec's picture
Initial training run
6da5dd0
{"mean_reward": 267.86335404534935, "std_reward": 17.226132874398076, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-10T03:09:14.802715"}