ppo-LunarLander-v2 / results.json
dhmeltzer's picture
RL_CourseV2, unit1, Lunarlander
cebb7eb
{"mean_reward": 255.08327430493523, "std_reward": 20.35802011668448, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-01-30T02:43:18.264031"}