ppo-LunarLander-v2 / results.json
gazuzur's picture
Upload PPO LunarLander-v2 trained agent 300.000 timesteps
2b64c04
{"mean_reward": -51.66490459996639, "std_reward": 23.720559339857928, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-02-16T14:37:40.634148"}