ppo-LunarLander_SB_1e6 / results.json
sw32-seo's picture
PPO training on LunarLander-v2
cde39dd
{"mean_reward": 271.79469734225995, "std_reward": 22.807300509658074, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-05-20T16:57:09.406713"}