ppo-LunarLander_SB_1e6 / results.json
sw32-seo's picture
PPO training on LunarLander-v2
ab5207a
raw
history blame
165 Bytes
{"mean_reward": 238.52892112531896, "std_reward": 45.047057629378486, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-04-23T22:49:22.569892"}