ppo-LunarLander-v2 / results.json
marcogfedozzi's picture
Unit1 - LunarLander-v2 - PPO - 1M steps
cad9f95
raw
history blame
156 Bytes
{"mean_reward": 186.5871873, "std_reward": 86.4228727038874, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-01-04T20:09:06.291183"}