ppo-LunarLander-v2 / results.json
RYOBEAR's picture
Second Upload PPO LunarLander-v2 trained agent
026dbb1
{"mean_reward": 232.99102170980396, "std_reward": 15.436556233754112, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-04-13T08:18:41.694951"}