ppo-LunarLander-v2 / results.json
ankandrew's picture
LunarLander-v2 PPO trained agent
2f55e14
{"mean_reward": 298.44275784065024, "std_reward": 14.61015396069865, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-03-08T12:56:14.028905"}