ppo-LunarLander-v2 / results.json
faran332's picture
trained agent
2290562 verified
{"mean_reward": 230.33757069999996, "std_reward": 28.295643686494913, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-02-03T18:41:37.458538"}