ppo-LunarLander-v2 / results.json
Lodeing's picture
Model trained
9cce2f1
{"mean_reward": 270.2772754, "std_reward": 17.77376789618108, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-12-02T20:37:45.131531"}