ppo-LunarLander-v2 / results.json
Guerosharp's picture
Model based on original code from unit 1 notebook
dbe3121
{"mean_reward": 259.1780016646884, "std_reward": 20.191664050254666, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-10T11:20:34.694509"}