ppo-LunarLander-v2 / results.json
matansol's picture
first model in the RL course. MS
3fa0f70
{"mean_reward": 246.18960080000002, "std_reward": 21.073665384476627, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-12-03T08:22:36.831059"}