ppo-LunarLander-v2 / results.json
mosterdslop's picture
First train of PPO on LunarLander for Unit 1 of RL course
8f1c328 verified
{"mean_reward": 262.2896671, "std_reward": 23.030810657635513, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-04-26T13:41:32.988708"}