ppo-LunarLander-v2 / results.json
gensym's picture
LunarLander PPO from Unit 1 of the RL course
cd48b5a
{"mean_reward": 280.5490798695877, "std_reward": 19.8749851463831, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-05-14T19:38:37.245240"}