ppo_2-LunarLander-v2 / results.json
daripaez's picture
First model trained for Deep RL Course Unit 1
ebf60f6
{"mean_reward": 285.44703732097275, "std_reward": 19.047294205779494, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-06T23:15:14.892997"}