ppo-LunarLander-v2 / results.json
daripaez's picture
First model trained for Deep RL Course Unit 1
2e7eb15
{"mean_reward": 254.9355858738565, "std_reward": 20.69034735531595, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-06T02:21:44.927512"}