ppo-LunarLander-v2 / results.json
Dae314's picture
Initial Deep RL course agent
fa1de28
{"mean_reward": 261.9487585363846, "std_reward": 29.478006521613988, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-05-02T21:57:15.733181"}