Rifky's picture
Initial PPO Model using MLPPolicy for LunarLander-v2
a2d1647
{"mean_reward": 288.84840311777685, "std_reward": 15.261479036604767, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-26T22:17:47.161878"}