LunarLander-v2-ppo-3 / results.json
arampacha's picture
trained model 2e+06 steps
c3d1abf
{"mean_reward": 283.2170367, "std_reward": 19.27402596766334, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-14T15:22:33.532592"}