PPO-LunarLander-v2 / results.json
linker81's picture
Update of hyperparameters PPO
d1594ec
{"mean_reward": 279.2502646277958, "std_reward": 16.685586876041896, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-19T15:34:19.705503"}