LunarLander-v2-ppo-3 / results.json
arampacha's picture
trained model 2e+06 steps
e5006b4
raw
history blame
157 Bytes
{"mean_reward": 275.5021106, "std_reward": 18.52895528542475, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-14T11:56:49.534105"}