ppo-LunarLander-v2 / results.json
stochastic's picture
another bad model hehe
a69aaee
{"mean_reward": 78.32007933102525, "std_reward": 104.7663662090126, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-23T15:57:19.563140"}