PPO-LunarLander-v2 / results.json
phildav's picture
rl class unit 1 submission
d1a4826
{"mean_reward": 144.75730208818882, "std_reward": 139.02954962614268, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-11-06T13:51:29.591812"}