ppo-LunarLander-v2 / results.json
TUMxudashuai's picture
Upload PPO LunarLander-v2 trained agent
a927507
{"mean_reward": 155.33258250570648, "std_reward": 58.358773873004836, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-11-22T01:40:38.025300"}