ppo-LunarLander-v2 / results.json
marcogfedozzi's picture
Unit1 - LunarLander-v2 - PPO - 1M steps
3cf931d
raw
history blame
158 Bytes
{"mean_reward": 257.6925258, "std_reward": 49.397620111334994, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-12-30T10:09:36.759937"}