ppo-LunarLander-v2 / results.json
NielsV's picture
Upload of trained PPO policy on LunarLander-v2
8a1d0c5
raw
history blame
163 Bytes
{"mean_reward": -858.0881617259234, "std_reward": 623.782596274442, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-20T16:43:31.589267"}