deep-rl-week1-ppo / results.json
Brad Hayes
PPO Model v2
f435714
raw
history blame
164 Bytes
{"mean_reward": 286.05366386984394, "std_reward": 23.11047313481399, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-06T21:37:29.262261"}