ppo-LunarLander-v2-TEST / results.json
kzipa's picture
80mil train
5eef30d
{"mean_reward": 296.127958295472, "std_reward": 21.213548724500868, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-23T11:07:11.299292"}