Lunar-Landing-PPO / results.json
SuperSecureHuman's picture
1M trained
0e90687
raw
history blame
164 Bytes
{"mean_reward": 284.3023116646667, "std_reward": 14.062328295819668, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-05T05:56:30.625644"}