ppo-lunar-rl-course / results.json
fermaat's picture
testing the course
507b0ab
{"mean_reward": 232.33607343301597, "std_reward": 46.08023171010049, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-23T20:00:10.777665"}