unit1_ppo / results.json
dzegan's picture
first attempt at PPO
8e262b5
{"mean_reward": 248.63545800533592, "std_reward": 40.881723701473504, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-07T00:56:16.990904"}