ppo-CartPole-v1 / results.json
cheremushkin's picture
2M iterations training
4f1eeb1
{"mean_reward": 500.0, "std_reward": 0.0, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-01-04T03:09:21.753715"}