ppo-LunarLander-v2 / results.json
BigTimeCoderSean's picture
This is my lunar lander agent from class 1 of Hugging Face's RL class
0db022f
{"mean_reward": 191.73667465857295, "std_reward": 31.062055610565075, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-07-05T19:29:22.575678"}