LunarLander_v2-PPO / results.json
misza222's picture
Second RL model
fd5b736
{"mean_reward": 256.4828945631626, "std_reward": 14.462702546463465, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-07T11:23:52.925299"}