PPO-LunarLander-v22 / results.json
ChaoChao2023's picture
上传月球着陆器强化学习模型
faa0a94 verified
{"mean_reward": 214.4859161, "std_reward": 81.86146489389608, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-01-23T03:13:11.013567"}