ppo-LunarLander-v2 / results.json
WDong's picture
Exercises for a deep reinforcement learning course
b5c7fcf verified
raw
history blame contribute delete
No virus
165 Bytes
{"mean_reward": 254.21788540000003, "std_reward": 14.316245073083714, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-03-10T03:34:35.127569"}