ppo-LunarLander-v2 / results.json
Eugene-Bond's picture
Using linear learning rate
3cd5346
raw
history blame contribute delete
No virus
164 Bytes
{"mean_reward": 282.8776428888155, "std_reward": 14.886537756471704, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-10T03:36:28.725423"}