joweyel's picture
Pushing agent that was trained with PPO in the LunarLander-v2 environment
d4850de
raw
history blame contribute delete
162 Bytes
{"mean_reward": 267.147389043737, "std_reward": 17.05484217280618, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-05-20T14:01:50.543033"}