a2c-PandaReachDense-v2 / results.json
NathanaelM's picture
new training
1829acd
{"mean_reward": -0.4884863195940852, "std_reward": 0.14316050242659178, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-01-27T09:23:56.744066"}