a2c-PandaReachDense-v2 / results.json
chavicoski's picture
Actor Critic model for PandaReachDense-v2 environment
95c2f3a
{"mean_reward": -1.5466729000000001, "std_reward": 0.24192117787512943, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-01-19T19:53:11.912908"}