Lendalf commited on
Commit
e2e6cdf
1 Parent(s): 3323eab

Upload PPO LunarLander-v2 with 2 million time steps

Browse files
README.md CHANGED
@@ -6,7 +6,7 @@ tags:
6
  - reinforcement-learning
7
  - stable-baselines3
8
  model-index:
9
- - name: ppo
10
  results:
11
  - task:
12
  type: reinforcement-learning
@@ -16,13 +16,13 @@ model-index:
16
  type: LunarLander-v2
17
  metrics:
18
  - type: mean_reward
19
- value: 242.22 +/- 49.88
20
  name: mean_reward
21
  verified: false
22
  ---
23
 
24
- # **ppo** Agent playing **LunarLander-v2**
25
- This is a trained model of a **ppo** agent playing **LunarLander-v2**
26
  using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
27
 
28
  ## Usage (with Stable-baselines3)
 
6
  - reinforcement-learning
7
  - stable-baselines3
8
  model-index:
9
+ - name: PPO
10
  results:
11
  - task:
12
  type: reinforcement-learning
 
16
  type: LunarLander-v2
17
  metrics:
18
  - type: mean_reward
19
+ value: 278.17 +/- 30.44
20
  name: mean_reward
21
  verified: false
22
  ---
23
 
24
+ # **PPO** Agent playing **LunarLander-v2**
25
+ This is a trained model of a **PPO** agent playing **LunarLander-v2**
26
  using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
27
 
28
  ## Usage (with Stable-baselines3)
config.json CHANGED
@@ -1 +1 @@
1
- {"policy_class": {":type:": "<class 'abc.ABCMeta'>", ":serialized:": "gAWVOwAAAAAAAACMIXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5wb2xpY2llc5SMEUFjdG9yQ3JpdGljUG9saWN5lJOULg==", "__module__": "stable_baselines3.common.policies", "__doc__": "\n Policy class for actor-critic algorithms (has both policy and value prediction).\n Used by A2C, PPO and the likes.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param ortho_init: Whether to use or not orthogonal initialization\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param full_std: Whether to use (n_features x n_actions) parameters\n for the std instead of only (n_features,) when using gSDE\n :param use_expln: Use ``expln()`` function instead of ``exp()`` to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param squash_output: Whether to squash the output using a tanh function,\n this allows to ensure boundaries when using gSDE.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param share_features_extractor: If True, the features extractor is shared between the policy and value networks.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n ", "__init__": "<function ActorCriticPolicy.__init__ at 0x7f48b48d5750>", "_get_constructor_parameters": "<function ActorCriticPolicy._get_constructor_parameters at 0x7f48b48d57e0>", "reset_noise": "<function ActorCriticPolicy.reset_noise at 0x7f48b48d5870>", "_build_mlp_extractor": "<function ActorCriticPolicy._build_mlp_extractor at 0x7f48b48d5900>", "_build": "<function ActorCriticPolicy._build at 0x7f48b48d5990>", "forward": "<function ActorCriticPolicy.forward at 0x7f48b48d5a20>", "extract_features": "<function ActorCriticPolicy.extract_features at 0x7f48b48d5ab0>", "_get_action_dist_from_latent": "<function ActorCriticPolicy._get_action_dist_from_latent at 0x7f48b48d5b40>", "_predict": "<function ActorCriticPolicy._predict at 0x7f48b48d5bd0>", "evaluate_actions": "<function ActorCriticPolicy.evaluate_actions at 0x7f48b48d5c60>", "get_distribution": "<function ActorCriticPolicy.get_distribution at 0x7f48b48d5cf0>", "predict_values": "<function ActorCriticPolicy.predict_values at 0x7f48b48d5d80>", "__abstractmethods__": "frozenset()", "_abc_impl": "<_abc._abc_data object at 0x7f48b48e8340>"}, "verbose": 1, "policy_kwargs": {}, "num_timesteps": 1015808, "_total_timesteps": 1000000, "_num_timesteps_at_start": 0, "seed": null, "action_noise": null, "start_time": 1684093156695198054, "learning_rate": 0.0003, "tensorboard_log": null, "_last_obs": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVdQIAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYAAgAAAAAAADPLeDspsEW6/HgtOpRYf7YFxcC6bTmBtQAAgD8AAIA/mssTPiri0j4qlK26OoKlvvmkfT0NjNw9AAAAAAAAAACAAuK9w0lYuoJFtrvGDFg4+jVlO9bb3jcAAIA/AAAAAG3sG7622DY/O/vAvYBux75dpZe9nf5LOwAAAAAAAAAAAJCMPI+yG7rrgp67MbZcOHFqMTt4zHQ4AACAPwAAgD9NMio9romculO8+Tn6Isu1eK2/ugLXD7kAAIA/AACAP2ahUz2kcBC5g/diu0RKz7b4C9I7MqCHOgAAgD8AAIA/M+SivApdEbs3XaY8A+OFPN07ujt9iGi9AACAPwAAgD8zxvo8H+WQuYUz0beaiVkwemHBuw1C+DYAAIA/AACAPwAUjjsUvJy6wtqNOvPOsTWArFs6w66iuQAAgD8AAIA/GoUyPUgzk7qKNya6RAQWteC7+Dj2i0A5AACAPwAAgD9TkYs+c2NbP5pk6z2+kq6+NyY6Ptu7vr0AAAAAAAAAAGYWnT2P3k+6tfzqvIJsb7XQgk67MyPiNAAAAAAAAAAAwASFPfb0R7qG+Dm8g1s6NsTzBrmBPau1AACAPwAAgD9zFKQ9KUBqup3m7Lp+VVi2tQ9eO5YRCToAAIA/AAAAAIDsjj32BCy6c6huOrWn5zW1v186DV2LuQAAgD8AAIA/lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksQSwiGlIwBQ5R0lFKULg=="}, "_last_episode_starts": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVgwAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACUjAVudW1weZSMBWR0eXBllJOUjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSxCFlIwBQ5R0lFKULg=="}, "_last_original_obs": null, "_episode_num": 0, "use_sde": false, "sde_sample_freq": -1, "_current_progress_remaining": -0.015808000000000044, "_stats_window_size": 100, "ep_info_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWVPQwAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHQF+mG+K0lZ6MAWyUTegDjAF0lEdAlG55yIYWL3V9lChoBkdAOYvOhTOxB2gHTQ4BaAhHQJR2ORr8BMl1fZQoaAZHQGY1+/pMYdhoB03oA2gIR0CUg8cN6PbPdX2UKGgGR0Bkt5Fb3XZoaAdN6ANoCEdAlIVrADaGpXV9lChoBkdAYzYZ5zHS4WgHTegDaAhHQJSGITufEn91fZQoaAZHQGPX2ZqmCRRoB03oA2gIR0CUhv63RXwLdX2UKGgGR0BgtgV45cTraAdN6ANoCEdAlIcLs0HhTHV9lChoBkdAZCwnGbTc7GgHTegDaAhHQJSHuxmkFfR1fZQoaAZHQGD9VtO2y9poB03oA2gIR0CUpLE7W/ahdX2UKGgGR0BoU8PatcOcaAdN6ANoCEdAlKa4sunMuHV9lChoBkdAZHo9ugpSaWgHTegDaAhHQJSyNR1oxpN1fZQoaAZHQGfUTVc2R7toB03oA2gIR0CUs3Bl+VkddX2UKGgGR0BiGRnSOR1YaAdN6ANoCEdAlLebf51vEXV9lChoBkdAX0d8stkFwGgHTegDaAhHQJS4ZQYUFjd1fZQoaAZHQFqBx8D0UXZoB03oA2gIR0CUuH1BMSK4dX2UKGgGR0Bh9VLDhtLtaAdN6ANoCEdAlLkd9ph4MXV9lChoBkdAZE4RUWEbpGgHTegDaAhHQJS8GLhrFfl1fZQoaAZHwDC0xqO938poB0v3aAhHQJS/uEi+tbN1fZQoaAZHQGKCy925hBtoB03oA2gIR0CUwhXgccU/dX2UKGgGR0A0los7MgU2aAdL4WgIR0CUw4JC0F8pdX2UKGgGR0BhFBIMBp6AaAdN6ANoCEdAlM3T/MnqmnV9lChoBkdAZYK5ggHNYGgHTegDaAhHQJTPbOKO1fF1fZQoaAZHQGOCPf8/D+BoB03oA2gIR0CU0BvfCQ9zdX2UKGgGR0BjwLTYukDZaAdN6ANoCEdAlND/QBxPwnV9lChoBkdAYF7bGFSKnGgHTegDaAhHQJTRDKifxtp1fZQoaAZHQF3kLRa5f+loB03oA2gIR0CU0dDUmUnpdX2UKGgGR0BldYsbvPToaAdN6ANoCEdAlPUoW1twaXV9lChoBkdAYoy99MK1HGgHTegDaAhHQJT291xKg7J1fZQoaAZHQGIrpy6tknVoB03oA2gIR0CVAL81Gb1AdX2UKGgGR0BnzBNEgGKRaAdN6ANoCEdAlQTMUAT7EnV9lChoBkdAY7o+L3sXzmgHTegDaAhHQJUFwfCAMDx1fZQoaAZHQGaymMwUQCloB03oA2gIR0CVBrKSgXdkdX2UKGgGR0A9wW56MR6GaAdNFAFoCEdAlQbhPwd8zHV9lChoBkdASVUtNBWxQmgHTRkBaAhHQJUInMOf/WF1fZQoaAZHQGDJS619fC1oB03oA2gIR0CVCjPEKmbcdX2UKGgGR0BjU72HtWuHaAdN6ANoCEdAlQ39NWU8m3V9lChoBkdAY5Zjebd8A2gHTegDaAhHQJUQStCAtnR1fZQoaAZHQF3QrRjSXt1oB03oA2gIR0CVEbO6d1+zdX2UKGgGR0BJtsrmQr+YaAdL8mgIR0CVEz/C66J7dX2UKGgGR0AGOetjkMkQaAdNBQFoCEdAlRYaIeo1k3V9lChoBkdAb1u1NxlxwWgHTSwBaAhHQJUWe5SWJJp1fZQoaAZHQGEIHn+yZ8doB03oA2gIR0CVHLmK64DtdX2UKGgGR0BiEdMwlByCaAdN6ANoCEdAlR6sMZxaPnV9lChoBkdAYYkjlgc94mgHTegDaAhHQJUfgIKMNtt1fZQoaAZHQFnOez2OAAhoB03oA2gIR0CVIJ6u4gA7dX2UKGgGR0BjpKA4GUwBaAdN6ANoCEdAlSCyXpnpS3V9lChoBkdAVsZ1jiGWU2gHTegDaAhHQJUhkYqG1x91fZQoaAZHQGSu6s6q815oB03oA2gIR0CVTHUornTzdX2UKGgGR0BlUBBomG/OaAdN6ANoCEdAlVBz101ZT3V9lChoBkdAYCfgx8D0UWgHTegDaAhHQJVRZaSs8xN1fZQoaAZHQGRpa0QbuMNoB03oA2gIR0CVV7j2SMcZdX2UKGgGR0BocVar3j+8aAdN6ANoCEdAlV1iYTj//HV9lChoBkdAZUK6J66as2gHTegDaAhHQJVg1jSXt0F1fZQoaAZHQGLCjjJdSl5oB03oA2gIR0CVYvPjn3cpdX2UKGgGR0BgrWT9sJpnaAdN6ANoCEdAlWSqUNayKXV9lChoBkdAY36bYsd1dWgHTegDaAhHQJVnWFdszl91fZQoaAZHQGWAh+nZTQ5oB03oA2gIR0CVZ7QiRnvldX2UKGgGR0BluzwlSjxkaAdN6ANoCEdAlWw2NrCWNXV9lChoBkdAZJx/5tWMj2gHTegDaAhHQJVtonRb8m91fZQoaAZHQGLKE4NqgyxoB03oA2gIR0CVbju7pV0cdX2UKGgGR0BfqhiTdLxqaAdN6ANoCEdAlW8GVu76HnV9lChoBkdAZEobEP1+RmgHTegDaAhHQJVvFSNwR5F1fZQoaAZHQGCHF7laKUFoB03oA2gIR0CVb775mAbydX2UKGgGR0BMb8pkPMB7aAdNAAFoCEdAlXOr0WdmQXV9lChoBkdAQs0yHmA9V2gHTRcBaAhHQJWL99Ujs2N1fZQoaAZHQGVP6P0Zm7JoB03oA2gIR0CVm+9vS+g2dX2UKGgGR0Bi8MBGQSzxaAdN6ANoCEdAlaBpLEk0JnV9lChoBkdAYAkhnJ1aGGgHTegDaAhHQJWhVYLb5/N1fZQoaAZHQGT7DM3ZPEdoB03oA2gIR0CVpjhnrY5DdX2UKGgGR0BkfsqvvBrOaAdN6ANoCEdAlarQdXDFZXV9lChoBkdAYTBDQZ4wAWgHTegDaAhHQJWtYUN8VpN1fZQoaAZHQGeMoH9m6GxoB03oA2gIR0CVrvlCkXUIdX2UKGgGR0BkZN7BwdbQaAdN6ANoCEdAlbCbytmthnV9lChoBkdAR+CBXjlxO2gHS+FoCEdAlbMECNjslnV9lChoBkdAY6WqR2bG3mgHTegDaAhHQJWzRWV/tpp1fZQoaAZHQGNUEGiYb85oB03oA2gIR0CVt/QQ+UyIdX2UKGgGR0BhD6Cz1K5DaAdN6ANoCEdAlblFwT/Q0HV9lChoBkdAZHBaaCtihGgHTegDaAhHQJW52+xnnMd1fZQoaAZHQGIdHJcPe55oB03oA2gIR0CVupxqfvnbdX2UKGgGR0Bg4pc/t6X0aAdN6ANoCEdAlbtP8IiTuHV9lChoBkdAZMbC2tuDSWgHTegDaAhHQJW/pNTLns91fZQoaAZHQCb2I2wV0tBoB0v+aAhHQJXEcCIUJv51fZQoaAZHQEThrHlwLmZoB0vtaAhHQJXE/8UEgW91fZQoaAZHQGQ7fdqL0jFoB03oA2gIR0CV3Z/VRUFTdX2UKGgGR0Bj++EmICU5aAdN6ANoCEdAlehS/9Hc13V9lChoBkdAQod2JSBK+WgHS+BoCEdAletFUhmoSHV9lChoBkdAYkVCSA6Mi2gHTegDaAhHQJXsWHvc8DB1fZQoaAZHQGNi5U1hsqJoB03oA2gIR0CV7TfQrtmddX2UKGgGR0Bn4qJTER8MaAdN6ANoCEdAlfX7fDUExXV9lChoBkdAYveJLM9r42gHTegDaAhHQJX4ih+OOsF1fZQoaAZHQF3wv8IiTt9oB03oA2gIR0CV+h+b3Gn5dX2UKGgGR0BbFnjlxOtXaAdN6ANoCEdAlfvVRxcVxnV9lChoBkdAZUIOZLIxQGgHTegDaAhHQJX+nzxwyZd1fZQoaAZHQGDqg/LTx5NoB03oA2gIR0CV/uis4ku6dX2UKGgGR0BeIM1O0svqaAdN6ANoCEdAlgnOx8lXzXV9lChoBkdAY4eelKsdUGgHTegDaAhHQJYLTaDf3vh1fZQoaAZHQGQf/t6X0GxoB03oA2gIR0CWDIw0fozOdX2UKGgGR0BhtSH9FWn1aAdN6ANoCEdAlhNHQID5kHV9lChoBkdAYD5Nr0rbxmgHTegDaAhHQJYYhIqbz9V1fZQoaAZHQGQld7OVxCJoB03oA2gIR0CWGR6guh9LdWUu"}, "ep_success_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="}, "_n_updates": 248, "observation_space": {":type:": "<class 'gymnasium.spaces.box.Box'>", ":serialized:": "gAWVcAIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lGgFk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMDWJvdW5kZWRfYmVsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWCAAAAAAAAAABAQEBAQEBAZRoB4wCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksIhZSMAUOUdJRSlIwNYm91bmRlZF9hYm92ZZRoECiWCAAAAAAAAAABAQEBAQEBAZRoFEsIhZRoGHSUUpSMBl9zaGFwZZRLCIWUjANsb3eUaBAoliAAAAAAAAAAAAC0wgAAtMIAAKDAAACgwNsPScAAAKDAAAAAgAAAAICUaApLCIWUaBh0lFKUjARoaWdolGgQKJYgAAAAAAAAAAAAtEIAALRCAACgQAAAoEDbD0lAAACgQAAAgD8AAIA/lGgKSwiFlGgYdJRSlIwIbG93X3JlcHKUjFtbLTkwLiAgICAgICAgLTkwLiAgICAgICAgIC01LiAgICAgICAgIC01LiAgICAgICAgIC0zLjE0MTU5MjcgIC01LgogIC0wLiAgICAgICAgIC0wLiAgICAgICBdlIwJaGlnaF9yZXBylIxTWzkwLiAgICAgICAgOTAuICAgICAgICAgNS4gICAgICAgICA1LiAgICAgICAgIDMuMTQxNTkyNyAgNS4KICAxLiAgICAgICAgIDEuICAgICAgIF2UjApfbnBfcmFuZG9tlE51Yi4=", "dtype": "float32", "bounded_below": "[ True True True True True True True True]", "bounded_above": "[ True True True True True True True True]", "_shape": [8], "low": "[-90. -90. -5. -5. -3.1415927 -5.\n -0. -0. ]", "high": "[90. 90. 5. 5. 3.1415927 5.\n 1. 1. ]", "low_repr": "[-90. -90. -5. -5. -3.1415927 -5.\n -0. -0. ]", "high_repr": "[90. 90. 5. 5. 3.1415927 5.\n 1. 1. ]", "_np_random": null}, "action_space": {":type:": "<class 'gymnasium.spaces.discrete.Discrete'>", ":serialized:": "gAWV1QAAAAAAAACMGWd5bW5hc2l1bS5zcGFjZXMuZGlzY3JldGWUjAhEaXNjcmV0ZZSTlCmBlH2UKIwBbpSMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMBnNjYWxhcpSTlIwFbnVtcHmUjAVkdHlwZZSTlIwCaTiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYkMIBAAAAAAAAACUhpRSlIwFc3RhcnSUaAhoDkMIAAAAAAAAAACUhpRSlIwGX3NoYXBllCloCmgOjApfbnBfcmFuZG9tlE51Yi4=", "n": "4", "start": "0", "_shape": [], "dtype": "int64", "_np_random": null}, "n_envs": 16, "n_steps": 1024, "gamma": 0.999, "gae_lambda": 0.98, "ent_coef": 0.01, "vf_coef": 0.5, "max_grad_norm": 0.5, "batch_size": 64, "n_epochs": 4, "clip_range": {":type:": "<class 'function'>", ":serialized:": "gAWVxQIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLAUsTQwSIAFMAlE6FlCmMAV+UhZSMSS91c3IvbG9jYWwvbGliL3B5dGhvbjMuMTAvZGlzdC1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjARmdW5jlEuEQwIEAZSMA3ZhbJSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjEkvdXNyL2xvY2FsL2xpYi9weXRob24zLjEwL2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaB99lH2UKGgWaA2MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgXjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz/JmZmZmZmahZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"}, "clip_range_vf": null, "normalize_advantage": true, "target_kl": null, "lr_schedule": {":type:": "<class 'function'>", ":serialized:": "gAWVxQIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLAUsTQwSIAFMAlE6FlCmMAV+UhZSMSS91c3IvbG9jYWwvbGliL3B5dGhvbjMuMTAvZGlzdC1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjARmdW5jlEuEQwIEAZSMA3ZhbJSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjEkvdXNyL2xvY2FsL2xpYi9weXRob24zLjEwL2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaB99lH2UKGgWaA2MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgXjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz8zqSowVTJhhZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"}, "system_info": {"OS": "Linux-5.15.107+-x86_64-with-glibc2.31 # 1 SMP Sat Apr 29 09:15:28 UTC 2023", "Python": "3.10.11", "Stable-Baselines3": "2.0.0a5", "PyTorch": "2.0.0+cu118", "GPU Enabled": "True", "Numpy": "1.22.4", "Cloudpickle": "2.2.1", "Gymnasium": "0.28.1", "OpenAI Gym": "0.25.2"}}
 
1
+ {"policy_class": {":type:": "<class 'abc.ABCMeta'>", ":serialized:": "gAWVOwAAAAAAAACMIXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5wb2xpY2llc5SMEUFjdG9yQ3JpdGljUG9saWN5lJOULg==", "__module__": "stable_baselines3.common.policies", "__doc__": "\n Policy class for actor-critic algorithms (has both policy and value prediction).\n Used by A2C, PPO and the likes.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param ortho_init: Whether to use or not orthogonal initialization\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param full_std: Whether to use (n_features x n_actions) parameters\n for the std instead of only (n_features,) when using gSDE\n :param use_expln: Use ``expln()`` function instead of ``exp()`` to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param squash_output: Whether to squash the output using a tanh function,\n this allows to ensure boundaries when using gSDE.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param share_features_extractor: If True, the features extractor is shared between the policy and value networks.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n ", "__init__": "<function ActorCriticPolicy.__init__ at 0x7f5605504700>", "_get_constructor_parameters": "<function ActorCriticPolicy._get_constructor_parameters at 0x7f5605504790>", "reset_noise": "<function ActorCriticPolicy.reset_noise at 0x7f5605504820>", "_build_mlp_extractor": "<function ActorCriticPolicy._build_mlp_extractor at 0x7f56055048b0>", "_build": "<function ActorCriticPolicy._build at 0x7f5605504940>", "forward": "<function ActorCriticPolicy.forward at 0x7f56055049d0>", "extract_features": "<function ActorCriticPolicy.extract_features at 0x7f5605504a60>", "_get_action_dist_from_latent": "<function ActorCriticPolicy._get_action_dist_from_latent at 0x7f5605504af0>", "_predict": "<function ActorCriticPolicy._predict at 0x7f5605504b80>", "evaluate_actions": "<function ActorCriticPolicy.evaluate_actions at 0x7f5605504c10>", "get_distribution": "<function ActorCriticPolicy.get_distribution at 0x7f5605504ca0>", "predict_values": "<function ActorCriticPolicy.predict_values at 0x7f5605504d30>", "__abstractmethods__": "frozenset()", "_abc_impl": "<_abc._abc_data object at 0x7f56054efe40>"}, "verbose": 1, "policy_kwargs": {}, "num_timesteps": 2015232, "_total_timesteps": 2000000, "_num_timesteps_at_start": 0, "seed": null, "action_noise": null, "start_time": 1684177142044086768, "learning_rate": 0.0003, "tensorboard_log": null, "_last_obs": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVdQIAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYAAgAAAAAAAE0GAL2OnKa8zlUDPD9zgbo6Tgm+qnOnvQAAgD8AAIA/s1EAvcWUcj5lRtc9g2OxvoH4ej2ilKu8AAAAAAAAAAAA+vq8w1k8unncLjMgBjUvCLnRuoXKwbMAAIA/AACAP02GZT09nZI/UK07PuCBHr9c14U9Ho7dPQAAAAAAAAAAZuuEvEiphrqAji4zc7V6rgAkEzsrWNOzAACAPwAAgD9myrO8yeBqPSABcD3s15q+qRrnPZI9nL0AAAAAAAAAABogEr1SKP27cDBtPRZRSL4gc8O8j/GEvwAAgD8AAIA/ZtrfPK5Eh7zz7jA+G64avhqKgL0GhA6+AACAPwAAgD8A+9I9VaKNP07AMT6Tsxu/yAb1PenxqLsAAAAAAAAAAKBKGD7lZF4+LqBpvh653r4lO7u9Rd/TvAAAAAAAAAAA5oJPPWHqp7xWwKe8OT51Ozs5ET6jLWO8AACAPwAAgD/NuRq9GayTPoU2Nz6iItK+3xSRPU3YHj4AAAAAAAAAAJq4Qz1oy7C84yvevVlmKD0Kma49CCjXvAAAgD8AAIA/MyvlPHuKsroRx6a7HjBoOP4ux7kWG/c3AACAPwAAAAAzf4s7yqa0PwjE3D4c3Jg8VGuhu+IGyL0AAAAAAAAAAJrxXL2th7M/StQ/vwcMCL6q7MQ8kl3+vAAAAAAAAAAAlIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksQSwiGlIwBQ5R0lFKULg=="}, "_last_episode_starts": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVgwAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAQCUjAVudW1weZSMBWR0eXBllJOUjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSxCFlIwBQ5R0lFKULg=="}, "_last_original_obs": null, "_episode_num": 0, "use_sde": false, "sde_sample_freq": -1, "_current_progress_remaining": -0.007616000000000067, "_stats_window_size": 100, "ep_info_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWV5AsAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHQHHTOo99tuWMAWyUS/yMAXSUR0CfVjL5ylvZdX2UKGgGR0Bx+vIyTINmaAdL3mgIR0CfVlya/h2odX2UKGgGR0Bw/PtsvZh8aAdL92gIR0CfVtORDCxedX2UKGgGR0Byr4wvg3tKaAdL02gIR0CfV+jCHh0hdX2UKGgGR0By+G/QBxPwaAdLyWgIR0CfV+TYukDZdX2UKGgGR0BPmtJFspG4aAdLrGgIR0CfWIOfukULdX2UKGgGR0BxUXgjyFwlaAdLy2gIR0CfWR4Ia99MdX2UKGgGR0BwjxUrCm/GaAdL1GgIR0CfWW5Y5ksjdX2UKGgGR0ByDqNGViWnaAdL0mgIR0CfWeQiRnvldX2UKGgGR0BwSAZgogFHaAdL1GgIR0CfWkMS9M9KdX2UKGgGR0BzmbQu27WeaAdL0GgIR0CfWmhbnoxIdX2UKGgGR0ByctpAUtZnaAdL1GgIR0CfWud0q6OHdX2UKGgGR0B0K2HnEETyaAdL3WgIR0CfWv/hl18tdX2UKGgGR0BzWnE4vN/waAdLzmgIR0CfWxAtnPE9dX2UKGgGR0BzbuNDMNc4aAdL6WgIR0CfW1liBoVVdX2UKGgGR0BwXBQbdadMaAdL62gIR0CfW6TPSlWPdX2UKGgGR0BzATJq7AclaAdL1mgIR0CfW7WaMJhOdX2UKGgGR0ByHHZuhsZYaAdL4WgIR0CfW8v60pmVdX2UKGgGR0BzDDChvitJaAdL7mgIR0CfXMDuSfUXdX2UKGgGR0BygYK/mDDkaAdLw2gIR0CfXNvjOs1bdX2UKGgGR0BxNyPluFYdaAdL0WgIR0CfXTxj8UEgdX2UKGgGR0Bv2tev6j33aAdLx2gIR0CfXicynDR/dX2UKGgGR0Bw81HvttygaAdL6WgIR0CfXn9QoCuEdX2UKGgGR0Bw3FNXYDkmaAdL3GgIR0CfXwUlRgqmdX2UKGgGR0Bw29y6tknUaAdLyWgIR0CfX35Zr56/dX2UKGgGR0BAoPmHP/rCaAdLmmgIR0CfX4cCHRCydX2UKGgGR0BzbkrUb1h9aAdLz2gIR0CfX4RNATqTdX2UKGgGR0BwrXqX4TK1aAdL5mgIR0CfX8IXTEzgdX2UKGgGR0BwNd4LThHcaAdL12gIR0CfYE1Cw8nvdX2UKGgGR0BxjwcinpB5aAdL3GgIR0CfYJN4JNTMdX2UKGgGR0BwXXtUn5SFaAdL5GgIR0CfciqKxcFAdX2UKGgGR0BzJGz4UN8WaAdL4WgIR0Cfcm+10DEFdX2UKGgGR0BwNFm29crzaAdL3WgIR0CfcrUSqU/wdX2UKGgGR0BzTx5AyEcsaAdLxmgIR0Cfc1RtgrpadX2UKGgGR0BxdDZf2K2saAdL3WgIR0Cfc9RcNYr8dX2UKGgGR0BzjlcpsoDxaAdNCgFoCEdAn3WEAcT8HnV9lChoBkdAcdseii7Ci2gHS9BoCEdAn3YBSk0rLHV9lChoBkdAcb6SElE7XGgHS8JoCEdAn3YlzMibD3V9lChoBkdAcHEc580DU2gHS+loCEdAn3ZEbo8p1HV9lChoBkdAcceru6VdHGgHS/poCEdAn3ZtQwblzXV9lChoBkdAckb/Y8Md92gHS8ZoCEdAn3Zi/GlyinV9lChoBkdAb88R+SbH62gHS9NoCEdAn3bWbsniN3V9lChoBkdAclRe3QUpNWgHS95oCEdAn3eH+hoM8nV9lChoBkdAcUxMKTjebmgHS8ZoCEdAn3iKQmu1W3V9lChoBkdAcxiMTN+so2gHS+VoCEdAn3ihHf/FSHV9lChoBkdAcRznuiN83WgHS9toCEdAn3ixt+CsfnV9lChoBkdAc6qAoG6f8WgHS+toCEdAn3lt0Rvm5nV9lChoBkdAcJ5oOx0MgGgHS+NoCEdAn3n+qWC2+nV9lChoBkdAcgaj0th/iGgHS9BoCEdAn3o8UmD15HV9lChoBkdAcNkxagVXWGgHS+VoCEdAn3ukihWYGHV9lChoBkdAceuzJp35e2gHS8FoCEdAn3ybuhK15XV9lChoBkdAccqFa0QbuWgHS8NoCEdAn31ql+EytXV9lChoBkdAcYd46fapP2gHS9RoCEdAn33Azk6tDHV9lChoBkdAcwlO5J9RaWgHS9hoCEdAn34QSamXPnV9lChoBkdAcv/PEKmbb2gHS9loCEdAn34wbEP1+XV9lChoBkdAc5FAiml67mgHS+toCEdAn38HH3lCC3V9lChoBkdAb4rdWyTpxGgHS9BoCEdAn382Y8dPtXV9lChoBkdAcc6HO8kD6mgHS8hoCEdAn4EE8JUo8nV9lChoBkdAcCzSRKYiPmgHTRYBaAhHQJ+BTC66J691fZQoaAZHQHIPkJ4SpR5oB0vjaAhHQJ+BTJ/5Lyt1fZQoaAZHQHDsQEt/WlNoB0vsaAhHQJ+BlpXZGrl1fZQoaAZHQHHqGp++dsloB0vyaAhHQJ+Bus8xKxt1fZQoaAZHQHKmZlnRLK5oB0vPaAhHQJ+B6KziS7p1fZQoaAZHQHM1M81XNkhoB0vGaAhHQJ+DZQMx46h1fZQoaAZHQHHdVSwW30BoB00gAWgIR0CfhY2i+L3sdX2UKGgGR0Bybib1AZ88aAdL4mgIR0CfhckZ75VPdX2UKGgGR0By0zVrhzeXaAdLyWgIR0CfhfkqtozvdX2UKGgGR0BxhAKlYU35aAdL42gIR0Cfhrg4ffXPdX2UKGgGR0BzJGXfIjnnaAdLy2gIR0Cfh6obGWD6dX2UKGgGR0ByJpTR6WxAaAdL6GgIR0Cfh8F2V3UydX2UKGgGR0Byat5IH1OCaAdL9WgIR0CfiCAHmig1dX2UKGgGR0Bxm3bDdgv2aAdL6mgIR0CfiLR2bG3ndX2UKGgGR0Bxgc8jiXIEaAdLwWgIR0CfiT4M4LkTdX2UKGgGR0BxO/xRVIZqaAdL2WgIR0CfijTBZZB+dX2UKGgGR0ByD4KeCkGiaAdL0WgIR0Cfikki2UjcdX2UKGgGR0BxYh2MbWEsaAdL5GgIR0CfilsiB5HFdX2UKGgGR0BzsDaAWi1zaAdL5WgIR0CfirPNmlImdX2UKGgGR0ByHTyVfNRnaAdL4mgIR0CfitNWEK3NdX2UKGgGR0BvbSQq7ROUaAdLymgIR0CfizAWSEDhdX2UKGgGR0BDpDTKDCgsaAdLqmgIR0Cfi5pzcRDkdX2UKGgGR0Bz46hf0EowaAdL0GgIR0CfjODpC8e0dX2UKGgGR0BykLdcjZ+QaAdL3mgIR0CfjSENe+mFdX2UKGgGR0Bob9wcYIjXaAdN6ANoCEdAn44KeXiR4nV9lChoBkdAccY22G7Bf2gHS+xoCEdAn44isOoYN3V9lChoBkdAcnPhF3IMjWgHS85oCEdAn45AsK9f1HV9lChoBkdAcS++t8uzyGgHS9hoCEdAn44yr92ovXV9lChoBkdAb7LfHggow2gHS+NoCEdAn46McU/OdHV9lChoBkdAcJ7f0mMOw2gHS8doCEdAn49rUG3WnXV9lChoBkdAc6MG7Bfrr2gHS+FoCEdAn490QbuMM3V9lChoBkdAcGsvUjLSu2gHS9BoCEdAn4+4ClrM1XV9lChoBkdAcaWeo1k1/GgHS/poCEdAn4/M8La24XV9lChoBkdAcJa6KtPpIWgHS9xoCEdAn5AXfhuO0nV9lChoBkdAb1xn/1g6VGgHS85oCEdAn5A7sjVx0nV9lChoBkdAcezG4I8hcWgHS9VoCEdAn5BId6sySHV9lChoBkdAcsjsWweNk2gHS8poCEdAn5B0snRb8nV9lChoBkdAcQ3gflp48mgHS8xoCEdAn5DguVX3g3V9lChoBkdAcqYhbGFSKmgHS89oCEdAn5JSfg75mHV9lChoBkdAco2aXKKYRmgHS+JoCEdAn5KeMZP2wnV9lChoBkdAcrQviLl3hWgHS85oCEdAn5NQcT8HfXV9lChoBkdAcIXBw++ueWgHS9JoCEdAn5N/qC6H03VlLg=="}, "ep_success_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="}, "_n_updates": 492, "observation_space": {":type:": "<class 'gymnasium.spaces.box.Box'>", ":serialized:": "gAWVcAIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lGgFk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMDWJvdW5kZWRfYmVsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWCAAAAAAAAAABAQEBAQEBAZRoB4wCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksIhZSMAUOUdJRSlIwNYm91bmRlZF9hYm92ZZRoECiWCAAAAAAAAAABAQEBAQEBAZRoFEsIhZRoGHSUUpSMBl9zaGFwZZRLCIWUjANsb3eUaBAoliAAAAAAAAAAAAC0wgAAtMIAAKDAAACgwNsPScAAAKDAAAAAgAAAAICUaApLCIWUaBh0lFKUjARoaWdolGgQKJYgAAAAAAAAAAAAtEIAALRCAACgQAAAoEDbD0lAAACgQAAAgD8AAIA/lGgKSwiFlGgYdJRSlIwIbG93X3JlcHKUjFtbLTkwLiAgICAgICAgLTkwLiAgICAgICAgIC01LiAgICAgICAgIC01LiAgICAgICAgIC0zLjE0MTU5MjcgIC01LgogIC0wLiAgICAgICAgIC0wLiAgICAgICBdlIwJaGlnaF9yZXBylIxTWzkwLiAgICAgICAgOTAuICAgICAgICAgNS4gICAgICAgICA1LiAgICAgICAgIDMuMTQxNTkyNyAgNS4KICAxLiAgICAgICAgIDEuICAgICAgIF2UjApfbnBfcmFuZG9tlE51Yi4=", "dtype": "float32", "bounded_below": "[ True True True True True True True True]", "bounded_above": "[ True True True True True True True True]", "_shape": [8], "low": "[-90. -90. -5. -5. -3.1415927 -5.\n -0. -0. ]", "high": "[90. 90. 5. 5. 3.1415927 5.\n 1. 1. ]", "low_repr": "[-90. -90. -5. -5. -3.1415927 -5.\n -0. -0. ]", "high_repr": "[90. 90. 5. 5. 3.1415927 5.\n 1. 1. ]", "_np_random": null}, "action_space": {":type:": "<class 'gymnasium.spaces.discrete.Discrete'>", ":serialized:": "gAWV1QAAAAAAAACMGWd5bW5hc2l1bS5zcGFjZXMuZGlzY3JldGWUjAhEaXNjcmV0ZZSTlCmBlH2UKIwBbpSMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMBnNjYWxhcpSTlIwFbnVtcHmUjAVkdHlwZZSTlIwCaTiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYkMIBAAAAAAAAACUhpRSlIwFc3RhcnSUaAhoDkMIAAAAAAAAAACUhpRSlIwGX3NoYXBllCloCmgOjApfbnBfcmFuZG9tlE51Yi4=", "n": "4", "start": "0", "_shape": [], "dtype": "int64", "_np_random": null}, "n_envs": 16, "n_steps": 1024, "gamma": 0.999, "gae_lambda": 0.98, "ent_coef": 0.01, "vf_coef": 0.5, "max_grad_norm": 0.5, "batch_size": 64, "n_epochs": 4, "clip_range": {":type:": "<class 'function'>", ":serialized:": "gAWVxQIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLAUsTQwSIAFMAlE6FlCmMAV+UhZSMSS91c3IvbG9jYWwvbGliL3B5dGhvbjMuMTAvZGlzdC1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjARmdW5jlEuEQwIEAZSMA3ZhbJSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjEkvdXNyL2xvY2FsL2xpYi9weXRob24zLjEwL2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaB99lH2UKGgWaA2MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgXjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz/JmZmZmZmahZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"}, "clip_range_vf": null, "normalize_advantage": true, "target_kl": null, "lr_schedule": {":type:": "<class 'function'>", ":serialized:": "gAWVxQIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLAUsTQwSIAFMAlE6FlCmMAV+UhZSMSS91c3IvbG9jYWwvbGliL3B5dGhvbjMuMTAvZGlzdC1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjARmdW5jlEuEQwIEAZSMA3ZhbJSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjEkvdXNyL2xvY2FsL2xpYi9weXRob24zLjEwL2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaB99lH2UKGgWaA2MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgXjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz8zqSowVTJhhZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"}, "system_info": {"OS": "Linux-5.15.107+-x86_64-with-glibc2.31 # 1 SMP Sat Apr 29 09:15:28 UTC 2023", "Python": "3.10.11", "Stable-Baselines3": "2.0.0a5", "PyTorch": "2.0.0+cu118", "GPU Enabled": "True", "Numpy": "1.22.4", "Cloudpickle": "2.2.1", "Gymnasium": "0.28.1", "OpenAI Gym": "0.25.2"}}
ppo-LunarLander-v2.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8ef5dab3718b521dce6431840fee5669f3e424d192c47c08c9ac9a86c06f103f
3
- size 146747
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7464187a0f6ed8de815afea1ab21dbfcfaa153834090a11aa254be9cd5a6a390
3
+ size 146631
ppo-LunarLander-v2/data CHANGED
@@ -4,54 +4,54 @@
4
  ":serialized:": "gAWVOwAAAAAAAACMIXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5wb2xpY2llc5SMEUFjdG9yQ3JpdGljUG9saWN5lJOULg==",
5
  "__module__": "stable_baselines3.common.policies",
6
  "__doc__": "\n Policy class for actor-critic algorithms (has both policy and value prediction).\n Used by A2C, PPO and the likes.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param ortho_init: Whether to use or not orthogonal initialization\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param full_std: Whether to use (n_features x n_actions) parameters\n for the std instead of only (n_features,) when using gSDE\n :param use_expln: Use ``expln()`` function instead of ``exp()`` to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param squash_output: Whether to squash the output using a tanh function,\n this allows to ensure boundaries when using gSDE.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param share_features_extractor: If True, the features extractor is shared between the policy and value networks.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n ",
7
- "__init__": "<function ActorCriticPolicy.__init__ at 0x7f48b48d5750>",
8
- "_get_constructor_parameters": "<function ActorCriticPolicy._get_constructor_parameters at 0x7f48b48d57e0>",
9
- "reset_noise": "<function ActorCriticPolicy.reset_noise at 0x7f48b48d5870>",
10
- "_build_mlp_extractor": "<function ActorCriticPolicy._build_mlp_extractor at 0x7f48b48d5900>",
11
- "_build": "<function ActorCriticPolicy._build at 0x7f48b48d5990>",
12
- "forward": "<function ActorCriticPolicy.forward at 0x7f48b48d5a20>",
13
- "extract_features": "<function ActorCriticPolicy.extract_features at 0x7f48b48d5ab0>",
14
- "_get_action_dist_from_latent": "<function ActorCriticPolicy._get_action_dist_from_latent at 0x7f48b48d5b40>",
15
- "_predict": "<function ActorCriticPolicy._predict at 0x7f48b48d5bd0>",
16
- "evaluate_actions": "<function ActorCriticPolicy.evaluate_actions at 0x7f48b48d5c60>",
17
- "get_distribution": "<function ActorCriticPolicy.get_distribution at 0x7f48b48d5cf0>",
18
- "predict_values": "<function ActorCriticPolicy.predict_values at 0x7f48b48d5d80>",
19
  "__abstractmethods__": "frozenset()",
20
- "_abc_impl": "<_abc._abc_data object at 0x7f48b48e8340>"
21
  },
22
  "verbose": 1,
23
  "policy_kwargs": {},
24
- "num_timesteps": 1015808,
25
- "_total_timesteps": 1000000,
26
  "_num_timesteps_at_start": 0,
27
  "seed": null,
28
  "action_noise": null,
29
- "start_time": 1684093156695198054,
30
  "learning_rate": 0.0003,
31
  "tensorboard_log": null,
32
  "_last_obs": {
33
  ":type:": "<class 'numpy.ndarray'>",
34
- ":serialized:": "gAWVdQIAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYAAgAAAAAAADPLeDspsEW6/HgtOpRYf7YFxcC6bTmBtQAAgD8AAIA/mssTPiri0j4qlK26OoKlvvmkfT0NjNw9AAAAAAAAAACAAuK9w0lYuoJFtrvGDFg4+jVlO9bb3jcAAIA/AAAAAG3sG7622DY/O/vAvYBux75dpZe9nf5LOwAAAAAAAAAAAJCMPI+yG7rrgp67MbZcOHFqMTt4zHQ4AACAPwAAgD9NMio9romculO8+Tn6Isu1eK2/ugLXD7kAAIA/AACAP2ahUz2kcBC5g/diu0RKz7b4C9I7MqCHOgAAgD8AAIA/M+SivApdEbs3XaY8A+OFPN07ujt9iGi9AACAPwAAgD8zxvo8H+WQuYUz0beaiVkwemHBuw1C+DYAAIA/AACAPwAUjjsUvJy6wtqNOvPOsTWArFs6w66iuQAAgD8AAIA/GoUyPUgzk7qKNya6RAQWteC7+Dj2i0A5AACAPwAAgD9TkYs+c2NbP5pk6z2+kq6+NyY6Ptu7vr0AAAAAAAAAAGYWnT2P3k+6tfzqvIJsb7XQgk67MyPiNAAAAAAAAAAAwASFPfb0R7qG+Dm8g1s6NsTzBrmBPau1AACAPwAAgD9zFKQ9KUBqup3m7Lp+VVi2tQ9eO5YRCToAAIA/AAAAAIDsjj32BCy6c6huOrWn5zW1v186DV2LuQAAgD8AAIA/lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksQSwiGlIwBQ5R0lFKULg=="
35
  },
36
  "_last_episode_starts": {
37
  ":type:": "<class 'numpy.ndarray'>",
38
- ":serialized:": "gAWVgwAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACUjAVudW1weZSMBWR0eXBllJOUjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSxCFlIwBQ5R0lFKULg=="
39
  },
40
  "_last_original_obs": null,
41
  "_episode_num": 0,
42
  "use_sde": false,
43
  "sde_sample_freq": -1,
44
- "_current_progress_remaining": -0.015808000000000044,
45
  "_stats_window_size": 100,
46
  "ep_info_buffer": {
47
  ":type:": "<class 'collections.deque'>",
48
- ":serialized:": "gAWVPQwAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHQF+mG+K0lZ6MAWyUTegDjAF0lEdAlG55yIYWL3V9lChoBkdAOYvOhTOxB2gHTQ4BaAhHQJR2ORr8BMl1fZQoaAZHQGY1+/pMYdhoB03oA2gIR0CUg8cN6PbPdX2UKGgGR0Bkt5Fb3XZoaAdN6ANoCEdAlIVrADaGpXV9lChoBkdAYzYZ5zHS4WgHTegDaAhHQJSGITufEn91fZQoaAZHQGPX2ZqmCRRoB03oA2gIR0CUhv63RXwLdX2UKGgGR0BgtgV45cTraAdN6ANoCEdAlIcLs0HhTHV9lChoBkdAZCwnGbTc7GgHTegDaAhHQJSHuxmkFfR1fZQoaAZHQGD9VtO2y9poB03oA2gIR0CUpLE7W/ahdX2UKGgGR0BoU8PatcOcaAdN6ANoCEdAlKa4sunMuHV9lChoBkdAZHo9ugpSaWgHTegDaAhHQJSyNR1oxpN1fZQoaAZHQGfUTVc2R7toB03oA2gIR0CUs3Bl+VkddX2UKGgGR0BiGRnSOR1YaAdN6ANoCEdAlLebf51vEXV9lChoBkdAX0d8stkFwGgHTegDaAhHQJS4ZQYUFjd1fZQoaAZHQFqBx8D0UXZoB03oA2gIR0CUuH1BMSK4dX2UKGgGR0Bh9VLDhtLtaAdN6ANoCEdAlLkd9ph4MXV9lChoBkdAZE4RUWEbpGgHTegDaAhHQJS8GLhrFfl1fZQoaAZHwDC0xqO938poB0v3aAhHQJS/uEi+tbN1fZQoaAZHQGKCy925hBtoB03oA2gIR0CUwhXgccU/dX2UKGgGR0A0los7MgU2aAdL4WgIR0CUw4JC0F8pdX2UKGgGR0BhFBIMBp6AaAdN6ANoCEdAlM3T/MnqmnV9lChoBkdAZYK5ggHNYGgHTegDaAhHQJTPbOKO1fF1fZQoaAZHQGOCPf8/D+BoB03oA2gIR0CU0BvfCQ9zdX2UKGgGR0BjwLTYukDZaAdN6ANoCEdAlND/QBxPwnV9lChoBkdAYF7bGFSKnGgHTegDaAhHQJTRDKifxtp1fZQoaAZHQF3kLRa5f+loB03oA2gIR0CU0dDUmUnpdX2UKGgGR0BldYsbvPToaAdN6ANoCEdAlPUoW1twaXV9lChoBkdAYoy99MK1HGgHTegDaAhHQJT291xKg7J1fZQoaAZHQGIrpy6tknVoB03oA2gIR0CVAL81Gb1AdX2UKGgGR0BnzBNEgGKRaAdN6ANoCEdAlQTMUAT7EnV9lChoBkdAY7o+L3sXzmgHTegDaAhHQJUFwfCAMDx1fZQoaAZHQGaymMwUQCloB03oA2gIR0CVBrKSgXdkdX2UKGgGR0A9wW56MR6GaAdNFAFoCEdAlQbhPwd8zHV9lChoBkdASVUtNBWxQmgHTRkBaAhHQJUInMOf/WF1fZQoaAZHQGDJS619fC1oB03oA2gIR0CVCjPEKmbcdX2UKGgGR0BjU72HtWuHaAdN6ANoCEdAlQ39NWU8m3V9lChoBkdAY5Zjebd8A2gHTegDaAhHQJUQStCAtnR1fZQoaAZHQF3QrRjSXt1oB03oA2gIR0CVEbO6d1+zdX2UKGgGR0BJtsrmQr+YaAdL8mgIR0CVEz/C66J7dX2UKGgGR0AGOetjkMkQaAdNBQFoCEdAlRYaIeo1k3V9lChoBkdAb1u1NxlxwWgHTSwBaAhHQJUWe5SWJJp1fZQoaAZHQGEIHn+yZ8doB03oA2gIR0CVHLmK64DtdX2UKGgGR0BiEdMwlByCaAdN6ANoCEdAlR6sMZxaPnV9lChoBkdAYYkjlgc94mgHTegDaAhHQJUfgIKMNtt1fZQoaAZHQFnOez2OAAhoB03oA2gIR0CVIJ6u4gA7dX2UKGgGR0BjpKA4GUwBaAdN6ANoCEdAlSCyXpnpS3V9lChoBkdAVsZ1jiGWU2gHTegDaAhHQJUhkYqG1x91fZQoaAZHQGSu6s6q815oB03oA2gIR0CVTHUornTzdX2UKGgGR0BlUBBomG/OaAdN6ANoCEdAlVBz101ZT3V9lChoBkdAYCfgx8D0UWgHTegDaAhHQJVRZaSs8xN1fZQoaAZHQGRpa0QbuMNoB03oA2gIR0CVV7j2SMcZdX2UKGgGR0BocVar3j+8aAdN6ANoCEdAlV1iYTj//HV9lChoBkdAZUK6J66as2gHTegDaAhHQJVg1jSXt0F1fZQoaAZHQGLCjjJdSl5oB03oA2gIR0CVYvPjn3cpdX2UKGgGR0BgrWT9sJpnaAdN6ANoCEdAlWSqUNayKXV9lChoBkdAY36bYsd1dWgHTegDaAhHQJVnWFdszl91fZQoaAZHQGWAh+nZTQ5oB03oA2gIR0CVZ7QiRnvldX2UKGgGR0BluzwlSjxkaAdN6ANoCEdAlWw2NrCWNXV9lChoBkdAZJx/5tWMj2gHTegDaAhHQJVtonRb8m91fZQoaAZHQGLKE4NqgyxoB03oA2gIR0CVbju7pV0cdX2UKGgGR0BfqhiTdLxqaAdN6ANoCEdAlW8GVu76HnV9lChoBkdAZEobEP1+RmgHTegDaAhHQJVvFSNwR5F1fZQoaAZHQGCHF7laKUFoB03oA2gIR0CVb775mAbydX2UKGgGR0BMb8pkPMB7aAdNAAFoCEdAlXOr0WdmQXV9lChoBkdAQs0yHmA9V2gHTRcBaAhHQJWL99Ujs2N1fZQoaAZHQGVP6P0Zm7JoB03oA2gIR0CVm+9vS+g2dX2UKGgGR0Bi8MBGQSzxaAdN6ANoCEdAlaBpLEk0JnV9lChoBkdAYAkhnJ1aGGgHTegDaAhHQJWhVYLb5/N1fZQoaAZHQGT7DM3ZPEdoB03oA2gIR0CVpjhnrY5DdX2UKGgGR0BkfsqvvBrOaAdN6ANoCEdAlarQdXDFZXV9lChoBkdAYTBDQZ4wAWgHTegDaAhHQJWtYUN8VpN1fZQoaAZHQGeMoH9m6GxoB03oA2gIR0CVrvlCkXUIdX2UKGgGR0BkZN7BwdbQaAdN6ANoCEdAlbCbytmthnV9lChoBkdAR+CBXjlxO2gHS+FoCEdAlbMECNjslnV9lChoBkdAY6WqR2bG3mgHTegDaAhHQJWzRWV/tpp1fZQoaAZHQGNUEGiYb85oB03oA2gIR0CVt/QQ+UyIdX2UKGgGR0BhD6Cz1K5DaAdN6ANoCEdAlblFwT/Q0HV9lChoBkdAZHBaaCtihGgHTegDaAhHQJW52+xnnMd1fZQoaAZHQGIdHJcPe55oB03oA2gIR0CVupxqfvnbdX2UKGgGR0Bg4pc/t6X0aAdN6ANoCEdAlbtP8IiTuHV9lChoBkdAZMbC2tuDSWgHTegDaAhHQJW/pNTLns91fZQoaAZHQCb2I2wV0tBoB0v+aAhHQJXEcCIUJv51fZQoaAZHQEThrHlwLmZoB0vtaAhHQJXE/8UEgW91fZQoaAZHQGQ7fdqL0jFoB03oA2gIR0CV3Z/VRUFTdX2UKGgGR0Bj++EmICU5aAdN6ANoCEdAlehS/9Hc13V9lChoBkdAQod2JSBK+WgHS+BoCEdAletFUhmoSHV9lChoBkdAYkVCSA6Mi2gHTegDaAhHQJXsWHvc8DB1fZQoaAZHQGNi5U1hsqJoB03oA2gIR0CV7TfQrtmddX2UKGgGR0Bn4qJTER8MaAdN6ANoCEdAlfX7fDUExXV9lChoBkdAYveJLM9r42gHTegDaAhHQJX4ih+OOsF1fZQoaAZHQF3wv8IiTt9oB03oA2gIR0CV+h+b3Gn5dX2UKGgGR0BbFnjlxOtXaAdN6ANoCEdAlfvVRxcVxnV9lChoBkdAZUIOZLIxQGgHTegDaAhHQJX+nzxwyZd1fZQoaAZHQGDqg/LTx5NoB03oA2gIR0CV/uis4ku6dX2UKGgGR0BeIM1O0svqaAdN6ANoCEdAlgnOx8lXzXV9lChoBkdAY4eelKsdUGgHTegDaAhHQJYLTaDf3vh1fZQoaAZHQGQf/t6X0GxoB03oA2gIR0CWDIw0fozOdX2UKGgGR0BhtSH9FWn1aAdN6ANoCEdAlhNHQID5kHV9lChoBkdAYD5Nr0rbxmgHTegDaAhHQJYYhIqbz9V1fZQoaAZHQGQld7OVxCJoB03oA2gIR0CWGR6guh9LdWUu"
49
  },
50
  "ep_success_buffer": {
51
  ":type:": "<class 'collections.deque'>",
52
  ":serialized:": "gAWVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="
53
  },
54
- "_n_updates": 248,
55
  "observation_space": {
56
  ":type:": "<class 'gymnasium.spaces.box.Box'>",
57
  ":serialized:": "gAWVcAIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lGgFk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMDWJvdW5kZWRfYmVsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWCAAAAAAAAAABAQEBAQEBAZRoB4wCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksIhZSMAUOUdJRSlIwNYm91bmRlZF9hYm92ZZRoECiWCAAAAAAAAAABAQEBAQEBAZRoFEsIhZRoGHSUUpSMBl9zaGFwZZRLCIWUjANsb3eUaBAoliAAAAAAAAAAAAC0wgAAtMIAAKDAAACgwNsPScAAAKDAAAAAgAAAAICUaApLCIWUaBh0lFKUjARoaWdolGgQKJYgAAAAAAAAAAAAtEIAALRCAACgQAAAoEDbD0lAAACgQAAAgD8AAIA/lGgKSwiFlGgYdJRSlIwIbG93X3JlcHKUjFtbLTkwLiAgICAgICAgLTkwLiAgICAgICAgIC01LiAgICAgICAgIC01LiAgICAgICAgIC0zLjE0MTU5MjcgIC01LgogIC0wLiAgICAgICAgIC0wLiAgICAgICBdlIwJaGlnaF9yZXBylIxTWzkwLiAgICAgICAgOTAuICAgICAgICAgNS4gICAgICAgICA1LiAgICAgICAgIDMuMTQxNTkyNyAgNS4KICAxLiAgICAgICAgIDEuICAgICAgIF2UjApfbnBfcmFuZG9tlE51Yi4=",
 
4
  ":serialized:": "gAWVOwAAAAAAAACMIXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5wb2xpY2llc5SMEUFjdG9yQ3JpdGljUG9saWN5lJOULg==",
5
  "__module__": "stable_baselines3.common.policies",
6
  "__doc__": "\n Policy class for actor-critic algorithms (has both policy and value prediction).\n Used by A2C, PPO and the likes.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param ortho_init: Whether to use or not orthogonal initialization\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param full_std: Whether to use (n_features x n_actions) parameters\n for the std instead of only (n_features,) when using gSDE\n :param use_expln: Use ``expln()`` function instead of ``exp()`` to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param squash_output: Whether to squash the output using a tanh function,\n this allows to ensure boundaries when using gSDE.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param share_features_extractor: If True, the features extractor is shared between the policy and value networks.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n ",
7
+ "__init__": "<function ActorCriticPolicy.__init__ at 0x7f5605504700>",
8
+ "_get_constructor_parameters": "<function ActorCriticPolicy._get_constructor_parameters at 0x7f5605504790>",
9
+ "reset_noise": "<function ActorCriticPolicy.reset_noise at 0x7f5605504820>",
10
+ "_build_mlp_extractor": "<function ActorCriticPolicy._build_mlp_extractor at 0x7f56055048b0>",
11
+ "_build": "<function ActorCriticPolicy._build at 0x7f5605504940>",
12
+ "forward": "<function ActorCriticPolicy.forward at 0x7f56055049d0>",
13
+ "extract_features": "<function ActorCriticPolicy.extract_features at 0x7f5605504a60>",
14
+ "_get_action_dist_from_latent": "<function ActorCriticPolicy._get_action_dist_from_latent at 0x7f5605504af0>",
15
+ "_predict": "<function ActorCriticPolicy._predict at 0x7f5605504b80>",
16
+ "evaluate_actions": "<function ActorCriticPolicy.evaluate_actions at 0x7f5605504c10>",
17
+ "get_distribution": "<function ActorCriticPolicy.get_distribution at 0x7f5605504ca0>",
18
+ "predict_values": "<function ActorCriticPolicy.predict_values at 0x7f5605504d30>",
19
  "__abstractmethods__": "frozenset()",
20
+ "_abc_impl": "<_abc._abc_data object at 0x7f56054efe40>"
21
  },
22
  "verbose": 1,
23
  "policy_kwargs": {},
24
+ "num_timesteps": 2015232,
25
+ "_total_timesteps": 2000000,
26
  "_num_timesteps_at_start": 0,
27
  "seed": null,
28
  "action_noise": null,
29
+ "start_time": 1684177142044086768,
30
  "learning_rate": 0.0003,
31
  "tensorboard_log": null,
32
  "_last_obs": {
33
  ":type:": "<class 'numpy.ndarray'>",
34
+ ":serialized:": "gAWVdQIAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYAAgAAAAAAAE0GAL2OnKa8zlUDPD9zgbo6Tgm+qnOnvQAAgD8AAIA/s1EAvcWUcj5lRtc9g2OxvoH4ej2ilKu8AAAAAAAAAAAA+vq8w1k8unncLjMgBjUvCLnRuoXKwbMAAIA/AACAP02GZT09nZI/UK07PuCBHr9c14U9Ho7dPQAAAAAAAAAAZuuEvEiphrqAji4zc7V6rgAkEzsrWNOzAACAPwAAgD9myrO8yeBqPSABcD3s15q+qRrnPZI9nL0AAAAAAAAAABogEr1SKP27cDBtPRZRSL4gc8O8j/GEvwAAgD8AAIA/ZtrfPK5Eh7zz7jA+G64avhqKgL0GhA6+AACAPwAAgD8A+9I9VaKNP07AMT6Tsxu/yAb1PenxqLsAAAAAAAAAAKBKGD7lZF4+LqBpvh653r4lO7u9Rd/TvAAAAAAAAAAA5oJPPWHqp7xWwKe8OT51Ozs5ET6jLWO8AACAPwAAgD/NuRq9GayTPoU2Nz6iItK+3xSRPU3YHj4AAAAAAAAAAJq4Qz1oy7C84yvevVlmKD0Kma49CCjXvAAAgD8AAIA/MyvlPHuKsroRx6a7HjBoOP4ux7kWG/c3AACAPwAAAAAzf4s7yqa0PwjE3D4c3Jg8VGuhu+IGyL0AAAAAAAAAAJrxXL2th7M/StQ/vwcMCL6q7MQ8kl3+vAAAAAAAAAAAlIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksQSwiGlIwBQ5R0lFKULg=="
35
  },
36
  "_last_episode_starts": {
37
  ":type:": "<class 'numpy.ndarray'>",
38
+ ":serialized:": "gAWVgwAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAQCUjAVudW1weZSMBWR0eXBllJOUjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSxCFlIwBQ5R0lFKULg=="
39
  },
40
  "_last_original_obs": null,
41
  "_episode_num": 0,
42
  "use_sde": false,
43
  "sde_sample_freq": -1,
44
+ "_current_progress_remaining": -0.007616000000000067,
45
  "_stats_window_size": 100,
46
  "ep_info_buffer": {
47
  ":type:": "<class 'collections.deque'>",
48
+ ":serialized:": "gAWV5AsAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHQHHTOo99tuWMAWyUS/yMAXSUR0CfVjL5ylvZdX2UKGgGR0Bx+vIyTINmaAdL3mgIR0CfVlya/h2odX2UKGgGR0Bw/PtsvZh8aAdL92gIR0CfVtORDCxedX2UKGgGR0Byr4wvg3tKaAdL02gIR0CfV+jCHh0hdX2UKGgGR0By+G/QBxPwaAdLyWgIR0CfV+TYukDZdX2UKGgGR0BPmtJFspG4aAdLrGgIR0CfWIOfukULdX2UKGgGR0BxUXgjyFwlaAdLy2gIR0CfWR4Ia99MdX2UKGgGR0BwjxUrCm/GaAdL1GgIR0CfWW5Y5ksjdX2UKGgGR0ByDqNGViWnaAdL0mgIR0CfWeQiRnvldX2UKGgGR0BwSAZgogFHaAdL1GgIR0CfWkMS9M9KdX2UKGgGR0BzmbQu27WeaAdL0GgIR0CfWmhbnoxIdX2UKGgGR0ByctpAUtZnaAdL1GgIR0CfWud0q6OHdX2UKGgGR0B0K2HnEETyaAdL3WgIR0CfWv/hl18tdX2UKGgGR0BzWnE4vN/waAdLzmgIR0CfWxAtnPE9dX2UKGgGR0BzbuNDMNc4aAdL6WgIR0CfW1liBoVVdX2UKGgGR0BwXBQbdadMaAdL62gIR0CfW6TPSlWPdX2UKGgGR0BzATJq7AclaAdL1mgIR0CfW7WaMJhOdX2UKGgGR0ByHHZuhsZYaAdL4WgIR0CfW8v60pmVdX2UKGgGR0BzDDChvitJaAdL7mgIR0CfXMDuSfUXdX2UKGgGR0BygYK/mDDkaAdLw2gIR0CfXNvjOs1bdX2UKGgGR0BxNyPluFYdaAdL0WgIR0CfXTxj8UEgdX2UKGgGR0Bv2tev6j33aAdLx2gIR0CfXicynDR/dX2UKGgGR0Bw81HvttygaAdL6WgIR0CfXn9QoCuEdX2UKGgGR0Bw3FNXYDkmaAdL3GgIR0CfXwUlRgqmdX2UKGgGR0Bw29y6tknUaAdLyWgIR0CfX35Zr56/dX2UKGgGR0BAoPmHP/rCaAdLmmgIR0CfX4cCHRCydX2UKGgGR0BzbkrUb1h9aAdLz2gIR0CfX4RNATqTdX2UKGgGR0BwrXqX4TK1aAdL5mgIR0CfX8IXTEzgdX2UKGgGR0BwNd4LThHcaAdL12gIR0CfYE1Cw8nvdX2UKGgGR0BxjwcinpB5aAdL3GgIR0CfYJN4JNTMdX2UKGgGR0BwXXtUn5SFaAdL5GgIR0CfciqKxcFAdX2UKGgGR0BzJGz4UN8WaAdL4WgIR0Cfcm+10DEFdX2UKGgGR0BwNFm29crzaAdL3WgIR0CfcrUSqU/wdX2UKGgGR0BzTx5AyEcsaAdLxmgIR0Cfc1RtgrpadX2UKGgGR0BxdDZf2K2saAdL3WgIR0Cfc9RcNYr8dX2UKGgGR0BzjlcpsoDxaAdNCgFoCEdAn3WEAcT8HnV9lChoBkdAcdseii7Ci2gHS9BoCEdAn3YBSk0rLHV9lChoBkdAcb6SElE7XGgHS8JoCEdAn3YlzMibD3V9lChoBkdAcHEc580DU2gHS+loCEdAn3ZEbo8p1HV9lChoBkdAcceru6VdHGgHS/poCEdAn3ZtQwblzXV9lChoBkdAckb/Y8Md92gHS8ZoCEdAn3Zi/GlyinV9lChoBkdAb88R+SbH62gHS9NoCEdAn3bWbsniN3V9lChoBkdAclRe3QUpNWgHS95oCEdAn3eH+hoM8nV9lChoBkdAcUxMKTjebmgHS8ZoCEdAn3iKQmu1W3V9lChoBkdAcxiMTN+so2gHS+VoCEdAn3ihHf/FSHV9lChoBkdAcRznuiN83WgHS9toCEdAn3ixt+CsfnV9lChoBkdAc6qAoG6f8WgHS+toCEdAn3lt0Rvm5nV9lChoBkdAcJ5oOx0MgGgHS+NoCEdAn3n+qWC2+nV9lChoBkdAcgaj0th/iGgHS9BoCEdAn3o8UmD15HV9lChoBkdAcNkxagVXWGgHS+VoCEdAn3ukihWYGHV9lChoBkdAceuzJp35e2gHS8FoCEdAn3ybuhK15XV9lChoBkdAccqFa0QbuWgHS8NoCEdAn31ql+EytXV9lChoBkdAcYd46fapP2gHS9RoCEdAn33Azk6tDHV9lChoBkdAcwlO5J9RaWgHS9hoCEdAn34QSamXPnV9lChoBkdAcv/PEKmbb2gHS9loCEdAn34wbEP1+XV9lChoBkdAc5FAiml67mgHS+toCEdAn38HH3lCC3V9lChoBkdAb4rdWyTpxGgHS9BoCEdAn382Y8dPtXV9lChoBkdAcc6HO8kD6mgHS8hoCEdAn4EE8JUo8nV9lChoBkdAcCzSRKYiPmgHTRYBaAhHQJ+BTC66J691fZQoaAZHQHIPkJ4SpR5oB0vjaAhHQJ+BTJ/5Lyt1fZQoaAZHQHDsQEt/WlNoB0vsaAhHQJ+BlpXZGrl1fZQoaAZHQHHqGp++dsloB0vyaAhHQJ+Bus8xKxt1fZQoaAZHQHKmZlnRLK5oB0vPaAhHQJ+B6KziS7p1fZQoaAZHQHM1M81XNkhoB0vGaAhHQJ+DZQMx46h1fZQoaAZHQHHdVSwW30BoB00gAWgIR0CfhY2i+L3sdX2UKGgGR0Bybib1AZ88aAdL4mgIR0CfhckZ75VPdX2UKGgGR0By0zVrhzeXaAdLyWgIR0CfhfkqtozvdX2UKGgGR0BxhAKlYU35aAdL42gIR0Cfhrg4ffXPdX2UKGgGR0BzJGXfIjnnaAdLy2gIR0Cfh6obGWD6dX2UKGgGR0ByJpTR6WxAaAdL6GgIR0Cfh8F2V3UydX2UKGgGR0Byat5IH1OCaAdL9WgIR0CfiCAHmig1dX2UKGgGR0Bxm3bDdgv2aAdL6mgIR0CfiLR2bG3ndX2UKGgGR0Bxgc8jiXIEaAdLwWgIR0CfiT4M4LkTdX2UKGgGR0BxO/xRVIZqaAdL2WgIR0CfijTBZZB+dX2UKGgGR0ByD4KeCkGiaAdL0WgIR0Cfikki2UjcdX2UKGgGR0BxYh2MbWEsaAdL5GgIR0CfilsiB5HFdX2UKGgGR0BzsDaAWi1zaAdL5WgIR0CfirPNmlImdX2UKGgGR0ByHTyVfNRnaAdL4mgIR0CfitNWEK3NdX2UKGgGR0BvbSQq7ROUaAdLymgIR0CfizAWSEDhdX2UKGgGR0BDpDTKDCgsaAdLqmgIR0Cfi5pzcRDkdX2UKGgGR0Bz46hf0EowaAdL0GgIR0CfjODpC8e0dX2UKGgGR0BykLdcjZ+QaAdL3mgIR0CfjSENe+mFdX2UKGgGR0Bob9wcYIjXaAdN6ANoCEdAn44KeXiR4nV9lChoBkdAccY22G7Bf2gHS+xoCEdAn44isOoYN3V9lChoBkdAcnPhF3IMjWgHS85oCEdAn45AsK9f1HV9lChoBkdAcS++t8uzyGgHS9hoCEdAn44yr92ovXV9lChoBkdAb7LfHggow2gHS+NoCEdAn46McU/OdHV9lChoBkdAcJ7f0mMOw2gHS8doCEdAn49rUG3WnXV9lChoBkdAc6MG7Bfrr2gHS+FoCEdAn490QbuMM3V9lChoBkdAcGsvUjLSu2gHS9BoCEdAn4+4ClrM1XV9lChoBkdAcaWeo1k1/GgHS/poCEdAn4/M8La24XV9lChoBkdAcJa6KtPpIWgHS9xoCEdAn5AXfhuO0nV9lChoBkdAb1xn/1g6VGgHS85oCEdAn5A7sjVx0nV9lChoBkdAcezG4I8hcWgHS9VoCEdAn5BId6sySHV9lChoBkdAcsjsWweNk2gHS8poCEdAn5B0snRb8nV9lChoBkdAcQ3gflp48mgHS8xoCEdAn5DguVX3g3V9lChoBkdAcqYhbGFSKmgHS89oCEdAn5JSfg75mHV9lChoBkdAco2aXKKYRmgHS+JoCEdAn5KeMZP2wnV9lChoBkdAcrQviLl3hWgHS85oCEdAn5NQcT8HfXV9lChoBkdAcIXBw++ueWgHS9JoCEdAn5N/qC6H03VlLg=="
49
  },
50
  "ep_success_buffer": {
51
  ":type:": "<class 'collections.deque'>",
52
  ":serialized:": "gAWVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="
53
  },
54
+ "_n_updates": 492,
55
  "observation_space": {
56
  ":type:": "<class 'gymnasium.spaces.box.Box'>",
57
  ":serialized:": "gAWVcAIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lGgFk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMDWJvdW5kZWRfYmVsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWCAAAAAAAAAABAQEBAQEBAZRoB4wCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksIhZSMAUOUdJRSlIwNYm91bmRlZF9hYm92ZZRoECiWCAAAAAAAAAABAQEBAQEBAZRoFEsIhZRoGHSUUpSMBl9zaGFwZZRLCIWUjANsb3eUaBAoliAAAAAAAAAAAAC0wgAAtMIAAKDAAACgwNsPScAAAKDAAAAAgAAAAICUaApLCIWUaBh0lFKUjARoaWdolGgQKJYgAAAAAAAAAAAAtEIAALRCAACgQAAAoEDbD0lAAACgQAAAgD8AAIA/lGgKSwiFlGgYdJRSlIwIbG93X3JlcHKUjFtbLTkwLiAgICAgICAgLTkwLiAgICAgICAgIC01LiAgICAgICAgIC01LiAgICAgICAgIC0zLjE0MTU5MjcgIC01LgogIC0wLiAgICAgICAgIC0wLiAgICAgICBdlIwJaGlnaF9yZXBylIxTWzkwLiAgICAgICAgOTAuICAgICAgICAgNS4gICAgICAgICA1LiAgICAgICAgIDMuMTQxNTkyNyAgNS4KICAxLiAgICAgICAgIDEuICAgICAgIF2UjApfbnBfcmFuZG9tlE51Yi4=",
ppo-LunarLander-v2/policy.optimizer.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:71d13f4ee55a458ed88aff15245be8620477a1dad191a4cafb98d1fe3f5506a8
3
  size 87929
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0b7573c646a0ea90c980c7201859d846591a90e896d92e9c04111bfb3ab018a4
3
  size 87929
ppo-LunarLander-v2/policy.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2797300c416e1eebecbc09a81197b41b6a12fd295fb0fc2b3bcf4694e5185869
3
  size 43329
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cc46d84c2498edbd8e82a272d41da70827e80644522dcafbbf0166f109aa2364
3
  size 43329
replay.mp4 CHANGED
Binary files a/replay.mp4 and b/replay.mp4 differ
 
results.json CHANGED
@@ -1 +1 @@
1
- {"mean_reward": 242.22309187045298, "std_reward": 49.87629210517521, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-05-14T20:05:30.370016"}
 
1
+ {"mean_reward": 278.17022678970727, "std_reward": 30.439213284273055, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-05-15T19:37:08.784276"}