Álvaro Martínez
commited on
Commit
•
1d96795
1
Parent(s):
25eea72
What is a commit lolol
Browse files- README.md +36 -0
- config.json +1 -1
- replay.mp4 +3 -0
- results.json +1 -1
- {username}//{model_architecture}-{env_id}.zip +2 -2
- {username}//{model_architecture}-{env_id}/data +16 -16
- {username}//{model_architecture}-{env_id}/policy.optimizer.pth +1 -1
- {username}//{model_architecture}-{env_id}/policy.pth +1 -1
README.md
ADDED
@@ -0,0 +1,36 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
library_name: stable-baselines3
|
3 |
+
tags:
|
4 |
+
- LunarLander-v2
|
5 |
+
- deep-reinforcement-learning
|
6 |
+
- reinforcement-learning
|
7 |
+
- stable-baselines3
|
8 |
+
model-index:
|
9 |
+
- name: PPO
|
10 |
+
results:
|
11 |
+
- metrics:
|
12 |
+
- type: mean_reward
|
13 |
+
value: 283.11 +/- 19.04
|
14 |
+
name: mean_reward
|
15 |
+
task:
|
16 |
+
type: reinforcement-learning
|
17 |
+
name: reinforcement-learning
|
18 |
+
dataset:
|
19 |
+
name: LunarLander-v2
|
20 |
+
type: LunarLander-v2
|
21 |
+
---
|
22 |
+
|
23 |
+
# **PPO** Agent playing **LunarLander-v2**
|
24 |
+
This is a trained model of a **PPO** agent playing **LunarLander-v2**
|
25 |
+
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
|
26 |
+
|
27 |
+
## Usage (with Stable-baselines3)
|
28 |
+
TODO: Add your code
|
29 |
+
|
30 |
+
|
31 |
+
```python
|
32 |
+
from stable_baselines3 import ...
|
33 |
+
from huggingface_sb3 import load_from_hub
|
34 |
+
|
35 |
+
...
|
36 |
+
```
|
config.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"policy_class": {":type:": "<class 'abc.ABCMeta'>", ":serialized:": "gASVOwAAAAAAAACMIXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5wb2xpY2llc5SMEUFjdG9yQ3JpdGljUG9saWN5lJOULg==", "__module__": "stable_baselines3.common.policies", "__doc__": "\n Policy class for actor-critic algorithms (has both policy and value prediction).\n Used by A2C, PPO and the likes.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param ortho_init: Whether to use or not orthogonal initialization\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param full_std: Whether to use (n_features x n_actions) parameters\n for the std instead of only (n_features,) when using gSDE\n :param sde_net_arch: Network architecture for extracting features\n when using gSDE. If None, the latent features from the policy will be used.\n Pass an empty list to use the states as features.\n :param use_expln: Use ``expln()`` function instead of ``exp()`` to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param squash_output: Whether to squash the output using a tanh function,\n this allows to ensure boundaries when using gSDE.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n ", "__init__": "<function ActorCriticPolicy.__init__ at 0x7f585c59d7a0>", "_get_constructor_parameters": "<function ActorCriticPolicy._get_constructor_parameters at 0x7f585c59d830>", "reset_noise": "<function ActorCriticPolicy.reset_noise at 0x7f585c59d8c0>", "_build_mlp_extractor": "<function ActorCriticPolicy._build_mlp_extractor at 0x7f585c59d950>", "_build": "<function ActorCriticPolicy._build at 0x7f585c59d9e0>", "forward": "<function ActorCriticPolicy.forward at 0x7f585c59da70>", "_get_action_dist_from_latent": "<function ActorCriticPolicy._get_action_dist_from_latent at 0x7f585c59db00>", "_predict": "<function ActorCriticPolicy._predict at 0x7f585c59db90>", "evaluate_actions": "<function ActorCriticPolicy.evaluate_actions at 0x7f585c59dc20>", "get_distribution": "<function ActorCriticPolicy.get_distribution at 0x7f585c59dcb0>", "predict_values": "<function ActorCriticPolicy.predict_values at 0x7f585c59dd40>", "__abstractmethods__": "frozenset()", "_abc_impl": "<_abc_data object at 0x7f585c571270>"}, "verbose": 0, "policy_kwargs": {}, "observation_space": {":type:": "<class 'gym.spaces.box.Box'>", ":serialized:": "gASVwwEAAAAAAACMDmd5bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lGgFk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMBl9zaGFwZZRLCIWUjANsb3eUjBVudW1weS5jb3JlLm11bHRpYXJyYXmUjAxfcmVjb25zdHJ1Y3SUk5RoBowHbmRhcnJheZSTlEsAhZRDAWKUh5RSlChLAUsIhZRoColDIAAAgP8AAID/AACA/wAAgP8AAID/AACA/wAAgP8AAID/lHSUYowEaGlnaJRoEmgUSwCFlGgWh5RSlChLAUsIhZRoColDIAAAgH8AAIB/AACAfwAAgH8AAIB/AACAfwAAgH8AAIB/lHSUYowNYm91bmRlZF9iZWxvd5RoEmgUSwCFlGgWh5RSlChLAUsIhZRoB4wCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYolDCAAAAAAAAAAAlHSUYowNYm91bmRlZF9hYm92ZZRoEmgUSwCFlGgWh5RSlChLAUsIhZRoKolDCAAAAAAAAAAAlHSUYowKX25wX3JhbmRvbZROdWIu", "dtype": "float32", "_shape": [8], "low": "[-inf -inf -inf -inf -inf -inf -inf -inf]", "high": "[inf inf inf inf inf inf inf inf]", "bounded_below": "[False False False False False False False False]", "bounded_above": "[False False False False False False False False]", "_np_random": null}, "action_space": {":type:": "<class 'gym.spaces.discrete.Discrete'>", ":serialized:": "gASVggAAAAAAAACME2d5bS5zcGFjZXMuZGlzY3JldGWUjAhEaXNjcmV0ZZSTlCmBlH2UKIwBbpRLBIwGX3NoYXBllCmMBWR0eXBllIwFbnVtcHmUaAeTlIwCaTiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYowKX25wX3JhbmRvbZROdWIu", "n": 4, "_shape": [], "dtype": "int64", "_np_random": null}, "n_envs": 16, "num_timesteps": 507904, "_total_timesteps": 500000, "_num_timesteps_at_start": 0, "seed": null, "action_noise": null, "start_time": 1657067379.6597142, "learning_rate": 0.0003, "tensorboard_log": null, "lr_schedule": {":type:": "<class 'function'>", ":serialized:": "gASVvwIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwNX2J1aWx0aW5fdHlwZZSTlIwKTGFtYmRhVHlwZZSFlFKUKGgCjAhDb2RlVHlwZZSFlFKUKEsBSwBLAUsBSxNDBIgAUwCUToWUKYwBX5SFlIxIL3Vzci9sb2NhbC9saWIvcHl0aG9uMy43L2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lIwEZnVuY5RLgEMCAAGUjAN2YWyUhZQpdJRSlH2UKIwLX19wYWNrYWdlX1+UjBhzdGFibGVfYmFzZWxpbmVzMy5jb21tb26UjAhfX25hbWVfX5SMHnN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi51dGlsc5SMCF9fZmlsZV9flIxIL3Vzci9sb2NhbC9saWIvcHl0aG9uMy43L2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaCB9lH2UKGgXaA6MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgYjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz8zqSowVTJhhZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"}, "_last_obs": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gASVjQIAAAAAAACMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMDF9yZWNvbnN0cnVjdJSTlIwFbnVtcHmUjAduZGFycmF5lJOUSwCFlEMBYpSHlFKUKEsBSxBLCIaUaAOMBWR0eXBllJOUjAJmNJSJiIeUUpQoSwOMATyUTk5OSv////9K/////0sAdJRiiUIAAgAA2ow4PhEPUj8+nxm+Tp7XvnF3Gz4dHNe9AAAAAAAAAACzOhm99qx6ul85pjcPoZsylPOzubaPwrYAAIA/AACAP7PDPT1vi2E95j4TP+Mlpr586eQ+ed+NPgAAAAAAAAAATWwiPdw+HD21t4W90hiMvq/sbrztFBO8AAAAAAAAAACARhS9rh2ourKrwrZcW8yx6eeMuorF4TUAAIA/AACAP7qhW75rcp4/8Cz5vkFOEL+jfNq+04J6vgAAAAAAAAAAZoxvPNf8GLuuONC7cvuLPNxXWjy7BnK9AACAPwAAgD+mkrS99thCukzbIztqLKg2adWaO0ogP7oAAAAAAACAPwAQgL2jm4c+wpYxPsnJqr7atJ88za51PAAAAAAAAAAAtkFivlxOoj4lls0+IR6nvoNEpb094bI+AAAAAAAAAABmVYA9NwVNP9dPiDqpTOO+u1QBPf4LCr0AAAAAAAAAAJrdJ7w9sRi7Vf7gOwaukzwG3i08qL9+vQAAgD8AAIA/AADROQp0NrvNZJK7SUuTPGaBwjzSkny9AACAPwAAgD8A4PM7tJCYPzrPirwLOei+g6qOPLoM47wAAAAAAAAAAJoxDbxFz608Yp07PSQ/h75Oxg09S0uFvQAAAAAAAAAAALiqvKCjtT+urlm9qXiyvqjCsLzfnce9AAAAAAAAAACUdJRiLg=="}, "_last_episode_starts": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gASVmAAAAAAAAACMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMDF9yZWNvbnN0cnVjdJSTlIwFbnVtcHmUjAduZGFycmF5lJOUSwCFlEMBYpSHlFKUKEsBSxCFlGgDjAVkdHlwZZSTlIwCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYolDEAAAAAAAAAAAAAAAAAAAAACUdJRiLg=="}, "_last_original_obs": null, "_episode_num": 0, "use_sde": false, "sde_sample_freq": -1, "_current_progress_remaining": -0.015808000000000044, "ep_info_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gASVKhAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpSMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMBnNjYWxhcpSTlIwFbnVtcHmUjAVkdHlwZZSTlIwCZjiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYkMIXaW766z3c0CUhpRSlIwBbJRL14wBdJRHQHX6i1mapgl1fZQoaAZoCWgPQwjvjozVZrRvQJSGlFKUaBVL4mgWR0B1+8nSfDk3dX2UKGgGaAloD0MI8guvJHmCcUCUhpRSlGgVS9xoFkdAdf4kTYdyUHV9lChoBmgJaA9DCE58taO45nBAlIaUUpRoFUvjaBZHQHX/L+Haewt1fZQoaAZoCWgPQwjzA1d5Qn9yQJSGlFKUaBVL12gWR0B2ABKJ2t+1dX2UKGgGaAloD0MIbVhTWRTIcECUhpRSlGgVS9loFkdAdgGvCdjG1nV9lChoBmgJaA9DCPyKNVyksXBAlIaUUpRoFU0LAWgWR0B2A2CUX531dX2UKGgGaAloD0MIM8Lbg1BfcUCUhpRSlGgVS9hoFkdAdgYS9du50HV9lChoBmgJaA9DCLNfd7qz3nFAlIaUUpRoFU0SAWgWR0B2CEXJo0yhdX2UKGgGaAloD0MIHofB/NXycECUhpRSlGgVS+toFkdAdgmgF5fMOnV9lChoBmgJaA9DCDmaIys/m3NAlIaUUpRoFUvYaBZHQHYKqQmu1Wt1fZQoaAZoCWgPQwiqgeZzLp5xQJSGlFKUaBVNEQFoFkdAdgzOSGJvYXV9lChoBmgJaA9DCLn/yHQoiHNAlIaUUpRoFUvaaBZHQHYM5lBhQWN1fZQoaAZoCWgPQwi9NEWAUyFwQJSGlFKUaBVL9GgWR0B2DPor4FibdX2UKGgGaAloD0MIq6+uCpSKc0CUhpRSlGgVS+BoFkdAdg7rsjVx0nV9lChoBmgJaA9DCMLaGDvhOm9AlIaUUpRoFUvgaBZHQHYPfcer+5x1fZQoaAZoCWgPQwi0dAXbCNpwQJSGlFKUaBVL9GgWR0B2D91nuiN9dX2UKGgGaAloD0MIU+knnN0Fc0CUhpRSlGgVS8NoFkdAdhCBlMAWBXV9lChoBmgJaA9DCEEtBg9TNHFAlIaUUpRoFU0CAWgWR0B2E9SCOFQEdX2UKGgGaAloD0MIwDxkyofYbkCUhpRSlGgVS99oFkdAdhQBXCCSR3V9lChoBmgJaA9DCNB8zt2ub29AlIaUUpRoFUvWaBZHQHYVnq7iADt1fZQoaAZoCWgPQwg1fuGVJCtvQJSGlFKUaBVL8mgWR0B2FmvZAY51dX2UKGgGaAloD0MIQukLIWcpc0CUhpRSlGgVS9toFkdAdheebNKRMnV9lChoBmgJaA9DCNwr81ZdV3FAlIaUUpRoFUvYaBZHQHYZ38O09hZ1fZQoaAZoCWgPQwisyOiA5CtwQJSGlFKUaBVL7mgWR0B2HiqyWzF/dX2UKGgGaAloD0MIYLGGi9xWcUCUhpRSlGgVS9loFkdAdh5912aDw3V9lChoBmgJaA9DCP0tAfhnhHNAlIaUUpRoFUvTaBZHQHYgVmSQo1F1fZQoaAZoCWgPQwhGPxpOWZVzQJSGlFKUaBVNCwFoFkdAdiKRZU1hs3V9lChoBmgJaA9DCLoUV5X9k29AlIaUUpRoFUvraBZHQHYitAX2ugZ1fZQoaAZoCWgPQwipbFhT2eZxQJSGlFKUaBVL3WgWR0B2JFYNiH6/dX2UKGgGaAloD0MIgbIpV3j6bkCUhpRSlGgVS+FoFkdAdiRVUMoc73V9lChoBmgJaA9DCNtSB3n9S3BAlIaUUpRoFUv/aBZHQHYksGkep4t1fZQoaAZoCWgPQwjNID6wo51yQJSGlFKUaBVL9GgWR0B2JW4TbnHOdX2UKGgGaAloD0MIZ3v0hvvzckCUhpRSlGgVS/xoFkdAdnH1yvLX+XV9lChoBmgJaA9DCKdB0TwATm9AlIaUUpRoFUviaBZHQHZzBJRO1v51fZQoaAZoCWgPQwhPIOwUq5xzQJSGlFKUaBVL8mgWR0B2dhyhi9ZidX2UKGgGaAloD0MIt7WF5yXWcECUhpRSlGgVS/JoFkdAdncCKJl8PXV9lChoBmgJaA9DCOgWuhJBfnFAlIaUUpRoFU0QAWgWR0B2dwY/FBIGdX2UKGgGaAloD0MIs+4fC9FCbUCUhpRSlGgVS+doFkdAdndVxS5y2nV9lChoBmgJaA9DCJiFdk6zTm9AlIaUUpRoFUvlaBZHQHZ5R6By0a91fZQoaAZoCWgPQwjC+6pcaNtxQJSGlFKUaBVL0GgWR0B2e5GlQ/HHdX2UKGgGaAloD0MIaAbxgZ3WckCUhpRSlGgVS99oFkdAdnydmQKa5XV9lChoBmgJaA9DCLcnSGx320dAlIaUUpRoFUuzaBZHQHZ+K4hEBsB1fZQoaAZoCWgPQwj7QPLOYdlwQJSGlFKUaBVL1WgWR0B2ggsQNCqqdX2UKGgGaAloD0MIayxhbQwDcUCUhpRSlGgVTQQBaBZHQHaCRiobXH11fZQoaAZoCWgPQwigTnl0o+BuQJSGlFKUaBVL72gWR0B2gl4VymygdX2UKGgGaAloD0MITYHMziKfbkCUhpRSlGgVS+RoFkdAdoMKIznA7HV9lChoBmgJaA9DCCcuxyvQQXJAlIaUUpRoFU0JAWgWR0B2hJrAP/aQdX2UKGgGaAloD0MIcXZrmUxWckCUhpRSlGgVS+toFkdAdoSmQbMot3V9lChoBmgJaA9DCKAzaVN1x0xAlIaUUpRoFUujaBZHQHaFeg13t8h1fZQoaAZoCWgPQwjJIHcR5mpyQJSGlFKUaBVL8GgWR0B2hzaL4vexdX2UKGgGaAloD0MIwtzu5b6bcECUhpRSlGgVS/hoFkdAdoj/C66J7HV9lChoBmgJaA9DCHdM3ZWdQHJAlIaUUpRoFUvhaBZHQHaLMCcPOIJ1fZQoaAZoCWgPQwiAuRYtwGdyQJSGlFKUaBVL8WgWR0B2i3cDbJwLdX2UKGgGaAloD0MIp7OTwVGFcECUhpRSlGgVS9xoFkdAdozeTV2A5XV9lChoBmgJaA9DCLu1TIbjHnFAlIaUUpRoFUveaBZHQHaPY1UEPlN1fZQoaAZoCWgPQwhq+uyAa15zQJSGlFKUaBVNGwFoFkdAdo/xLkCFK3V9lChoBmgJaA9DCMLCSZo/D1BAlIaUUpRoFUuaaBZHQHaSPIOpbUx1fZQoaAZoCWgPQwiVJxB2CnBxQJSGlFKUaBVL+WgWR0B2kxM495hSdX2UKGgGaAloD0MIYkhOJm4acUCUhpRSlGgVS+hoFkdAdpMULlV94XV9lChoBmgJaA9DCGA7GLFPUnNAlIaUUpRoFUvTaBZHQHaVpNCZ4Od1fZQoaAZoCWgPQwjfisQEte9vQJSGlFKUaBVL4GgWR0B2lh+kP+XJdX2UKGgGaAloD0MIUHEceDVTc0CUhpRSlGgVS9BoFkdAdpcjhky1u3V9lChoBmgJaA9DCEzEW+df5HFAlIaUUpRoFUv1aBZHQHaXxj4Hoox1fZQoaAZoCWgPQwgZARWOoBNwQJSGlFKUaBVL/WgWR0B2mLVhCtzTdX2UKGgGaAloD0MI5pE/GLhQcECUhpRSlGgVS+FoFkdAdpnIyj59E3V9lChoBmgJaA9DCLzMsFEWZnBAlIaUUpRoFU0AAWgWR0B2nmq4pc5bdX2UKGgGaAloD0MI5zQLtPulc0CUhpRSlGgVS9JoFkdAdp6aEi+tbXV9lChoBmgJaA9DCPFL/bypd3BAlIaUUpRoFUv2aBZHQHafXwXqJMx1fZQoaAZoCWgPQwi+27xxEgRyQJSGlFKUaBVL8mgWR0B2oz9m6GxmdX2UKGgGaAloD0MIVp+rrZircUCUhpRSlGgVS9doFkdAdqNv/BFd9nV9lChoBmgJaA9DCDuOHyrNKHJAlIaUUpRoFU0KAWgWR0B2o7WI42jxdX2UKGgGaAloD0MI0jqqmiC/bkCUhpRSlGgVS+BoFkdAdqTCojv/i3V9lChoBmgJaA9DCNcxrrg4FHJAlIaUUpRoFUvSaBZHQHamdj0+TvB1fZQoaAZoCWgPQwgIkKFjB7pwQJSGlFKUaBVL4mgWR0B2pyFpPAO8dX2UKGgGaAloD0MIpBe1+5W4ckCUhpRSlGgVS+ZoFkdAdqhIacZtN3V9lChoBmgJaA9DCOUMxR1vNm9AlIaUUpRoFUvraBZHQHatGhdt2s91fZQoaAZoCWgPQwj1geSdAw9wQJSGlFKUaBVL22gWR0B2rTc8DB/JdX2UKGgGaAloD0MIvHZpw6FPcUCUhpRSlGgVS/doFkdAdq02Qnx8UnV9lChoBmgJaA9DCDqSy38ITHBAlIaUUpRoFUvmaBZHQHatUL2HtWx1fZQoaAZoCWgPQwhoPudul+1wQJSGlFKUaBVNDAFoFkdAdq6lxOtW/HV9lChoBmgJaA9DCNIcWfnl8nBAlIaUUpRoFUvmaBZHQHavLXg9/z91fZQoaAZoCWgPQwgJwhVQ6DdxQJSGlFKUaBVL4mgWR0B2tEXZXdTHdX2UKGgGaAloD0MIxFp8CkAXcUCUhpRSlGgVS/poFkdAdrWZflZHNHV9lChoBmgJaA9DCFGDaRh+Z3JAlIaUUpRoFUv/aBZHQHa2Qe7tiQV1fZQoaAZoCWgPQwjiP91AAV1uQJSGlFKUaBVL3WgWR0B2t4mF8G9pdX2UKGgGaAloD0MIj6hQ3ZyDckCUhpRSlGgVS+FoFkdAdrgK3uuzQnV9lChoBmgJaA9DCDeJQWBlqXFAlIaUUpRoFUvlaBZHQHa4nG0eEIx1fZQoaAZoCWgPQwgXYvVH2PJxQJSGlFKUaBVL32gWR0B2uRpYcNpedX2UKGgGaAloD0MIYye8BGc6cECUhpRSlGgVS9xoFkdAdrsFrVOKwnV9lChoBmgJaA9DCHcTfNM0onJAlIaUUpRoFUvuaBZHQHa79Z7ojfN1fZQoaAZoCWgPQwi0BYTWQyJyQJSGlFKUaBVL92gWR0B2vm58Sf16dX2UKGgGaAloD0MIwtoYO6HmcUCUhpRSlGgVS9RoFkdAdsF5o4+8oXV9lChoBmgJaA9DCFAcQL8vqXFAlIaUUpRoFUvlaBZHQHbBnpW3jMp1fZQoaAZoCWgPQwg2kC42bdFxQJSGlFKUaBVL/mgWR0B2xBVktmL+dX2UKGgGaAloD0MIEalpF9NGc0CUhpRSlGgVTQQBaBZHQHbEwFxGUfR1fZQoaAZoCWgPQwiSWFLufuBxQJSGlFKUaBVL/GgWR0B2xez1K5CodX2UKGgGaAloD0MIYhIu5BFVb0CUhpRSlGgVS+JoFkdAdsjqjJuEVXV9lChoBmgJaA9DCPhPN1DgcXFAlIaUUpRoFUvWaBZHQHbJwdCE6DJ1fZQoaAZoCWgPQwiWdmout7lvQJSGlFKUaBVL5WgWR0B2yo/IKc/ddWUu"}, "ep_success_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gASVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="}, "_n_updates": 496, "n_steps": 1024, "gamma": 0.999, "gae_lambda": 0.98, "ent_coef": 0.01, "vf_coef": 0.5, "max_grad_norm": 0.5, "batch_size": 64, "n_epochs": 4, "clip_range": {":type:": "<class 'function'>", ":serialized:": "gASVvwIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwNX2J1aWx0aW5fdHlwZZSTlIwKTGFtYmRhVHlwZZSFlFKUKGgCjAhDb2RlVHlwZZSFlFKUKEsBSwBLAUsBSxNDBIgAUwCUToWUKYwBX5SFlIxIL3Vzci9sb2NhbC9saWIvcHl0aG9uMy43L2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lIwEZnVuY5RLgEMCAAGUjAN2YWyUhZQpdJRSlH2UKIwLX19wYWNrYWdlX1+UjBhzdGFibGVfYmFzZWxpbmVzMy5jb21tb26UjAhfX25hbWVfX5SMHnN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi51dGlsc5SMCF9fZmlsZV9flIxIL3Vzci9sb2NhbC9saWIvcHl0aG9uMy43L2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaCB9lH2UKGgXaA6MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgYjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz/JmZmZmZmahZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"}, "clip_range_vf": null, "normalize_advantage": true, "target_kl": null, "system_info": {"OS": "Linux-5.4.188+-x86_64-with-Ubuntu-18.04-bionic #1 SMP Sun Apr 24 10:03:06 PDT 2022", "Python": "3.7.13", "Stable-Baselines3": "1.5.0", "PyTorch": "1.11.0+cu113", "GPU Enabled": "True", "Numpy": "1.21.6", "Gym": "0.21.0"}}
|
|
|
1 |
+
{"policy_class": {":type:": "<class 'abc.ABCMeta'>", ":serialized:": "gASVOwAAAAAAAACMIXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5wb2xpY2llc5SMEUFjdG9yQ3JpdGljUG9saWN5lJOULg==", "__module__": "stable_baselines3.common.policies", "__doc__": "\n Policy class for actor-critic algorithms (has both policy and value prediction).\n Used by A2C, PPO and the likes.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param ortho_init: Whether to use or not orthogonal initialization\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param full_std: Whether to use (n_features x n_actions) parameters\n for the std instead of only (n_features,) when using gSDE\n :param sde_net_arch: Network architecture for extracting features\n when using gSDE. If None, the latent features from the policy will be used.\n Pass an empty list to use the states as features.\n :param use_expln: Use ``expln()`` function instead of ``exp()`` to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param squash_output: Whether to squash the output using a tanh function,\n this allows to ensure boundaries when using gSDE.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n ", "__init__": "<function ActorCriticPolicy.__init__ at 0x7f170b6e0440>", "_get_constructor_parameters": "<function ActorCriticPolicy._get_constructor_parameters at 0x7f170b6e04d0>", "reset_noise": "<function ActorCriticPolicy.reset_noise at 0x7f170b6e0560>", "_build_mlp_extractor": "<function ActorCriticPolicy._build_mlp_extractor at 0x7f170b6e05f0>", "_build": "<function ActorCriticPolicy._build at 0x7f170b6e0680>", "forward": "<function ActorCriticPolicy.forward at 0x7f170b6e0710>", "_get_action_dist_from_latent": "<function ActorCriticPolicy._get_action_dist_from_latent at 0x7f170b6e07a0>", "_predict": "<function ActorCriticPolicy._predict at 0x7f170b6e0830>", "evaluate_actions": "<function ActorCriticPolicy.evaluate_actions at 0x7f170b6e08c0>", "get_distribution": "<function ActorCriticPolicy.get_distribution at 0x7f170b6e0950>", "predict_values": "<function ActorCriticPolicy.predict_values at 0x7f170b6e09e0>", "__abstractmethods__": "frozenset()", "_abc_impl": "<_abc_data object at 0x7f170b736270>"}, "verbose": 0, "policy_kwargs": {}, "observation_space": {":type:": "<class 'gym.spaces.box.Box'>", ":serialized:": "gASVwwEAAAAAAACMDmd5bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lGgFk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMBl9zaGFwZZRLCIWUjANsb3eUjBVudW1weS5jb3JlLm11bHRpYXJyYXmUjAxfcmVjb25zdHJ1Y3SUk5RoBowHbmRhcnJheZSTlEsAhZRDAWKUh5RSlChLAUsIhZRoColDIAAAgP8AAID/AACA/wAAgP8AAID/AACA/wAAgP8AAID/lHSUYowEaGlnaJRoEmgUSwCFlGgWh5RSlChLAUsIhZRoColDIAAAgH8AAIB/AACAfwAAgH8AAIB/AACAfwAAgH8AAIB/lHSUYowNYm91bmRlZF9iZWxvd5RoEmgUSwCFlGgWh5RSlChLAUsIhZRoB4wCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYolDCAAAAAAAAAAAlHSUYowNYm91bmRlZF9hYm92ZZRoEmgUSwCFlGgWh5RSlChLAUsIhZRoKolDCAAAAAAAAAAAlHSUYowKX25wX3JhbmRvbZROdWIu", "dtype": "float32", "_shape": [8], "low": "[-inf -inf -inf -inf -inf -inf -inf -inf]", "high": "[inf inf inf inf inf inf inf inf]", "bounded_below": "[False False False False False False False False]", "bounded_above": "[False False False False False False False False]", "_np_random": null}, "action_space": {":type:": "<class 'gym.spaces.discrete.Discrete'>", ":serialized:": "gASVggAAAAAAAACME2d5bS5zcGFjZXMuZGlzY3JldGWUjAhEaXNjcmV0ZZSTlCmBlH2UKIwBbpRLBIwGX3NoYXBllCmMBWR0eXBllIwFbnVtcHmUaAeTlIwCaTiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYowKX25wX3JhbmRvbZROdWIu", "n": 4, "_shape": [], "dtype": "int64", "_np_random": null}, "n_envs": 16, "num_timesteps": 507904, "_total_timesteps": 500000, "_num_timesteps_at_start": 0, "seed": null, "action_noise": null, "start_time": 1657068448.7295914, "learning_rate": 0.0003, "tensorboard_log": null, "lr_schedule": {":type:": "<class 'function'>", ":serialized:": "gASVvwIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwNX2J1aWx0aW5fdHlwZZSTlIwKTGFtYmRhVHlwZZSFlFKUKGgCjAhDb2RlVHlwZZSFlFKUKEsBSwBLAUsBSxNDBIgAUwCUToWUKYwBX5SFlIxIL3Vzci9sb2NhbC9saWIvcHl0aG9uMy43L2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lIwEZnVuY5RLgEMCAAGUjAN2YWyUhZQpdJRSlH2UKIwLX19wYWNrYWdlX1+UjBhzdGFibGVfYmFzZWxpbmVzMy5jb21tb26UjAhfX25hbWVfX5SMHnN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi51dGlsc5SMCF9fZmlsZV9flIxIL3Vzci9sb2NhbC9saWIvcHl0aG9uMy43L2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaCB9lH2UKGgXaA6MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgYjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz8zqSowVTJhhZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"}, "_last_obs": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gASVjQIAAAAAAACMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMDF9yZWNvbnN0cnVjdJSTlIwFbnVtcHmUjAduZGFycmF5lJOUSwCFlEMBYpSHlFKUKEsBSxBLCIaUaAOMBWR0eXBllJOUjAJmNJSJiIeUUpQoSwOMATyUTk5OSv////9K/////0sAdJRiiUIAAgAATUVPPjHDkD/Y0uY+LXjhvixYzj76/7c+AAAAAAAAAAAAx5s8hzeQP4wEEr2B4+m+2i6hPCZ5nrwAAAAAAAAAABq8or28i0U+wX4TPl+3wr5tGI89+lgGPAAAAAAAAAAAmmN1vfZEc7qG+La5lhOdNXXobLqm+9U4AAAAAAAAAAAa18+924TgPZItZD5sXsa+Tx6uPfsHlz0AAAAAAAAAAJr7vbyBW4O88zenuyPwjDvvU+89yu+AvAAAgD8AAIA/M14qPUuEqD4wCga9JdbLvrJdZzy+qaS7AAAAAAAAAABAboQ9GghhPll/wr5Ker2++vRHvpsKnLwAAAAAAAAAAGYE4r3MpWw+GtcwPp4+rL4zWYs8Jr+HuwAAAAAAAAAADdQWvoqUpz/yOeS+PtL5vmWyjr5C/GW+AAAAAAAAAACak7m9j6JBPlGgwT0DxKm+8h/lvLljvrwAAAAAAAAAAHOApb0YPJQ/Kzf0vYmc9L4NKxK+SLGjPAAAAAAAAAAADVMZPgJ4Wj94Ses9R5DgvnPYez5sBji9AAAAAAAAAABmiGY8ztOGvGILczxK6pW9I83MPQ/MtD4AAIA/AACAPzMYmjySEY080XUaveecnb5qp3W9Gg4ZPQAAAAAAAAAAABcDPRIlsT8sKRs+9sGRvpTleD1imVA+AAAAAAAAAACUdJRiLg=="}, "_last_episode_starts": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gASVmAAAAAAAAACMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMDF9yZWNvbnN0cnVjdJSTlIwFbnVtcHmUjAduZGFycmF5lJOUSwCFlEMBYpSHlFKUKEsBSxCFlGgDjAVkdHlwZZSTlIwCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYolDEAAAAAAAAAAAAAAAAAAAAACUdJRiLg=="}, "_last_original_obs": null, "_episode_num": 0, "use_sde": false, "sde_sample_freq": -1, "_current_progress_remaining": -0.015808000000000044, "ep_info_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gASVIxAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpSMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMBnNjYWxhcpSTlIwFbnVtcHmUjAVkdHlwZZSTlIwCZjiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYkMIjBL0F/qGc0CUhpRSlIwBbJRL9owBdJRHQHOkn3L3bmF1fZQoaAZoCWgPQwhj1LX2Pt1xQJSGlFKUaBVL42gWR0BzpbBMzuWsdX2UKGgGaAloD0MI1SMNbquic0CUhpRSlGgVS9toFkdAc6XoYNy5qnV9lChoBmgJaA9DCFBQilZuxHJAlIaUUpRoFUveaBZHQHOnVx0dRzl1fZQoaAZoCWgPQwgw2uOFdPVvQJSGlFKUaBVL/mgWR0Bzp7atcObzdX2UKGgGaAloD0MIr2Ab8WTfcUCUhpRSlGgVS95oFkdAc6fY1He7+XV9lChoBmgJaA9DCAzIXu9+LW5AlIaUUpRoFUvaaBZHQHOrRtgrpaB1fZQoaAZoCWgPQwgzbmqg+bBUQJSGlFKUaBVLrmgWR0Bzq+5wwTM8dX2UKGgGaAloD0MIWoC21Swjb0CUhpRSlGgVS99oFkdAc6x3Hq/ucHV9lChoBmgJaA9DCJwU5j0OM3JAlIaUUpRoFUvqaBZHQHOv+JtSAH51fZQoaAZoCWgPQwj7BiY3io5wQJSGlFKUaBVL7mgWR0BzsU+5e7cxdX2UKGgGaAloD0MIR+f8FMc/ckCUhpRSlGgVS+doFkdAc7Gy0rsjV3V9lChoBmgJaA9DCLPw9bWu8mdAlIaUUpRoFU3oA2gWR0Bzsd3s5XEJdX2UKGgGaAloD0MIS+SCM/i4cECUhpRSlGgVS9NoFkdAc7HbrTpgTnV9lChoBmgJaA9DCPPoRliUOXJAlIaUUpRoFUvSaBZHQHOyUsvqTr51fZQoaAZoCWgPQwgf+BisOCVyQJSGlFKUaBVLz2gWR0Bzt+wt8NQTdX2UKGgGaAloD0MI5nYv98kCckCUhpRSlGgVS/loFkdAc7iC+De0onV9lChoBmgJaA9DCHhCrz8JhnFAlIaUUpRoFUvlaBZHQHO4xXCCSRt1fZQoaAZoCWgPQwjZXaCkAMtzQJSGlFKUaBVL3GgWR0BzuPnq3VkMdX2UKGgGaAloD0MI+l+uRUuJc0CUhpRSlGgVS8ZoFkdAc7kb9qDbrXV9lChoBmgJaA9DCGJp4Ef1GXJAlIaUUpRoFUvWaBZHQHO55jUd7v51fZQoaAZoCWgPQwgXR+Umak9vQJSGlFKUaBVL32gWR0BzuuIyj59FdX2UKGgGaAloD0MIL1G9NbB4cUCUhpRSlGgVS8doFkdAc7wocaOxS3V9lChoBmgJaA9DCILIIk18QXJAlIaUUpRoFUviaBZHQHO+5VOsT391fZQoaAZoCWgPQwgrhxbZDn5yQJSGlFKUaBVL4mgWR0Bzv2TV2A5JdX2UKGgGaAloD0MILZljeVc2cECUhpRSlGgVS9hoFkdAdAxfeDWbw3V9lChoBmgJaA9DCLuZ0Y+G4nJAlIaUUpRoFUvraBZHQHQM+1OTJQt1fZQoaAZoCWgPQwhTtHIvMNNuQJSGlFKUaBVL2WgWR0B0DPonrpqzdX2UKGgGaAloD0MIN1X3yGY1bUCUhpRSlGgVS+BoFkdAdA125QP7N3V9lChoBmgJaA9DCLn6sUm+63JAlIaUUpRoFUvfaBZHQHQNhjJ+2E11fZQoaAZoCWgPQwiVRszsc1BzQJSGlFKUaBVL82gWR0B0D5yFPBSDdX2UKGgGaAloD0MIg4qqX+lSRUCUhpRSlGgVS59oFkdAdBMFxXGOuXV9lChoBmgJaA9DCLkzEwxn2nBAlIaUUpRoFUvSaBZHQHQTUy+HrQh1fZQoaAZoCWgPQwgKhJ1i1bBwQJSGlFKUaBVL2GgWR0B0FE4uK4x2dX2UKGgGaAloD0MIHJqy0w/jcECUhpRSlGgVS8xoFkdAdBRPgNwzcnV9lChoBmgJaA9DCHuFBfeDKXFAlIaUUpRoFUvaaBZHQHQUTJEH+qB1fZQoaAZoCWgPQwg6ArhZPItyQJSGlFKUaBVL52gWR0B0FKhf0EowdX2UKGgGaAloD0MI3bQZp2FgcUCUhpRSlGgVS/JoFkdAdBZmig00nHV9lChoBmgJaA9DCOQPBp77SHBAlIaUUpRoFU0FAWgWR0B0GelwcYIjdX2UKGgGaAloD0MIqKj6lQ6Hc0CUhpRSlGgVS9ZoFkdAdBrcB2fTTnV9lChoBmgJaA9DCBo09E8w6HFAlIaUUpRoFUvwaBZHQHQciVGCqZN1fZQoaAZoCWgPQwg6z9iX7GBwQJSGlFKUaBVL2WgWR0B0H9L9MsYmdX2UKGgGaAloD0MI7Pma5bKjb0CUhpRSlGgVS+BoFkdAdB/2VE/jbXV9lChoBmgJaA9DCPj578GrO3NAlIaUUpRoFUvmaBZHQHQf8YZVGTd1fZQoaAZoCWgPQwhDVyJQPd5yQJSGlFKUaBVL42gWR0B0IMTmGM4tdX2UKGgGaAloD0MI6nqi60LhcUCUhpRSlGgVS/hoFkdAdCIJSzgMt3V9lChoBmgJaA9DCBuC4zIuqnFAlIaUUpRoFUvaaBZHQHQiLHdXT3J1fZQoaAZoCWgPQwgvi4nNB5xxQJSGlFKUaBVLwmgWR0B0I90q6OHWdX2UKGgGaAloD0MIaYoApzcOcECUhpRSlGgVS9doFkdAdCVRmbsniXV9lChoBmgJaA9DCOV+h6IApHNAlIaUUpRoFUvMaBZHQHQljlDF6zF1fZQoaAZoCWgPQwiWIvlKIOpyQJSGlFKUaBVLzGgWR0B0Je3I+4b0dX2UKGgGaAloD0MIbVM8LmqLcECUhpRSlGgVS9JoFkdAdCYV+7UXpHV9lChoBmgJaA9DCI8YPbdQfnFAlIaUUpRoFU0FAWgWR0B0Kk4ZMtbtdX2UKGgGaAloD0MI3ZTyWgnzc0CUhpRSlGgVS/VoFkdAdCtRCx/us3V9lChoBmgJaA9DCIRiK2hannNAlIaUUpRoFUvdaBZHQHQty1iONo91fZQoaAZoCWgPQwglk1M7Q1NvQJSGlFKUaBVL22gWR0B0L3bmEGqxdX2UKGgGaAloD0MI/+cwX962ckCUhpRSlGgVTQMBaBZHQHQwSLQ5WBB1fZQoaAZoCWgPQwiPjquRXZNKQJSGlFKUaBVLuGgWR0B0MJKmKqGUdX2UKGgGaAloD0MITyMtlfdSckCUhpRSlGgVS8hoFkdAdDEOz6ab4XV9lChoBmgJaA9DCNo391dPH3BAlIaUUpRoFUvlaBZHQHQz5GWldkd1fZQoaAZoCWgPQwhcrn5sEhxyQJSGlFKUaBVL6GgWR0B0NDqC6H0sdX2UKGgGaAloD0MIOZz51ZywcUCUhpRSlGgVS9hoFkdAdDUDiwSrYHV9lChoBmgJaA9DCIRhwJLrEHJAlIaUUpRoFUvRaBZHQHQ3r/XGwRp1fZQoaAZoCWgPQwimRuhnqoVyQJSGlFKUaBVL+GgWR0B0N/5LytmudX2UKGgGaAloD0MIuHaiJCTscECUhpRSlGgVS+NoFkdAdDnwxFiKBXV9lChoBmgJaA9DCCxhbYzdMHFAlIaUUpRoFUvhaBZHQHQ58oMKCxx1fZQoaAZoCWgPQwjJkc7AyItxQJSGlFKUaBVNBwFoFkdAdDraGYa5w3V9lChoBmgJaA9DCMQGCydpeHJAlIaUUpRoFUv2aBZHQHQ7JezD4xl1fZQoaAZoCWgPQwiuYvGbwj1vQJSGlFKUaBVLzWgWR0B0PGcbzbvgdX2UKGgGaAloD0MI5Zgs7j98cECUhpRSlGgVS/loFkdAdEFn6l+Ey3V9lChoBmgJaA9DCNmVlpF63W5AlIaUUpRoFUvNaBZHQHRC6S9ugpV1fZQoaAZoCWgPQwj2mEhp9oJyQJSGlFKUaBVL9mgWR0B0RAM3IdU9dX2UKGgGaAloD0MIg2qDE9Gzb0CUhpRSlGgVS+9oFkdAdEUa3Zwn6XV9lChoBmgJaA9DCHoYWp2ccXNAlIaUUpRoFUvoaBZHQHRGD5bhWHV1fZQoaAZoCWgPQwiCWDZzCBh0QJSGlFKUaBVL+2gWR0B0Rwb1h9b5dX2UKGgGaAloD0MIKZMa2oBLcUCUhpRSlGgVS9doFkdAdEeJVKf4AXV9lChoBmgJaA9DCIicvp7vBnNAlIaUUpRoFUvYaBZHQHRIQeFL39J1fZQoaAZoCWgPQwgCYhIu5HZwQJSGlFKUaBVL6WgWR0B0SK6WgOBldX2UKGgGaAloD0MIFm75SEobc0CUhpRSlGgVS+hoFkdAdEwCFbmlqXV9lChoBmgJaA9DCA37PbFOy3FAlIaUUpRoFUvKaBZHQHRM4qkM1CR1fZQoaAZoCWgPQwiIZMixdfhuQJSGlFKUaBVL22gWR0B0TRkAggX/dX2UKGgGaAloD0MIeXjPgeUrc0CUhpRSlGgVS+ZoFkdAdE4Npudf9nV9lChoBmgJaA9DCBfZzvfTAHJAlIaUUpRoFUv9aBZHQHROJzYEnst1fZQoaAZoCWgPQwgPJzCd1pJxQJSGlFKUaBVLz2gWR0B0TqrXDm8vdX2UKGgGaAloD0MIjbPpCGBJcUCUhpRSlGgVS+toFkdAdE9RZU1hs3V9lChoBmgJaA9DCJxSXishcXJAlIaUUpRoFUvYaBZHQHRULbQC0Wx1fZQoaAZoCWgPQwhQFymUhVtvQJSGlFKUaBVL0mgWR0B0VQrFwT/RdX2UKGgGaAloD0MINuZ1xCHvS0CUhpRSlGgVS6VoFkdAdFaqyWzF/HV9lChoBmgJaA9DCJQXmYDfBXJAlIaUUpRoFUv1aBZHQHRZHV9Wp611fZQoaAZoCWgPQwhB8s6hDIdxQJSGlFKUaBVL3WgWR0B0WeiKziS8dX2UKGgGaAloD0MIY3rCEo+AckCUhpRSlGgVS/ZoFkdAdFpG+sYEXHV9lChoBmgJaA9DCGnIeJRKr21AlIaUUpRoFUvvaBZHQHRanXqZ+hJ1fZQoaAZoCWgPQwhngAuyZepxQJSGlFKUaBVL3mgWR0B0WpMEidJ8dX2UKGgGaAloD0MIBwySPq0HdECUhpRSlGgVS99oFkdAdFteYlY2bXV9lChoBmgJaA9DCAwepn0zm3BAlIaUUpRoFUvpaBZHQHRgUZNwiq11fZQoaAZoCWgPQwjoTNpU3eNuQJSGlFKUaBVL0WgWR0B0YQc5sCT2dX2UKGgGaAloD0MIdopVg7CKbkCUhpRSlGgVS99oFkdAdGGbtJFspHV9lChoBmgJaA9DCAIR4srZQ3NAlIaUUpRoFUvxaBZHQHRiSDujRD11fZQoaAZoCWgPQwiE9BQ5RChwQJSGlFKUaBVL5WgWR0B0YkvTPSlWdX2UKGgGaAloD0MIOdOE7SfMcUCUhpRSlGgVTQABaBZHQHRjUQwsXi11ZS4="}, "ep_success_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gASVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="}, "_n_updates": 620, "n_steps": 1024, "gamma": 0.999, "gae_lambda": 0.98, "ent_coef": 0.01, "vf_coef": 0.5, "max_grad_norm": 0.5, "batch_size": 64, "n_epochs": 4, "clip_range": {":type:": "<class 'function'>", ":serialized:": "gASVvwIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwNX2J1aWx0aW5fdHlwZZSTlIwKTGFtYmRhVHlwZZSFlFKUKGgCjAhDb2RlVHlwZZSFlFKUKEsBSwBLAUsBSxNDBIgAUwCUToWUKYwBX5SFlIxIL3Vzci9sb2NhbC9saWIvcHl0aG9uMy43L2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lIwEZnVuY5RLgEMCAAGUjAN2YWyUhZQpdJRSlH2UKIwLX19wYWNrYWdlX1+UjBhzdGFibGVfYmFzZWxpbmVzMy5jb21tb26UjAhfX25hbWVfX5SMHnN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi51dGlsc5SMCF9fZmlsZV9flIxIL3Vzci9sb2NhbC9saWIvcHl0aG9uMy43L2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaCB9lH2UKGgXaA6MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgYjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz/JmZmZmZmahZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"}, "clip_range_vf": null, "normalize_advantage": true, "target_kl": null, "system_info": {"OS": "Linux-5.4.188+-x86_64-with-Ubuntu-18.04-bionic #1 SMP Sun Apr 24 10:03:06 PDT 2022", "Python": "3.7.13", "Stable-Baselines3": "1.5.0", "PyTorch": "1.11.0+cu113", "GPU Enabled": "True", "Numpy": "1.21.6", "Gym": "0.21.0"}}
|
replay.mp4
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a39e3365089d7ec4ef08302d5baea1dce6e43deeae97c9ecccdff4f8dad477e7
|
3 |
+
size 186162
|
results.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"mean_reward":
|
|
|
1 |
+
{"mean_reward": 283.10897031647426, "std_reward": 19.04043042617934, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-07-06T00:53:30.123002"}
|
{username}//{model_architecture}-{env_id}.zip
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:80e81fb4cb0a021383a04df3bd762e01707c4510015b014e8319e1df2a2203b9
|
3 |
+
size 144096
|
{username}//{model_architecture}-{env_id}/data
CHANGED
@@ -4,19 +4,19 @@
|
|
4 |
":serialized:": "gASVOwAAAAAAAACMIXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5wb2xpY2llc5SMEUFjdG9yQ3JpdGljUG9saWN5lJOULg==",
|
5 |
"__module__": "stable_baselines3.common.policies",
|
6 |
"__doc__": "\n Policy class for actor-critic algorithms (has both policy and value prediction).\n Used by A2C, PPO and the likes.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param ortho_init: Whether to use or not orthogonal initialization\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param full_std: Whether to use (n_features x n_actions) parameters\n for the std instead of only (n_features,) when using gSDE\n :param sde_net_arch: Network architecture for extracting features\n when using gSDE. If None, the latent features from the policy will be used.\n Pass an empty list to use the states as features.\n :param use_expln: Use ``expln()`` function instead of ``exp()`` to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param squash_output: Whether to squash the output using a tanh function,\n this allows to ensure boundaries when using gSDE.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n ",
|
7 |
-
"__init__": "<function ActorCriticPolicy.__init__ at
|
8 |
-
"_get_constructor_parameters": "<function ActorCriticPolicy._get_constructor_parameters at
|
9 |
-
"reset_noise": "<function ActorCriticPolicy.reset_noise at
|
10 |
-
"_build_mlp_extractor": "<function ActorCriticPolicy._build_mlp_extractor at
|
11 |
-
"_build": "<function ActorCriticPolicy._build at
|
12 |
-
"forward": "<function ActorCriticPolicy.forward at
|
13 |
-
"_get_action_dist_from_latent": "<function ActorCriticPolicy._get_action_dist_from_latent at
|
14 |
-
"_predict": "<function ActorCriticPolicy._predict at
|
15 |
-
"evaluate_actions": "<function ActorCriticPolicy.evaluate_actions at
|
16 |
-
"get_distribution": "<function ActorCriticPolicy.get_distribution at
|
17 |
-
"predict_values": "<function ActorCriticPolicy.predict_values at
|
18 |
"__abstractmethods__": "frozenset()",
|
19 |
-
"_abc_impl": "<_abc_data object at
|
20 |
},
|
21 |
"verbose": 0,
|
22 |
"policy_kwargs": {},
|
@@ -47,7 +47,7 @@
|
|
47 |
"_num_timesteps_at_start": 0,
|
48 |
"seed": null,
|
49 |
"action_noise": null,
|
50 |
-
"start_time":
|
51 |
"learning_rate": 0.0003,
|
52 |
"tensorboard_log": null,
|
53 |
"lr_schedule": {
|
@@ -56,7 +56,7 @@
|
|
56 |
},
|
57 |
"_last_obs": {
|
58 |
":type:": "<class 'numpy.ndarray'>",
|
59 |
-
":serialized:": "gASVjQIAAAAAAACMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMDF9yZWNvbnN0cnVjdJSTlIwFbnVtcHmUjAduZGFycmF5lJOUSwCFlEMBYpSHlFKUKEsBSxBLCIaUaAOMBWR0eXBllJOUjAJmNJSJiIeUUpQoSwOMATyUTk5OSv////9K/////
|
60 |
},
|
61 |
"_last_episode_starts": {
|
62 |
":type:": "<class 'numpy.ndarray'>",
|
@@ -69,13 +69,13 @@
|
|
69 |
"_current_progress_remaining": -0.015808000000000044,
|
70 |
"ep_info_buffer": {
|
71 |
":type:": "<class 'collections.deque'>",
|
72 |
-
":serialized:": "
|
73 |
},
|
74 |
"ep_success_buffer": {
|
75 |
":type:": "<class 'collections.deque'>",
|
76 |
":serialized:": "gASVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="
|
77 |
},
|
78 |
-
"_n_updates":
|
79 |
"n_steps": 1024,
|
80 |
"gamma": 0.999,
|
81 |
"gae_lambda": 0.98,
|
|
|
4 |
":serialized:": "gASVOwAAAAAAAACMIXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5wb2xpY2llc5SMEUFjdG9yQ3JpdGljUG9saWN5lJOULg==",
|
5 |
"__module__": "stable_baselines3.common.policies",
|
6 |
"__doc__": "\n Policy class for actor-critic algorithms (has both policy and value prediction).\n Used by A2C, PPO and the likes.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param ortho_init: Whether to use or not orthogonal initialization\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param full_std: Whether to use (n_features x n_actions) parameters\n for the std instead of only (n_features,) when using gSDE\n :param sde_net_arch: Network architecture for extracting features\n when using gSDE. If None, the latent features from the policy will be used.\n Pass an empty list to use the states as features.\n :param use_expln: Use ``expln()`` function instead of ``exp()`` to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param squash_output: Whether to squash the output using a tanh function,\n this allows to ensure boundaries when using gSDE.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n ",
|
7 |
+
"__init__": "<function ActorCriticPolicy.__init__ at 0x7f170b6e0440>",
|
8 |
+
"_get_constructor_parameters": "<function ActorCriticPolicy._get_constructor_parameters at 0x7f170b6e04d0>",
|
9 |
+
"reset_noise": "<function ActorCriticPolicy.reset_noise at 0x7f170b6e0560>",
|
10 |
+
"_build_mlp_extractor": "<function ActorCriticPolicy._build_mlp_extractor at 0x7f170b6e05f0>",
|
11 |
+
"_build": "<function ActorCriticPolicy._build at 0x7f170b6e0680>",
|
12 |
+
"forward": "<function ActorCriticPolicy.forward at 0x7f170b6e0710>",
|
13 |
+
"_get_action_dist_from_latent": "<function ActorCriticPolicy._get_action_dist_from_latent at 0x7f170b6e07a0>",
|
14 |
+
"_predict": "<function ActorCriticPolicy._predict at 0x7f170b6e0830>",
|
15 |
+
"evaluate_actions": "<function ActorCriticPolicy.evaluate_actions at 0x7f170b6e08c0>",
|
16 |
+
"get_distribution": "<function ActorCriticPolicy.get_distribution at 0x7f170b6e0950>",
|
17 |
+
"predict_values": "<function ActorCriticPolicy.predict_values at 0x7f170b6e09e0>",
|
18 |
"__abstractmethods__": "frozenset()",
|
19 |
+
"_abc_impl": "<_abc_data object at 0x7f170b736270>"
|
20 |
},
|
21 |
"verbose": 0,
|
22 |
"policy_kwargs": {},
|
|
|
47 |
"_num_timesteps_at_start": 0,
|
48 |
"seed": null,
|
49 |
"action_noise": null,
|
50 |
+
"start_time": 1657068448.7295914,
|
51 |
"learning_rate": 0.0003,
|
52 |
"tensorboard_log": null,
|
53 |
"lr_schedule": {
|
|
|
56 |
},
|
57 |
"_last_obs": {
|
58 |
":type:": "<class 'numpy.ndarray'>",
|
59 |
+
":serialized:": "gASVjQIAAAAAAACMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMDF9yZWNvbnN0cnVjdJSTlIwFbnVtcHmUjAduZGFycmF5lJOUSwCFlEMBYpSHlFKUKEsBSxBLCIaUaAOMBWR0eXBllJOUjAJmNJSJiIeUUpQoSwOMATyUTk5OSv////9K/////0sAdJRiiUIAAgAATUVPPjHDkD/Y0uY+LXjhvixYzj76/7c+AAAAAAAAAAAAx5s8hzeQP4wEEr2B4+m+2i6hPCZ5nrwAAAAAAAAAABq8or28i0U+wX4TPl+3wr5tGI89+lgGPAAAAAAAAAAAmmN1vfZEc7qG+La5lhOdNXXobLqm+9U4AAAAAAAAAAAa18+924TgPZItZD5sXsa+Tx6uPfsHlz0AAAAAAAAAAJr7vbyBW4O88zenuyPwjDvvU+89yu+AvAAAgD8AAIA/M14qPUuEqD4wCga9JdbLvrJdZzy+qaS7AAAAAAAAAABAboQ9GghhPll/wr5Ker2++vRHvpsKnLwAAAAAAAAAAGYE4r3MpWw+GtcwPp4+rL4zWYs8Jr+HuwAAAAAAAAAADdQWvoqUpz/yOeS+PtL5vmWyjr5C/GW+AAAAAAAAAACak7m9j6JBPlGgwT0DxKm+8h/lvLljvrwAAAAAAAAAAHOApb0YPJQ/Kzf0vYmc9L4NKxK+SLGjPAAAAAAAAAAADVMZPgJ4Wj94Ses9R5DgvnPYez5sBji9AAAAAAAAAABmiGY8ztOGvGILczxK6pW9I83MPQ/MtD4AAIA/AACAPzMYmjySEY080XUaveecnb5qp3W9Gg4ZPQAAAAAAAAAAABcDPRIlsT8sKRs+9sGRvpTleD1imVA+AAAAAAAAAACUdJRiLg=="
|
60 |
},
|
61 |
"_last_episode_starts": {
|
62 |
":type:": "<class 'numpy.ndarray'>",
|
|
|
69 |
"_current_progress_remaining": -0.015808000000000044,
|
70 |
"ep_info_buffer": {
|
71 |
":type:": "<class 'collections.deque'>",
|
72 |
+
":serialized:": "gASVIxAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpSMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMBnNjYWxhcpSTlIwFbnVtcHmUjAVkdHlwZZSTlIwCZjiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYkMIjBL0F/qGc0CUhpRSlIwBbJRL9owBdJRHQHOkn3L3bmF1fZQoaAZoCWgPQwhj1LX2Pt1xQJSGlFKUaBVL42gWR0BzpbBMzuWsdX2UKGgGaAloD0MI1SMNbquic0CUhpRSlGgVS9toFkdAc6XoYNy5qnV9lChoBmgJaA9DCFBQilZuxHJAlIaUUpRoFUveaBZHQHOnVx0dRzl1fZQoaAZoCWgPQwgw2uOFdPVvQJSGlFKUaBVL/mgWR0Bzp7atcObzdX2UKGgGaAloD0MIr2Ab8WTfcUCUhpRSlGgVS95oFkdAc6fY1He7+XV9lChoBmgJaA9DCAzIXu9+LW5AlIaUUpRoFUvaaBZHQHOrRtgrpaB1fZQoaAZoCWgPQwgzbmqg+bBUQJSGlFKUaBVLrmgWR0Bzq+5wwTM8dX2UKGgGaAloD0MIWoC21Swjb0CUhpRSlGgVS99oFkdAc6x3Hq/ucHV9lChoBmgJaA9DCJwU5j0OM3JAlIaUUpRoFUvqaBZHQHOv+JtSAH51fZQoaAZoCWgPQwj7BiY3io5wQJSGlFKUaBVL7mgWR0BzsU+5e7cxdX2UKGgGaAloD0MIR+f8FMc/ckCUhpRSlGgVS+doFkdAc7Gy0rsjV3V9lChoBmgJaA9DCLPw9bWu8mdAlIaUUpRoFU3oA2gWR0Bzsd3s5XEJdX2UKGgGaAloD0MIS+SCM/i4cECUhpRSlGgVS9NoFkdAc7HbrTpgTnV9lChoBmgJaA9DCPPoRliUOXJAlIaUUpRoFUvSaBZHQHOyUsvqTr51fZQoaAZoCWgPQwgf+BisOCVyQJSGlFKUaBVLz2gWR0Bzt+wt8NQTdX2UKGgGaAloD0MI5nYv98kCckCUhpRSlGgVS/loFkdAc7iC+De0onV9lChoBmgJaA9DCHhCrz8JhnFAlIaUUpRoFUvlaBZHQHO4xXCCSRt1fZQoaAZoCWgPQwjZXaCkAMtzQJSGlFKUaBVL3GgWR0BzuPnq3VkMdX2UKGgGaAloD0MI+l+uRUuJc0CUhpRSlGgVS8ZoFkdAc7kb9qDbrXV9lChoBmgJaA9DCGJp4Ef1GXJAlIaUUpRoFUvWaBZHQHO55jUd7v51fZQoaAZoCWgPQwgXR+Umak9vQJSGlFKUaBVL32gWR0BzuuIyj59FdX2UKGgGaAloD0MIL1G9NbB4cUCUhpRSlGgVS8doFkdAc7wocaOxS3V9lChoBmgJaA9DCILIIk18QXJAlIaUUpRoFUviaBZHQHO+5VOsT391fZQoaAZoCWgPQwgrhxbZDn5yQJSGlFKUaBVL4mgWR0Bzv2TV2A5JdX2UKGgGaAloD0MILZljeVc2cECUhpRSlGgVS9hoFkdAdAxfeDWbw3V9lChoBmgJaA9DCLuZ0Y+G4nJAlIaUUpRoFUvraBZHQHQM+1OTJQt1fZQoaAZoCWgPQwhTtHIvMNNuQJSGlFKUaBVL2WgWR0B0DPonrpqzdX2UKGgGaAloD0MIN1X3yGY1bUCUhpRSlGgVS+BoFkdAdA125QP7N3V9lChoBmgJaA9DCLn6sUm+63JAlIaUUpRoFUvfaBZHQHQNhjJ+2E11fZQoaAZoCWgPQwiVRszsc1BzQJSGlFKUaBVL82gWR0B0D5yFPBSDdX2UKGgGaAloD0MIg4qqX+lSRUCUhpRSlGgVS59oFkdAdBMFxXGOuXV9lChoBmgJaA9DCLkzEwxn2nBAlIaUUpRoFUvSaBZHQHQTUy+HrQh1fZQoaAZoCWgPQwgKhJ1i1bBwQJSGlFKUaBVL2GgWR0B0FE4uK4x2dX2UKGgGaAloD0MIHJqy0w/jcECUhpRSlGgVS8xoFkdAdBRPgNwzcnV9lChoBmgJaA9DCHuFBfeDKXFAlIaUUpRoFUvaaBZHQHQUTJEH+qB1fZQoaAZoCWgPQwg6ArhZPItyQJSGlFKUaBVL52gWR0B0FKhf0EowdX2UKGgGaAloD0MI3bQZp2FgcUCUhpRSlGgVS/JoFkdAdBZmig00nHV9lChoBmgJaA9DCOQPBp77SHBAlIaUUpRoFU0FAWgWR0B0GelwcYIjdX2UKGgGaAloD0MIqKj6lQ6Hc0CUhpRSlGgVS9ZoFkdAdBrcB2fTTnV9lChoBmgJaA9DCBo09E8w6HFAlIaUUpRoFUvwaBZHQHQciVGCqZN1fZQoaAZoCWgPQwg6z9iX7GBwQJSGlFKUaBVL2WgWR0B0H9L9MsYmdX2UKGgGaAloD0MI7Pma5bKjb0CUhpRSlGgVS+BoFkdAdB/2VE/jbXV9lChoBmgJaA9DCPj578GrO3NAlIaUUpRoFUvmaBZHQHQf8YZVGTd1fZQoaAZoCWgPQwhDVyJQPd5yQJSGlFKUaBVL42gWR0B0IMTmGM4tdX2UKGgGaAloD0MI6nqi60LhcUCUhpRSlGgVS/hoFkdAdCIJSzgMt3V9lChoBmgJaA9DCBuC4zIuqnFAlIaUUpRoFUvaaBZHQHQiLHdXT3J1fZQoaAZoCWgPQwgvi4nNB5xxQJSGlFKUaBVLwmgWR0B0I90q6OHWdX2UKGgGaAloD0MIaYoApzcOcECUhpRSlGgVS9doFkdAdCVRmbsniXV9lChoBmgJaA9DCOV+h6IApHNAlIaUUpRoFUvMaBZHQHQljlDF6zF1fZQoaAZoCWgPQwiWIvlKIOpyQJSGlFKUaBVLzGgWR0B0Je3I+4b0dX2UKGgGaAloD0MIbVM8LmqLcECUhpRSlGgVS9JoFkdAdCYV+7UXpHV9lChoBmgJaA9DCI8YPbdQfnFAlIaUUpRoFU0FAWgWR0B0Kk4ZMtbtdX2UKGgGaAloD0MI3ZTyWgnzc0CUhpRSlGgVS/VoFkdAdCtRCx/us3V9lChoBmgJaA9DCIRiK2hannNAlIaUUpRoFUvdaBZHQHQty1iONo91fZQoaAZoCWgPQwglk1M7Q1NvQJSGlFKUaBVL22gWR0B0L3bmEGqxdX2UKGgGaAloD0MI/+cwX962ckCUhpRSlGgVTQMBaBZHQHQwSLQ5WBB1fZQoaAZoCWgPQwiPjquRXZNKQJSGlFKUaBVLuGgWR0B0MJKmKqGUdX2UKGgGaAloD0MITyMtlfdSckCUhpRSlGgVS8hoFkdAdDEOz6ab4XV9lChoBmgJaA9DCNo391dPH3BAlIaUUpRoFUvlaBZHQHQz5GWldkd1fZQoaAZoCWgPQwhcrn5sEhxyQJSGlFKUaBVL6GgWR0B0NDqC6H0sdX2UKGgGaAloD0MIOZz51ZywcUCUhpRSlGgVS9hoFkdAdDUDiwSrYHV9lChoBmgJaA9DCIRhwJLrEHJAlIaUUpRoFUvRaBZHQHQ3r/XGwRp1fZQoaAZoCWgPQwimRuhnqoVyQJSGlFKUaBVL+GgWR0B0N/5LytmudX2UKGgGaAloD0MIuHaiJCTscECUhpRSlGgVS+NoFkdAdDnwxFiKBXV9lChoBmgJaA9DCCxhbYzdMHFAlIaUUpRoFUvhaBZHQHQ58oMKCxx1fZQoaAZoCWgPQwjJkc7AyItxQJSGlFKUaBVNBwFoFkdAdDraGYa5w3V9lChoBmgJaA9DCMQGCydpeHJAlIaUUpRoFUv2aBZHQHQ7JezD4xl1fZQoaAZoCWgPQwiuYvGbwj1vQJSGlFKUaBVLzWgWR0B0PGcbzbvgdX2UKGgGaAloD0MI5Zgs7j98cECUhpRSlGgVS/loFkdAdEFn6l+Ey3V9lChoBmgJaA9DCNmVlpF63W5AlIaUUpRoFUvNaBZHQHRC6S9ugpV1fZQoaAZoCWgPQwj2mEhp9oJyQJSGlFKUaBVL9mgWR0B0RAM3IdU9dX2UKGgGaAloD0MIg2qDE9Gzb0CUhpRSlGgVS+9oFkdAdEUa3Zwn6XV9lChoBmgJaA9DCHoYWp2ccXNAlIaUUpRoFUvoaBZHQHRGD5bhWHV1fZQoaAZoCWgPQwiCWDZzCBh0QJSGlFKUaBVL+2gWR0B0Rwb1h9b5dX2UKGgGaAloD0MIKZMa2oBLcUCUhpRSlGgVS9doFkdAdEeJVKf4AXV9lChoBmgJaA9DCIicvp7vBnNAlIaUUpRoFUvYaBZHQHRIQeFL39J1fZQoaAZoCWgPQwgCYhIu5HZwQJSGlFKUaBVL6WgWR0B0SK6WgOBldX2UKGgGaAloD0MIFm75SEobc0CUhpRSlGgVS+hoFkdAdEwCFbmlqXV9lChoBmgJaA9DCA37PbFOy3FAlIaUUpRoFUvKaBZHQHRM4qkM1CR1fZQoaAZoCWgPQwiIZMixdfhuQJSGlFKUaBVL22gWR0B0TRkAggX/dX2UKGgGaAloD0MIeXjPgeUrc0CUhpRSlGgVS+ZoFkdAdE4Npudf9nV9lChoBmgJaA9DCBfZzvfTAHJAlIaUUpRoFUv9aBZHQHROJzYEnst1fZQoaAZoCWgPQwgPJzCd1pJxQJSGlFKUaBVLz2gWR0B0TqrXDm8vdX2UKGgGaAloD0MIjbPpCGBJcUCUhpRSlGgVS+toFkdAdE9RZU1hs3V9lChoBmgJaA9DCJxSXishcXJAlIaUUpRoFUvYaBZHQHRULbQC0Wx1fZQoaAZoCWgPQwhQFymUhVtvQJSGlFKUaBVL0mgWR0B0VQrFwT/RdX2UKGgGaAloD0MINuZ1xCHvS0CUhpRSlGgVS6VoFkdAdFaqyWzF/HV9lChoBmgJaA9DCJQXmYDfBXJAlIaUUpRoFUv1aBZHQHRZHV9Wp611fZQoaAZoCWgPQwhB8s6hDIdxQJSGlFKUaBVL3WgWR0B0WeiKziS8dX2UKGgGaAloD0MIY3rCEo+AckCUhpRSlGgVS/ZoFkdAdFpG+sYEXHV9lChoBmgJaA9DCGnIeJRKr21AlIaUUpRoFUvvaBZHQHRanXqZ+hJ1fZQoaAZoCWgPQwhngAuyZepxQJSGlFKUaBVL3mgWR0B0WpMEidJ8dX2UKGgGaAloD0MIBwySPq0HdECUhpRSlGgVS99oFkdAdFteYlY2bXV9lChoBmgJaA9DCAwepn0zm3BAlIaUUpRoFUvpaBZHQHRgUZNwiq11fZQoaAZoCWgPQwjoTNpU3eNuQJSGlFKUaBVL0WgWR0B0YQc5sCT2dX2UKGgGaAloD0MIdopVg7CKbkCUhpRSlGgVS99oFkdAdGGbtJFspHV9lChoBmgJaA9DCAIR4srZQ3NAlIaUUpRoFUvxaBZHQHRiSDujRD11fZQoaAZoCWgPQwiE9BQ5RChwQJSGlFKUaBVL5WgWR0B0YkvTPSlWdX2UKGgGaAloD0MIOdOE7SfMcUCUhpRSlGgVTQABaBZHQHRjUQwsXi11ZS4="
|
73 |
},
|
74 |
"ep_success_buffer": {
|
75 |
":type:": "<class 'collections.deque'>",
|
76 |
":serialized:": "gASVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="
|
77 |
},
|
78 |
+
"_n_updates": 620,
|
79 |
"n_steps": 1024,
|
80 |
"gamma": 0.999,
|
81 |
"gae_lambda": 0.98,
|
{username}//{model_architecture}-{env_id}/policy.optimizer.pth
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 84893
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:efd74cb46c874442800213602302c1eec4a718fd54152bcc0773fb4d4a845fc7
|
3 |
size 84893
|
{username}//{model_architecture}-{env_id}/policy.pth
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 43201
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:affc3aad75254a172272cb54fc892423908591860d74a8ab1994259b4ddc6a95
|
3 |
size 43201
|