xszhou commited on
Commit
72d502f
·
1 Parent(s): 6ffbb4d

Upload PPO LunarLander-v2 trained agent with help from optuna

Browse files
README.md ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: stable-baselines3
3
+ tags:
4
+ - LunarLander-v2
5
+ - deep-reinforcement-learning
6
+ - reinforcement-learning
7
+ - stable-baselines3
8
+ model-index:
9
+ - name: PPO
10
+ results:
11
+ - task:
12
+ type: reinforcement-learning
13
+ name: reinforcement-learning
14
+ dataset:
15
+ name: LunarLander-v2
16
+ type: LunarLander-v2
17
+ metrics:
18
+ - type: mean_reward
19
+ value: 285.20 +/- 16.37
20
+ name: mean_reward
21
+ verified: false
22
+ ---
23
+
24
+ # **PPO** Agent playing **LunarLander-v2**
25
+ This is a trained model of a **PPO** agent playing **LunarLander-v2**
26
+ using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
27
+
28
+ ## Usage (with Stable-baselines3)
29
+ TODO: Add your code
30
+
31
+
32
+ ```python
33
+ from stable_baselines3 import ...
34
+ from huggingface_sb3 import load_from_hub
35
+
36
+ ...
37
+ ```
config.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"policy_class": {":type:": "<class 'abc.ABCMeta'>", ":serialized:": "gAWVOwAAAAAAAACMIXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5wb2xpY2llc5SMEUFjdG9yQ3JpdGljUG9saWN5lJOULg==", "__module__": "stable_baselines3.common.policies", "__doc__": "\n Policy class for actor-critic algorithms (has both policy and value prediction).\n Used by A2C, PPO and the likes.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param ortho_init: Whether to use or not orthogonal initialization\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param full_std: Whether to use (n_features x n_actions) parameters\n for the std instead of only (n_features,) when using gSDE\n :param use_expln: Use ``expln()`` function instead of ``exp()`` to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param squash_output: Whether to squash the output using a tanh function,\n this allows to ensure boundaries when using gSDE.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param share_features_extractor: If True, the features extractor is shared between the policy and value networks.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n ", "__init__": "<function ActorCriticPolicy.__init__ at 0x7a6d183c85e0>", "_get_constructor_parameters": "<function ActorCriticPolicy._get_constructor_parameters at 0x7a6d183c8670>", "reset_noise": "<function ActorCriticPolicy.reset_noise at 0x7a6d183c8700>", "_build_mlp_extractor": "<function ActorCriticPolicy._build_mlp_extractor at 0x7a6d183c8790>", "_build": "<function ActorCriticPolicy._build at 0x7a6d183c8820>", "forward": "<function ActorCriticPolicy.forward at 0x7a6d183c88b0>", "extract_features": "<function ActorCriticPolicy.extract_features at 0x7a6d183c8940>", "_get_action_dist_from_latent": "<function ActorCriticPolicy._get_action_dist_from_latent at 0x7a6d183c89d0>", "_predict": "<function ActorCriticPolicy._predict at 0x7a6d183c8a60>", "evaluate_actions": "<function ActorCriticPolicy.evaluate_actions at 0x7a6d183c8af0>", "get_distribution": "<function ActorCriticPolicy.get_distribution at 0x7a6d183c8b80>", "predict_values": "<function ActorCriticPolicy.predict_values at 0x7a6d183c8c10>", "__abstractmethods__": "frozenset()", "_abc_impl": "<_abc._abc_data object at 0x7a6d183c5180>"}, "verbose": 1, "policy_kwargs": {":type:": "<class 'dict'>", ":serialized:": "gAWVlQAAAAAAAAB9lCiMDGxvZ19zdGRfaW5pdJRHP+l+K/qQGoiMCG5ldF9hcmNolH2UKIwCcGmUXZQoTQABTQABTQABZYwCdmaUXZQoTQABTQABTQABZXWMDWFjdGl2YXRpb25fZm6UjBt0b3JjaC5ubi5tb2R1bGVzLmFjdGl2YXRpb26UjARSZUxVlJOUjApvcnRob19pbml0lIh1Lg==", "log_std_init": 0.796651830082582, "net_arch": {"pi": [256, 256, 256], "vf": [256, 256, 256]}, "activation_fn": "<class 'torch.nn.modules.activation.ReLU'>", "ortho_init": true}, "num_timesteps": 2015232, "_total_timesteps": 2000000, "_num_timesteps_at_start": 0, "seed": null, "action_noise": null, "start_time": 1694060867728305936, "learning_rate": 0.0003, "tensorboard_log": null, "_last_obs": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVdQIAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYAAgAAAAAAAHPNuj0AJKM/FhO3PsKjDr9C2xg+ll+EPgAAAAAAAAAALdZpPsLViD/iVwE/1LQ1vxDkxj5rTVg+AAAAAAAAAABm/DG9wzF2uhVRzjO0POGvuKiWOhWcsbMAAIA/AACAPzvigr6PzXY/17a6vktwIb9ZSf++dtBQvgAAAAAAAAAALb4GvtJfrLsVEly8PjbIur2YDD1MWKo7AACAPwAAgD/dGos+aDuNP4IPuT6EWRW/AkH9PgA6VD4AAAAAAAAAAM3gcjyKwLE/tXoQPRESBb8WwqI9F4FEPQAAAAAAAAAAc2GqvY9eNroC0UO48xeaslkDtrqtLGA3AACAPwAAgD/NrOa8FEaFuqc0SD1hnEE1VOFru+dBNTQAAIA/AAAAAFrPnT3hHti6HISJuvPSvDybagK8AFihPQAAgD8AAAAAZmYBuun7M7y6KxG9JTYbPTyiaz06ghk8AACAPwAAgD9mlw+9SC+dupaptzRAZJcvy0rMuRLHRLMAAIA/AACAP/r/Oj7SQAM/lEoSPilRTr/glJQ+I4kovAAAAAAAAAAAwOFyPqvugz54aKe+1NEdv9SWTz4dB16+AAAAAAAAAADjB1e+FnZyP5Exhr4RQ0q/HT7EvrbuGr0AAAAAAAAAAAC4pDuktku7NlCzvbVotjuSSok84wKqvAAAgD8AAIA/lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksQSwiGlIwBQ5R0lFKULg=="}, "_last_episode_starts": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVgwAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACUjAVudW1weZSMBWR0eXBllJOUjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSxCFlIwBQ5R0lFKULg=="}, "_last_original_obs": null, "_episode_num": 0, "use_sde": false, "sde_sample_freq": -1, "_current_progress_remaining": -0.007616000000000067, "_stats_window_size": 100, "ep_info_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWV4AsAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHQHIc6fJ3gUGMAWyUS5yMAXSUR0Csic5eqrBCdX2UKGgGR0Bw1OyeI2wWaAdLjmgIR0Csid42S+xodX2UKGgGR0ByVMo1DSgHaAdLs2gIR0CsiiawdKdydX2UKGgGR0Bz2WhIvrWzaAdL3mgIR0Csikxoh6jWdX2UKGgGR0BxjYTnJT2naAdLomgIR0CsikqbayrxdX2UKGgGR0Bx/ynGbTc7aAdLjmgIR0CsimAX/HYIdX2UKGgGR0BwMKrIYFaCaAdLn2gIR0CsimcQiA2AdX2UKGgGR0BzfBLuhK15aAdLxGgIR0CsmWdq+JxedX2UKGgGR0Bx52UNayKOaAdLsGgIR0CsmZFNUOurdX2UKGgGR0Bw6qAwwj+raAdLmWgIR0CsmdDZ13dLdX2UKGgGR0ByYA64lQdkaAdLm2gIR0CsmfN+LFXJdX2UKGgGR0ByaTBKtga4aAdLqWgIR0CsmiFgc94edX2UKGgGR0Bxvx9x6v7naAdLomgIR0Csmk0TL4etdX2UKGgGR0Bw8ZwNsnAqaAdLnmgIR0Csmm1uivgWdX2UKGgGR0BzBs4lyBClaAdLwWgIR0CsmmpGWldkdX2UKGgGR0By5CIUJv5yaAdLsmgIR0CsmnEXk5p8dX2UKGgGR0BwkR+rlvIfaAdLkGgIR0CsmqMvysjndX2UKGgGR0BzZh9Wp6yCaAdLu2gIR0CsmrCQLeANdX2UKGgGR0By3CR9w3o+aAdLtmgIR0CsmrswL3K0dX2UKGgGR0Bxba05U96kaAdLh2gIR0Csmsq77Kq5dX2UKGgGR0BwDg6xPfsNaAdLiWgIR0CsmtZkkKNRdX2UKGgGR0Bw00fIS13MaAdLlGgIR0CsmvYzJp35dX2UKGgGR0BxertKIznBaAdLqWgIR0Csmwb2tdRjdX2UKGgGR0BxEUFNcnmaaAdLsGgIR0Csm22zWwu/dX2UKGgGR0Bw3zebd8AraAdL0mgIR0Csm3SLqD9PdX2UKGgGR0Bxu+IZZSvUaAdLj2gIR0Csm3Zk078vdX2UKGgGR0BvojwMH8jzaAdLkmgIR0Csm6Xos7MgdX2UKGgGR0BymEpEx7AtaAdLjWgIR0Csm97fpD/mdX2UKGgGR0Bylxf7aZhKaAdLq2gIR0CsnBALZzxPdX2UKGgGR0ByyhsabWmQaAdL2mgIR0CsnCPE0iyIdX2UKGgGR0BxyulDWsijaAdLlWgIR0CsnC7L2YfGdX2UKGgGR0ByQnO4XoC/aAdLtWgIR0CsnEoF/x2CdX2UKGgGR0BzxJhqj8DTaAdLzGgIR0CsnIIAwPAgdX2UKGgGR0Bu+VvGZNO/aAdLj2gIR0CsnIbmU4aQdX2UKGgGR0Bx5oGSpzcRaAdLs2gIR0CsnKJGvwEydX2UKGgGR0By1THfdhy9aAdLvGgIR0CsnKCr92ovdX2UKGgGR0ByZg5IYm9haAdLuWgIR0CsnL6VdHDrdX2UKGgGR0BxpfwTdtVJaAdLrWgIR0CsnMIXj2i+dX2UKGgGR0BzKJoRIz3zaAdL7WgIR0CsnS034sVddX2UKGgGR0ByCE4lyBClaAdLq2gIR0CsnTiW/rSmdX2UKGgGR0BzkMjFAE+xaAdLxGgIR0CsnYVLamGedX2UKGgGR0BxeHFglWwNaAdLy2gIR0CsnZp9qk/KdX2UKGgGR0Bw44dT5wfhaAdLimgIR0CsnZ74agmJdX2UKGgGR0BxTOxt52QoaAdLpWgIR0CsnZvv0AcUdX2UKGgGR0BzG0XzlLezaAdL12gIR0Csne+evpyIdX2UKGgGR0ByotmGucMFaAdLuWgIR0CsngCQcPvsdX2UKGgGR0Bw/dC0F8ohaAdLkGgIR0CsngYWLxZudX2UKGgGR0ByVNj/dZaFaAdLr2gIR0Csnh2joIOZdX2UKGgGR0BwHhBmf5DaaAdLkmgIR0CsnibulXRxdX2UKGgGR0BxzIood+5OaAdLxWgIR0Csni0aqCHzdX2UKGgGR0Bu9va+N96UaAdLq2gIR0CsnkkRaouPdX2UKGgGR0Bwx+Ml1KXfaAdLpWgIR0CsnlWphnandX2UKGgGR0BziwRVZLZjaAdLn2gIR0CsnmQQtjCpdX2UKGgGR0ByNFttQ9A5aAdLtWgIR0CsnpXvYvnKdX2UKGgGR0BxeoJHAh0RaAdLmGgIR0Csnr6isXBQdX2UKGgGR0BxavPQfIS2aAdLfmgIR0CsnsNEofCAdX2UKGgGR0BujjAYYR/WaAdLlWgIR0Csnxj1PFefdX2UKGgGR0Bxo/5HmRvFaAdLxWgIR0CsnzIWpIczdX2UKGgGR0BxkFSvTw2EaAdLsGgIR0Csn2qN6w+udX2UKGgGR0BzHpJ2+wkgaAdLt2gIR0Csn3uw5eZ5dX2UKGgGR0BxvOmelKsdaAdLo2gIR0Csn5opH7P6dX2UKGgGR0BxQV9iMHbAaAdLnGgIR0Csn9FWfbsXdX2UKGgGR0Bzib6l+EytaAdLpWgIR0Csn+kAHVwxdX2UKGgGR0Bx7W4oZydXaAdLuGgIR0Csn/d4mkWRdX2UKGgGR0ByCHNqxkd4aAdLpWgIR0CsoBzvqkdndX2UKGgGR0ByQ0H8jzI4aAdLwmgIR0CsoClsguAadX2UKGgGR0B0I6wTufEoaAdLz2gIR0CsoCy3b212dX2UKGgGR0ByRKhM8HObaAdLsmgIR0CsoDBTfixWdX2UKGgGR0ByLIjQiRnwaAdLr2gIR0CsoEK/dqL1dX2UKGgGR0BwxJCUornUaAdLlGgIR0CsoFmoR7JGdX2UKGgGR0Bxgj60pmVaaAdLpmgIR0CsoF2mxdIHdX2UKGgGR0BySODnNgSfaAdLmmgIR0CsoGsuWa+fdX2UKGgGR0Bx3a6DoQnQaAdLmmgIR0CsoMiJXQt0dX2UKGgGR0Bx7FrpJPIoaAdLpGgIR0CsoMx1xKg7dX2UKGgGR0BzPtu4wyqNaAdLn2gIR0CsoRh6KLsKdX2UKGgGR0BzXqG1x82KaAdLxWgIR0CsoXDm0VrRdX2UKGgGR0BwisNutOmBaAdLmWgIR0CsoXfOD8LsdX2UKGgGR0BxE9AB1cMWaAdLh2gIR0CsoYADzRQadX2UKGgGR0Bx8drIo3JgaAdLu2gIR0CsoYVyvLX+dX2UKGgGR0BzVkN4JNTMaAdLs2gIR0CsoaRCQcPwdX2UKGgGR0Bw4ZWfbsWwaAdLmmgIR0CsobzJp35fdX2UKGgGR0ByJrf8/D+BaAdLxGgIR0CsoeWDQJHBdX2UKGgGR0Bz20MWoFV1aAdLs2gIR0Csoe35N47jdX2UKGgGR0BxphQ+EAYIaAdLmmgIR0CsofN/vv0AdX2UKGgGR0BzqGcXm/34aAdLqWgIR0Csof9g4OtodX2UKGgGR0BwxEAtFrmAaAdLw2gIR0CsoiQj+rEMdX2UKGgGR0ByXAPAfuCxaAdLv2gIR0CsomPl2eQNdX2UKGgGR0B0WhSn+AEuaAdLyGgIR0Csomvq9oN/dX2UKGgGR0BwLQug6EJ0aAdLsGgIR0CsoqcE3bVSdX2UKGgGR0BynsvCdjG2aAdLlGgIR0CsoqqZ+hGpdX2UKGgGR0Bxw/DVH4GmaAdLtGgIR0Csoq6n752ydX2UKGgGR0ByM5Mj/uLKaAdLj2gIR0CsovbxusLfdX2UKGgGR0BxUgdFOO81aAdLgWgIR0Csow7UwztUdX2UKGgGR0BwbV3Roh6jaAdLlWgIR0Cso2OObRWtdX2UKGgGR0ByJJjOLR8daAdLtmgIR0Cso32nbZezdX2UKGgGR0Bw7WpDNQj2aAdLnWgIR0Cso4P4EfT1dX2UKGgGR0BxGosxwhnraAdLjmgIR0Cso4/T1CgLdX2UKGgGR0Bx8AOLBKtgaAdLzGgIR0Cso5k25xzadX2UKGgGR0BxdQiml67eaAdLqGgIR0Cso53mV7hOdX2UKGgGR0Bz3Reu3c59aAdLsGgIR0Cso79ic5KfdWUu"}, "ep_success_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="}, "_n_updates": 1230, "observation_space": {":type:": "<class 'gymnasium.spaces.box.Box'>", ":serialized:": "gAWVcAIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lGgFk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMDWJvdW5kZWRfYmVsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWCAAAAAAAAAABAQEBAQEBAZRoB4wCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksIhZSMAUOUdJRSlIwNYm91bmRlZF9hYm92ZZRoECiWCAAAAAAAAAABAQEBAQEBAZRoFEsIhZRoGHSUUpSMBl9zaGFwZZRLCIWUjANsb3eUaBAoliAAAAAAAAAAAAC0wgAAtMIAAKDAAACgwNsPScAAAKDAAAAAgAAAAICUaApLCIWUaBh0lFKUjARoaWdolGgQKJYgAAAAAAAAAAAAtEIAALRCAACgQAAAoEDbD0lAAACgQAAAgD8AAIA/lGgKSwiFlGgYdJRSlIwIbG93X3JlcHKUjFtbLTkwLiAgICAgICAgLTkwLiAgICAgICAgIC01LiAgICAgICAgIC01LiAgICAgICAgIC0zLjE0MTU5MjcgIC01LgogIC0wLiAgICAgICAgIC0wLiAgICAgICBdlIwJaGlnaF9yZXBylIxTWzkwLiAgICAgICAgOTAuICAgICAgICAgNS4gICAgICAgICA1LiAgICAgICAgIDMuMTQxNTkyNyAgNS4KICAxLiAgICAgICAgIDEuICAgICAgIF2UjApfbnBfcmFuZG9tlE51Yi4=", "dtype": "float32", "bounded_below": "[ True True True True True True True True]", "bounded_above": "[ True True True True True True True True]", "_shape": [8], "low": "[-90. -90. -5. -5. -3.1415927 -5.\n -0. -0. ]", "high": "[90. 90. 5. 5. 3.1415927 5.\n 1. 1. ]", "low_repr": "[-90. -90. -5. -5. -3.1415927 -5.\n -0. -0. ]", "high_repr": "[90. 90. 5. 5. 3.1415927 5.\n 1. 1. ]", "_np_random": null}, "action_space": {":type:": "<class 'gymnasium.spaces.discrete.Discrete'>", ":serialized:": "gAWV1QAAAAAAAACMGWd5bW5hc2l1bS5zcGFjZXMuZGlzY3JldGWUjAhEaXNjcmV0ZZSTlCmBlH2UKIwBbpSMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMBnNjYWxhcpSTlIwFbnVtcHmUjAVkdHlwZZSTlIwCaTiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYkMIBAAAAAAAAACUhpRSlIwFc3RhcnSUaAhoDkMIAAAAAAAAAACUhpRSlIwGX3NoYXBllCloCmgOjApfbnBfcmFuZG9tlE51Yi4=", "n": "4", "start": "0", "_shape": [], "dtype": "int64", "_np_random": null}, "n_envs": 16, "n_steps": 1024, "gamma": 0.99, "gae_lambda": 0.92, "ent_coef": 0.04214859382089834, "vf_coef": 0.8884865667357631, "max_grad_norm": 0.3, "batch_size": 128, "n_epochs": 10, "clip_range": {":type:": "<class 'function'>", ":serialized:": "gAWVxQIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLAUsTQwSIAFMAlE6FlCmMAV+UhZSMSS91c3IvbG9jYWwvbGliL3B5dGhvbjMuMTAvZGlzdC1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjARmdW5jlEuEQwIEAZSMA3ZhbJSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjEkvdXNyL2xvY2FsL2xpYi9weXRob24zLjEwL2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaB99lH2UKGgWaA2MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgXjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz/ZmZmZmZmahZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"}, "clip_range_vf": null, "normalize_advantage": true, "target_kl": null, "lr_schedule": {":type:": "<class 'function'>", ":serialized:": "gAWVxQIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLAUsTQwSIAFMAlE6FlCmMAV+UhZSMSS91c3IvbG9jYWwvbGliL3B5dGhvbjMuMTAvZGlzdC1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjARmdW5jlEuEQwIEAZSMA3ZhbJSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjEkvdXNyL2xvY2FsL2xpYi9weXRob24zLjEwL2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaB99lH2UKGgWaA2MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgXjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz8zqSowVTJhhZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"}, "system_info": {"OS": "Linux-5.15.109+-x86_64-with-glibc2.35 # 1 SMP Fri Jun 9 10:57:30 UTC 2023", "Python": "3.10.12", "Stable-Baselines3": "2.0.0a5", "PyTorch": "2.0.1+cu118", "GPU Enabled": "True", "Numpy": "1.23.5", "Cloudpickle": "2.2.1", "Gymnasium": "0.28.1", "OpenAI Gym": "0.25.2"}}
optuned-ppo-LunarLander-v2.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:be7aa34e01f264819838e7127f646dd3e29d3c92b6521c79f685e5ad0f11fcdd
3
+ size 3263162
optuned-ppo-LunarLander-v2/_stable_baselines3_version ADDED
@@ -0,0 +1 @@
 
 
1
+ 2.0.0a5
optuned-ppo-LunarLander-v2/data ADDED
@@ -0,0 +1,117 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "policy_class": {
3
+ ":type:": "<class 'abc.ABCMeta'>",
4
+ ":serialized:": "gAWVOwAAAAAAAACMIXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5wb2xpY2llc5SMEUFjdG9yQ3JpdGljUG9saWN5lJOULg==",
5
+ "__module__": "stable_baselines3.common.policies",
6
+ "__doc__": "\n Policy class for actor-critic algorithms (has both policy and value prediction).\n Used by A2C, PPO and the likes.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param ortho_init: Whether to use or not orthogonal initialization\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param full_std: Whether to use (n_features x n_actions) parameters\n for the std instead of only (n_features,) when using gSDE\n :param use_expln: Use ``expln()`` function instead of ``exp()`` to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param squash_output: Whether to squash the output using a tanh function,\n this allows to ensure boundaries when using gSDE.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param share_features_extractor: If True, the features extractor is shared between the policy and value networks.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n ",
7
+ "__init__": "<function ActorCriticPolicy.__init__ at 0x7a6d183c85e0>",
8
+ "_get_constructor_parameters": "<function ActorCriticPolicy._get_constructor_parameters at 0x7a6d183c8670>",
9
+ "reset_noise": "<function ActorCriticPolicy.reset_noise at 0x7a6d183c8700>",
10
+ "_build_mlp_extractor": "<function ActorCriticPolicy._build_mlp_extractor at 0x7a6d183c8790>",
11
+ "_build": "<function ActorCriticPolicy._build at 0x7a6d183c8820>",
12
+ "forward": "<function ActorCriticPolicy.forward at 0x7a6d183c88b0>",
13
+ "extract_features": "<function ActorCriticPolicy.extract_features at 0x7a6d183c8940>",
14
+ "_get_action_dist_from_latent": "<function ActorCriticPolicy._get_action_dist_from_latent at 0x7a6d183c89d0>",
15
+ "_predict": "<function ActorCriticPolicy._predict at 0x7a6d183c8a60>",
16
+ "evaluate_actions": "<function ActorCriticPolicy.evaluate_actions at 0x7a6d183c8af0>",
17
+ "get_distribution": "<function ActorCriticPolicy.get_distribution at 0x7a6d183c8b80>",
18
+ "predict_values": "<function ActorCriticPolicy.predict_values at 0x7a6d183c8c10>",
19
+ "__abstractmethods__": "frozenset()",
20
+ "_abc_impl": "<_abc._abc_data object at 0x7a6d183c5180>"
21
+ },
22
+ "verbose": 1,
23
+ "policy_kwargs": {
24
+ ":type:": "<class 'dict'>",
25
+ ":serialized:": "gAWVlQAAAAAAAAB9lCiMDGxvZ19zdGRfaW5pdJRHP+l+K/qQGoiMCG5ldF9hcmNolH2UKIwCcGmUXZQoTQABTQABTQABZYwCdmaUXZQoTQABTQABTQABZXWMDWFjdGl2YXRpb25fZm6UjBt0b3JjaC5ubi5tb2R1bGVzLmFjdGl2YXRpb26UjARSZUxVlJOUjApvcnRob19pbml0lIh1Lg==",
26
+ "log_std_init": 0.796651830082582,
27
+ "net_arch": {
28
+ "pi": [
29
+ 256,
30
+ 256,
31
+ 256
32
+ ],
33
+ "vf": [
34
+ 256,
35
+ 256,
36
+ 256
37
+ ]
38
+ },
39
+ "activation_fn": "<class 'torch.nn.modules.activation.ReLU'>",
40
+ "ortho_init": true
41
+ },
42
+ "num_timesteps": 2015232,
43
+ "_total_timesteps": 2000000,
44
+ "_num_timesteps_at_start": 0,
45
+ "seed": null,
46
+ "action_noise": null,
47
+ "start_time": 1694060867728305936,
48
+ "learning_rate": 0.0003,
49
+ "tensorboard_log": null,
50
+ "_last_obs": {
51
+ ":type:": "<class 'numpy.ndarray'>",
52
+ ":serialized:": "gAWVdQIAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYAAgAAAAAAAHPNuj0AJKM/FhO3PsKjDr9C2xg+ll+EPgAAAAAAAAAALdZpPsLViD/iVwE/1LQ1vxDkxj5rTVg+AAAAAAAAAABm/DG9wzF2uhVRzjO0POGvuKiWOhWcsbMAAIA/AACAPzvigr6PzXY/17a6vktwIb9ZSf++dtBQvgAAAAAAAAAALb4GvtJfrLsVEly8PjbIur2YDD1MWKo7AACAPwAAgD/dGos+aDuNP4IPuT6EWRW/AkH9PgA6VD4AAAAAAAAAAM3gcjyKwLE/tXoQPRESBb8WwqI9F4FEPQAAAAAAAAAAc2GqvY9eNroC0UO48xeaslkDtrqtLGA3AACAPwAAgD/NrOa8FEaFuqc0SD1hnEE1VOFru+dBNTQAAIA/AAAAAFrPnT3hHti6HISJuvPSvDybagK8AFihPQAAgD8AAAAAZmYBuun7M7y6KxG9JTYbPTyiaz06ghk8AACAPwAAgD9mlw+9SC+dupaptzRAZJcvy0rMuRLHRLMAAIA/AACAP/r/Oj7SQAM/lEoSPilRTr/glJQ+I4kovAAAAAAAAAAAwOFyPqvugz54aKe+1NEdv9SWTz4dB16+AAAAAAAAAADjB1e+FnZyP5Exhr4RQ0q/HT7EvrbuGr0AAAAAAAAAAAC4pDuktku7NlCzvbVotjuSSok84wKqvAAAgD8AAIA/lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksQSwiGlIwBQ5R0lFKULg=="
53
+ },
54
+ "_last_episode_starts": {
55
+ ":type:": "<class 'numpy.ndarray'>",
56
+ ":serialized:": "gAWVgwAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACUjAVudW1weZSMBWR0eXBllJOUjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSxCFlIwBQ5R0lFKULg=="
57
+ },
58
+ "_last_original_obs": null,
59
+ "_episode_num": 0,
60
+ "use_sde": false,
61
+ "sde_sample_freq": -1,
62
+ "_current_progress_remaining": -0.007616000000000067,
63
+ "_stats_window_size": 100,
64
+ "ep_info_buffer": {
65
+ ":type:": "<class 'collections.deque'>",
66
+ ":serialized:": "gAWV4AsAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHQHIc6fJ3gUGMAWyUS5yMAXSUR0Csic5eqrBCdX2UKGgGR0Bw1OyeI2wWaAdLjmgIR0Csid42S+xodX2UKGgGR0ByVMo1DSgHaAdLs2gIR0CsiiawdKdydX2UKGgGR0Bz2WhIvrWzaAdL3mgIR0Csikxoh6jWdX2UKGgGR0BxjYTnJT2naAdLomgIR0CsikqbayrxdX2UKGgGR0Bx/ynGbTc7aAdLjmgIR0CsimAX/HYIdX2UKGgGR0BwMKrIYFaCaAdLn2gIR0CsimcQiA2AdX2UKGgGR0BzfBLuhK15aAdLxGgIR0CsmWdq+JxedX2UKGgGR0Bx52UNayKOaAdLsGgIR0CsmZFNUOurdX2UKGgGR0Bw6qAwwj+raAdLmWgIR0CsmdDZ13dLdX2UKGgGR0ByYA64lQdkaAdLm2gIR0CsmfN+LFXJdX2UKGgGR0ByaTBKtga4aAdLqWgIR0CsmiFgc94edX2UKGgGR0Bxvx9x6v7naAdLomgIR0Csmk0TL4etdX2UKGgGR0Bw8ZwNsnAqaAdLnmgIR0Csmm1uivgWdX2UKGgGR0BzBs4lyBClaAdLwWgIR0CsmmpGWldkdX2UKGgGR0By5CIUJv5yaAdLsmgIR0CsmnEXk5p8dX2UKGgGR0BwkR+rlvIfaAdLkGgIR0CsmqMvysjndX2UKGgGR0BzZh9Wp6yCaAdLu2gIR0CsmrCQLeANdX2UKGgGR0By3CR9w3o+aAdLtmgIR0CsmrswL3K0dX2UKGgGR0Bxba05U96kaAdLh2gIR0Csmsq77Kq5dX2UKGgGR0BwDg6xPfsNaAdLiWgIR0CsmtZkkKNRdX2UKGgGR0Bw00fIS13MaAdLlGgIR0CsmvYzJp35dX2UKGgGR0BxertKIznBaAdLqWgIR0Csmwb2tdRjdX2UKGgGR0BxEUFNcnmaaAdLsGgIR0Csm22zWwu/dX2UKGgGR0Bw3zebd8AraAdL0mgIR0Csm3SLqD9PdX2UKGgGR0Bxu+IZZSvUaAdLj2gIR0Csm3Zk078vdX2UKGgGR0BvojwMH8jzaAdLkmgIR0Csm6Xos7MgdX2UKGgGR0BymEpEx7AtaAdLjWgIR0Csm97fpD/mdX2UKGgGR0Bylxf7aZhKaAdLq2gIR0CsnBALZzxPdX2UKGgGR0ByyhsabWmQaAdL2mgIR0CsnCPE0iyIdX2UKGgGR0BxyulDWsijaAdLlWgIR0CsnC7L2YfGdX2UKGgGR0ByQnO4XoC/aAdLtWgIR0CsnEoF/x2CdX2UKGgGR0BzxJhqj8DTaAdLzGgIR0CsnIIAwPAgdX2UKGgGR0Bu+VvGZNO/aAdLj2gIR0CsnIbmU4aQdX2UKGgGR0Bx5oGSpzcRaAdLs2gIR0CsnKJGvwEydX2UKGgGR0By1THfdhy9aAdLvGgIR0CsnKCr92ovdX2UKGgGR0ByZg5IYm9haAdLuWgIR0CsnL6VdHDrdX2UKGgGR0BxpfwTdtVJaAdLrWgIR0CsnMIXj2i+dX2UKGgGR0BzKJoRIz3zaAdL7WgIR0CsnS034sVddX2UKGgGR0ByCE4lyBClaAdLq2gIR0CsnTiW/rSmdX2UKGgGR0BzkMjFAE+xaAdLxGgIR0CsnYVLamGedX2UKGgGR0BxeHFglWwNaAdLy2gIR0CsnZp9qk/KdX2UKGgGR0Bw44dT5wfhaAdLimgIR0CsnZ74agmJdX2UKGgGR0BxTOxt52QoaAdLpWgIR0CsnZvv0AcUdX2UKGgGR0BzG0XzlLezaAdL12gIR0Csne+evpyIdX2UKGgGR0ByotmGucMFaAdLuWgIR0CsngCQcPvsdX2UKGgGR0Bw/dC0F8ohaAdLkGgIR0CsngYWLxZudX2UKGgGR0ByVNj/dZaFaAdLr2gIR0Csnh2joIOZdX2UKGgGR0BwHhBmf5DaaAdLkmgIR0CsnibulXRxdX2UKGgGR0BxzIood+5OaAdLxWgIR0Csni0aqCHzdX2UKGgGR0Bu9va+N96UaAdLq2gIR0CsnkkRaouPdX2UKGgGR0Bwx+Ml1KXfaAdLpWgIR0CsnlWphnandX2UKGgGR0BziwRVZLZjaAdLn2gIR0CsnmQQtjCpdX2UKGgGR0ByNFttQ9A5aAdLtWgIR0CsnpXvYvnKdX2UKGgGR0BxeoJHAh0RaAdLmGgIR0Csnr6isXBQdX2UKGgGR0BxavPQfIS2aAdLfmgIR0CsnsNEofCAdX2UKGgGR0BujjAYYR/WaAdLlWgIR0Csnxj1PFefdX2UKGgGR0Bxo/5HmRvFaAdLxWgIR0CsnzIWpIczdX2UKGgGR0BxkFSvTw2EaAdLsGgIR0Csn2qN6w+udX2UKGgGR0BzHpJ2+wkgaAdLt2gIR0Csn3uw5eZ5dX2UKGgGR0BxvOmelKsdaAdLo2gIR0Csn5opH7P6dX2UKGgGR0BxQV9iMHbAaAdLnGgIR0Csn9FWfbsXdX2UKGgGR0Bzib6l+EytaAdLpWgIR0Csn+kAHVwxdX2UKGgGR0Bx7W4oZydXaAdLuGgIR0Csn/d4mkWRdX2UKGgGR0ByCHNqxkd4aAdLpWgIR0CsoBzvqkdndX2UKGgGR0ByQ0H8jzI4aAdLwmgIR0CsoClsguAadX2UKGgGR0B0I6wTufEoaAdLz2gIR0CsoCy3b212dX2UKGgGR0ByRKhM8HObaAdLsmgIR0CsoDBTfixWdX2UKGgGR0ByLIjQiRnwaAdLr2gIR0CsoEK/dqL1dX2UKGgGR0BwxJCUornUaAdLlGgIR0CsoFmoR7JGdX2UKGgGR0Bxgj60pmVaaAdLpmgIR0CsoF2mxdIHdX2UKGgGR0BySODnNgSfaAdLmmgIR0CsoGsuWa+fdX2UKGgGR0Bx3a6DoQnQaAdLmmgIR0CsoMiJXQt0dX2UKGgGR0Bx7FrpJPIoaAdLpGgIR0CsoMx1xKg7dX2UKGgGR0BzPtu4wyqNaAdLn2gIR0CsoRh6KLsKdX2UKGgGR0BzXqG1x82KaAdLxWgIR0CsoXDm0VrRdX2UKGgGR0BwisNutOmBaAdLmWgIR0CsoXfOD8LsdX2UKGgGR0BxE9AB1cMWaAdLh2gIR0CsoYADzRQadX2UKGgGR0Bx8drIo3JgaAdLu2gIR0CsoYVyvLX+dX2UKGgGR0BzVkN4JNTMaAdLs2gIR0CsoaRCQcPwdX2UKGgGR0Bw4ZWfbsWwaAdLmmgIR0CsobzJp35fdX2UKGgGR0ByJrf8/D+BaAdLxGgIR0CsoeWDQJHBdX2UKGgGR0Bz20MWoFV1aAdLs2gIR0Csoe35N47jdX2UKGgGR0BxphQ+EAYIaAdLmmgIR0CsofN/vv0AdX2UKGgGR0BzqGcXm/34aAdLqWgIR0Csof9g4OtodX2UKGgGR0BwxEAtFrmAaAdLw2gIR0CsoiQj+rEMdX2UKGgGR0ByXAPAfuCxaAdLv2gIR0CsomPl2eQNdX2UKGgGR0B0WhSn+AEuaAdLyGgIR0Csomvq9oN/dX2UKGgGR0BwLQug6EJ0aAdLsGgIR0CsoqcE3bVSdX2UKGgGR0BynsvCdjG2aAdLlGgIR0CsoqqZ+hGpdX2UKGgGR0Bxw/DVH4GmaAdLtGgIR0Csoq6n752ydX2UKGgGR0ByM5Mj/uLKaAdLj2gIR0CsovbxusLfdX2UKGgGR0BxUgdFOO81aAdLgWgIR0Csow7UwztUdX2UKGgGR0BwbV3Roh6jaAdLlWgIR0Cso2OObRWtdX2UKGgGR0ByJJjOLR8daAdLtmgIR0Cso32nbZezdX2UKGgGR0Bw7WpDNQj2aAdLnWgIR0Cso4P4EfT1dX2UKGgGR0BxGosxwhnraAdLjmgIR0Cso4/T1CgLdX2UKGgGR0Bx8AOLBKtgaAdLzGgIR0Cso5k25xzadX2UKGgGR0BxdQiml67eaAdLqGgIR0Cso53mV7hOdX2UKGgGR0Bz3Reu3c59aAdLsGgIR0Cso79ic5KfdWUu"
67
+ },
68
+ "ep_success_buffer": {
69
+ ":type:": "<class 'collections.deque'>",
70
+ ":serialized:": "gAWVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="
71
+ },
72
+ "_n_updates": 1230,
73
+ "observation_space": {
74
+ ":type:": "<class 'gymnasium.spaces.box.Box'>",
75
+ ":serialized:": "gAWVcAIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lGgFk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMDWJvdW5kZWRfYmVsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWCAAAAAAAAAABAQEBAQEBAZRoB4wCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksIhZSMAUOUdJRSlIwNYm91bmRlZF9hYm92ZZRoECiWCAAAAAAAAAABAQEBAQEBAZRoFEsIhZRoGHSUUpSMBl9zaGFwZZRLCIWUjANsb3eUaBAoliAAAAAAAAAAAAC0wgAAtMIAAKDAAACgwNsPScAAAKDAAAAAgAAAAICUaApLCIWUaBh0lFKUjARoaWdolGgQKJYgAAAAAAAAAAAAtEIAALRCAACgQAAAoEDbD0lAAACgQAAAgD8AAIA/lGgKSwiFlGgYdJRSlIwIbG93X3JlcHKUjFtbLTkwLiAgICAgICAgLTkwLiAgICAgICAgIC01LiAgICAgICAgIC01LiAgICAgICAgIC0zLjE0MTU5MjcgIC01LgogIC0wLiAgICAgICAgIC0wLiAgICAgICBdlIwJaGlnaF9yZXBylIxTWzkwLiAgICAgICAgOTAuICAgICAgICAgNS4gICAgICAgICA1LiAgICAgICAgIDMuMTQxNTkyNyAgNS4KICAxLiAgICAgICAgIDEuICAgICAgIF2UjApfbnBfcmFuZG9tlE51Yi4=",
76
+ "dtype": "float32",
77
+ "bounded_below": "[ True True True True True True True True]",
78
+ "bounded_above": "[ True True True True True True True True]",
79
+ "_shape": [
80
+ 8
81
+ ],
82
+ "low": "[-90. -90. -5. -5. -3.1415927 -5.\n -0. -0. ]",
83
+ "high": "[90. 90. 5. 5. 3.1415927 5.\n 1. 1. ]",
84
+ "low_repr": "[-90. -90. -5. -5. -3.1415927 -5.\n -0. -0. ]",
85
+ "high_repr": "[90. 90. 5. 5. 3.1415927 5.\n 1. 1. ]",
86
+ "_np_random": null
87
+ },
88
+ "action_space": {
89
+ ":type:": "<class 'gymnasium.spaces.discrete.Discrete'>",
90
+ ":serialized:": "gAWV1QAAAAAAAACMGWd5bW5hc2l1bS5zcGFjZXMuZGlzY3JldGWUjAhEaXNjcmV0ZZSTlCmBlH2UKIwBbpSMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMBnNjYWxhcpSTlIwFbnVtcHmUjAVkdHlwZZSTlIwCaTiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYkMIBAAAAAAAAACUhpRSlIwFc3RhcnSUaAhoDkMIAAAAAAAAAACUhpRSlIwGX3NoYXBllCloCmgOjApfbnBfcmFuZG9tlE51Yi4=",
91
+ "n": "4",
92
+ "start": "0",
93
+ "_shape": [],
94
+ "dtype": "int64",
95
+ "_np_random": null
96
+ },
97
+ "n_envs": 16,
98
+ "n_steps": 1024,
99
+ "gamma": 0.99,
100
+ "gae_lambda": 0.92,
101
+ "ent_coef": 0.04214859382089834,
102
+ "vf_coef": 0.8884865667357631,
103
+ "max_grad_norm": 0.3,
104
+ "batch_size": 128,
105
+ "n_epochs": 10,
106
+ "clip_range": {
107
+ ":type:": "<class 'function'>",
108
+ ":serialized:": "gAWVxQIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLAUsTQwSIAFMAlE6FlCmMAV+UhZSMSS91c3IvbG9jYWwvbGliL3B5dGhvbjMuMTAvZGlzdC1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjARmdW5jlEuEQwIEAZSMA3ZhbJSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjEkvdXNyL2xvY2FsL2xpYi9weXRob24zLjEwL2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaB99lH2UKGgWaA2MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgXjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz/ZmZmZmZmahZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"
109
+ },
110
+ "clip_range_vf": null,
111
+ "normalize_advantage": true,
112
+ "target_kl": null,
113
+ "lr_schedule": {
114
+ ":type:": "<class 'function'>",
115
+ ":serialized:": "gAWVxQIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLAUsTQwSIAFMAlE6FlCmMAV+UhZSMSS91c3IvbG9jYWwvbGliL3B5dGhvbjMuMTAvZGlzdC1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjARmdW5jlEuEQwIEAZSMA3ZhbJSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjEkvdXNyL2xvY2FsL2xpYi9weXRob24zLjEwL2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaB99lH2UKGgWaA2MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgXjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz8zqSowVTJhhZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"
116
+ }
117
+ }
optuned-ppo-LunarLander-v2/policy.optimizer.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c930f67d9592260572c1cea10acf323cbe1d505dd31bb6979cd6cee13ebf1084
3
+ size 2165333
optuned-ppo-LunarLander-v2/policy.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a48cb207f2827c59f6723be3b52c19c40738c8d8031e25cdbc3d8b5e1d676161
3
+ size 1081781
optuned-ppo-LunarLander-v2/pytorch_variables.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d030ad8db708280fcae77d87e973102039acd23a11bdecc3db8eb6c0ac940ee1
3
+ size 431
optuned-ppo-LunarLander-v2/system_info.txt ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ - OS: Linux-5.15.109+-x86_64-with-glibc2.35 # 1 SMP Fri Jun 9 10:57:30 UTC 2023
2
+ - Python: 3.10.12
3
+ - Stable-Baselines3: 2.0.0a5
4
+ - PyTorch: 2.0.1+cu118
5
+ - GPU Enabled: True
6
+ - Numpy: 1.23.5
7
+ - Cloudpickle: 2.2.1
8
+ - Gymnasium: 0.28.1
9
+ - OpenAI Gym: 0.25.2
replay.mp4 ADDED
Binary file (162 kB). View file
 
results.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"mean_reward": 285.19568010636857, "std_reward": 16.36622508993008, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-09-07T05:02:12.348925"}