Battu007
/

V4_PPO2_LunarLander_v2

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Model card Files Files and versions Community

V4_PPO2_LunarLander_v2 / V4_PPO_LL

2 contributors

History: 1 commit

ASBattu

PPO Hyperparemeter tune 1M steps LL-2 agent

825d67c over 2 years ago

_stable_baselines3_version

5 Bytes

PPO Hyperparemeter tune 1M steps LL-2 agent over 2 years ago
data

17.5 kB

PPO Hyperparemeter tune 1M steps LL-2 agent over 2 years ago
policy.optimizer.pth
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "torch.FloatStorage"
What is a pickle import?
84.6 kB
LFS

PPO Hyperparemeter tune 1M steps LL-2 agent over 2 years ago
policy.pth
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "torch.FloatStorage"
What is a pickle import?
43.1 kB
LFS

PPO Hyperparemeter tune 1M steps LL-2 agent over 2 years ago
pytorch_variables.pth

431 Bytes
LFS

PPO Hyperparemeter tune 1M steps LL-2 agent over 2 years ago
system_info.txt

146 Bytes

PPO Hyperparemeter tune 1M steps LL-2 agent over 2 years ago