Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Battu007
/
V4_PPO2_LunarLander_v2
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Community
Use this model
825d67c
V4_PPO2_LunarLander_v2
/
V4_PPO_LL
2 contributors
History:
1 commit
ASBattu
PPO Hyperparemeter tune 1M steps LL-2 agent
825d67c
over 2 years ago
_stable_baselines3_version
5 Bytes
PPO Hyperparemeter tune 1M steps LL-2 agent
over 2 years ago
data
17.5 kB
PPO Hyperparemeter tune 1M steps LL-2 agent
over 2 years ago
policy.optimizer.pth
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
What is a pickle import?
84.6 kB
LFS
PPO Hyperparemeter tune 1M steps LL-2 agent
over 2 years ago
policy.pth
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
What is a pickle import?
43.1 kB
LFS
PPO Hyperparemeter tune 1M steps LL-2 agent
over 2 years ago
pytorch_variables.pth
pickle
431 Bytes
LFS
PPO Hyperparemeter tune 1M steps LL-2 agent
over 2 years ago
system_info.txt
146 Bytes
PPO Hyperparemeter tune 1M steps LL-2 agent
over 2 years ago