Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
coyotespike
/
lunarlanderV2
like
0
Reinforcement Learning
stable-baselines3
BipedalWalker-v3
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Community
Use this model
05e4b77
lunarlanderV2
/
lunarLander
1 contributor
History:
2 commits
coyotespike
Double training timesteps to 1 mil
05e4b77
almost 2 years ago
_stable_baselines3_version
5 Bytes
Upload PPO lunar lander trained agent
almost 2 years ago
data
14.7 kB
Double training timesteps to 1 mil
almost 2 years ago
policy.optimizer.pth
87.9 kB
LFS
Double training timesteps to 1 mil
almost 2 years ago
policy.pth
43.2 kB
LFS
Double training timesteps to 1 mil
almost 2 years ago
pytorch_variables.pth
pickle
431 Bytes
LFS
Upload PPO lunar lander trained agent
almost 2 years ago
system_info.txt
184 Bytes
Upload PPO lunar lander trained agent
almost 2 years ago