Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
lucascruz
/
ppo_lunarlander
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Community
Use this model
main
ppo_lunarlander
/
PPO-LunarLander-v2
1 contributor
History:
1 commit
lucascruz
PPO in lunar lander with (~15h of) tunning of hyperparams
b25e08b
over 1 year ago
_stable_baselines3_version
5 Bytes
PPO in lunar lander with (~15h of) tunning of hyperparams
over 1 year ago
data
15.5 kB
PPO in lunar lander with (~15h of) tunning of hyperparams
over 1 year ago
policy.optimizer.pth
87.9 kB
LFS
PPO in lunar lander with (~15h of) tunning of hyperparams
over 1 year ago
policy.pth
43.2 kB
LFS
PPO in lunar lander with (~15h of) tunning of hyperparams
over 1 year ago
pytorch_variables.pth
pickle
431 Bytes
LFS
PPO in lunar lander with (~15h of) tunning of hyperparams
over 1 year ago
system_info.txt
208 Bytes
PPO in lunar lander with (~15h of) tunning of hyperparams
over 1 year ago