Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
EvanMath
/
new-PPO-LunarLander-v2
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Community
Use this model
main
new-PPO-LunarLander-v2
/
PPO-LunarLander-v2
/
policy.optimizer.pth
Commit History
PPO trained on 500,000 steps.
e2eaf0e
EvanMath
commited on
Jul 22, 2022