Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
crossroderick
/
ppo-LunarLander-v2
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Community
Use this model
257c5fb
ppo-LunarLander-v2
1 contributor
History:
11 commits
crossroderick
Tuned PPO agent trained on LunarLander-v2 (1 million timesteps)
257c5fb
verified
9 months ago
ppo-LunarLander-v2
Tuned PPO agent trained on LunarLander-v2 (1 million timesteps)
9 months ago
.gitattributes
1.52 kB
initial commit
9 months ago
README.md
784 Bytes
Tuned PPO agent trained on LunarLander-v2 (1 million timesteps)
9 months ago
config.json
16.7 kB
Tuned PPO agent trained on LunarLander-v2 (1 million timesteps)
9 months ago
ppo-LunarLander-v2.zip
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
150 kB
LFS
Tuned PPO agent trained on LunarLander-v2 (1 million timesteps)
9 months ago
replay.mp4
174 kB
Tuned PPO agent trained on LunarLander-v2 (1 million timesteps)
9 months ago
results.json
158 Bytes
Tuned PPO agent trained on LunarLander-v2 (1 million timesteps)
9 months ago