Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LinasKo
/
RL-lander
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
xet
Community
Use this model
main
RL-lander
Ctrl+K
Ctrl+K
1 contributor
History:
5 commits
LinasKo
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
3f14e63
almost 3 years ago
DQN-1e6
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
almost 3 years ago
a2c-1e6
A2C trained on LunarLander-v2 for 1e6 timesteps
almost 3 years ago
lander-MlpPolicy-1
The initial run of the lander, after training for 1M timestamps.
almost 3 years ago
ppo-1e6
Trained on my local machine.
almost 3 years ago
.gitattributes
Safe
1.48 kB
initial commit
almost 3 years ago
DQN-1e6.zip
Safe
110 kB
xet
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
almost 3 years ago
README.md
Safe
784 Bytes
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
almost 3 years ago
a2c-1e6.zip
Safe
102 kB
xet
A2C trained on LunarLander-v2 for 1e6 timesteps
almost 3 years ago
config.json
Safe
19.4 kB
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
almost 3 years ago
lander-MlpPolicy-1.zip
Safe
147 kB
xet
The initial run of the lander, after training for 1M timestamps.
almost 3 years ago
ppo-1e6.zip
Safe
148 kB
xet
Trained on my local machine.
almost 3 years ago
replay.mp4
Safe
207 kB
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
almost 3 years ago
results.json
Safe
165 Bytes
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
almost 3 years ago