lander-go-fast / PPO-LunarLander-v2-5M-b256 /_stable_baselines3_version
jgerbscheid's picture
basic PPO model trained on local pc for 4M timesteps
f39177b
raw
history blame contribute delete
No virus
5 Bytes
1.5.0