lander-go-fast / PPO-LunarLander-v2-5M-b256 /_stable_baselines3_version

Commit History

basic PPO model trained on local pc for 4M timesteps
f39177b

jgerbscheid commited on