PPO-LunarLander-v2 / config.json

Commit History

Initial PPO model on 1000000 training steps
ad41807

shivr commited on