ppo-lunarlander-v2 / 3m-linlr-ppo-LunarLander-v2 /_stable_baselines3_version
bartpotrykus's picture
3m training steps with linear learning rate scheduler
05ea564
1.6.2