ppo-lunarlander-v2 / 30m-linlr_0_0002_after15m-ppo-LunarLander-v2.zip

Commit History

30m training steps with linear learning rate scheduler applied after 15m steps
6881971

bartpotrykus commited on