First version of the LunarLander-v2 model trained with PPO (2e5 training steps, no tuning).
b6157e7
verified
cochaviz
commited on