PPO LunarLander-v2 trained agent - batchsize 32, total_timestaps 4M a0ccefd verified polyconnect commited on May 19
PPO LunarLander-v2 trained agent - batchsize 32, total_timestaps 4M 72b7312 verified polyconnect commited on May 19