deep-rl-class-unit01-LunarLander-v2 / ppo-LunarLander-v2-1000000-steps-hf

Commit History

Trained with 1000000 steps and HF's parameters
c2d7881
unverified

diskshima commited on