LearnHF-LunarLander-v2 / ppo-LunarLander-v2

Commit History

🥰📚
5b4930b

nbiish commited on

The key is a while loop on the mean_reward variable when evaluating your agent🥰📚
094b889

nbiish commited on

Training more and learning the routine
1a708dc

nbiish commited on

Trained 1mil steps once - LunarLander-v2 trained agent
3db2cc2

nbiish commited on