Commit History

The key is a while loop on the mean_reward variable when evaluating your agent🥰📚
094b889

nbiish commited on

Training more and learning the routine
1a708dc

nbiish commited on

Trained 1mil steps once - LunarLander-v2 trained agent
3db2cc2

nbiish commited on