lunar_lander_v2-ppo / README.md

Commit History

first trained agent with proximal policy optimization
ca4b3e9

jjpp3301 commited on