ppo-LunarLander-v2 / README.md

Commit History

Tuned PPO agent trained on LunarLander-v2 (1 million timesteps)
257c5fb
verified

crossroderick commited on

Tuned PPO agent trained on LunarLander-v2 (1 million timesteps)
958737f
verified

crossroderick commited on

Tuned PPO agent trained on LunarLander-v2 (1 million timesteps)
a7b84b9
verified

crossroderick commited on

Tuned PPO agent trained on LunarLander-v2 (1 million timesteps)
48759a4
verified

crossroderick commited on

Tuned PPO agent trained on LunarLander-v2 (1 million timesteps)
2ad0405
verified

crossroderick commited on

Standard PPO agent trained on LunarLander-v2 (3 million timesteps)
6507c03
verified

crossroderick commited on

Standard PPO agent trained on LunarLander-v2 (2 million timesteps)
7fef5cc
verified

crossroderick commited on

Standard PPO agent trained on LunarLander-v2 (3 million timesteps)
a4e534e
verified

crossroderick commited on

Standard PPO agent trained on LunarLander-v2 (3 million timesteps)
23e630c
verified

crossroderick commited on

Initial commit of a PPO agent trained on LunarLander-v2 (2 million timesteps)
747edb3
verified

crossroderick commited on