PPO (from scratch) - LunarLander-v2

A CleanRL-style PPO agent trained from scratch on LunarLander-v2.

Numbers are auto-generated from results.json so the card and results.json always match.

Downloads last month: -; Downloads are not tracked for this model. How to track

Video Preview

Evaluation results