Reinforce Agent playing Pixelcopter-PLE-v0
This is a trained model of a Reinforce agent playing Pixelcopter-PLE-v0, part of the Hugging Face Deep Reinforcement Learning Course Unit 4.
Results
mean_reward: 17.25 +/- 16.31 over 50 evaluation episodes. Threshold: 5 (passed).