Reinforce agent playing CartPole-v1
This is a trained model of a Reinforce agent playing CartPole-v1. This is a result of the training from the Deep RL Class by Hugging Face.
See this repo for the algorithm implementation.
Evaluation results
- mean_reward on CartPole-v1self-reported209.20 +/- 97.11