tags: | |
- Pixelcopter-PLE-v0 | |
- reinforce | |
- reinforcement-learning | |
- custom-implementation | |
- deep-rl-class | |
model-index: | |
- name: Reinforce-Pixelcopter-PLE-v0 | |
results: | |
- task: | |
type: reinforcement-learning | |
name: reinforcement-learning | |
dataset: | |
name: Pixelcopter-PLE-v0 | |
type: Pixelcopter-PLE-v0 | |
metrics: | |
- type: mean_reward | |
value: 42.70 +/- 24.24 | |
name: mean_reward | |
verified: false | |
# **Reinforce** Agent playing **Pixelcopter-PLE-v0** | |
This is a trained model of a **Reinforce** agent playiscores = reinforce(pixelcopter_policy, | |
pixelcopter_optimizer, | |
pixelcopter_hyperparameters["n_training_episodes"], | |
pixelcopter_hyperparameters["max_t"], | |
pixelcopter_hyperparameters["gamma"], | |
1000)ng **Pixelcopter-PLE-v0** . | |
To learn to use this model and train yours check Unit 4 of the Deep Reinforcement Learning Course: https://huggingface.co/deep-rl-course/unit4/introduction | |