|
--- |
|
tags: |
|
- Pixelcopter-PLE-v0 |
|
- reinforce |
|
- reinforcement-learning |
|
- custom-implementation |
|
- deep-rl-class |
|
model-index: |
|
- name: Reinforce-Pixelcopter-PLE-v0 |
|
results: |
|
- task: |
|
type: reinforcement-learning |
|
name: reinforcement-learning |
|
dataset: |
|
name: Pixelcopter-PLE-v0 |
|
type: Pixelcopter-PLE-v0 |
|
metrics: |
|
- type: mean_reward |
|
value: 76.70 +/- 65.02 |
|
name: mean_reward |
|
verified: false |
|
--- |
|
|
|
# **Reinforce** Agent playing **Pixelcopter-PLE-v0** |
|
This is a trained model of a **Reinforce** agent playing **Pixelcopter-PLE-v0** . |
|
To learn to use this model and train yours check Unit 4 of the Deep Reinforcement Learning Course: https://huggingface.co/deep-rl-course/unit4/introduction |
|
|
|
## Training Time |
|
Trained on 50 000 timesteps for 4 hours and 20 minutes. |
|
|
|
## Hyperparameters |
|
```python |
|
pixelcopter_hyperparameters = { |
|
"h_size": 64, |
|
"n_training_episodes": 50000, |
|
"n_evaluation_episodes": 10, |
|
"max_t": 10000, |
|
"gamma": 0.99, |
|
"lr": 1e-4, |
|
"env_id": env_id, |
|
"state_space": s_size, |
|
"action_space": a_size, |
|
} |
|
``` |
|
|