chirbard
/

Reinforce-Pixelcopter-PLE-v0

Reinforcement Learning

Pixelcopter-PLE-v0

custom-implementation

Model card Files Files and versions Community

Reinforce-Pixelcopter-PLE-v0 / README.md

chirbard's picture

Update README.md

3907a11 verified 7 months ago

|

history blame contribute delete

1.09 kB

	---
	tags:
	- Pixelcopter-PLE-v0
	- reinforce
	- reinforcement-learning
	- custom-implementation
	- deep-rl-class
	model-index:
	- name: Reinforce-Pixelcopter-PLE-v0
	results:
	- task:
	type: reinforcement-learning
	name: reinforcement-learning
	dataset:
	name: Pixelcopter-PLE-v0
	type: Pixelcopter-PLE-v0
	metrics:
	- type: mean_reward
	value: 76.70 +/- 65.02
	name: mean_reward
	verified: false
	---

	# Reinforce Agent playing Pixelcopter-PLE-v0
	This is a trained model of a Reinforce agent playing Pixelcopter-PLE-v0 .
	To learn to use this model and train yours check Unit 4 of the Deep Reinforcement Learning Course: https://huggingface.co/deep-rl-course/unit4/introduction

	## Training Time
	Trained on 50 000 timesteps for 4 hours and 20 minutes.

	## Hyperparameters
	```python
	pixelcopter_hyperparameters = {
	"h_size": 64,
	"n_training_episodes": 50000,
	"n_evaluation_episodes": 10,
	"max_t": 10000,
	"gamma": 0.99,
	"lr": 1e-4,
	"env_id": env_id,
	"state_space": s_size,
	"action_space": a_size,
	}
	```