Farbum
/

REINFORCE_Pixelcopter

Reinforcement Learning

Pixelcopter-PLE-v0

deep-reinforcement-learning

custom-implementation

Model card Files Files and versions Community

REINFORCE_Pixelcopter / README.md

Farbum's picture

Push agent to the Hub

c0bfcf0 verified 4 months ago

|

history blame contribute delete

No virus

845 Bytes

	---
	tags:
	- Pixelcopter-PLE-v0
	- ppo
	- deep-reinforcement-learning
	- reinforcement-learning
	- custom-implementation
	- deep-rl-course
	model-index:
	- name: SARSA_QLEARNING
	results:
	- task:
	type: reinforcement-learning
	name: reinforcement-learning
	dataset:
	name: Pixelcopter-PLE-v0
	type: Pixelcopter-PLE-v0
	metrics:
	- type: mean_reward
	value: 49.30 +/- 37.85
	name: mean_reward
	verified: false
	---

	# REINFORCE Agent Playing Pixelcopter-PLE-v0

	This is a trained model of a REINFORCE agent playing Pixelcopter-PLE-v0.

	# Hyperparameters
	hp_seed: 1<br />hp_torch_deterministic: True<br />hp_nb_frames: 8<br />hp_total_timesteps: 1005000<br />hp_learning_t: 500<br />hp_num_envs: 12<br />hp_learning_rate: 0.0001<br />hp_gamma: 0.99<br />hp_buffer_size: 10000<br />hp_batch_size: 32