DQN model applied to the this discrete environments CartPole-v1
Model Description
The model was trained from the CleanRl library using the DQN algorithm
Intended Use & Limitation
The model is intended to be used for the following environments CartPole-v1 and understand the implication of Quantization on this type of model from a pretrained state## Training Procdure
Training Hyperparameters
The folloing hyperparameters were used during training:
- exp_name: functional_dqn
- seed: 0
- torch_deterministic: True
- cuda: False
- track: True
- wandb_project_name: cleanRL
- wandb_entity: compress_rl
- capture_video: False
- env_id: CartPole-v1
- total_timesteps: 500000
- learning_rate: 0.00025
- buffer_size: 10000
- gamma: 0.99
- target_network_frequency: 500
- batch_size: 128
- start_e: 1
- end_e: 0.05
- exploration_fraction: 0.5
- learning_starts: 10000
- train_frequency: 10
- optimizer: Adam
- wandb_project: cleanrl
Framework and version
Pytorch 1.12.1+cu102
gym 0.23.1
Weights and Biases 0.13.3
Hugging Face Hub 0.11.1