vwxyzjn commited on
Commit
d71cf7d
1 Parent(s): 1582d49

pushing model

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ model-index:
15
  type: CartPole-v1
16
  metrics:
17
  - type: mean_reward
18
- value: 70.00 +/- 19.16
19
  name: mean_reward
20
  verified: false
21
  ---
@@ -26,6 +26,11 @@ This is a trained model of a DQN agent playing CartPole-v1.
26
  The model was trained by using [CleanRL](https://github.com/vwxyzjn/cleanrl) and the training code can be
27
  found [here](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/dqn.py).
28
 
 
 
 
 
 
29
 
30
  # Hyperparameters
31
  ```python
@@ -46,7 +51,7 @@ found [here](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/dqn.py).
46
  'start_e': 1,
47
  'target_network_frequency': 500,
48
  'torch_deterministic': True,
49
- 'total_timesteps': 10000,
50
  'track': False,
51
  'train_frequency': 10,
52
  'upload_model': True,
 
15
  type: CartPole-v1
16
  metrics:
17
  - type: mean_reward
18
+ value: 65.80 +/- 13.11
19
  name: mean_reward
20
  verified: false
21
  ---
 
26
  The model was trained by using [CleanRL](https://github.com/vwxyzjn/cleanrl) and the training code can be
27
  found [here](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/dqn.py).
28
 
29
+ ## Command to reproduce the training
30
+
31
+ ```bash
32
+ dqn.py --cuda False --save-model --upload-model --total-timesteps 500
33
+
34
 
35
  # Hyperparameters
36
  ```python
 
51
  'start_e': 1,
52
  'target_network_frequency': 500,
53
  'torch_deterministic': True,
54
+ 'total_timesteps': 500,
55
  'track': False,
56
  'train_frequency': 10,
57
  'upload_model': True,
events.out.tfevents.1668612313.pop-os.1028360.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:93e3b97f1606139c2589cd8db6c6a49be58d467d90cd8a2f6d4bce7795290fde
3
+ size 1805
events.out.tfevents.1668612314.pop-os.1028360.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a81d131d629717b87468b6c4231deffe55099b4f6b20aa9e7766c73eda75e4cb
3
+ size 618
q_network.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:37569e0ac54f61ecf5dde3387ec06184d48efe905501c80dd239668e91bf5884
3
- size 45783
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:574158aedd6795b4ae67e7e916a05b146481d57fbb5e8a78fad770e289fb43fa
3
+ size 45927
replay.mp4 ADDED
Binary file (10.8 kB). View file
 
videos/CartPole-v1__dqn__1__1668612313-eval/rl-video-episode-0.mp4 ADDED
Binary file (12.6 kB). View file
 
videos/CartPole-v1__dqn__1__1668612313-eval/rl-video-episode-1.mp4 ADDED
Binary file (8.84 kB). View file
 
videos/CartPole-v1__dqn__1__1668612313-eval/rl-video-episode-8.mp4 ADDED
Binary file (10.8 kB). View file