vwxyzjn commited on
Commit
ac1efd8
1 Parent(s): 732489d

pushing model

Browse files
README.md CHANGED
@@ -10,14 +10,14 @@ tags:
10
 
11
  This is a trained model of a PPO agent playing CartPole-v1.
12
  The model was trained by using [CleanRL](https://github.com/vwxyzjn/cleanrl) and the training code can be
13
- found [here](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/ppo.py
14
 
15
 
16
  # Hyperparameters
17
  ```python
18
  {'anneal_lr': True,
19
  'batch_size': 512,
20
- 'capture_video': False,
21
  'clip_coef': 0.2,
22
  'clip_vloss': True,
23
  'cuda': False,
@@ -34,6 +34,7 @@ found [here](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/ppo.py
34
  'num_envs': 4,
35
  'num_minibatches': 4,
36
  'num_steps': 128,
 
37
  'seed': 1,
38
  'target_kl': None,
39
  'torch_deterministic': True,
 
10
 
11
  This is a trained model of a PPO agent playing CartPole-v1.
12
  The model was trained by using [CleanRL](https://github.com/vwxyzjn/cleanrl) and the training code can be
13
+ found [here](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/ppo.py).
14
 
15
 
16
  # Hyperparameters
17
  ```python
18
  {'anneal_lr': True,
19
  'batch_size': 512,
20
+ 'capture_video': True,
21
  'clip_coef': 0.2,
22
  'clip_vloss': True,
23
  'cuda': False,
 
34
  'num_envs': 4,
35
  'num_minibatches': 4,
36
  'num_steps': 128,
37
+ 'save_model': True,
38
  'seed': 1,
39
  'target_kl': None,
40
  'torch_deterministic': True,
events.out.tfevents.1665691635.pop-os.2080612.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a370497eb2dc9c84622df4308f39c0df5ef3326f6cf4259f997d3c32fe55b97b
3
+ size 3493