cleanrl
/

ppo

vwxyzjn commited on Oct 13, 2022

Commit

ac1efd8

•

1 Parent(s): 732489d

pushing model

Files changed (2) hide show

README.md CHANGED Viewed

@@ -10,14 +10,14 @@ tags:
 This is a trained model of a PPO agent playing CartPole-v1.
 The model was trained by using [CleanRL](https://github.com/vwxyzjn/cleanrl) and the training code can be
-found [here](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/ppo.py
 # Hyperparameters
 ```python
 {'anneal_lr': True,
  'batch_size': 512,
- 'capture_video': False,
  'clip_coef': 0.2,
  'clip_vloss': True,
  'cuda': False,
@@ -34,6 +34,7 @@ found [here](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/ppo.py
  'num_envs': 4,
  'num_minibatches': 4,
  'num_steps': 128,
  'seed': 1,
  'target_kl': None,
  'torch_deterministic': True,

 This is a trained model of a PPO agent playing CartPole-v1.
 The model was trained by using [CleanRL](https://github.com/vwxyzjn/cleanrl) and the training code can be
+found [here](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/ppo.py).
 # Hyperparameters
 ```python
 {'anneal_lr': True,
  'batch_size': 512,
+ 'capture_video': True,
  'clip_coef': 0.2,
  'clip_vloss': True,
  'cuda': False,
  'num_envs': 4,
  'num_minibatches': 4,
  'num_steps': 128,
+ 'save_model': True,
  'seed': 1,
  'target_kl': None,
  'torch_deterministic': True,

events.out.tfevents.1665691635.pop-os.2080612.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a370497eb2dc9c84622df4308f39c0df5ef3326f6cf4259f997d3c32fe55b97b
+size 3493