Al020198zee commited on
Commit
3bded2a
1 Parent(s): 85a40ba

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -2
README.md CHANGED
@@ -25,8 +25,6 @@ This is a trained model of a **PPO** agent playing **AntBulletEnv-v0**
25
  using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
26
 
27
  ## Usage (with Stable-baselines3)
28
- TODO: Add your code
29
-
30
 
31
  ```python
32
  from stable_baselines3 import ...
@@ -34,3 +32,24 @@ from huggingface_sb3 import load_from_hub
34
 
35
  ...
36
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
  using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
26
 
27
  ## Usage (with Stable-baselines3)
 
 
28
 
29
  ```python
30
  from stable_baselines3 import ...
 
32
 
33
  ...
34
  ```
35
+ MODEL
36
+ model = PPO(policy = "MlpPolicy",
37
+ env = env,
38
+ batch_size = 256,
39
+ clip_range = 0.4,
40
+ ent_coef = 0.0,
41
+ gae_lambda = 0.92,
42
+ gamma = 0.99,
43
+ learning_rate = 3.0e-05,
44
+ max_grad_norm = 0.5,
45
+ n_epochs = 30,
46
+ n_steps = 512,
47
+ policy_kwargs = dict(log_std_init=-2, ortho_init=False, activation_fn=nn.ReLU, net_arch=[dict(pi=[256,
48
+ 256], vf=[256, 256])] ),
49
+ use_sde = True,
50
+ sde_sample_freq = 4,
51
+ vf_coef = 0.5,
52
+ tensorboard_log = "./tensorboard",
53
+ verbose=1)
54
+
55
+ model.learn(1_000_000)