MattStammers commited on
Commit
2835925
1 Parent(s): 46b399c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -36,8 +36,9 @@ from huggingface_sb3 import load_from_hub
36
  ...
37
  ```
38
 
39
- Well he does ok but still gets stuck on the rocks. Here are my hyperparameters not that they did me any good:
40
 
 
41
  def linear_schedule(initial_value, final_value=0.00001):
42
  def func(progress_remaining):
43
  """Progress will decrease from 1 (beginning) to 0 (end)"""
@@ -61,5 +62,6 @@ model = SAC(
61
  policy_kwargs=dict(net_arch=[400, 300]),
62
  verbose=1
63
  )
 
64
 
65
  These are pretty well tuned but SAC leads to too much exploration and the agent is unable to exploit the required actions to complete the course. I suspect TD3 will be more successful so plan to turn back to that
 
36
  ...
37
  ```
38
 
39
+ Well he does ok but still gets stuck on the rocks. Here are my hyperparameters not that they did me much good 😂:
40
 
41
+ ```python
42
  def linear_schedule(initial_value, final_value=0.00001):
43
  def func(progress_remaining):
44
  """Progress will decrease from 1 (beginning) to 0 (end)"""
 
62
  policy_kwargs=dict(net_arch=[400, 300]),
63
  verbose=1
64
  )
65
+ ```
66
 
67
  These are pretty well tuned but SAC leads to too much exploration and the agent is unable to exploit the required actions to complete the course. I suspect TD3 will be more successful so plan to turn back to that