MattStammers
commited on
Commit
•
2835925
1
Parent(s):
46b399c
Update README.md
Browse files
README.md
CHANGED
@@ -36,8 +36,9 @@ from huggingface_sb3 import load_from_hub
|
|
36 |
...
|
37 |
```
|
38 |
|
39 |
-
Well he does ok but still gets stuck on the rocks. Here are my hyperparameters not that they did me
|
40 |
|
|
|
41 |
def linear_schedule(initial_value, final_value=0.00001):
|
42 |
def func(progress_remaining):
|
43 |
"""Progress will decrease from 1 (beginning) to 0 (end)"""
|
@@ -61,5 +62,6 @@ model = SAC(
|
|
61 |
policy_kwargs=dict(net_arch=[400, 300]),
|
62 |
verbose=1
|
63 |
)
|
|
|
64 |
|
65 |
These are pretty well tuned but SAC leads to too much exploration and the agent is unable to exploit the required actions to complete the course. I suspect TD3 will be more successful so plan to turn back to that
|
|
|
36 |
...
|
37 |
```
|
38 |
|
39 |
+
Well he does ok but still gets stuck on the rocks. Here are my hyperparameters not that they did me much good 😂:
|
40 |
|
41 |
+
```python
|
42 |
def linear_schedule(initial_value, final_value=0.00001):
|
43 |
def func(progress_remaining):
|
44 |
"""Progress will decrease from 1 (beginning) to 0 (end)"""
|
|
|
62 |
policy_kwargs=dict(net_arch=[400, 300]),
|
63 |
verbose=1
|
64 |
)
|
65 |
+
```
|
66 |
|
67 |
These are pretty well tuned but SAC leads to too much exploration and the agent is unable to exploit the required actions to complete the course. I suspect TD3 will be more successful so plan to turn back to that
|