daenielkim-66
commited on
Commit
•
6a46663
1
Parent(s):
d57e89d
Update README.md
Browse files
README.md
CHANGED
@@ -26,7 +26,7 @@ model-index:
|
|
26 |
This is a trained model of a **A2C** agent playing **PandaReachDense-v3**
|
27 |
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3). It controls a robotic arm to pick up balls.
|
28 |
|
29 |
-
### What I did:
|
30 |
Manually tuned hyperparameters by adding "learning_rate=0.0007, n_steps=5, gamma=0.99, gae_lambda=0.95" to the A2C model.
|
31 |
```
|
32 |
model = A2C(policy = "MultiInputPolicy",
|
|
|
26 |
This is a trained model of a **A2C** agent playing **PandaReachDense-v3**
|
27 |
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3). It controls a robotic arm to pick up balls.
|
28 |
|
29 |
+
### What I did to improve the model:
|
30 |
Manually tuned hyperparameters by adding "learning_rate=0.0007, n_steps=5, gamma=0.99, gae_lambda=0.95" to the A2C model.
|
31 |
```
|
32 |
model = A2C(policy = "MultiInputPolicy",
|