MattBoraske commited on
Commit
60c6128
1 Parent(s): 3ddebdd

Upload PPO HumanoidStandup-v2 trained agent

Browse files
PPO-HumanoidStandup-v2.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7482dd2d1abe67e4420a98840128e8ab5b20bbe84c262d593fd3d295597cab9e
3
- size 809018
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8a24c1f85d54b319cc08b2e12b7ba2df08af5629b97bd83640cc22584e4fb443
3
+ size 809023
PPO-HumanoidStandup-v2/data CHANGED
The diff for this file is too large to render. See raw diff
 
PPO-HumanoidStandup-v2/policy.optimizer.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:10a4cc02f3ca36c28a7f4c0e009fb50ec4021abf89dad61a6a63983691ef5ecf
3
  size 472432
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eacc071c5db5280a02641b08d8bc593d1cf76fd637029c4a64519e0e61fe553d
3
  size 472432
PPO-HumanoidStandup-v2/policy.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c580792332a97b94940ca21ecfbbb8e7e6e577d8c27bfcc7f5f7b0b0fcdad73f
3
  size 235518
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bc4b804bf8678329b10b85f862dde151f5030caef4ae69b04dd02efe2290a5e1
3
  size 235518
README.md CHANGED
@@ -16,7 +16,7 @@ model-index:
16
  type: HumanoidStandup-v2
17
  metrics:
18
  - type: mean_reward
19
- value: 113886.35 +/- 16081.21
20
  name: mean_reward
21
  verified: false
22
  ---
 
16
  type: HumanoidStandup-v2
17
  metrics:
18
  - type: mean_reward
19
+ value: 65822.31 +/- 10972.33
20
  name: mean_reward
21
  verified: false
22
  ---
config.json CHANGED
The diff for this file is too large to render. See raw diff
 
replay.mp4 CHANGED
Binary files a/replay.mp4 and b/replay.mp4 differ
 
results.json CHANGED
@@ -1 +1 @@
1
- {"mean_reward": 113886.34566268921, "std_reward": 16081.211883029271, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-04-28T01:45:32.216881"}
 
1
+ {"mean_reward": 65822.30792961121, "std_reward": 10972.326574588238, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-04-28T02:58:34.726191"}