therealagni
commited on
Commit
•
b616d57
1
Parent(s):
573fd74
Upload PPO MontezumaRevenge-v5 trained agent 1M timesteps, CNN 0.01 LR
Browse files- config.json +0 -0
- ppo-MontezumaRevenge.zip +2 -2
- ppo-MontezumaRevenge/data +0 -0
- ppo-MontezumaRevenge/policy.optimizer.pth +1 -1
- ppo-MontezumaRevenge/policy.pth +1 -1
- replay.mp4 +0 -0
- results.json +1 -1
config.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
ppo-MontezumaRevenge.zip
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9ec2fb8c4b5ad2795ec2e44c70b4efcbd4a02b7151b3fa0d673c4b2c13bc9fc1
|
3 |
+
size 142167374
|
ppo-MontezumaRevenge/data
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
ppo-MontezumaRevenge/policy.optimizer.pth
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 92973881
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8735a1a3877d4ae8afe5962b0918a97d418693a0ca4aebe057b87072c901eb87
|
3 |
size 92973881
|
ppo-MontezumaRevenge/policy.pth
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 46486273
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:77e53672489b6743077b89799c3475a363aeed0c6b1c3e1a5c4d5c27c72875d6
|
3 |
size 46486273
|
replay.mp4
CHANGED
Binary files a/replay.mp4 and b/replay.mp4 differ
|
|
results.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"mean_reward": 0.0, "std_reward": 0.0, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2022-12-
|
|
|
1 |
+
{"mean_reward": 0.0, "std_reward": 0.0, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2022-12-27T01:08:06.608569"}
|