therealagni commited on
Commit
e013694
1 Parent(s): 6802cc4

Upload PPO MontezumaRevenge-v5 trained agent

Browse files
README.md ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: stable-baselines3
3
+ tags:
4
+ - ALE/MontezumaRevenge-v5
5
+ - deep-reinforcement-learning
6
+ - reinforcement-learning
7
+ - stable-baselines3
8
+ model-index:
9
+ - name: PPO
10
+ results:
11
+ - task:
12
+ type: reinforcement-learning
13
+ name: reinforcement-learning
14
+ dataset:
15
+ name: ALE/MontezumaRevenge-v5
16
+ type: ALE/MontezumaRevenge-v5
17
+ metrics:
18
+ - type: mean_reward
19
+ value: 0.00 +/- 0.00
20
+ name: mean_reward
21
+ verified: false
22
+ ---
23
+
24
+ # **PPO** Agent playing **ALE/MontezumaRevenge-v5**
25
+ This is a trained model of a **PPO** agent playing **ALE/MontezumaRevenge-v5**
26
+ using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
27
+
28
+ ## Usage (with Stable-baselines3)
29
+ TODO: Add your code
30
+
31
+
32
+ ```python
33
+ from stable_baselines3 import ...
34
+ from huggingface_sb3 import load_from_hub
35
+
36
+ ...
37
+ ```
config.json ADDED
The diff for this file is too large to render. See raw diff
 
ppo-MontezumaRevenge.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:65948c65eaffa29bf4338a99226b49092f600a95d63d1665d236f6c188094e38
3
+ size 157659793
ppo-MontezumaRevenge/_stable_baselines3_version ADDED
@@ -0,0 +1 @@
 
 
1
+ 1.6.2
ppo-MontezumaRevenge/data ADDED
The diff for this file is too large to render. See raw diff
 
ppo-MontezumaRevenge/policy.optimizer.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4cef216c8479e32c0c7d74a2120e1ceaf2788aaee81d3b7eca1046f5babdb1ed
3
+ size 103306233
ppo-MontezumaRevenge/policy.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:84e0ff63d1751e951d877e4c5c3a1d1dc4c6811a5a7d78ce5d93e7b6d04a8c04
3
+ size 51652353
ppo-MontezumaRevenge/pytorch_variables.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d030ad8db708280fcae77d87e973102039acd23a11bdecc3db8eb6c0ac940ee1
3
+ size 431
ppo-MontezumaRevenge/system_info.txt ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ OS: Linux-5.10.133+-x86_64-with-glibc2.27 #1 SMP Fri Aug 26 08:44:51 UTC 2022
2
+ Python: 3.8.16
3
+ Stable-Baselines3: 1.6.2
4
+ PyTorch: 1.13.0+cu116
5
+ GPU Enabled: True
6
+ Numpy: 1.21.6
7
+ Gym: 0.21.0
replay.mp4 ADDED
Binary file (144 kB). View file
 
results.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"mean_reward": 0.0, "std_reward": 0.0, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2022-12-26T23:26:05.292103"}