pietroluongo commited on
Commit
ebe655e
1 Parent(s): e00af70

Initial commit

Browse files
README.md ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: stable-baselines3
3
+ tags:
4
+ - udem1
5
+ - deep-reinforcement-learning
6
+ - reinforcement-learning
7
+ - stable-baselines3
8
+ model-index:
9
+ - name: PPO
10
+ results:
11
+ - task:
12
+ type: reinforcement-learning
13
+ name: reinforcement-learning
14
+ dataset:
15
+ name: udem1
16
+ type: udem1
17
+ metrics:
18
+ - type: mean_reward
19
+ value: -1081.15 +/- 86.12
20
+ name: mean_reward
21
+ verified: false
22
+ ---
23
+
24
+ # **PPO** Agent playing **udem1**
25
+ This is a trained model of a **PPO** agent playing **udem1**
26
+ using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
27
+
28
+ ## Usage (with Stable-baselines3)
29
+ TODO: Add your code
30
+
31
+
32
+ ```python
33
+ from stable_baselines3 import ...
34
+ from huggingface_sb3 import load_from_hub
35
+
36
+ ...
37
+ ```
a2c-udem1.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5e66c18479993242b52f380bbdeffc73d22320556a417a58c937014e83fd8491
3
+ size 1680641732
a2c-udem1/_stable_baselines3_version ADDED
@@ -0,0 +1 @@
 
 
1
+ 2.1.0
a2c-udem1/data ADDED
The diff for this file is too large to render. See raw diff
 
a2c-udem1/policy.optimizer.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:56cc6b8566e83dd48b7aad8a264354cdfa6e182ae026b513162db4c62f0b53ae
3
+ size 1116319344
a2c-udem1/policy.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:624cebfa2773175b166fd4734c1ed5b7c5a058b2e1b0bf782e3e195b6bfa8fdf
3
+ size 558161726
a2c-udem1/pytorch_variables.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d030ad8db708280fcae77d87e973102039acd23a11bdecc3db8eb6c0ac940ee1
3
+ size 431
a2c-udem1/system_info.txt ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ - OS: Linux-5.15.90.1-microsoft-standard-WSL2-x86_64-with-glibc2.35 # 1 SMP Fri Jan 27 02:56:13 UTC 2023
2
+ - Python: 3.11.5
3
+ - Stable-Baselines3: 2.1.0
4
+ - PyTorch: 2.0.1+cu117
5
+ - GPU Enabled: True
6
+ - Numpy: 1.26.0
7
+ - Cloudpickle: 2.2.1
8
+ - Gymnasium: 0.29.1
9
+ - OpenAI Gym: 0.26.2
config.json ADDED
The diff for this file is too large to render. See raw diff
 
results.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"mean_reward": -1081.1514985211193, "std_reward": 86.1153949026763, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-11-03T01:18:22.251769"}