meln1k commited on
Commit
372c57d
1 Parent(s): 57e5eaf

a bit more training

Browse files
.gitattributes CHANGED
@@ -25,3 +25,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
25
  *.zip filter=lfs diff=lfs merge=lfs -text
26
  *.zstandard filter=lfs diff=lfs merge=lfs -text
27
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
25
  *.zip filter=lfs diff=lfs merge=lfs -text
26
  *.zstandard filter=lfs diff=lfs merge=lfs -text
27
  *tfevents* filter=lfs diff=lfs merge=lfs -text
28
+ *.mp4 filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: stable-baselines3
3
+ tags:
4
+ - CarRacing-v0
5
+ - deep-reinforcement-learning
6
+ - reinforcement-learning
7
+ - stable-baselines3
8
+ model-index:
9
+ - name: PPO
10
+ results:
11
+ - metrics:
12
+ - type: mean_reward
13
+ value: 800.67 +/- 46.00
14
+ name: mean_reward
15
+ task:
16
+ type: reinforcement-learning
17
+ name: reinforcement-learning
18
+ dataset:
19
+ name: CarRacing-v0
20
+ type: CarRacing-v0
21
+ ---
22
+
23
+ # **PPO** Agent playing **CarRacing-v0**
24
+ This is a trained model of a **PPO** agent playing **CarRacing-v0** using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
25
+
26
+ ## Usage (with Stable-baselines3)
27
+ TODO: Add your code
28
+
config.json ADDED
The diff for this file is too large to render. See raw diff
 
ppo-CarRacing-v0_v1.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:57c316e9da8bfa0644cd9c2f0e1b59fef355be532ccd5d6a77e536d99ba32926
3
+ size 26597790
ppo-CarRacing-v0_v1/_stable_baselines3_version ADDED
@@ -0,0 +1 @@
 
 
1
+ 1.5.0
ppo-CarRacing-v0_v1/data ADDED
The diff for this file is too large to render. See raw diff
 
ppo-CarRacing-v0_v1/policy.optimizer.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8de8baae8611aa6c0cf45023233991451afa58aa346dcd77c9d9d52d36d61fa7
3
+ size 17424599
ppo-CarRacing-v0_v1/policy.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9501574d790e9a80ced1b2636b86b7e3377de7e20f25475b993dcd05a993e57c
3
+ size 8713150
ppo-CarRacing-v0_v1/pytorch_variables.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d030ad8db708280fcae77d87e973102039acd23a11bdecc3db8eb6c0ac940ee1
3
+ size 431
ppo-CarRacing-v0_v1/system_info.txt ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ OS: Linux-5.11.0-38-generic-x86_64-with-glibc2.31 #42~20.04.1-Ubuntu SMP Tue Sep 28 20:41:07 UTC 2021
2
+ Python: 3.9.12
3
+ Stable-Baselines3: 1.5.0
4
+ PyTorch: 1.11.0+cu102
5
+ GPU Enabled: True
6
+ Numpy: 1.22.3
7
+ Gym: 0.21.0
replay.mp4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:83215c8c825bfecb60e45ec04fed778c28d59676adfd13e2873453467d955131
3
+ size 873816
results.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"mean_reward": 800.6700507842004, "std_reward": 46.00085727339187, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-15T09:33:31.717609"}