Push PPO CNN Agent for CarRacingv0

Files changed (12) hide show

.gitattributes CHANGED Viewed

@@ -32,3 +32,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+replay.mp4 filter=lfs diff=lfs merge=lfs -text

PPO_CNN_For_CarRacing.zip ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ab25a8b82b56b5c4f3f1083508a76e0c0c2a8446f753fddaceb5ec67e750aa4f
+size 26587488

PPO_CNN_For_CarRacing/_stable_baselines3_version ADDED Viewed

	@@ -0,0 +1 @@


1	+ 1.7.0

PPO_CNN_For_CarRacing/data ADDED Viewed

The diff for this file is too large to render. See raw diff

PPO_CNN_For_CarRacing/policy.optimizer.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:89d1063feaf9930dee6a337e09cf603815cfe33de6d4bacef778cb122a7dd74a
+size 17415600

PPO_CNN_For_CarRacing/policy.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:694f6685bc65c5de10f29439cf5db61fe41bf56ae7053e610e0ae82e7cb6dfce
+size 8709950

PPO_CNN_For_CarRacing/pytorch_variables.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:d030ad8db708280fcae77d87e973102039acd23a11bdecc3db8eb6c0ac940ee1
+size 431

PPO_CNN_For_CarRacing/system_info.txt ADDED Viewed

+- OS: Linux-5.10.147+-x86_64-with-glibc2.29 # 1 SMP Sat Dec 10 16:00:40 UTC 2022
+- Python: 3.8.10
+- Stable-Baselines3: 1.7.0
+- PyTorch: 1.13.1+cu116
+- GPU Enabled: True
+- Numpy: 1.21.6
+- Gym: 0.21.0

README.md ADDED Viewed

+---
+library_name: stable-baselines3
+tags:
+- CarRacing-v0
+- deep-reinforcement-learning
+- reinforcement-learning
+- stable-baselines3
+model-index:
+- name: PPO
+  results:
+  - task:
+      type: reinforcement-learning
+      name: reinforcement-learning
+    dataset:
+      name: CarRacing-v0
+      type: CarRacing-v0
+    metrics:
+    - type: mean_reward
+      value: 153.28 +/- 117.71
+      name: mean_reward
+      verified: false
+---
+# **PPO** Agent playing **CarRacing-v0**
+This is a trained model of a **PPO** agent playing **CarRacing-v0**
+using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
+## Usage (with Stable-baselines3)
+TODO: Add your code
+```python
+from stable_baselines3 import ...
+from huggingface_sb3 import load_from_hub
+...
+```

config.json ADDED Viewed

The diff for this file is too large to render. See raw diff

replay.mp4 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:067b264cd9c1a1373506fd67483def3f0550b1361928b2ef3e561ee28b55f6d6
+size 1084773

results.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"mean_reward": 153.27774166315794, "std_reward": 117.7100965950417, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-02-06T11:07:48.947009"}