Upload PPO CarRacing-v0 trained agent

Files changed (10) hide show

README.md ADDED Viewed

+---
+library_name: stable-baselines3
+tags:
+- CarRacing-v0
+- deep-reinforcement-learning
+- reinforcement-learning
+- stable-baselines3
+model-index:
+- name: PPO
+  results:
+  - task:
+      type: reinforcement-learning
+      name: reinforcement-learning
+    dataset:
+      name: CarRacing-v0
+      type: CarRacing-v0
+    metrics:
+    - type: mean_reward
+      value: -63.85 +/- 2.86
+      name: mean_reward
+      verified: false
+---
+# **PPO** Agent playing **CarRacing-v0**
+This is a trained model of a **PPO** agent playing **CarRacing-v0**
+using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
+## Usage (with Stable-baselines3)
+TODO: Add your code
+```python
+from stable_baselines3 import ...
+from huggingface_sb3 import load_from_hub
+...
+```

config.json ADDED Viewed

The diff for this file is too large to render. See raw diff

ppo-CarRacing-v0.zip ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ee49967a87f59cf674b48cdf4c9fceeac241506f38050d05c8490285f9d21710
+size 43336998

ppo-CarRacing-v0/_stable_baselines3_version ADDED Viewed

	@@ -0,0 +1 @@


1	+ 1.8.0

ppo-CarRacing-v0/data ADDED Viewed

The diff for this file is too large to render. See raw diff

ppo-CarRacing-v0/policy.optimizer.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:52978444b846112a71f37722265b84807020b17be77c3222829c6e6483bee859
+size 28391152

ppo-CarRacing-v0/policy.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:4fafc6ac2874f61d9a0c4aad2831e12ef4ada8afd6fd67f2cdea345273772ad2
+size 14194942

ppo-CarRacing-v0/pytorch_variables.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:d030ad8db708280fcae77d87e973102039acd23a11bdecc3db8eb6c0ac940ee1
+size 431

ppo-CarRacing-v0/system_info.txt ADDED Viewed

+- OS: Windows-10-10.0.22621-SP0 10.0.22621
+- Python: 3.10.11
+- Stable-Baselines3: 1.8.0
+- PyTorch: 2.0.0+cpu
+- GPU Enabled: False
+- Numpy: 1.24.1
+- Gym: 0.21.0

results.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"mean_reward": -63.84656130671501, "std_reward": 2.860821205018601, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-05-01T23:15:22.841791"}