ThomasSimonini
/

ppo-QbertNoFrameskip-v4

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Model card Files Files and versions Community

ThomasSimonini HF staff commited on Mar 1, 2022

Commit

bc370b4

•

1 Parent(s): 0f84bf8

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -3,6 +3,17 @@ tags:
 - deep-reinforcement-learning
 - reinforcement-learning
 - stable-baselines3
 ---
 # PPO Agent playing QbertNoFrameskip-v4
 This is a trained model of a **PPO agent playing QbertNoFrameskip-v4 using the [stable-baselines3 library](https://stable-baselines3.readthedocs.io/en/master/index.html)**.

 - deep-reinforcement-learning
 - reinforcement-learning
 - stable-baselines3
+model-index:
+- name: PPO Agent
+  results:
+  - task:
+      type: reinforcement-learning  # Required. Example: automatic-speech-recognition
+    dataset:
+      type: QbertNoFrameskip-v4  # Required. Example: common_voice. Use dataset id from https://hf.co/datasets
+      name: QbertNoFrameskip-v4  # Required. Example: Common Voice zh-CN
+    metrics:
+      - type: mean_reward    # Required. Example: wer
+        value: 15685.00 +/- 115.217  # Required. Example: 20.90
 ---
 # PPO Agent playing QbertNoFrameskip-v4
 This is a trained model of a **PPO agent playing QbertNoFrameskip-v4 using the [stable-baselines3 library](https://stable-baselines3.readthedocs.io/en/master/index.html)**.