ledmands
/

ALE-Pacman-v5

@@ -19,14 +19,15 @@ model-index:
       type: ALE/Pacman-v5
     metrics:
     - type: mean_reward
-      value: none
       name: mean_reward
       verified: false
 ---
 # Agent using DQN to play ALE/Pacman-v5
-## UPDATE 16 May 2024: Latest DQN model is version 2.8
 This is an agent that is trained using Stable Baselines3 as part of the capstone project for South Hills School in Spring 2024.
 The goal of this project is to gain familiarity with reinforcement learning concepts and tools, and to train an agent to score up into the 400-500 point range in Pacman.

       type: ALE/Pacman-v5
     metrics:
     - type: mean_reward
+      value: 455.60 +/- 40.10
       name: mean_reward
       verified: false
 ---
 # Agent using DQN to play ALE/Pacman-v5
+# Update 20 May 2024: Latest DQN model is version 2.8
+# NOTE: Video preview is version 2.8, best model playing for 10,000 steps. Evaluation metrics are self-reported based on 10 episodes of evaluation. Can be found in agents/dqn_v2-8/evals.txt
 This is an agent that is trained using Stable Baselines3 as part of the capstone project for South Hills School in Spring 2024.
 The goal of this project is to gain familiarity with reinforcement learning concepts and tools, and to train an agent to score up into the 400-500 point range in Pacman.