pushing model

Files changed (11) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ model-index:
       type: Hopper-v4
     metrics:
     - type: mean_reward
-      value: 3220.86 +/- 688.15
       name: mean_reward
       verified: false
 ---

       type: Hopper-v4
     metrics:
     - type: mean_reward
+      value: 861.03 +/- 23.36
       name: mean_reward
       verified: false
 ---

events.out.tfevents.1704452503.4090-171.104630.0 → events.out.tfevents.1705691776.3090-172.2535072.0 RENAMED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:718f46dc88f935f62ebb204ccc1f28fe00ff6439b13105de2fbd8df00136850d
-size 517443

 version https://git-lfs.github.com/spec/v1
+oid sha256:be7bbae0819e9f0bebc15e763d8fa883979f132b3d93f41dd41afbf913d2af27
+size 900823

ppo_fix_continuous_action.cleanrl_model CHANGED Viewed

Binary files a/ppo_fix_continuous_action.cleanrl_model and b/ppo_fix_continuous_action.cleanrl_model differ

ppo_fix_continuous_action.py CHANGED Viewed

@@ -198,7 +198,7 @@ class NormalizeReward(gym.core.Wrapper, gym.utils.RecordConstructorArgs):
         return obs, rews, terminateds, truncateds, infos
     def reset(self, **kwargs):
-        self.returns = np.zeros(self.num_envs)
         return self.env.reset(**kwargs)
     def normalize(self, rews):

         return obs, rews, terminateds, truncateds, infos
     def reset(self, **kwargs):
+        # self.returns = np.zeros(self.num_envs)
         return self.env.reset(**kwargs)
     def normalize(self, rews):

replay.mp4 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a1cfd61c8162dc544a41ba35906fcb7c0326d62280eebf20fb7e48d6c18d164d
-size 1511663

 version https://git-lfs.github.com/spec/v1
+oid sha256:c42d53e5fcc007cfd1f8ffaac1a58ebd967421cc544857a5f8e7fc832943bdc4
+size 367610

videos/Hopper-v4__ppo_fix_continuous_action__4__1704452496-eval/rl-video-episode-0.mp4 DELETED Viewed

Binary file (807 kB)

videos/Hopper-v4__ppo_fix_continuous_action__4__1704452496-eval/rl-video-episode-1.mp4 DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:c04dc577bcd60edd71834a4908e587b63aa22dc438e67f05afd9fe411d86398d
-size 1347230

videos/Hopper-v4__ppo_fix_continuous_action__4__1704452496-eval/rl-video-episode-8.mp4 DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:a1cfd61c8162dc544a41ba35906fcb7c0326d62280eebf20fb7e48d6c18d164d
-size 1511663

videos/Hopper-v4__ppo_fix_continuous_action__4__1705691765-eval/rl-video-episode-0.mp4 ADDED Viewed

Binary file (352 kB). View file

videos/Hopper-v4__ppo_fix_continuous_action__4__1705691765-eval/rl-video-episode-1.mp4 ADDED Viewed

Binary file (373 kB). View file

videos/Hopper-v4__ppo_fix_continuous_action__4__1705691765-eval/rl-video-episode-8.mp4 ADDED Viewed

Binary file (368 kB). View file