[2023-03-08 12:52:39,709][434108] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/config.json... [2023-03-08 12:52:39,724][434108] Rollout worker 0 uses device cpu [2023-03-08 12:52:39,724][434108] Rollout worker 1 uses device cpu [2023-03-08 12:52:39,724][434108] Rollout worker 2 uses device cpu [2023-03-08 12:52:39,725][434108] Rollout worker 3 uses device cpu [2023-03-08 12:52:39,725][434108] Rollout worker 4 uses device cpu [2023-03-08 12:52:39,725][434108] Rollout worker 5 uses device cpu [2023-03-08 12:52:39,725][434108] Rollout worker 6 uses device cpu [2023-03-08 12:52:39,725][434108] Rollout worker 7 uses device cpu [2023-03-08 12:52:39,725][434108] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 [2023-03-08 12:52:39,736][434108] InferenceWorker_p0-w0: min num requests: 2 [2023-03-08 12:52:39,753][434108] Starting all processes... [2023-03-08 12:52:39,753][434108] Starting process learner_proc0 [2023-03-08 12:52:39,803][434108] Starting all processes... [2023-03-08 12:52:39,846][434108] Starting process inference_proc0-0 [2023-03-08 12:52:39,848][434108] Starting process rollout_proc0 [2023-03-08 12:52:39,848][434108] Starting process rollout_proc1 [2023-03-08 12:52:39,848][434108] Starting process rollout_proc2 [2023-03-08 12:52:39,848][434108] Starting process rollout_proc3 [2023-03-08 12:52:39,848][434108] Starting process rollout_proc4 [2023-03-08 12:52:39,848][434108] Starting process rollout_proc5 [2023-03-08 12:52:39,848][434108] Starting process rollout_proc6 [2023-03-08 12:52:39,848][434108] Starting process rollout_proc7 [2023-03-08 12:52:41,231][434347] Starting seed is not provided [2023-03-08 12:52:41,232][434347] Initializing actor-critic model on device cpu [2023-03-08 12:52:41,232][434347] RunningMeanStd input shape: (39,) [2023-03-08 12:52:41,232][434347] RunningMeanStd input shape: (1,) [2023-03-08 12:52:41,289][434347] Created Actor Critic model with architecture: [2023-03-08 12:52:41,289][434347] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): MlpEncoder( (mlp_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=Tanh) (2): RecursiveScriptModule(original_name=Linear) (3): RecursiveScriptModule(original_name=Tanh) ) ) ) ) (core): ModelCoreIdentity() (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=64, out_features=1, bias=True) (action_parameterization): ActionParameterizationContinuousNonAdaptiveStddev( (distribution_linear): Linear(in_features=64, out_features=4, bias=True) ) ) [2023-03-08 12:52:41,403][434397] Worker 6 uses CPU cores [24, 25, 26, 27] [2023-03-08 12:52:41,527][434399] Worker 1 uses CPU cores [4, 5, 6, 7] [2023-03-08 12:52:41,547][434402] Worker 2 uses CPU cores [8, 9, 10, 11] [2023-03-08 12:52:41,591][434395] Worker 5 uses CPU cores [20, 21, 22, 23] [2023-03-08 12:52:41,666][434347] Using optimizer [2023-03-08 12:52:41,666][434347] No checkpoints found [2023-03-08 12:52:41,667][434347] Did not load from checkpoint, starting from scratch! [2023-03-08 12:52:41,667][434347] Initialized policy 0 weights for model version 0 [2023-03-08 12:52:41,667][434347] LearnerWorker_p0 finished initialization! [2023-03-08 12:52:41,698][434398] Worker 0 uses CPU cores [0, 1, 2, 3] [2023-03-08 12:52:41,797][434407] Worker 3 uses CPU cores [12, 13, 14, 15] [2023-03-08 12:52:41,892][434108] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-08 12:52:41,921][434396] RunningMeanStd input shape: (39,) [2023-03-08 12:52:41,922][434396] RunningMeanStd input shape: (1,) [2023-03-08 12:52:41,977][434108] Inference worker 0-0 is ready! [2023-03-08 12:52:41,978][434108] All inference workers are ready! Signal rollout workers to start! [2023-03-08 12:52:41,980][434400] Worker 4 uses CPU cores [16, 17, 18, 19] [2023-03-08 12:52:42,113][434401] Worker 7 uses CPU cores [28, 29, 30, 31] [2023-03-08 12:52:46,893][434108] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-08 12:52:48,597][434402] Decorrelating experience for 0 frames... [2023-03-08 12:52:48,607][434397] Decorrelating experience for 0 frames... [2023-03-08 12:52:48,612][434395] Decorrelating experience for 0 frames... [2023-03-08 12:52:48,612][434407] Decorrelating experience for 0 frames... [2023-03-08 12:52:48,617][434402] Decorrelating experience for 64 frames... [2023-03-08 12:52:48,620][434399] Decorrelating experience for 0 frames... [2023-03-08 12:52:48,627][434397] Decorrelating experience for 64 frames... [2023-03-08 12:52:48,630][434398] Decorrelating experience for 0 frames... [2023-03-08 12:52:48,632][434407] Decorrelating experience for 64 frames... [2023-03-08 12:52:48,632][434395] Decorrelating experience for 64 frames... [2023-03-08 12:52:48,639][434399] Decorrelating experience for 64 frames... [2023-03-08 12:52:48,640][434400] Decorrelating experience for 0 frames... [2023-03-08 12:52:48,650][434398] Decorrelating experience for 64 frames... [2023-03-08 12:52:48,660][434400] Decorrelating experience for 64 frames... [2023-03-08 12:52:48,667][434402] Decorrelating experience for 128 frames... [2023-03-08 12:52:48,678][434397] Decorrelating experience for 128 frames... [2023-03-08 12:52:48,682][434407] Decorrelating experience for 128 frames... [2023-03-08 12:52:48,683][434395] Decorrelating experience for 128 frames... [2023-03-08 12:52:48,690][434399] Decorrelating experience for 128 frames... [2023-03-08 12:52:48,700][434398] Decorrelating experience for 128 frames... [2023-03-08 12:52:48,710][434400] Decorrelating experience for 128 frames... [2023-03-08 12:52:48,742][434401] Decorrelating experience for 0 frames... [2023-03-08 12:52:48,748][434402] Decorrelating experience for 192 frames... [2023-03-08 12:52:48,758][434397] Decorrelating experience for 192 frames... [2023-03-08 12:52:48,761][434401] Decorrelating experience for 64 frames... [2023-03-08 12:52:48,762][434395] Decorrelating experience for 192 frames... [2023-03-08 12:52:48,763][434407] Decorrelating experience for 192 frames... [2023-03-08 12:52:48,771][434399] Decorrelating experience for 192 frames... [2023-03-08 12:52:48,780][434398] Decorrelating experience for 192 frames... [2023-03-08 12:52:48,790][434400] Decorrelating experience for 192 frames... [2023-03-08 12:52:48,812][434401] Decorrelating experience for 128 frames... [2023-03-08 12:52:48,892][434401] Decorrelating experience for 192 frames... [2023-03-08 12:52:51,893][434108] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-08 12:52:51,894][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000000000_0.pth... [2023-03-08 12:52:55,332][434402] Decorrelating experience for 256 frames... [2023-03-08 12:52:55,345][434397] Decorrelating experience for 256 frames... [2023-03-08 12:52:55,359][434407] Decorrelating experience for 256 frames... [2023-03-08 12:52:55,363][434395] Decorrelating experience for 256 frames... [2023-03-08 12:52:55,375][434399] Decorrelating experience for 256 frames... [2023-03-08 12:52:55,390][434400] Decorrelating experience for 256 frames... [2023-03-08 12:52:55,393][434398] Decorrelating experience for 256 frames... [2023-03-08 12:52:55,472][434402] Decorrelating experience for 320 frames... [2023-03-08 12:52:55,483][434397] Decorrelating experience for 320 frames... [2023-03-08 12:52:55,484][434401] Decorrelating experience for 256 frames... [2023-03-08 12:52:55,500][434407] Decorrelating experience for 320 frames... [2023-03-08 12:52:55,503][434395] Decorrelating experience for 320 frames... [2023-03-08 12:52:55,516][434399] Decorrelating experience for 320 frames... [2023-03-08 12:52:55,531][434398] Decorrelating experience for 320 frames... [2023-03-08 12:52:55,532][434400] Decorrelating experience for 320 frames... [2023-03-08 12:52:55,624][434401] Decorrelating experience for 320 frames... [2023-03-08 12:52:55,643][434402] Decorrelating experience for 384 frames... [2023-03-08 12:52:55,651][434397] Decorrelating experience for 384 frames... [2023-03-08 12:52:55,672][434395] Decorrelating experience for 384 frames... [2023-03-08 12:52:55,675][434407] Decorrelating experience for 384 frames... [2023-03-08 12:52:55,686][434399] Decorrelating experience for 384 frames... [2023-03-08 12:52:55,708][434400] Decorrelating experience for 384 frames... [2023-03-08 12:52:55,708][434398] Decorrelating experience for 384 frames... [2023-03-08 12:52:55,797][434401] Decorrelating experience for 384 frames... [2023-03-08 12:52:55,843][434402] Decorrelating experience for 448 frames... [2023-03-08 12:52:55,850][434397] Decorrelating experience for 448 frames... [2023-03-08 12:52:55,872][434395] Decorrelating experience for 448 frames... [2023-03-08 12:52:55,874][434407] Decorrelating experience for 448 frames... [2023-03-08 12:52:55,884][434399] Decorrelating experience for 448 frames... [2023-03-08 12:52:55,915][434398] Decorrelating experience for 448 frames... [2023-03-08 12:52:55,922][434400] Decorrelating experience for 448 frames... [2023-03-08 12:52:55,997][434401] Decorrelating experience for 448 frames... [2023-03-08 12:52:56,892][434108] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-08 12:52:56,893][434108] Avg episode reward: [(0, '49.762')] [2023-03-08 12:52:59,732][434108] Heartbeat connected on Batcher_0 [2023-03-08 12:52:59,734][434108] Heartbeat connected on LearnerWorker_p0 [2023-03-08 12:52:59,740][434108] Heartbeat connected on InferenceWorker_p0-w0 [2023-03-08 12:52:59,743][434108] Heartbeat connected on RolloutWorker_w0 [2023-03-08 12:52:59,743][434108] Heartbeat connected on RolloutWorker_w1 [2023-03-08 12:52:59,747][434108] Heartbeat connected on RolloutWorker_w2 [2023-03-08 12:52:59,749][434108] Heartbeat connected on RolloutWorker_w3 [2023-03-08 12:52:59,750][434108] Heartbeat connected on RolloutWorker_w4 [2023-03-08 12:52:59,752][434108] Heartbeat connected on RolloutWorker_w5 [2023-03-08 12:52:59,753][434108] Heartbeat connected on RolloutWorker_w6 [2023-03-08 12:52:59,755][434108] Heartbeat connected on RolloutWorker_w7 [2023-03-08 12:53:00,652][434396] Updated weights for policy 0, policy_version 80 (0.0005) [2023-03-08 12:53:01,892][434108] Fps is (10 sec: 5324.9, 60 sec: 2662.4, 300 sec: 2662.4). Total num frames: 53248. Throughput: 0: 1702.4. Samples: 34048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:53:01,893][434108] Avg episode reward: [(0, '340.839')] [2023-03-08 12:53:04,528][434396] Updated weights for policy 0, policy_version 160 (0.0005) [2023-03-08 12:53:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 4096.0, 300 sec: 4096.0). Total num frames: 102400. Throughput: 0: 3891.0. Samples: 97276. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 12:53:06,893][434108] Avg episode reward: [(0, '445.457')] [2023-03-08 12:53:06,954][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000000208_106496.pth... [2023-03-08 12:53:06,956][434347] Saving new best policy, reward=445.457! [2023-03-08 12:53:08,485][434396] Updated weights for policy 0, policy_version 240 (0.0005) [2023-03-08 12:53:11,892][434108] Fps is (10 sec: 10649.5, 60 sec: 5324.8, 300 sec: 5324.8). Total num frames: 159744. Throughput: 0: 4275.6. Samples: 128268. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 12:53:11,893][434108] Avg episode reward: [(0, '467.200')] [2023-03-08 12:53:11,894][434347] Saving new best policy, reward=467.200! [2023-03-08 12:53:12,209][434396] Updated weights for policy 0, policy_version 320 (0.0005) [2023-03-08 12:53:16,005][434396] Updated weights for policy 0, policy_version 400 (0.0005) [2023-03-08 12:53:16,892][434108] Fps is (10 sec: 11059.1, 60 sec: 6085.5, 300 sec: 6085.5). Total num frames: 212992. Throughput: 0: 5558.6. Samples: 194552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:53:16,893][434108] Avg episode reward: [(0, '461.062')] [2023-03-08 12:53:19,901][434396] Updated weights for policy 0, policy_version 480 (0.0005) [2023-03-08 12:53:21,892][434108] Fps is (10 sec: 10240.1, 60 sec: 6553.6, 300 sec: 6553.6). Total num frames: 262144. Throughput: 0: 6430.5. Samples: 257220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:53:21,893][434108] Avg episode reward: [(0, '470.000')] [2023-03-08 12:53:21,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000000512_262144.pth... [2023-03-08 12:53:21,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000000000_0.pth [2023-03-08 12:53:21,897][434347] Saving new best policy, reward=470.000! [2023-03-08 12:53:24,128][434396] Updated weights for policy 0, policy_version 560 (0.0004) [2023-03-08 12:53:26,892][434108] Fps is (10 sec: 9830.4, 60 sec: 6917.7, 300 sec: 6917.7). Total num frames: 311296. Throughput: 0: 6345.7. Samples: 285556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:53:26,893][434108] Avg episode reward: [(0, '477.598')] [2023-03-08 12:53:26,893][434347] Saving new best policy, reward=477.598! [2023-03-08 12:53:28,213][434396] Updated weights for policy 0, policy_version 640 (0.0005) [2023-03-08 12:53:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 7290.9, 300 sec: 7290.9). Total num frames: 364544. Throughput: 0: 7731.1. Samples: 347900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:53:31,893][434108] Avg episode reward: [(0, '466.877')] [2023-03-08 12:53:31,955][434396] Updated weights for policy 0, policy_version 720 (0.0005) [2023-03-08 12:53:36,027][434396] Updated weights for policy 0, policy_version 800 (0.0005) [2023-03-08 12:53:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 7596.2, 300 sec: 7596.2). Total num frames: 417792. Throughput: 0: 9102.4. Samples: 409608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:53:36,893][434108] Avg episode reward: [(0, '469.307')] [2023-03-08 12:53:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000000816_417792.pth... [2023-03-08 12:53:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000000208_106496.pth [2023-03-08 12:53:40,262][434396] Updated weights for policy 0, policy_version 880 (0.0005) [2023-03-08 12:53:41,892][434108] Fps is (10 sec: 9830.4, 60 sec: 7714.1, 300 sec: 7714.1). Total num frames: 462848. Throughput: 0: 9745.1. Samples: 438528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:53:41,893][434108] Avg episode reward: [(0, '478.544')] [2023-03-08 12:53:41,954][434347] Saving new best policy, reward=478.544! [2023-03-08 12:53:44,619][434396] Updated weights for policy 0, policy_version 960 (0.0005) [2023-03-08 12:53:46,892][434108] Fps is (10 sec: 9420.8, 60 sec: 8533.4, 300 sec: 7876.9). Total num frames: 512000. Throughput: 0: 10257.4. Samples: 495632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:53:46,893][434108] Avg episode reward: [(0, '472.494')] [2023-03-08 12:53:48,791][434396] Updated weights for policy 0, policy_version 1040 (0.0005) [2023-03-08 12:53:51,892][434108] Fps is (10 sec: 10240.1, 60 sec: 9420.8, 300 sec: 8075.0). Total num frames: 565248. Throughput: 0: 10210.5. Samples: 556748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:53:51,893][434108] Avg episode reward: [(0, '486.445')] [2023-03-08 12:53:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000001104_565248.pth... [2023-03-08 12:53:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000000512_262144.pth [2023-03-08 12:53:51,899][434347] Saving new best policy, reward=486.445! [2023-03-08 12:53:52,628][434396] Updated weights for policy 0, policy_version 1120 (0.0005) [2023-03-08 12:53:56,396][434396] Updated weights for policy 0, policy_version 1200 (0.0005) [2023-03-08 12:53:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 8246.6). Total num frames: 618496. Throughput: 0: 10237.7. Samples: 588964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:53:56,893][434108] Avg episode reward: [(0, '485.469')] [2023-03-08 12:54:00,171][434396] Updated weights for policy 0, policy_version 1280 (0.0005) [2023-03-08 12:54:01,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 8396.8). Total num frames: 671744. Throughput: 0: 10202.0. Samples: 653640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:54:01,893][434108] Avg episode reward: [(0, '466.644')] [2023-03-08 12:54:03,959][434396] Updated weights for policy 0, policy_version 1360 (0.0005) [2023-03-08 12:54:06,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 8529.3). Total num frames: 724992. Throughput: 0: 10264.5. Samples: 719124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:54:06,893][434108] Avg episode reward: [(0, '493.825')] [2023-03-08 12:54:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000001416_724992.pth... [2023-03-08 12:54:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000000816_417792.pth [2023-03-08 12:54:06,899][434347] Saving new best policy, reward=493.825! [2023-03-08 12:54:07,763][434396] Updated weights for policy 0, policy_version 1440 (0.0005) [2023-03-08 12:54:11,545][434396] Updated weights for policy 0, policy_version 1520 (0.0005) [2023-03-08 12:54:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 8647.1). Total num frames: 778240. Throughput: 0: 10342.3. Samples: 750960. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 12:54:11,893][434108] Avg episode reward: [(0, '461.099')] [2023-03-08 12:54:15,364][434396] Updated weights for policy 0, policy_version 1600 (0.0005) [2023-03-08 12:54:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 8752.5). Total num frames: 831488. Throughput: 0: 10384.0. Samples: 815180. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 12:54:16,893][434108] Avg episode reward: [(0, '490.717')] [2023-03-08 12:54:19,278][434396] Updated weights for policy 0, policy_version 1680 (0.0005) [2023-03-08 12:54:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 8888.3). Total num frames: 888832. Throughput: 0: 10445.5. Samples: 879656. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 12:54:21,893][434108] Avg episode reward: [(0, '445.907')] [2023-03-08 12:54:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000001736_888832.pth... [2023-03-08 12:54:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000001104_565248.pth [2023-03-08 12:54:23,047][434396] Updated weights for policy 0, policy_version 1760 (0.0004) [2023-03-08 12:54:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 8933.2). Total num frames: 937984. Throughput: 0: 10520.5. Samples: 911952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:54:26,894][434396] Updated weights for policy 0, policy_version 1840 (0.0005) [2023-03-08 12:54:26,903][434108] Avg episode reward: [(0, '493.869')] [2023-03-08 12:54:26,904][434347] Saving new best policy, reward=493.869! [2023-03-08 12:54:30,703][434396] Updated weights for policy 0, policy_version 1920 (0.0005) [2023-03-08 12:54:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 9048.4). Total num frames: 995328. Throughput: 0: 10667.5. Samples: 975668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:54:31,903][434108] Avg episode reward: [(0, '483.584')] [2023-03-08 12:54:34,503][434396] Updated weights for policy 0, policy_version 2000 (0.0005) [2023-03-08 12:54:36,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10513.1, 300 sec: 9118.1). Total num frames: 1048576. Throughput: 0: 10747.7. Samples: 1040396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:54:36,904][434108] Avg episode reward: [(0, '478.191')] [2023-03-08 12:54:36,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000002048_1048576.pth... [2023-03-08 12:54:36,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000001416_724992.pth [2023-03-08 12:54:38,318][434396] Updated weights for policy 0, policy_version 2080 (0.0004) [2023-03-08 12:54:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 9181.9). Total num frames: 1101824. Throughput: 0: 10759.7. Samples: 1073152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:54:41,903][434108] Avg episode reward: [(0, '468.614')] [2023-03-08 12:54:42,014][434396] Updated weights for policy 0, policy_version 2160 (0.0005) [2023-03-08 12:54:45,804][434396] Updated weights for policy 0, policy_version 2240 (0.0005) [2023-03-08 12:54:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 9240.6). Total num frames: 1155072. Throughput: 0: 10780.3. Samples: 1138752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:54:46,906][434108] Avg episode reward: [(0, '451.458')] [2023-03-08 12:54:49,698][434396] Updated weights for policy 0, policy_version 2320 (0.0005) [2023-03-08 12:54:51,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 9294.8). Total num frames: 1208320. Throughput: 0: 10723.8. Samples: 1201696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:54:51,893][434108] Avg episode reward: [(0, '432.707')] [2023-03-08 12:54:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000002360_1208320.pth... [2023-03-08 12:54:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000001736_888832.pth [2023-03-08 12:54:53,500][434396] Updated weights for policy 0, policy_version 2400 (0.0005) [2023-03-08 12:54:56,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10717.8, 300 sec: 9344.9). Total num frames: 1261568. Throughput: 0: 10740.4. Samples: 1234280. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 12:54:56,893][434108] Avg episode reward: [(0, '450.544')] [2023-03-08 12:54:57,334][434396] Updated weights for policy 0, policy_version 2480 (0.0005) [2023-03-08 12:55:01,146][434396] Updated weights for policy 0, policy_version 2560 (0.0005) [2023-03-08 12:55:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 9391.5). Total num frames: 1314816. Throughput: 0: 10740.5. Samples: 1298504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:55:01,893][434108] Avg episode reward: [(0, '465.511')] [2023-03-08 12:55:05,038][434396] Updated weights for policy 0, policy_version 2640 (0.0005) [2023-03-08 12:55:06,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 9434.9). Total num frames: 1368064. Throughput: 0: 10720.0. Samples: 1362056. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 12:55:06,893][434108] Avg episode reward: [(0, '458.882')] [2023-03-08 12:55:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000002672_1368064.pth... [2023-03-08 12:55:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000002048_1048576.pth [2023-03-08 12:55:09,029][434396] Updated weights for policy 0, policy_version 2720 (0.0004) [2023-03-08 12:55:11,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 9475.4). Total num frames: 1421312. Throughput: 0: 10682.2. Samples: 1392648. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 12:55:11,893][434108] Avg episode reward: [(0, '463.453')] [2023-03-08 12:55:12,997][434396] Updated weights for policy 0, policy_version 2800 (0.0005) [2023-03-08 12:55:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 9486.9). Total num frames: 1470464. Throughput: 0: 10631.4. Samples: 1454080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:55:16,893][434108] Avg episode reward: [(0, '466.510')] [2023-03-08 12:55:17,011][434396] Updated weights for policy 0, policy_version 2880 (0.0005) [2023-03-08 12:55:20,969][434396] Updated weights for policy 0, policy_version 2960 (0.0005) [2023-03-08 12:55:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 9523.2). Total num frames: 1523712. Throughput: 0: 10561.9. Samples: 1515680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:55:21,893][434108] Avg episode reward: [(0, '480.465')] [2023-03-08 12:55:21,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000002976_1523712.pth... [2023-03-08 12:55:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000002360_1208320.pth [2023-03-08 12:55:24,758][434396] Updated weights for policy 0, policy_version 3040 (0.0004) [2023-03-08 12:55:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 9557.3). Total num frames: 1576960. Throughput: 0: 10560.2. Samples: 1548360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:55:26,893][434108] Avg episode reward: [(0, '490.370')] [2023-03-08 12:55:28,538][434396] Updated weights for policy 0, policy_version 3120 (0.0005) [2023-03-08 12:55:31,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 9589.5). Total num frames: 1630208. Throughput: 0: 10557.1. Samples: 1613824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:55:31,893][434108] Avg episode reward: [(0, '474.193')] [2023-03-08 12:55:32,330][434396] Updated weights for policy 0, policy_version 3200 (0.0005) [2023-03-08 12:55:36,016][434396] Updated weights for policy 0, policy_version 3280 (0.0004) [2023-03-08 12:55:36,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 9643.2). Total num frames: 1687552. Throughput: 0: 10616.2. Samples: 1679424. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 12:55:36,893][434108] Avg episode reward: [(0, '477.162')] [2023-03-08 12:55:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000003296_1687552.pth... [2023-03-08 12:55:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000002672_1368064.pth [2023-03-08 12:55:39,770][434396] Updated weights for policy 0, policy_version 3360 (0.0005) [2023-03-08 12:55:41,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 9671.1). Total num frames: 1740800. Throughput: 0: 10620.3. Samples: 1712192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:55:41,893][434108] Avg episode reward: [(0, '470.931')] [2023-03-08 12:55:43,544][434396] Updated weights for policy 0, policy_version 3440 (0.0005) [2023-03-08 12:55:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 9697.6). Total num frames: 1794048. Throughput: 0: 10640.9. Samples: 1777344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:55:46,893][434108] Avg episode reward: [(0, '475.999')] [2023-03-08 12:55:47,371][434396] Updated weights for policy 0, policy_version 3520 (0.0005) [2023-03-08 12:55:51,094][434396] Updated weights for policy 0, policy_version 3600 (0.0005) [2023-03-08 12:55:51,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10717.9, 300 sec: 9744.2). Total num frames: 1851392. Throughput: 0: 10687.9. Samples: 1843012. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 12:55:51,893][434108] Avg episode reward: [(0, '504.901')] [2023-03-08 12:55:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000003616_1851392.pth... [2023-03-08 12:55:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000002976_1523712.pth [2023-03-08 12:55:51,899][434347] Saving new best policy, reward=504.901! [2023-03-08 12:55:54,877][434396] Updated weights for policy 0, policy_version 3680 (0.0005) [2023-03-08 12:55:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 9767.4). Total num frames: 1904640. Throughput: 0: 10734.9. Samples: 1875720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:55:56,893][434108] Avg episode reward: [(0, '578.503')] [2023-03-08 12:55:56,893][434347] Saving new best policy, reward=578.503! [2023-03-08 12:55:58,630][434396] Updated weights for policy 0, policy_version 3760 (0.0004) [2023-03-08 12:56:01,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.2, 300 sec: 9809.9). Total num frames: 1961984. Throughput: 0: 10822.5. Samples: 1941092. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 12:56:01,893][434108] Avg episode reward: [(0, '585.936')] [2023-03-08 12:56:01,893][434347] Saving new best policy, reward=585.936! [2023-03-08 12:56:02,202][434396] Updated weights for policy 0, policy_version 3840 (0.0004) [2023-03-08 12:56:05,830][434396] Updated weights for policy 0, policy_version 3920 (0.0004) [2023-03-08 12:56:06,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10786.1, 300 sec: 9830.4). Total num frames: 2015232. Throughput: 0: 10960.9. Samples: 2008920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:56:06,893][434108] Avg episode reward: [(0, '644.724')] [2023-03-08 12:56:06,925][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000003944_2019328.pth... [2023-03-08 12:56:06,927][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000003296_1687552.pth [2023-03-08 12:56:06,927][434347] Saving new best policy, reward=644.724! [2023-03-08 12:56:09,448][434396] Updated weights for policy 0, policy_version 4000 (0.0005) [2023-03-08 12:56:11,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10854.4, 300 sec: 9869.4). Total num frames: 2072576. Throughput: 0: 10996.2. Samples: 2043188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:56:11,893][434108] Avg episode reward: [(0, '534.543')] [2023-03-08 12:56:13,397][434396] Updated weights for policy 0, policy_version 4080 (0.0005) [2023-03-08 12:56:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 9868.5). Total num frames: 2121728. Throughput: 0: 10934.2. Samples: 2105860. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 12:56:16,893][434108] Avg episode reward: [(0, '554.559')] [2023-03-08 12:56:17,334][434396] Updated weights for policy 0, policy_version 4160 (0.0005) [2023-03-08 12:56:21,255][434396] Updated weights for policy 0, policy_version 4240 (0.0005) [2023-03-08 12:56:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10854.4, 300 sec: 9886.3). Total num frames: 2174976. Throughput: 0: 10859.0. Samples: 2168080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:56:21,893][434108] Avg episode reward: [(0, '536.273')] [2023-03-08 12:56:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000004248_2174976.pth... [2023-03-08 12:56:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000003616_1851392.pth [2023-03-08 12:56:25,182][434396] Updated weights for policy 0, policy_version 4320 (0.0005) [2023-03-08 12:56:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 9903.2). Total num frames: 2228224. Throughput: 0: 10831.8. Samples: 2199624. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 12:56:26,893][434108] Avg episode reward: [(0, '632.931')] [2023-03-08 12:56:29,150][434396] Updated weights for policy 0, policy_version 4400 (0.0005) [2023-03-08 12:56:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10786.2, 300 sec: 9901.6). Total num frames: 2277376. Throughput: 0: 10749.3. Samples: 2261064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:56:31,893][434108] Avg episode reward: [(0, '596.357')] [2023-03-08 12:56:33,070][434396] Updated weights for policy 0, policy_version 4480 (0.0004) [2023-03-08 12:56:36,887][434396] Updated weights for policy 0, policy_version 4560 (0.0004) [2023-03-08 12:56:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 9935.0). Total num frames: 2334720. Throughput: 0: 10718.8. Samples: 2325360. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 12:56:36,893][434108] Avg episode reward: [(0, '597.979')] [2023-03-08 12:56:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000004560_2334720.pth... [2023-03-08 12:56:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000003944_2019328.pth [2023-03-08 12:56:40,757][434396] Updated weights for policy 0, policy_version 4640 (0.0005) [2023-03-08 12:56:41,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10786.1, 300 sec: 9949.9). Total num frames: 2387968. Throughput: 0: 10707.8. Samples: 2357572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:56:41,893][434108] Avg episode reward: [(0, '592.428')] [2023-03-08 12:56:44,666][434396] Updated weights for policy 0, policy_version 4720 (0.0005) [2023-03-08 12:56:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 9947.4). Total num frames: 2437120. Throughput: 0: 10655.9. Samples: 2420608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:56:46,893][434108] Avg episode reward: [(0, '589.043')] [2023-03-08 12:56:48,632][434396] Updated weights for policy 0, policy_version 4800 (0.0005) [2023-03-08 12:56:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 9961.5). Total num frames: 2490368. Throughput: 0: 10539.4. Samples: 2483192. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 12:56:51,893][434108] Avg episode reward: [(0, '561.667')] [2023-03-08 12:56:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000004864_2490368.pth... [2023-03-08 12:56:51,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000004248_2174976.pth [2023-03-08 12:56:52,461][434396] Updated weights for policy 0, policy_version 4880 (0.0004) [2023-03-08 12:56:56,404][434396] Updated weights for policy 0, policy_version 4960 (0.0005) [2023-03-08 12:56:56,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 9975.0). Total num frames: 2543616. Throughput: 0: 10484.9. Samples: 2515008. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 12:56:56,893][434108] Avg episode reward: [(0, '603.868')] [2023-03-08 12:57:00,291][434396] Updated weights for policy 0, policy_version 5040 (0.0005) [2023-03-08 12:57:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 9987.9). Total num frames: 2596864. Throughput: 0: 10479.9. Samples: 2577456. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 12:57:01,893][434108] Avg episode reward: [(0, '674.525')] [2023-03-08 12:57:01,893][434347] Saving new best policy, reward=674.525! [2023-03-08 12:57:04,243][434396] Updated weights for policy 0, policy_version 5120 (0.0005) [2023-03-08 12:57:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 9985.0). Total num frames: 2646016. Throughput: 0: 10459.0. Samples: 2638732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:57:06,893][434108] Avg episode reward: [(0, '754.441')] [2023-03-08 12:57:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000005168_2646016.pth... [2023-03-08 12:57:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000004560_2334720.pth [2023-03-08 12:57:06,898][434347] Saving new best policy, reward=754.441! [2023-03-08 12:57:08,414][434396] Updated weights for policy 0, policy_version 5200 (0.0005) [2023-03-08 12:57:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 9982.1). Total num frames: 2695168. Throughput: 0: 10406.7. Samples: 2667924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:57:11,893][434108] Avg episode reward: [(0, '597.939')] [2023-03-08 12:57:12,669][434396] Updated weights for policy 0, policy_version 5280 (0.0005) [2023-03-08 12:57:16,892][434108] Fps is (10 sec: 9420.8, 60 sec: 10308.3, 300 sec: 9964.5). Total num frames: 2740224. Throughput: 0: 10312.5. Samples: 2725128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:57:16,893][434108] Avg episode reward: [(0, '837.289')] [2023-03-08 12:57:16,923][434347] Saving new best policy, reward=837.289! [2023-03-08 12:57:16,923][434396] Updated weights for policy 0, policy_version 5360 (0.0005) [2023-03-08 12:57:21,169][434396] Updated weights for policy 0, policy_version 5440 (0.0005) [2023-03-08 12:57:21,892][434108] Fps is (10 sec: 9420.8, 60 sec: 10240.0, 300 sec: 9962.1). Total num frames: 2789376. Throughput: 0: 10189.5. Samples: 2783888. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 12:57:21,893][434108] Avg episode reward: [(0, '769.025')] [2023-03-08 12:57:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000005448_2789376.pth... [2023-03-08 12:57:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000004864_2490368.pth [2023-03-08 12:57:25,548][434396] Updated weights for policy 0, policy_version 5520 (0.0005) [2023-03-08 12:57:26,892][434108] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 9945.4). Total num frames: 2834432. Throughput: 0: 10108.5. Samples: 2812456. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 12:57:26,893][434108] Avg episode reward: [(0, '852.736')] [2023-03-08 12:57:26,902][434347] Saving new best policy, reward=852.736! [2023-03-08 12:57:29,969][434396] Updated weights for policy 0, policy_version 5600 (0.0005) [2023-03-08 12:57:31,892][434108] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 9943.4). Total num frames: 2883584. Throughput: 0: 9927.2. Samples: 2867332. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 12:57:31,893][434108] Avg episode reward: [(0, '899.623')] [2023-03-08 12:57:31,894][434347] Saving new best policy, reward=899.623! [2023-03-08 12:57:34,415][434396] Updated weights for policy 0, policy_version 5680 (0.0006) [2023-03-08 12:57:36,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9927.6). Total num frames: 2928640. Throughput: 0: 9787.3. Samples: 2923620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:57:36,893][434108] Avg episode reward: [(0, '991.319')] [2023-03-08 12:57:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000005720_2928640.pth... [2023-03-08 12:57:36,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000005168_2646016.pth [2023-03-08 12:57:36,898][434347] Saving new best policy, reward=991.319! [2023-03-08 12:57:38,751][434396] Updated weights for policy 0, policy_version 5760 (0.0005) [2023-03-08 12:57:41,892][434108] Fps is (10 sec: 9011.2, 60 sec: 9762.1, 300 sec: 10080.3). Total num frames: 2973696. Throughput: 0: 9703.2. Samples: 2951652. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 12:57:41,893][434108] Avg episode reward: [(0, '990.581')] [2023-03-08 12:57:43,334][434396] Updated weights for policy 0, policy_version 5840 (0.0006) [2023-03-08 12:57:46,892][434108] Fps is (10 sec: 9011.2, 60 sec: 9693.9, 300 sec: 10233.1). Total num frames: 3018752. Throughput: 0: 9516.0. Samples: 3005676. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 12:57:46,893][434108] Avg episode reward: [(0, '1235.727')] [2023-03-08 12:57:46,923][434347] Saving new best policy, reward=1235.727! [2023-03-08 12:57:47,735][434396] Updated weights for policy 0, policy_version 5920 (0.0005) [2023-03-08 12:57:51,893][434108] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 10399.7). Total num frames: 3067904. Throughput: 0: 9427.7. Samples: 3062980. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 12:57:51,904][434108] Avg episode reward: [(0, '1121.948')] [2023-03-08 12:57:51,908][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000005992_3067904.pth... [2023-03-08 12:57:51,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000005448_2789376.pth [2023-03-08 12:57:51,981][434396] Updated weights for policy 0, policy_version 6000 (0.0003) [2023-03-08 12:57:56,303][434396] Updated weights for policy 0, policy_version 6080 (0.0004) [2023-03-08 12:57:56,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 10385.8). Total num frames: 3117056. Throughput: 0: 9420.3. Samples: 3091836. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 12:57:56,893][434108] Avg episode reward: [(0, '1074.237')] [2023-03-08 12:58:00,747][434396] Updated weights for policy 0, policy_version 6160 (0.0004) [2023-03-08 12:58:01,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 10371.9). Total num frames: 3162112. Throughput: 0: 9390.3. Samples: 3147692. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 12:58:01,893][434108] Avg episode reward: [(0, '1065.614')] [2023-03-08 12:58:05,378][434396] Updated weights for policy 0, policy_version 6240 (0.0005) [2023-03-08 12:58:06,892][434108] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 10330.3). Total num frames: 3207168. Throughput: 0: 9266.5. Samples: 3200880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:58:06,893][434108] Avg episode reward: [(0, '1003.980')] [2023-03-08 12:58:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000006264_3207168.pth... [2023-03-08 12:58:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000005720_2928640.pth [2023-03-08 12:58:09,808][434396] Updated weights for policy 0, policy_version 6320 (0.0005) [2023-03-08 12:58:11,892][434108] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 10302.5). Total num frames: 3252224. Throughput: 0: 9233.2. Samples: 3227948. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 12:58:11,893][434108] Avg episode reward: [(0, '1144.291')] [2023-03-08 12:58:14,108][434396] Updated weights for policy 0, policy_version 6400 (0.0005) [2023-03-08 12:58:16,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 10302.5). Total num frames: 3301376. Throughput: 0: 9282.9. Samples: 3285064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:58:16,893][434108] Avg episode reward: [(0, '1111.900')] [2023-03-08 12:58:18,488][434396] Updated weights for policy 0, policy_version 6480 (0.0005) [2023-03-08 12:58:21,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 10288.6). Total num frames: 3346432. Throughput: 0: 9285.2. Samples: 3341456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:58:21,893][434108] Avg episode reward: [(0, '1000.668')] [2023-03-08 12:58:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000006536_3346432.pth... [2023-03-08 12:58:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000005992_3067904.pth [2023-03-08 12:58:22,850][434396] Updated weights for policy 0, policy_version 6560 (0.0004) [2023-03-08 12:58:26,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 10274.7). Total num frames: 3395584. Throughput: 0: 9300.4. Samples: 3370168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:58:26,893][434108] Avg episode reward: [(0, '1173.744')] [2023-03-08 12:58:27,215][434396] Updated weights for policy 0, policy_version 6640 (0.0005) [2023-03-08 12:58:31,582][434396] Updated weights for policy 0, policy_version 6720 (0.0005) [2023-03-08 12:58:31,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 10246.9). Total num frames: 3440640. Throughput: 0: 9347.7. Samples: 3426324. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 12:58:31,893][434108] Avg episode reward: [(0, '1080.494')] [2023-03-08 12:58:35,802][434396] Updated weights for policy 0, policy_version 6800 (0.0005) [2023-03-08 12:58:36,892][434108] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 10260.8). Total num frames: 3489792. Throughput: 0: 9350.4. Samples: 3483748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:58:36,893][434108] Avg episode reward: [(0, '1147.732')] [2023-03-08 12:58:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000006816_3489792.pth... [2023-03-08 12:58:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000006264_3207168.pth [2023-03-08 12:58:40,075][434396] Updated weights for policy 0, policy_version 6880 (0.0005) [2023-03-08 12:58:41,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9420.8, 300 sec: 10260.8). Total num frames: 3538944. Throughput: 0: 9355.7. Samples: 3512844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:58:41,893][434108] Avg episode reward: [(0, '1093.229')] [2023-03-08 12:58:44,465][434396] Updated weights for policy 0, policy_version 6960 (0.0005) [2023-03-08 12:58:46,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 10233.1). Total num frames: 3584000. Throughput: 0: 9345.7. Samples: 3568248. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 12:58:46,893][434108] Avg episode reward: [(0, '1134.614')] [2023-03-08 12:58:48,842][434396] Updated weights for policy 0, policy_version 7040 (0.0005) [2023-03-08 12:58:51,892][434108] Fps is (10 sec: 9011.2, 60 sec: 9352.6, 300 sec: 10205.3). Total num frames: 3629056. Throughput: 0: 9406.4. Samples: 3624168. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 12:58:51,893][434108] Avg episode reward: [(0, '1088.649')] [2023-03-08 12:58:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000007088_3629056.pth... [2023-03-08 12:58:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000006536_3346432.pth [2023-03-08 12:58:53,335][434396] Updated weights for policy 0, policy_version 7120 (0.0005) [2023-03-08 12:58:56,892][434108] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 10177.5). Total num frames: 3674112. Throughput: 0: 9409.7. Samples: 3651384. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 12:58:56,893][434108] Avg episode reward: [(0, '1220.635')] [2023-03-08 12:58:57,911][434396] Updated weights for policy 0, policy_version 7200 (0.0005) [2023-03-08 12:59:01,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 10163.6). Total num frames: 3723264. Throughput: 0: 9374.0. Samples: 3706892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:59:01,893][434108] Avg episode reward: [(0, '1218.156')] [2023-03-08 12:59:02,197][434396] Updated weights for policy 0, policy_version 7280 (0.0004) [2023-03-08 12:59:06,628][434396] Updated weights for policy 0, policy_version 7360 (0.0005) [2023-03-08 12:59:06,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 10135.9). Total num frames: 3768320. Throughput: 0: 9363.4. Samples: 3762808. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 12:59:06,893][434108] Avg episode reward: [(0, '1359.159')] [2023-03-08 12:59:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000007360_3768320.pth... [2023-03-08 12:59:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000006816_3489792.pth [2023-03-08 12:59:06,899][434347] Saving new best policy, reward=1359.159! [2023-03-08 12:59:10,897][434396] Updated weights for policy 0, policy_version 7440 (0.0005) [2023-03-08 12:59:11,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 10122.0). Total num frames: 3817472. Throughput: 0: 9355.9. Samples: 3791184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:59:11,893][434108] Avg episode reward: [(0, '1476.213')] [2023-03-08 12:59:11,893][434347] Saving new best policy, reward=1476.213! [2023-03-08 12:59:15,171][434396] Updated weights for policy 0, policy_version 7520 (0.0005) [2023-03-08 12:59:16,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 10080.3). Total num frames: 3862528. Throughput: 0: 9383.9. Samples: 3848600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:59:16,903][434108] Avg episode reward: [(0, '1538.204')] [2023-03-08 12:59:16,906][434347] Saving new best policy, reward=1538.204! [2023-03-08 12:59:19,511][434396] Updated weights for policy 0, policy_version 7600 (0.0005) [2023-03-08 12:59:21,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 10080.3). Total num frames: 3911680. Throughput: 0: 9367.7. Samples: 3905292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:59:21,903][434108] Avg episode reward: [(0, '1575.586')] [2023-03-08 12:59:21,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000007640_3911680.pth... [2023-03-08 12:59:21,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000007088_3629056.pth [2023-03-08 12:59:21,909][434347] Saving new best policy, reward=1575.586! [2023-03-08 12:59:23,851][434396] Updated weights for policy 0, policy_version 7680 (0.0005) [2023-03-08 12:59:26,892][434108] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 10038.7). Total num frames: 3956736. Throughput: 0: 9352.5. Samples: 3933708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:59:26,893][434108] Avg episode reward: [(0, '1636.163')] [2023-03-08 12:59:26,894][434347] Saving new best policy, reward=1636.163! [2023-03-08 12:59:28,292][434396] Updated weights for policy 0, policy_version 7760 (0.0005) [2023-03-08 12:59:31,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 10024.8). Total num frames: 4005888. Throughput: 0: 9361.2. Samples: 3989504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:59:31,903][434108] Avg episode reward: [(0, '1564.938')] [2023-03-08 12:59:32,618][434396] Updated weights for policy 0, policy_version 7840 (0.0005) [2023-03-08 12:59:36,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9997.0). Total num frames: 4050944. Throughput: 0: 9364.9. Samples: 4045588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:59:36,903][434108] Avg episode reward: [(0, '1598.377')] [2023-03-08 12:59:36,906][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000007912_4050944.pth... [2023-03-08 12:59:36,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000007360_3768320.pth [2023-03-08 12:59:37,095][434396] Updated weights for policy 0, policy_version 7920 (0.0005) [2023-03-08 12:59:41,804][434396] Updated weights for policy 0, policy_version 8000 (0.0005) [2023-03-08 12:59:41,892][434108] Fps is (10 sec: 9011.3, 60 sec: 9284.3, 300 sec: 9969.2). Total num frames: 4096000. Throughput: 0: 9336.5. Samples: 4071528. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 12:59:41,893][434108] Avg episode reward: [(0, '1552.071')] [2023-03-08 12:59:46,573][434396] Updated weights for policy 0, policy_version 8080 (0.0005) [2023-03-08 12:59:46,892][434108] Fps is (10 sec: 8601.6, 60 sec: 9216.0, 300 sec: 9927.6). Total num frames: 4136960. Throughput: 0: 9263.6. Samples: 4123756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:59:46,893][434108] Avg episode reward: [(0, '1516.752')] [2023-03-08 12:59:51,167][434396] Updated weights for policy 0, policy_version 8160 (0.0005) [2023-03-08 12:59:51,892][434108] Fps is (10 sec: 8601.5, 60 sec: 9216.0, 300 sec: 9899.8). Total num frames: 4182016. Throughput: 0: 9199.5. Samples: 4176784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:59:51,893][434108] Avg episode reward: [(0, '1499.967')] [2023-03-08 12:59:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000008168_4182016.pth... [2023-03-08 12:59:51,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000007640_3911680.pth [2023-03-08 12:59:55,740][434396] Updated weights for policy 0, policy_version 8240 (0.0005) [2023-03-08 12:59:56,892][434108] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9872.1). Total num frames: 4227072. Throughput: 0: 9164.4. Samples: 4203580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 12:59:56,893][434108] Avg episode reward: [(0, '1519.105')] [2023-03-08 13:00:00,735][434396] Updated weights for policy 0, policy_version 8320 (0.0005) [2023-03-08 13:00:01,892][434108] Fps is (10 sec: 8601.6, 60 sec: 9079.5, 300 sec: 9830.4). Total num frames: 4268032. Throughput: 0: 9022.3. Samples: 4254604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:00:01,893][434108] Avg episode reward: [(0, '1582.005')] [2023-03-08 13:00:05,431][434396] Updated weights for policy 0, policy_version 8400 (0.0004) [2023-03-08 13:00:06,892][434108] Fps is (10 sec: 8601.6, 60 sec: 9079.5, 300 sec: 9802.6). Total num frames: 4313088. Throughput: 0: 8906.6. Samples: 4306088. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:00:06,893][434108] Avg episode reward: [(0, '1485.233')] [2023-03-08 13:00:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000008424_4313088.pth... [2023-03-08 13:00:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000007912_4050944.pth [2023-03-08 13:00:09,882][434396] Updated weights for policy 0, policy_version 8480 (0.0003) [2023-03-08 13:00:11,892][434108] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9788.7). Total num frames: 4358144. Throughput: 0: 8898.1. Samples: 4334124. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:00:11,893][434108] Avg episode reward: [(0, '1386.124')] [2023-03-08 13:00:14,143][434396] Updated weights for policy 0, policy_version 8560 (0.0005) [2023-03-08 13:00:16,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 9774.9). Total num frames: 4407296. Throughput: 0: 8921.8. Samples: 4390984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:00:16,893][434108] Avg episode reward: [(0, '1443.178')] [2023-03-08 13:00:18,443][434396] Updated weights for policy 0, policy_version 8640 (0.0005) [2023-03-08 13:00:21,892][434108] Fps is (10 sec: 9420.7, 60 sec: 9011.2, 300 sec: 9747.1). Total num frames: 4452352. Throughput: 0: 8948.3. Samples: 4448264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:00:21,893][434108] Avg episode reward: [(0, '1561.297')] [2023-03-08 13:00:21,903][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000008704_4456448.pth... [2023-03-08 13:00:21,904][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000008168_4182016.pth [2023-03-08 13:00:22,789][434396] Updated weights for policy 0, policy_version 8720 (0.0005) [2023-03-08 13:00:26,892][434108] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9719.3). Total num frames: 4497408. Throughput: 0: 8989.0. Samples: 4476032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:00:26,893][434108] Avg episode reward: [(0, '1573.027')] [2023-03-08 13:00:27,384][434396] Updated weights for policy 0, policy_version 8800 (0.0005) [2023-03-08 13:00:31,892][434108] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9677.7). Total num frames: 4542464. Throughput: 0: 8977.8. Samples: 4527756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:00:31,893][434108] Avg episode reward: [(0, '1640.779')] [2023-03-08 13:00:31,893][434347] Saving new best policy, reward=1640.779! [2023-03-08 13:00:32,232][434396] Updated weights for policy 0, policy_version 8880 (0.0005) [2023-03-08 13:00:36,664][434396] Updated weights for policy 0, policy_version 8960 (0.0005) [2023-03-08 13:00:36,892][434108] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9649.9). Total num frames: 4587520. Throughput: 0: 8988.6. Samples: 4581272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:00:36,893][434108] Avg episode reward: [(0, '1727.429')] [2023-03-08 13:00:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000008960_4587520.pth... [2023-03-08 13:00:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000008424_4313088.pth [2023-03-08 13:00:36,899][434347] Saving new best policy, reward=1727.429! [2023-03-08 13:00:40,576][434396] Updated weights for policy 0, policy_version 9040 (0.0005) [2023-03-08 13:00:41,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9079.5, 300 sec: 9649.9). Total num frames: 4640768. Throughput: 0: 9075.4. Samples: 4611972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:00:41,893][434108] Avg episode reward: [(0, '1884.699')] [2023-03-08 13:00:41,894][434347] Saving new best policy, reward=1884.699! [2023-03-08 13:00:44,567][434396] Updated weights for policy 0, policy_version 9120 (0.0005) [2023-03-08 13:00:46,893][434108] Fps is (10 sec: 10239.9, 60 sec: 9216.0, 300 sec: 9622.1). Total num frames: 4689920. Throughput: 0: 9318.3. Samples: 4673928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:00:46,893][434108] Avg episode reward: [(0, '1928.833')] [2023-03-08 13:00:46,943][434347] Saving new best policy, reward=1928.833! [2023-03-08 13:00:48,619][434396] Updated weights for policy 0, policy_version 9200 (0.0004) [2023-03-08 13:00:51,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9284.3, 300 sec: 9608.2). Total num frames: 4739072. Throughput: 0: 9519.0. Samples: 4734444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:00:51,893][434108] Avg episode reward: [(0, '1917.277')] [2023-03-08 13:00:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000009264_4743168.pth... [2023-03-08 13:00:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000008704_4456448.pth [2023-03-08 13:00:52,693][434396] Updated weights for policy 0, policy_version 9280 (0.0005) [2023-03-08 13:00:56,669][434396] Updated weights for policy 0, policy_version 9360 (0.0005) [2023-03-08 13:00:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 9420.8, 300 sec: 9594.4). Total num frames: 4792320. Throughput: 0: 9564.8. Samples: 4764540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:00:56,893][434108] Avg episode reward: [(0, '1931.176')] [2023-03-08 13:00:56,894][434347] Saving new best policy, reward=1931.176! [2023-03-08 13:01:00,674][434396] Updated weights for policy 0, policy_version 9440 (0.0005) [2023-03-08 13:01:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 9625.6, 300 sec: 9594.4). Total num frames: 4845568. Throughput: 0: 9678.3. Samples: 4826508. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:01:01,893][434108] Avg episode reward: [(0, '1984.781')] [2023-03-08 13:01:01,894][434347] Saving new best policy, reward=1984.781! [2023-03-08 13:01:04,695][434396] Updated weights for policy 0, policy_version 9520 (0.0004) [2023-03-08 13:01:06,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9552.7). Total num frames: 4890624. Throughput: 0: 9739.2. Samples: 4886528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:01:06,893][434108] Avg episode reward: [(0, '1956.443')] [2023-03-08 13:01:06,954][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000009560_4894720.pth... [2023-03-08 13:01:06,956][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000008960_4587520.pth [2023-03-08 13:01:09,094][434396] Updated weights for policy 0, policy_version 9600 (0.0006) [2023-03-08 13:01:11,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9552.7). Total num frames: 4939776. Throughput: 0: 9747.5. Samples: 4914668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:01:11,893][434108] Avg episode reward: [(0, '1922.663')] [2023-03-08 13:01:13,407][434396] Updated weights for policy 0, policy_version 9680 (0.0005) [2023-03-08 13:01:16,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9538.8). Total num frames: 4988928. Throughput: 0: 9881.7. Samples: 4972432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:01:16,893][434108] Avg episode reward: [(0, '1936.588')] [2023-03-08 13:01:17,704][434396] Updated weights for policy 0, policy_version 9760 (0.0005) [2023-03-08 13:01:21,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9511.1). Total num frames: 5033984. Throughput: 0: 9969.2. Samples: 5029884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:01:21,903][434108] Avg episode reward: [(0, '1909.067')] [2023-03-08 13:01:21,912][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000009840_5038080.pth... [2023-03-08 13:01:21,913][434396] Updated weights for policy 0, policy_version 9840 (0.0004) [2023-03-08 13:01:21,915][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000009264_4743168.pth [2023-03-08 13:01:26,211][434396] Updated weights for policy 0, policy_version 9920 (0.0005) [2023-03-08 13:01:26,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9511.0). Total num frames: 5083136. Throughput: 0: 9909.9. Samples: 5057916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:01:26,893][434108] Avg episode reward: [(0, '1935.840')] [2023-03-08 13:01:30,285][434396] Updated weights for policy 0, policy_version 10000 (0.0004) [2023-03-08 13:01:31,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9483.3). Total num frames: 5132288. Throughput: 0: 9853.7. Samples: 5117344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:01:31,893][434108] Avg episode reward: [(0, '1901.820')] [2023-03-08 13:01:34,428][434396] Updated weights for policy 0, policy_version 10080 (0.0004) [2023-03-08 13:01:36,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9469.4). Total num frames: 5181440. Throughput: 0: 9810.0. Samples: 5175896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:01:36,893][434108] Avg episode reward: [(0, '1895.552')] [2023-03-08 13:01:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000010120_5181440.pth... [2023-03-08 13:01:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000009560_4894720.pth [2023-03-08 13:01:38,841][434396] Updated weights for policy 0, policy_version 10160 (0.0005) [2023-03-08 13:01:41,893][434108] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9455.5). Total num frames: 5226496. Throughput: 0: 9756.3. Samples: 5203576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:01:41,893][434108] Avg episode reward: [(0, '1900.994')] [2023-03-08 13:01:43,287][434396] Updated weights for policy 0, policy_version 10240 (0.0006) [2023-03-08 13:01:46,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9762.2, 300 sec: 9441.6). Total num frames: 5275648. Throughput: 0: 9611.7. Samples: 5259036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:01:46,893][434108] Avg episode reward: [(0, '1945.815')] [2023-03-08 13:01:47,690][434396] Updated weights for policy 0, policy_version 10320 (0.0005) [2023-03-08 13:01:51,893][434108] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9413.9). Total num frames: 5320704. Throughput: 0: 9557.5. Samples: 5316616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:01:51,893][434108] Avg episode reward: [(0, '2009.986')] [2023-03-08 13:01:51,901][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000010400_5324800.pth... [2023-03-08 13:01:51,902][434396] Updated weights for policy 0, policy_version 10400 (0.0005) [2023-03-08 13:01:51,904][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000009840_5038080.pth [2023-03-08 13:01:51,904][434347] Saving new best policy, reward=2009.986! [2023-03-08 13:01:56,078][434396] Updated weights for policy 0, policy_version 10480 (0.0005) [2023-03-08 13:01:56,893][434108] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 5369856. Throughput: 0: 9570.7. Samples: 5345352. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:01:56,893][434108] Avg episode reward: [(0, '1993.209')] [2023-03-08 13:02:00,385][434396] Updated weights for policy 0, policy_version 10560 (0.0006) [2023-03-08 13:02:01,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9400.0). Total num frames: 5419008. Throughput: 0: 9566.2. Samples: 5402912. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:02:01,893][434108] Avg episode reward: [(0, '1874.929')] [2023-03-08 13:02:04,606][434396] Updated weights for policy 0, policy_version 10640 (0.0005) [2023-03-08 13:02:06,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 5468160. Throughput: 0: 9597.0. Samples: 5461748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:02:06,893][434108] Avg episode reward: [(0, '1970.039')] [2023-03-08 13:02:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000010680_5468160.pth... [2023-03-08 13:02:06,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000010120_5181440.pth [2023-03-08 13:02:08,682][434396] Updated weights for policy 0, policy_version 10720 (0.0005) [2023-03-08 13:02:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9427.7). Total num frames: 5521408. Throughput: 0: 9651.6. Samples: 5492240. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:02:11,893][434108] Avg episode reward: [(0, '1982.838')] [2023-03-08 13:02:12,731][434396] Updated weights for policy 0, policy_version 10800 (0.0005) [2023-03-08 13:02:16,893][434108] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9413.9). Total num frames: 5566464. Throughput: 0: 9640.0. Samples: 5551144. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:02:16,893][434108] Avg episode reward: [(0, '1972.038')] [2023-03-08 13:02:16,940][434396] Updated weights for policy 0, policy_version 10880 (0.0006) [2023-03-08 13:02:21,116][434396] Updated weights for policy 0, policy_version 10960 (0.0005) [2023-03-08 13:02:21,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9427.7). Total num frames: 5615616. Throughput: 0: 9676.6. Samples: 5611344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:02:21,893][434108] Avg episode reward: [(0, '1934.808')] [2023-03-08 13:02:21,917][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000010976_5619712.pth... [2023-03-08 13:02:21,919][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000010400_5324800.pth [2023-03-08 13:02:25,352][434396] Updated weights for policy 0, policy_version 11040 (0.0005) [2023-03-08 13:02:26,892][434108] Fps is (10 sec: 9830.6, 60 sec: 9693.9, 300 sec: 9427.7). Total num frames: 5664768. Throughput: 0: 9701.8. Samples: 5640156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:02:26,893][434108] Avg episode reward: [(0, '1963.958')] [2023-03-08 13:02:29,518][434396] Updated weights for policy 0, policy_version 11120 (0.0005) [2023-03-08 13:02:31,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9441.6). Total num frames: 5713920. Throughput: 0: 9751.6. Samples: 5697860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:02:31,893][434108] Avg episode reward: [(0, '1933.387')] [2023-03-08 13:02:33,812][434396] Updated weights for policy 0, policy_version 11200 (0.0005) [2023-03-08 13:02:36,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9455.5). Total num frames: 5763072. Throughput: 0: 9776.5. Samples: 5756556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:02:36,893][434108] Avg episode reward: [(0, '1947.836')] [2023-03-08 13:02:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000011256_5763072.pth... [2023-03-08 13:02:36,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000010680_5468160.pth [2023-03-08 13:02:37,951][434396] Updated weights for policy 0, policy_version 11280 (0.0005) [2023-03-08 13:02:41,879][434396] Updated weights for policy 0, policy_version 11360 (0.0004) [2023-03-08 13:02:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9483.3). Total num frames: 5816320. Throughput: 0: 9806.7. Samples: 5786652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:02:41,893][434108] Avg episode reward: [(0, '1961.856')] [2023-03-08 13:02:45,838][434396] Updated weights for policy 0, policy_version 11440 (0.0004) [2023-03-08 13:02:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9483.3). Total num frames: 5865472. Throughput: 0: 9899.6. Samples: 5848392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:02:46,893][434108] Avg episode reward: [(0, '1891.825')] [2023-03-08 13:02:49,825][434396] Updated weights for policy 0, policy_version 11520 (0.0003) [2023-03-08 13:02:51,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9483.3). Total num frames: 5914624. Throughput: 0: 9973.1. Samples: 5910536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:02:51,893][434108] Avg episode reward: [(0, '1919.041')] [2023-03-08 13:02:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000011560_5918720.pth... [2023-03-08 13:02:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000010976_5619712.pth [2023-03-08 13:02:53,884][434396] Updated weights for policy 0, policy_version 11600 (0.0004) [2023-03-08 13:02:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 9967.0, 300 sec: 9511.1). Total num frames: 5967872. Throughput: 0: 9955.0. Samples: 5940216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:02:56,893][434108] Avg episode reward: [(0, '1951.508')] [2023-03-08 13:02:58,088][434396] Updated weights for policy 0, policy_version 11680 (0.0006) [2023-03-08 13:03:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9524.9). Total num frames: 6017024. Throughput: 0: 9969.9. Samples: 5999788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:03:01,893][434108] Avg episode reward: [(0, '1957.268')] [2023-03-08 13:03:02,228][434396] Updated weights for policy 0, policy_version 11760 (0.0005) [2023-03-08 13:03:06,338][434396] Updated weights for policy 0, policy_version 11840 (0.0006) [2023-03-08 13:03:06,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9538.8). Total num frames: 6066176. Throughput: 0: 9938.7. Samples: 6058584. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:03:06,893][434108] Avg episode reward: [(0, '1964.519')] [2023-03-08 13:03:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000011848_6066176.pth... [2023-03-08 13:03:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000011256_5763072.pth [2023-03-08 13:03:10,576][434396] Updated weights for policy 0, policy_version 11920 (0.0006) [2023-03-08 13:03:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9538.8). Total num frames: 6115328. Throughput: 0: 9946.6. Samples: 6087752. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:03:11,893][434108] Avg episode reward: [(0, '1949.042')] [2023-03-08 13:03:14,662][434396] Updated weights for policy 0, policy_version 12000 (0.0005) [2023-03-08 13:03:16,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9552.7). Total num frames: 6164480. Throughput: 0: 9999.7. Samples: 6147848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:03:16,893][434108] Avg episode reward: [(0, '1966.758')] [2023-03-08 13:03:18,621][434396] Updated weights for policy 0, policy_version 12080 (0.0005) [2023-03-08 13:03:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9566.6). Total num frames: 6217728. Throughput: 0: 10066.2. Samples: 6209536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:03:21,893][434108] Avg episode reward: [(0, '1936.815')] [2023-03-08 13:03:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000012144_6217728.pth... [2023-03-08 13:03:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000011560_5918720.pth [2023-03-08 13:03:22,587][434396] Updated weights for policy 0, policy_version 12160 (0.0004) [2023-03-08 13:03:26,790][434396] Updated weights for policy 0, policy_version 12240 (0.0005) [2023-03-08 13:03:26,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9580.5). Total num frames: 6266880. Throughput: 0: 10083.9. Samples: 6240428. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:03:26,893][434108] Avg episode reward: [(0, '1945.033')] [2023-03-08 13:03:31,171][434396] Updated weights for policy 0, policy_version 12320 (0.0006) [2023-03-08 13:03:31,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9566.6). Total num frames: 6311936. Throughput: 0: 9963.5. Samples: 6296748. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:03:31,893][434108] Avg episode reward: [(0, '1871.447')] [2023-03-08 13:03:35,550][434396] Updated weights for policy 0, policy_version 12400 (0.0005) [2023-03-08 13:03:36,892][434108] Fps is (10 sec: 9420.7, 60 sec: 9966.9, 300 sec: 9566.6). Total num frames: 6361088. Throughput: 0: 9831.8. Samples: 6352968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:03:36,893][434108] Avg episode reward: [(0, '1913.826')] [2023-03-08 13:03:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000012424_6361088.pth... [2023-03-08 13:03:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000011848_6066176.pth [2023-03-08 13:03:39,800][434396] Updated weights for policy 0, policy_version 12480 (0.0005) [2023-03-08 13:03:41,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9566.6). Total num frames: 6406144. Throughput: 0: 9810.7. Samples: 6381700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:03:41,893][434108] Avg episode reward: [(0, '1950.227')] [2023-03-08 13:03:44,070][434396] Updated weights for policy 0, policy_version 12560 (0.0005) [2023-03-08 13:03:46,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9580.5). Total num frames: 6455296. Throughput: 0: 9771.0. Samples: 6439484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:03:46,893][434108] Avg episode reward: [(0, '1912.530')] [2023-03-08 13:03:48,373][434396] Updated weights for policy 0, policy_version 12640 (0.0004) [2023-03-08 13:03:51,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9594.4). Total num frames: 6504448. Throughput: 0: 9753.9. Samples: 6497508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:03:51,893][434108] Avg episode reward: [(0, '1940.793')] [2023-03-08 13:03:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000012704_6504448.pth... [2023-03-08 13:03:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000012144_6217728.pth [2023-03-08 13:03:52,537][434396] Updated weights for policy 0, policy_version 12720 (0.0004) [2023-03-08 13:03:56,659][434396] Updated weights for policy 0, policy_version 12800 (0.0005) [2023-03-08 13:03:56,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9594.4). Total num frames: 6553600. Throughput: 0: 9781.2. Samples: 6527904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:03:56,893][434108] Avg episode reward: [(0, '1971.723')] [2023-03-08 13:04:00,826][434396] Updated weights for policy 0, policy_version 12880 (0.0005) [2023-03-08 13:04:01,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9608.2). Total num frames: 6602752. Throughput: 0: 9745.2. Samples: 6586380. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:04:01,893][434108] Avg episode reward: [(0, '1974.141')] [2023-03-08 13:04:04,982][434396] Updated weights for policy 0, policy_version 12960 (0.0005) [2023-03-08 13:04:06,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9608.2). Total num frames: 6651904. Throughput: 0: 9713.7. Samples: 6646652. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:04:06,893][434108] Avg episode reward: [(0, '1918.958')] [2023-03-08 13:04:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000013000_6656000.pth... [2023-03-08 13:04:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000012424_6361088.pth [2023-03-08 13:04:08,967][434396] Updated weights for policy 0, policy_version 13040 (0.0005) [2023-03-08 13:04:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9622.1). Total num frames: 6701056. Throughput: 0: 9693.9. Samples: 6676652. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:04:11,893][434108] Avg episode reward: [(0, '1931.721')] [2023-03-08 13:04:13,312][434396] Updated weights for policy 0, policy_version 13120 (0.0005) [2023-03-08 13:04:16,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9622.1). Total num frames: 6750208. Throughput: 0: 9712.8. Samples: 6733824. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:04:16,893][434108] Avg episode reward: [(0, '1935.381')] [2023-03-08 13:04:17,690][434396] Updated weights for policy 0, policy_version 13200 (0.0005) [2023-03-08 13:04:21,844][434396] Updated weights for policy 0, policy_version 13280 (0.0005) [2023-03-08 13:04:21,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9636.0). Total num frames: 6799360. Throughput: 0: 9739.2. Samples: 6791232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:04:21,893][434108] Avg episode reward: [(0, '1945.887')] [2023-03-08 13:04:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000013280_6799360.pth... [2023-03-08 13:04:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000012704_6504448.pth [2023-03-08 13:04:26,109][434396] Updated weights for policy 0, policy_version 13360 (0.0005) [2023-03-08 13:04:26,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9622.1). Total num frames: 6844416. Throughput: 0: 9738.1. Samples: 6819912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:04:26,893][434108] Avg episode reward: [(0, '1960.499')] [2023-03-08 13:04:30,267][434396] Updated weights for policy 0, policy_version 13440 (0.0006) [2023-03-08 13:04:31,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9636.0). Total num frames: 6893568. Throughput: 0: 9761.6. Samples: 6878756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:04:31,893][434108] Avg episode reward: [(0, '1939.759')] [2023-03-08 13:04:34,455][434396] Updated weights for policy 0, policy_version 13520 (0.0005) [2023-03-08 13:04:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9663.8). Total num frames: 6946816. Throughput: 0: 9789.6. Samples: 6938040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:04:36,893][434108] Avg episode reward: [(0, '1925.715')] [2023-03-08 13:04:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000013568_6946816.pth... [2023-03-08 13:04:36,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000013000_6656000.pth [2023-03-08 13:04:38,532][434396] Updated weights for policy 0, policy_version 13600 (0.0004) [2023-03-08 13:04:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9691.6). Total num frames: 6995968. Throughput: 0: 9776.5. Samples: 6967844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:04:41,893][434108] Avg episode reward: [(0, '1982.613')] [2023-03-08 13:04:42,650][434396] Updated weights for policy 0, policy_version 13680 (0.0005) [2023-03-08 13:04:46,576][434396] Updated weights for policy 0, policy_version 13760 (0.0003) [2023-03-08 13:04:46,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9705.4). Total num frames: 7045120. Throughput: 0: 9833.1. Samples: 7028868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:04:46,893][434108] Avg episode reward: [(0, '1925.523')] [2023-03-08 13:04:50,740][434396] Updated weights for policy 0, policy_version 13840 (0.0005) [2023-03-08 13:04:51,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9719.3). Total num frames: 7094272. Throughput: 0: 9828.6. Samples: 7088940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:04:51,893][434108] Avg episode reward: [(0, '1981.764')] [2023-03-08 13:04:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000013856_7094272.pth... [2023-03-08 13:04:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000013280_6799360.pth [2023-03-08 13:04:54,640][434396] Updated weights for policy 0, policy_version 13920 (0.0004) [2023-03-08 13:04:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 9761.0). Total num frames: 7147520. Throughput: 0: 9857.2. Samples: 7120224. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:04:56,893][434108] Avg episode reward: [(0, '2002.460')] [2023-03-08 13:04:58,666][434396] Updated weights for policy 0, policy_version 14000 (0.0005) [2023-03-08 13:05:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 9774.9). Total num frames: 7196672. Throughput: 0: 9943.9. Samples: 7181300. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:05:01,893][434108] Avg episode reward: [(0, '1915.105')] [2023-03-08 13:05:02,787][434396] Updated weights for policy 0, policy_version 14080 (0.0005) [2023-03-08 13:05:06,553][434396] Updated weights for policy 0, policy_version 14160 (0.0005) [2023-03-08 13:05:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9802.6). Total num frames: 7249920. Throughput: 0: 10070.6. Samples: 7244408. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:05:06,893][434108] Avg episode reward: [(0, '1335.586')] [2023-03-08 13:05:06,925][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000014168_7254016.pth... [2023-03-08 13:05:06,926][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000013568_6946816.pth [2023-03-08 13:05:10,364][434396] Updated weights for policy 0, policy_version 14240 (0.0005) [2023-03-08 13:05:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10035.2, 300 sec: 9816.5). Total num frames: 7303168. Throughput: 0: 10140.6. Samples: 7276240. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:05:11,893][434108] Avg episode reward: [(0, '894.597')] [2023-03-08 13:05:14,372][434396] Updated weights for policy 0, policy_version 14320 (0.0005) [2023-03-08 13:05:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10103.5, 300 sec: 9844.3). Total num frames: 7356416. Throughput: 0: 10223.5. Samples: 7338816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:05:16,893][434108] Avg episode reward: [(0, '1197.408')] [2023-03-08 13:05:18,348][434396] Updated weights for policy 0, policy_version 14400 (0.0005) [2023-03-08 13:05:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 9872.1). Total num frames: 7409664. Throughput: 0: 10280.3. Samples: 7400652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:05:21,893][434108] Avg episode reward: [(0, '1885.335')] [2023-03-08 13:05:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000014472_7409664.pth... [2023-03-08 13:05:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000013856_7094272.pth [2023-03-08 13:05:22,272][434396] Updated weights for policy 0, policy_version 14480 (0.0005) [2023-03-08 13:05:26,150][434396] Updated weights for policy 0, policy_version 14560 (0.0004) [2023-03-08 13:05:26,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 9885.9). Total num frames: 7458816. Throughput: 0: 10317.3. Samples: 7432120. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:05:26,893][434108] Avg episode reward: [(0, '1946.870')] [2023-03-08 13:05:30,442][434396] Updated weights for policy 0, policy_version 14640 (0.0005) [2023-03-08 13:05:31,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 9899.8). Total num frames: 7507968. Throughput: 0: 10284.0. Samples: 7491648. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:05:31,893][434108] Avg episode reward: [(0, '1917.121')] [2023-03-08 13:05:34,662][434396] Updated weights for policy 0, policy_version 14720 (0.0005) [2023-03-08 13:05:36,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 9885.9). Total num frames: 7557120. Throughput: 0: 10240.4. Samples: 7549760. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:05:36,893][434108] Avg episode reward: [(0, '1974.050')] [2023-03-08 13:05:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000014760_7557120.pth... [2023-03-08 13:05:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000014168_7254016.pth [2023-03-08 13:05:38,812][434396] Updated weights for policy 0, policy_version 14800 (0.0006) [2023-03-08 13:05:41,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9885.9). Total num frames: 7606272. Throughput: 0: 10211.9. Samples: 7579760. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:05:41,893][434108] Avg episode reward: [(0, '1956.084')] [2023-03-08 13:05:42,949][434396] Updated weights for policy 0, policy_version 14880 (0.0005) [2023-03-08 13:05:46,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9885.9). Total num frames: 7655424. Throughput: 0: 10174.8. Samples: 7639164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:05:46,893][434108] Avg episode reward: [(0, '1995.058')] [2023-03-08 13:05:47,035][434396] Updated weights for policy 0, policy_version 14960 (0.0004) [2023-03-08 13:05:51,151][434396] Updated weights for policy 0, policy_version 15040 (0.0005) [2023-03-08 13:05:51,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9872.1). Total num frames: 7704576. Throughput: 0: 10111.7. Samples: 7699436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:05:51,893][434108] Avg episode reward: [(0, '1996.930')] [2023-03-08 13:05:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000015048_7704576.pth... [2023-03-08 13:05:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000014472_7409664.pth [2023-03-08 13:05:55,299][434396] Updated weights for policy 0, policy_version 15120 (0.0005) [2023-03-08 13:05:56,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9858.2). Total num frames: 7753728. Throughput: 0: 10066.1. Samples: 7729216. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:05:56,893][434108] Avg episode reward: [(0, '1967.453')] [2023-03-08 13:05:59,277][434396] Updated weights for policy 0, policy_version 15200 (0.0004) [2023-03-08 13:06:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 9885.9). Total num frames: 7806976. Throughput: 0: 10029.7. Samples: 7790152. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:06:01,893][434108] Avg episode reward: [(0, '1964.643')] [2023-03-08 13:06:03,196][434396] Updated weights for policy 0, policy_version 15280 (0.0004) [2023-03-08 13:06:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 9899.8). Total num frames: 7860224. Throughput: 0: 10030.8. Samples: 7852040. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:06:06,893][434108] Avg episode reward: [(0, '1977.075')] [2023-03-08 13:06:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000015352_7860224.pth... [2023-03-08 13:06:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000014760_7557120.pth [2023-03-08 13:06:07,203][434396] Updated weights for policy 0, policy_version 15360 (0.0004) [2023-03-08 13:06:11,355][434396] Updated weights for policy 0, policy_version 15440 (0.0005) [2023-03-08 13:06:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9899.8). Total num frames: 7909376. Throughput: 0: 10013.3. Samples: 7882720. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:06:11,893][434108] Avg episode reward: [(0, '1996.341')] [2023-03-08 13:06:15,465][434396] Updated weights for policy 0, policy_version 15520 (0.0005) [2023-03-08 13:06:16,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 7958528. Throughput: 0: 10011.0. Samples: 7942144. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:06:16,893][434108] Avg episode reward: [(0, '1992.307')] [2023-03-08 13:06:19,508][434396] Updated weights for policy 0, policy_version 15600 (0.0005) [2023-03-08 13:06:21,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 8007680. Throughput: 0: 10062.1. Samples: 8002556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:06:21,893][434108] Avg episode reward: [(0, '1994.365')] [2023-03-08 13:06:21,928][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000015648_8011776.pth... [2023-03-08 13:06:21,929][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000015048_7704576.pth [2023-03-08 13:06:23,438][434396] Updated weights for policy 0, policy_version 15680 (0.0005) [2023-03-08 13:06:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 8060928. Throughput: 0: 10095.6. Samples: 8034064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:06:26,893][434108] Avg episode reward: [(0, '1994.570')] [2023-03-08 13:06:27,468][434396] Updated weights for policy 0, policy_version 15760 (0.0005) [2023-03-08 13:06:31,390][434396] Updated weights for policy 0, policy_version 15840 (0.0005) [2023-03-08 13:06:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 8114176. Throughput: 0: 10145.5. Samples: 8095712. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:06:31,893][434108] Avg episode reward: [(0, '1969.849')] [2023-03-08 13:06:35,333][434396] Updated weights for policy 0, policy_version 15920 (0.0004) [2023-03-08 13:06:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9955.4). Total num frames: 8163328. Throughput: 0: 10198.9. Samples: 8158388. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:06:36,893][434108] Avg episode reward: [(0, '1979.536')] [2023-03-08 13:06:36,913][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000015952_8167424.pth... [2023-03-08 13:06:36,915][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000015352_7860224.pth [2023-03-08 13:06:39,236][434396] Updated weights for policy 0, policy_version 16000 (0.0005) [2023-03-08 13:06:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9969.2). Total num frames: 8216576. Throughput: 0: 10228.3. Samples: 8189488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:06:41,893][434108] Avg episode reward: [(0, '1985.722')] [2023-03-08 13:06:43,255][434396] Updated weights for policy 0, policy_version 16080 (0.0005) [2023-03-08 13:06:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 9983.1). Total num frames: 8265728. Throughput: 0: 10220.9. Samples: 8250092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:06:46,893][434108] Avg episode reward: [(0, '1977.649')] [2023-03-08 13:06:47,379][434396] Updated weights for policy 0, policy_version 16160 (0.0005) [2023-03-08 13:06:51,550][434396] Updated weights for policy 0, policy_version 16240 (0.0005) [2023-03-08 13:06:51,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 9983.1). Total num frames: 8314880. Throughput: 0: 10168.4. Samples: 8309620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:06:51,893][434108] Avg episode reward: [(0, '1979.529')] [2023-03-08 13:06:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000016240_8314880.pth... [2023-03-08 13:06:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000015648_8011776.pth [2023-03-08 13:06:55,744][434396] Updated weights for policy 0, policy_version 16320 (0.0005) [2023-03-08 13:06:56,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 9983.1). Total num frames: 8364032. Throughput: 0: 10151.1. Samples: 8339520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:06:56,893][434108] Avg episode reward: [(0, '1973.008')] [2023-03-08 13:06:59,904][434396] Updated weights for policy 0, policy_version 16400 (0.0005) [2023-03-08 13:07:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 9997.0). Total num frames: 8417280. Throughput: 0: 10121.8. Samples: 8397624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:07:01,893][434108] Avg episode reward: [(0, '1983.573')] [2023-03-08 13:07:03,746][434396] Updated weights for policy 0, policy_version 16480 (0.0004) [2023-03-08 13:07:06,893][434108] Fps is (10 sec: 10239.8, 60 sec: 10103.4, 300 sec: 9983.1). Total num frames: 8466432. Throughput: 0: 10197.6. Samples: 8461448. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:07:06,893][434108] Avg episode reward: [(0, '1967.493')] [2023-03-08 13:07:06,929][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000016544_8470528.pth... [2023-03-08 13:07:06,930][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000015952_8167424.pth [2023-03-08 13:07:07,763][434396] Updated weights for policy 0, policy_version 16560 (0.0005) [2023-03-08 13:07:11,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 9997.0). Total num frames: 8515584. Throughput: 0: 10154.3. Samples: 8491008. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:07:11,893][434108] Avg episode reward: [(0, '1989.567')] [2023-03-08 13:07:12,034][434396] Updated weights for policy 0, policy_version 16640 (0.0005) [2023-03-08 13:07:16,296][434396] Updated weights for policy 0, policy_version 16720 (0.0004) [2023-03-08 13:07:16,892][434108] Fps is (10 sec: 9830.6, 60 sec: 10103.5, 300 sec: 9997.0). Total num frames: 8564736. Throughput: 0: 10058.8. Samples: 8548360. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:07:16,893][434108] Avg episode reward: [(0, '1989.239')] [2023-03-08 13:07:20,605][434396] Updated weights for policy 0, policy_version 16800 (0.0005) [2023-03-08 13:07:21,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 9997.0). Total num frames: 8613888. Throughput: 0: 9940.4. Samples: 8605704. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:07:21,893][434108] Avg episode reward: [(0, '1983.396')] [2023-03-08 13:07:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000016824_8613888.pth... [2023-03-08 13:07:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000016240_8314880.pth [2023-03-08 13:07:24,927][434396] Updated weights for policy 0, policy_version 16880 (0.0005) [2023-03-08 13:07:26,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9966.9, 300 sec: 9983.1). Total num frames: 8658944. Throughput: 0: 9887.8. Samples: 8634440. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:07:26,893][434108] Avg episode reward: [(0, '1797.126')] [2023-03-08 13:07:29,274][434396] Updated weights for policy 0, policy_version 16960 (0.0005) [2023-03-08 13:07:31,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9983.1). Total num frames: 8708096. Throughput: 0: 9802.1. Samples: 8691188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:07:31,893][434108] Avg episode reward: [(0, '1724.821')] [2023-03-08 13:07:33,342][434396] Updated weights for policy 0, policy_version 17040 (0.0005) [2023-03-08 13:07:36,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 8757248. Throughput: 0: 9787.7. Samples: 8750064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:07:36,893][434108] Avg episode reward: [(0, '1905.312')] [2023-03-08 13:07:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000017104_8757248.pth... [2023-03-08 13:07:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000016544_8470528.pth [2023-03-08 13:07:37,612][434396] Updated weights for policy 0, policy_version 17120 (0.0005) [2023-03-08 13:07:41,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9955.4). Total num frames: 8802304. Throughput: 0: 9759.8. Samples: 8778708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:07:41,893][434108] Avg episode reward: [(0, '1861.721')] [2023-03-08 13:07:41,978][434396] Updated weights for policy 0, policy_version 17200 (0.0005) [2023-03-08 13:07:46,220][434396] Updated weights for policy 0, policy_version 17280 (0.0005) [2023-03-08 13:07:46,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9955.4). Total num frames: 8851456. Throughput: 0: 9722.7. Samples: 8835144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:07:46,893][434108] Avg episode reward: [(0, '1946.266')] [2023-03-08 13:07:50,561][434396] Updated weights for policy 0, policy_version 17360 (0.0005) [2023-03-08 13:07:51,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9941.5). Total num frames: 8900608. Throughput: 0: 9578.9. Samples: 8892496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:07:51,893][434108] Avg episode reward: [(0, '1967.975')] [2023-03-08 13:07:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000017384_8900608.pth... [2023-03-08 13:07:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000016824_8613888.pth [2023-03-08 13:07:54,823][434396] Updated weights for policy 0, policy_version 17440 (0.0005) [2023-03-08 13:07:56,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 8945664. Throughput: 0: 9558.8. Samples: 8921152. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:07:56,893][434108] Avg episode reward: [(0, '1990.152')] [2023-03-08 13:07:59,071][434396] Updated weights for policy 0, policy_version 17520 (0.0005) [2023-03-08 13:08:01,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9927.6). Total num frames: 8994816. Throughput: 0: 9576.7. Samples: 8979312. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:08:01,893][434108] Avg episode reward: [(0, '1999.182')] [2023-03-08 13:08:03,296][434396] Updated weights for policy 0, policy_version 17600 (0.0005) [2023-03-08 13:08:06,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9927.6). Total num frames: 9043968. Throughput: 0: 9592.1. Samples: 9037348. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:08:06,893][434108] Avg episode reward: [(0, '2008.934')] [2023-03-08 13:08:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000017664_9043968.pth... [2023-03-08 13:08:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000017104_8757248.pth [2023-03-08 13:08:07,550][434396] Updated weights for policy 0, policy_version 17680 (0.0005) [2023-03-08 13:08:11,770][434396] Updated weights for policy 0, policy_version 17760 (0.0005) [2023-03-08 13:08:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9927.6). Total num frames: 9093120. Throughput: 0: 9599.6. Samples: 9066424. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:08:11,893][434108] Avg episode reward: [(0, '1994.009')] [2023-03-08 13:08:16,034][434396] Updated weights for policy 0, policy_version 17840 (0.0005) [2023-03-08 13:08:16,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9899.8). Total num frames: 9138176. Throughput: 0: 9633.4. Samples: 9124692. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:08:16,893][434108] Avg episode reward: [(0, '1978.651')] [2023-03-08 13:08:20,231][434396] Updated weights for policy 0, policy_version 17920 (0.0005) [2023-03-08 13:08:21,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9913.7). Total num frames: 9191424. Throughput: 0: 9621.1. Samples: 9183012. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:08:21,893][434108] Avg episode reward: [(0, '1942.333')] [2023-03-08 13:08:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000017952_9191424.pth... [2023-03-08 13:08:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000017384_8900608.pth [2023-03-08 13:08:24,325][434396] Updated weights for policy 0, policy_version 18000 (0.0005) [2023-03-08 13:08:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 9240576. Throughput: 0: 9646.3. Samples: 9212792. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:08:26,893][434108] Avg episode reward: [(0, '1921.454')] [2023-03-08 13:08:28,482][434396] Updated weights for policy 0, policy_version 18080 (0.0005) [2023-03-08 13:08:31,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 9289728. Throughput: 0: 9713.9. Samples: 9272272. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:08:31,893][434108] Avg episode reward: [(0, '1887.489')] [2023-03-08 13:08:32,632][434396] Updated weights for policy 0, policy_version 18160 (0.0005) [2023-03-08 13:08:36,859][434396] Updated weights for policy 0, policy_version 18240 (0.0005) [2023-03-08 13:08:36,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9941.5). Total num frames: 9338880. Throughput: 0: 9739.2. Samples: 9330760. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:08:36,893][434108] Avg episode reward: [(0, '1943.825')] [2023-03-08 13:08:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000018240_9338880.pth... [2023-03-08 13:08:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000017664_9043968.pth [2023-03-08 13:08:40,955][434396] Updated weights for policy 0, policy_version 18320 (0.0004) [2023-03-08 13:08:41,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9941.5). Total num frames: 9388032. Throughput: 0: 9757.6. Samples: 9360244. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:08:41,893][434108] Avg episode reward: [(0, '1922.584')] [2023-03-08 13:08:45,120][434396] Updated weights for policy 0, policy_version 18400 (0.0005) [2023-03-08 13:08:46,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9941.5). Total num frames: 9437184. Throughput: 0: 9793.8. Samples: 9420032. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:08:46,893][434108] Avg episode reward: [(0, '1893.820')] [2023-03-08 13:08:49,236][434396] Updated weights for policy 0, policy_version 18480 (0.0005) [2023-03-08 13:08:51,893][434108] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9941.5). Total num frames: 9486336. Throughput: 0: 9848.9. Samples: 9480548. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:08:51,893][434108] Avg episode reward: [(0, '1909.230')] [2023-03-08 13:08:51,911][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000018536_9490432.pth... [2023-03-08 13:08:51,912][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000017952_9191424.pth [2023-03-08 13:08:53,097][434396] Updated weights for policy 0, policy_version 18560 (0.0004) [2023-03-08 13:08:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 9539584. Throughput: 0: 9890.2. Samples: 9511484. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:08:56,903][434108] Avg episode reward: [(0, '1939.413')] [2023-03-08 13:08:57,139][434396] Updated weights for policy 0, policy_version 18640 (0.0005) [2023-03-08 13:09:01,498][434396] Updated weights for policy 0, policy_version 18720 (0.0005) [2023-03-08 13:09:01,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9941.5). Total num frames: 9584640. Throughput: 0: 9913.4. Samples: 9570796. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:09:01,893][434108] Avg episode reward: [(0, '1818.700')] [2023-03-08 13:09:05,590][434396] Updated weights for policy 0, policy_version 18800 (0.0005) [2023-03-08 13:09:06,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 9637888. Throughput: 0: 9926.6. Samples: 9629708. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:09:06,893][434108] Avg episode reward: [(0, '1573.836')] [2023-03-08 13:09:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000018824_9637888.pth... [2023-03-08 13:09:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000018240_9338880.pth [2023-03-08 13:09:09,768][434396] Updated weights for policy 0, policy_version 18880 (0.0005) [2023-03-08 13:09:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 9687040. Throughput: 0: 9911.4. Samples: 9658804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:09:11,893][434108] Avg episode reward: [(0, '1760.190')] [2023-03-08 13:09:13,726][434396] Updated weights for policy 0, policy_version 18960 (0.0005) [2023-03-08 13:09:16,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9955.4). Total num frames: 9736192. Throughput: 0: 9964.1. Samples: 9720656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:09:16,893][434108] Avg episode reward: [(0, '1673.260')] [2023-03-08 13:09:17,721][434396] Updated weights for policy 0, policy_version 19040 (0.0005) [2023-03-08 13:09:21,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 9785344. Throughput: 0: 10010.8. Samples: 9781244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:09:21,893][434108] Avg episode reward: [(0, '1848.472')] [2023-03-08 13:09:21,951][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000019120_9789440.pth... [2023-03-08 13:09:21,952][434396] Updated weights for policy 0, policy_version 19120 (0.0005) [2023-03-08 13:09:21,953][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000018536_9490432.pth [2023-03-08 13:09:26,199][434396] Updated weights for policy 0, policy_version 19200 (0.0005) [2023-03-08 13:09:26,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 9834496. Throughput: 0: 9976.1. Samples: 9809168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:09:26,893][434108] Avg episode reward: [(0, '1813.005')] [2023-03-08 13:09:30,523][434396] Updated weights for policy 0, policy_version 19280 (0.0005) [2023-03-08 13:09:31,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 9883648. Throughput: 0: 9924.8. Samples: 9866648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:09:31,893][434108] Avg episode reward: [(0, '1922.384')] [2023-03-08 13:09:34,655][434396] Updated weights for policy 0, policy_version 19360 (0.0005) [2023-03-08 13:09:36,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 9932800. Throughput: 0: 9908.6. Samples: 9926436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:09:36,893][434108] Avg episode reward: [(0, '1923.762')] [2023-03-08 13:09:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000019400_9932800.pth... [2023-03-08 13:09:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000018824_9637888.pth [2023-03-08 13:09:38,738][434396] Updated weights for policy 0, policy_version 19440 (0.0005) [2023-03-08 13:09:41,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 9981952. Throughput: 0: 9882.1. Samples: 9956180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:09:41,893][434108] Avg episode reward: [(0, '1894.021')] [2023-03-08 13:09:42,795][434396] Updated weights for policy 0, policy_version 19520 (0.0005) [2023-03-08 13:09:46,725][434396] Updated weights for policy 0, policy_version 19600 (0.0005) [2023-03-08 13:09:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9969.2). Total num frames: 10035200. Throughput: 0: 9918.2. Samples: 10017116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:09:46,893][434108] Avg episode reward: [(0, '1906.474')] [2023-03-08 13:09:50,688][434396] Updated weights for policy 0, policy_version 19680 (0.0005) [2023-03-08 13:09:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9967.0, 300 sec: 9955.4). Total num frames: 10084352. Throughput: 0: 10005.6. Samples: 10079960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:09:51,893][434108] Avg episode reward: [(0, '1955.399')] [2023-03-08 13:09:51,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000019696_10084352.pth... [2023-03-08 13:09:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000019120_9789440.pth [2023-03-08 13:09:54,946][434396] Updated weights for policy 0, policy_version 19760 (0.0006) [2023-03-08 13:09:56,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 10133504. Throughput: 0: 9992.7. Samples: 10108476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:09:56,893][434108] Avg episode reward: [(0, '1966.880')] [2023-03-08 13:09:59,026][434396] Updated weights for policy 0, policy_version 19840 (0.0004) [2023-03-08 13:10:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 10186752. Throughput: 0: 9945.3. Samples: 10168192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:10:01,893][434108] Avg episode reward: [(0, '1862.590')] [2023-03-08 13:10:03,008][434396] Updated weights for policy 0, policy_version 19920 (0.0004) [2023-03-08 13:10:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 10235904. Throughput: 0: 9921.7. Samples: 10227720. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:10:06,893][434108] Avg episode reward: [(0, '1863.055')] [2023-03-08 13:10:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000019992_10235904.pth... [2023-03-08 13:10:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000019400_9932800.pth [2023-03-08 13:10:07,178][434396] Updated weights for policy 0, policy_version 20000 (0.0006) [2023-03-08 13:10:11,134][434396] Updated weights for policy 0, policy_version 20080 (0.0005) [2023-03-08 13:10:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 10285056. Throughput: 0: 10012.6. Samples: 10259736. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:10:11,893][434108] Avg episode reward: [(0, '1920.294')] [2023-03-08 13:10:15,317][434396] Updated weights for policy 0, policy_version 20160 (0.0005) [2023-03-08 13:10:16,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 10334208. Throughput: 0: 10046.9. Samples: 10318756. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:10:16,893][434108] Avg episode reward: [(0, '1876.567')] [2023-03-08 13:10:19,516][434396] Updated weights for policy 0, policy_version 20240 (0.0004) [2023-03-08 13:10:21,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 10383360. Throughput: 0: 10024.9. Samples: 10377556. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:10:21,893][434108] Avg episode reward: [(0, '1891.188')] [2023-03-08 13:10:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000020280_10383360.pth... [2023-03-08 13:10:21,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000019696_10084352.pth [2023-03-08 13:10:23,704][434396] Updated weights for policy 0, policy_version 20320 (0.0005) [2023-03-08 13:10:26,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 10432512. Throughput: 0: 10028.1. Samples: 10407444. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:10:26,893][434108] Avg episode reward: [(0, '1906.333')] [2023-03-08 13:10:27,815][434396] Updated weights for policy 0, policy_version 20400 (0.0005) [2023-03-08 13:10:31,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 10481664. Throughput: 0: 9983.6. Samples: 10466380. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:10:31,893][434108] Avg episode reward: [(0, '1918.942')] [2023-03-08 13:10:31,947][434396] Updated weights for policy 0, policy_version 20480 (0.0005) [2023-03-08 13:10:36,188][434396] Updated weights for policy 0, policy_version 20560 (0.0005) [2023-03-08 13:10:36,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 10530816. Throughput: 0: 9895.6. Samples: 10525264. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:10:36,893][434108] Avg episode reward: [(0, '1979.932')] [2023-03-08 13:10:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000020568_10530816.pth... [2023-03-08 13:10:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000019992_10235904.pth [2023-03-08 13:10:40,242][434396] Updated weights for policy 0, policy_version 20640 (0.0005) [2023-03-08 13:10:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 10584064. Throughput: 0: 9932.9. Samples: 10555456. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:10:41,893][434108] Avg episode reward: [(0, '1976.326')] [2023-03-08 13:10:44,202][434396] Updated weights for policy 0, policy_version 20720 (0.0005) [2023-03-08 13:10:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 10633216. Throughput: 0: 9976.3. Samples: 10617124. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:10:46,893][434108] Avg episode reward: [(0, '1940.134')] [2023-03-08 13:10:48,355][434396] Updated weights for policy 0, policy_version 20800 (0.0005) [2023-03-08 13:10:51,893][434108] Fps is (10 sec: 9830.2, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 10682368. Throughput: 0: 9960.0. Samples: 10675920. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:10:51,893][434108] Avg episode reward: [(0, '1958.841')] [2023-03-08 13:10:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000020864_10682368.pth... [2023-03-08 13:10:51,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000020280_10383360.pth [2023-03-08 13:10:52,466][434396] Updated weights for policy 0, policy_version 20880 (0.0006) [2023-03-08 13:10:56,348][434396] Updated weights for policy 0, policy_version 20960 (0.0005) [2023-03-08 13:10:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 10735616. Throughput: 0: 9939.4. Samples: 10707008. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:10:56,893][434108] Avg episode reward: [(0, '1963.128')] [2023-03-08 13:11:00,262][434396] Updated weights for policy 0, policy_version 21040 (0.0005) [2023-03-08 13:11:01,892][434108] Fps is (10 sec: 10649.9, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 10788864. Throughput: 0: 10022.6. Samples: 10769772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:11:01,893][434108] Avg episode reward: [(0, '1987.949')] [2023-03-08 13:11:04,194][434396] Updated weights for policy 0, policy_version 21120 (0.0005) [2023-03-08 13:11:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 10838016. Throughput: 0: 10115.0. Samples: 10832732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:11:06,893][434108] Avg episode reward: [(0, '1926.998')] [2023-03-08 13:11:06,899][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000021176_10842112.pth... [2023-03-08 13:11:06,901][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000020568_10530816.pth [2023-03-08 13:11:08,095][434396] Updated weights for policy 0, policy_version 21200 (0.0004) [2023-03-08 13:11:11,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 10891264. Throughput: 0: 10133.6. Samples: 10863456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:11:11,893][434108] Avg episode reward: [(0, '1946.551')] [2023-03-08 13:11:12,131][434396] Updated weights for policy 0, policy_version 21280 (0.0004) [2023-03-08 13:11:16,182][434396] Updated weights for policy 0, policy_version 21360 (0.0004) [2023-03-08 13:11:16,892][434108] Fps is (10 sec: 10240.2, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 10940416. Throughput: 0: 10182.8. Samples: 10924604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:11:16,893][434108] Avg episode reward: [(0, '1980.361')] [2023-03-08 13:11:20,334][434396] Updated weights for policy 0, policy_version 21440 (0.0005) [2023-03-08 13:11:21,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 9927.6). Total num frames: 10989568. Throughput: 0: 10191.0. Samples: 10983860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:11:21,893][434108] Avg episode reward: [(0, '1977.630')] [2023-03-08 13:11:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000021464_10989568.pth... [2023-03-08 13:11:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000020864_10682368.pth [2023-03-08 13:11:24,512][434396] Updated weights for policy 0, policy_version 21520 (0.0005) [2023-03-08 13:11:26,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 9913.7). Total num frames: 11038720. Throughput: 0: 10184.9. Samples: 11013776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:11:26,893][434108] Avg episode reward: [(0, '1904.160')] [2023-03-08 13:11:28,658][434396] Updated weights for policy 0, policy_version 21600 (0.0005) [2023-03-08 13:11:31,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 9913.7). Total num frames: 11087872. Throughput: 0: 10105.5. Samples: 11071872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:11:31,893][434108] Avg episode reward: [(0, '1967.911')] [2023-03-08 13:11:32,838][434396] Updated weights for policy 0, policy_version 21680 (0.0005) [2023-03-08 13:11:36,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9899.8). Total num frames: 11137024. Throughput: 0: 10128.6. Samples: 11131708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:11:36,893][434108] Avg episode reward: [(0, '1981.819')] [2023-03-08 13:11:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000021752_11137024.pth... [2023-03-08 13:11:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000021176_10842112.pth [2023-03-08 13:11:37,017][434396] Updated weights for policy 0, policy_version 21760 (0.0005) [2023-03-08 13:11:40,890][434396] Updated weights for policy 0, policy_version 21840 (0.0004) [2023-03-08 13:11:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9913.7). Total num frames: 11190272. Throughput: 0: 10117.3. Samples: 11162288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:11:41,893][434108] Avg episode reward: [(0, '1955.065')] [2023-03-08 13:11:44,956][434396] Updated weights for policy 0, policy_version 21920 (0.0005) [2023-03-08 13:11:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9913.7). Total num frames: 11239424. Throughput: 0: 10079.4. Samples: 11223348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:11:46,893][434108] Avg episode reward: [(0, '1951.717')] [2023-03-08 13:11:49,118][434396] Updated weights for policy 0, policy_version 22000 (0.0006) [2023-03-08 13:11:51,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9913.7). Total num frames: 11288576. Throughput: 0: 9991.8. Samples: 11282364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:11:51,893][434108] Avg episode reward: [(0, '1900.432')] [2023-03-08 13:11:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000022048_11288576.pth... [2023-03-08 13:11:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000021464_10989568.pth [2023-03-08 13:11:53,400][434396] Updated weights for policy 0, policy_version 22080 (0.0005) [2023-03-08 13:11:56,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 11337728. Throughput: 0: 9941.2. Samples: 11310808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:11:56,893][434108] Avg episode reward: [(0, '1868.611')] [2023-03-08 13:11:57,547][434396] Updated weights for policy 0, policy_version 22160 (0.0004) [2023-03-08 13:12:01,443][434396] Updated weights for policy 0, policy_version 22240 (0.0004) [2023-03-08 13:12:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 11390976. Throughput: 0: 9935.8. Samples: 11371716. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:12:01,893][434108] Avg episode reward: [(0, '1944.466')] [2023-03-08 13:12:05,531][434396] Updated weights for policy 0, policy_version 22320 (0.0005) [2023-03-08 13:12:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9913.7). Total num frames: 11440128. Throughput: 0: 9961.1. Samples: 11432108. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:12:06,893][434108] Avg episode reward: [(0, '1889.920')] [2023-03-08 13:12:06,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000022344_11440128.pth... [2023-03-08 13:12:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000021752_11137024.pth [2023-03-08 13:12:09,778][434396] Updated weights for policy 0, policy_version 22400 (0.0005) [2023-03-08 13:12:11,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 11489280. Throughput: 0: 9943.8. Samples: 11461248. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:12:11,893][434108] Avg episode reward: [(0, '1921.264')] [2023-03-08 13:12:13,790][434396] Updated weights for policy 0, policy_version 22480 (0.0004) [2023-03-08 13:12:16,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 11538432. Throughput: 0: 10005.5. Samples: 11522120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:12:16,893][434108] Avg episode reward: [(0, '1736.453')] [2023-03-08 13:12:17,850][434396] Updated weights for policy 0, policy_version 22560 (0.0005) [2023-03-08 13:12:21,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 11587584. Throughput: 0: 9985.0. Samples: 11581032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:12:21,893][434108] Avg episode reward: [(0, '1828.511')] [2023-03-08 13:12:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000022632_11587584.pth... [2023-03-08 13:12:21,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000022048_11288576.pth [2023-03-08 13:12:22,171][434396] Updated weights for policy 0, policy_version 22640 (0.0006) [2023-03-08 13:12:26,346][434396] Updated weights for policy 0, policy_version 22720 (0.0005) [2023-03-08 13:12:26,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 11636736. Throughput: 0: 9948.2. Samples: 11609956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:12:26,893][434108] Avg episode reward: [(0, '1890.480')] [2023-03-08 13:12:30,588][434396] Updated weights for policy 0, policy_version 22800 (0.0005) [2023-03-08 13:12:31,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 11685888. Throughput: 0: 9889.0. Samples: 11668352. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:12:31,893][434108] Avg episode reward: [(0, '1926.311')] [2023-03-08 13:12:34,646][434396] Updated weights for policy 0, policy_version 22880 (0.0005) [2023-03-08 13:12:36,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 11735040. Throughput: 0: 9897.4. Samples: 11727748. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:12:36,893][434108] Avg episode reward: [(0, '1830.485')] [2023-03-08 13:12:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000022920_11735040.pth... [2023-03-08 13:12:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000022344_11440128.pth [2023-03-08 13:12:38,803][434396] Updated weights for policy 0, policy_version 22960 (0.0004) [2023-03-08 13:12:41,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 11784192. Throughput: 0: 9933.8. Samples: 11757828. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:12:41,893][434108] Avg episode reward: [(0, '1702.692')] [2023-03-08 13:12:42,943][434396] Updated weights for policy 0, policy_version 23040 (0.0005) [2023-03-08 13:12:46,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 11833344. Throughput: 0: 9886.9. Samples: 11816628. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:12:46,893][434108] Avg episode reward: [(0, '1829.011')] [2023-03-08 13:12:47,178][434396] Updated weights for policy 0, policy_version 23120 (0.0005) [2023-03-08 13:12:51,576][434396] Updated weights for policy 0, policy_version 23200 (0.0005) [2023-03-08 13:12:51,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9941.5). Total num frames: 11878400. Throughput: 0: 9808.6. Samples: 11873496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:12:51,893][434108] Avg episode reward: [(0, '1680.303')] [2023-03-08 13:12:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000023200_11878400.pth... [2023-03-08 13:12:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000022632_11587584.pth [2023-03-08 13:12:55,782][434396] Updated weights for policy 0, policy_version 23280 (0.0006) [2023-03-08 13:12:56,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9941.5). Total num frames: 11927552. Throughput: 0: 9795.8. Samples: 11902060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:12:56,893][434108] Avg episode reward: [(0, '1913.871')] [2023-03-08 13:13:00,057][434396] Updated weights for policy 0, policy_version 23360 (0.0005) [2023-03-08 13:13:01,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9941.5). Total num frames: 11976704. Throughput: 0: 9737.7. Samples: 11960316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:13:01,893][434108] Avg episode reward: [(0, '1932.097')] [2023-03-08 13:13:04,503][434396] Updated weights for policy 0, policy_version 23440 (0.0005) [2023-03-08 13:13:06,892][434108] Fps is (10 sec: 9420.7, 60 sec: 9693.8, 300 sec: 9927.6). Total num frames: 12021760. Throughput: 0: 9669.2. Samples: 12016144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:13:06,893][434108] Avg episode reward: [(0, '1940.271')] [2023-03-08 13:13:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000023480_12021760.pth... [2023-03-08 13:13:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000022920_11735040.pth [2023-03-08 13:13:08,583][434396] Updated weights for policy 0, policy_version 23520 (0.0005) [2023-03-08 13:13:11,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9941.5). Total num frames: 12070912. Throughput: 0: 9699.0. Samples: 12046408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:13:11,893][434108] Avg episode reward: [(0, '1971.079')] [2023-03-08 13:13:12,771][434396] Updated weights for policy 0, policy_version 23600 (0.0005) [2023-03-08 13:13:16,824][434396] Updated weights for policy 0, policy_version 23680 (0.0005) [2023-03-08 13:13:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 9762.1, 300 sec: 9941.5). Total num frames: 12124160. Throughput: 0: 9735.4. Samples: 12106444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:13:16,893][434108] Avg episode reward: [(0, '1784.320')] [2023-03-08 13:13:20,803][434396] Updated weights for policy 0, policy_version 23760 (0.0005) [2023-03-08 13:13:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9941.5). Total num frames: 12173312. Throughput: 0: 9770.5. Samples: 12167420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:13:21,893][434108] Avg episode reward: [(0, '1811.330')] [2023-03-08 13:13:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000023776_12173312.pth... [2023-03-08 13:13:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000023200_11878400.pth [2023-03-08 13:13:24,745][434396] Updated weights for policy 0, policy_version 23840 (0.0004) [2023-03-08 13:13:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 12226560. Throughput: 0: 9793.5. Samples: 12198536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:13:26,893][434108] Avg episode reward: [(0, '1863.060')] [2023-03-08 13:13:28,912][434396] Updated weights for policy 0, policy_version 23920 (0.0005) [2023-03-08 13:13:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 12275712. Throughput: 0: 9806.7. Samples: 12257932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:13:31,893][434108] Avg episode reward: [(0, '1866.851')] [2023-03-08 13:13:32,974][434396] Updated weights for policy 0, policy_version 24000 (0.0005) [2023-03-08 13:13:36,870][434396] Updated weights for policy 0, policy_version 24080 (0.0005) [2023-03-08 13:13:36,893][434108] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 12328960. Throughput: 0: 9922.7. Samples: 12320020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:13:36,893][434108] Avg episode reward: [(0, '1870.194')] [2023-03-08 13:13:36,898][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000024080_12328960.pth... [2023-03-08 13:13:36,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000023480_12021760.pth [2023-03-08 13:13:40,831][434396] Updated weights for policy 0, policy_version 24160 (0.0005) [2023-03-08 13:13:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 12378112. Throughput: 0: 9977.0. Samples: 12351024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:13:41,893][434108] Avg episode reward: [(0, '1856.944')] [2023-03-08 13:13:45,092][434396] Updated weights for policy 0, policy_version 24240 (0.0006) [2023-03-08 13:13:46,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9969.3). Total num frames: 12427264. Throughput: 0: 10007.6. Samples: 12410656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:13:46,893][434108] Avg episode reward: [(0, '1844.612')] [2023-03-08 13:13:49,320][434396] Updated weights for policy 0, policy_version 24320 (0.0005) [2023-03-08 13:13:51,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 12472320. Throughput: 0: 10046.2. Samples: 12468224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:13:51,893][434108] Avg episode reward: [(0, '1889.123')] [2023-03-08 13:13:51,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000024368_12476416.pth... [2023-03-08 13:13:51,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000023776_12173312.pth [2023-03-08 13:13:53,571][434396] Updated weights for policy 0, policy_version 24400 (0.0005) [2023-03-08 13:13:56,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 12521472. Throughput: 0: 10012.2. Samples: 12496960. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:13:56,893][434108] Avg episode reward: [(0, '1867.496')] [2023-03-08 13:13:57,759][434396] Updated weights for policy 0, policy_version 24480 (0.0006) [2023-03-08 13:13:58,594][434347] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000007 [2023-03-08 13:14:01,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 12570624. Throughput: 0: 9961.2. Samples: 12554700. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:14:01,893][434108] Avg episode reward: [(0, '1882.825')] [2023-03-08 13:14:02,206][434396] Updated weights for policy 0, policy_version 24560 (0.0005) [2023-03-08 13:14:06,472][434396] Updated weights for policy 0, policy_version 24640 (0.0005) [2023-03-08 13:14:06,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 12619776. Throughput: 0: 9865.9. Samples: 12611384. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:14:06,893][434108] Avg episode reward: [(0, '1789.544')] [2023-03-08 13:14:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000024648_12619776.pth... [2023-03-08 13:14:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000024080_12328960.pth [2023-03-08 13:14:10,436][434396] Updated weights for policy 0, policy_version 24720 (0.0004) [2023-03-08 13:14:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 12668928. Throughput: 0: 9861.8. Samples: 12642316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:14:11,893][434108] Avg episode reward: [(0, '1878.132')] [2023-03-08 13:14:14,643][434396] Updated weights for policy 0, policy_version 24800 (0.0005) [2023-03-08 13:14:16,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 12718080. Throughput: 0: 9858.7. Samples: 12701572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:14:16,893][434108] Avg episode reward: [(0, '1902.935')] [2023-03-08 13:14:18,757][434396] Updated weights for policy 0, policy_version 24880 (0.0005) [2023-03-08 13:14:21,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9927.6). Total num frames: 12763136. Throughput: 0: 9729.4. Samples: 12757844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:14:21,893][434108] Avg episode reward: [(0, '1901.712')] [2023-03-08 13:14:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000024928_12763136.pth... [2023-03-08 13:14:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000024368_12476416.pth [2023-03-08 13:14:23,339][434396] Updated weights for policy 0, policy_version 24960 (0.0005) [2023-03-08 13:14:26,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 12812288. Throughput: 0: 9682.4. Samples: 12786732. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:14:26,893][434108] Avg episode reward: [(0, '1920.205')] [2023-03-08 13:14:27,489][434396] Updated weights for policy 0, policy_version 25040 (0.0005) [2023-03-08 13:14:31,740][434396] Updated weights for policy 0, policy_version 25120 (0.0005) [2023-03-08 13:14:31,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 12861440. Throughput: 0: 9644.6. Samples: 12844664. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:14:31,893][434108] Avg episode reward: [(0, '1912.814')] [2023-03-08 13:14:36,001][434396] Updated weights for policy 0, policy_version 25200 (0.0005) [2023-03-08 13:14:36,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 12910592. Throughput: 0: 9649.8. Samples: 12902464. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:14:36,893][434108] Avg episode reward: [(0, '1943.230')] [2023-03-08 13:14:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000025216_12910592.pth... [2023-03-08 13:14:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000024648_12619776.pth [2023-03-08 13:14:40,298][434396] Updated weights for policy 0, policy_version 25280 (0.0005) [2023-03-08 13:14:41,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9899.8). Total num frames: 12955648. Throughput: 0: 9647.1. Samples: 12931080. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:14:41,893][434108] Avg episode reward: [(0, '1956.615')] [2023-03-08 13:14:44,497][434396] Updated weights for policy 0, policy_version 25360 (0.0005) [2023-03-08 13:14:46,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9899.8). Total num frames: 13004800. Throughput: 0: 9649.2. Samples: 12988916. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:14:46,893][434108] Avg episode reward: [(0, '1959.123')] [2023-03-08 13:14:48,694][434396] Updated weights for policy 0, policy_version 25440 (0.0005) [2023-03-08 13:14:51,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9899.8). Total num frames: 13053952. Throughput: 0: 9710.9. Samples: 13048376. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:14:51,893][434108] Avg episode reward: [(0, '1963.127')] [2023-03-08 13:14:51,927][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000025504_13058048.pth... [2023-03-08 13:14:51,929][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000024928_12763136.pth [2023-03-08 13:14:52,756][434396] Updated weights for policy 0, policy_version 25520 (0.0004) [2023-03-08 13:14:56,832][434396] Updated weights for policy 0, policy_version 25600 (0.0005) [2023-03-08 13:14:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 13107200. Throughput: 0: 9695.2. Samples: 13078600. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:14:56,893][434108] Avg episode reward: [(0, '1948.219')] [2023-03-08 13:15:00,919][434396] Updated weights for policy 0, policy_version 25680 (0.0005) [2023-03-08 13:15:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 13156352. Throughput: 0: 9725.2. Samples: 13139204. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:15:01,893][434108] Avg episode reward: [(0, '1921.040')] [2023-03-08 13:15:04,967][434396] Updated weights for policy 0, policy_version 25760 (0.0005) [2023-03-08 13:15:06,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 13205504. Throughput: 0: 9818.3. Samples: 13199668. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:15:06,893][434108] Avg episode reward: [(0, '1915.225')] [2023-03-08 13:15:06,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000025792_13205504.pth... [2023-03-08 13:15:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000025216_12910592.pth [2023-03-08 13:15:09,112][434396] Updated weights for policy 0, policy_version 25840 (0.0005) [2023-03-08 13:15:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 13254656. Throughput: 0: 9829.2. Samples: 13229048. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:15:11,893][434108] Avg episode reward: [(0, '1933.411')] [2023-03-08 13:15:13,156][434396] Updated weights for policy 0, policy_version 25920 (0.0005) [2023-03-08 13:15:16,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 13303808. Throughput: 0: 9840.5. Samples: 13287488. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:15:16,893][434108] Avg episode reward: [(0, '1927.850')] [2023-03-08 13:15:17,480][434396] Updated weights for policy 0, policy_version 26000 (0.0006) [2023-03-08 13:15:21,743][434396] Updated weights for policy 0, policy_version 26080 (0.0005) [2023-03-08 13:15:21,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9899.8). Total num frames: 13352960. Throughput: 0: 9858.5. Samples: 13346096. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:15:21,893][434108] Avg episode reward: [(0, '1937.723')] [2023-03-08 13:15:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000026080_13352960.pth... [2023-03-08 13:15:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000025504_13058048.pth [2023-03-08 13:15:26,047][434396] Updated weights for policy 0, policy_version 26160 (0.0005) [2023-03-08 13:15:26,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9899.8). Total num frames: 13402112. Throughput: 0: 9845.2. Samples: 13374112. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:15:26,893][434108] Avg episode reward: [(0, '1954.904')] [2023-03-08 13:15:30,294][434396] Updated weights for policy 0, policy_version 26240 (0.0004) [2023-03-08 13:15:31,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9885.9). Total num frames: 13447168. Throughput: 0: 9843.2. Samples: 13431860. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:15:31,893][434108] Avg episode reward: [(0, '1927.748')] [2023-03-08 13:15:34,561][434396] Updated weights for policy 0, policy_version 26320 (0.0005) [2023-03-08 13:15:36,893][434108] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9872.1). Total num frames: 13496320. Throughput: 0: 9820.1. Samples: 13490280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:15:36,893][434108] Avg episode reward: [(0, '1924.859')] [2023-03-08 13:15:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000026360_13496320.pth... [2023-03-08 13:15:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000025792_13205504.pth [2023-03-08 13:15:38,578][434396] Updated weights for policy 0, policy_version 26400 (0.0005) [2023-03-08 13:15:41,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9872.1). Total num frames: 13545472. Throughput: 0: 9830.4. Samples: 13520968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:15:41,893][434108] Avg episode reward: [(0, '1969.212')] [2023-03-08 13:15:42,743][434396] Updated weights for policy 0, policy_version 26480 (0.0005) [2023-03-08 13:15:46,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9872.1). Total num frames: 13594624. Throughput: 0: 9786.2. Samples: 13579584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:15:46,893][434108] Avg episode reward: [(0, '1954.814')] [2023-03-08 13:15:46,955][434396] Updated weights for policy 0, policy_version 26560 (0.0005) [2023-03-08 13:15:51,106][434396] Updated weights for policy 0, policy_version 26640 (0.0004) [2023-03-08 13:15:51,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 13643776. Throughput: 0: 9773.6. Samples: 13639480. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:15:51,893][434108] Avg episode reward: [(0, '1953.292')] [2023-03-08 13:15:51,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000026648_13643776.pth... [2023-03-08 13:15:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000026080_13352960.pth [2023-03-08 13:15:55,313][434396] Updated weights for policy 0, policy_version 26720 (0.0005) [2023-03-08 13:15:56,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9844.3). Total num frames: 13692928. Throughput: 0: 9762.5. Samples: 13668360. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:15:56,893][434108] Avg episode reward: [(0, '1966.559')] [2023-03-08 13:15:59,511][434396] Updated weights for policy 0, policy_version 26800 (0.0005) [2023-03-08 13:16:01,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9844.3). Total num frames: 13742080. Throughput: 0: 9751.5. Samples: 13726304. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:16:01,893][434108] Avg episode reward: [(0, '1941.234')] [2023-03-08 13:16:03,636][434396] Updated weights for policy 0, policy_version 26880 (0.0004) [2023-03-08 13:16:06,892][434108] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 9844.3). Total num frames: 13795328. Throughput: 0: 9798.6. Samples: 13787032. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:16:06,893][434108] Avg episode reward: [(0, '1965.560')] [2023-03-08 13:16:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000026944_13795328.pth... [2023-03-08 13:16:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000026360_13496320.pth [2023-03-08 13:16:07,764][434396] Updated weights for policy 0, policy_version 26960 (0.0004) [2023-03-08 13:16:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 13840384. Throughput: 0: 9816.9. Samples: 13815872. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:16:11,893][434108] Avg episode reward: [(0, '1957.691')] [2023-03-08 13:16:11,920][434396] Updated weights for policy 0, policy_version 27040 (0.0005) [2023-03-08 13:16:16,327][434396] Updated weights for policy 0, policy_version 27120 (0.0005) [2023-03-08 13:16:16,892][434108] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 13889536. Throughput: 0: 9808.1. Samples: 13873224. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:16:16,893][434108] Avg episode reward: [(0, '1967.294')] [2023-03-08 13:16:20,570][434396] Updated weights for policy 0, policy_version 27200 (0.0005) [2023-03-08 13:16:21,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 13938688. Throughput: 0: 9784.2. Samples: 13930568. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:16:21,893][434108] Avg episode reward: [(0, '1992.264')] [2023-03-08 13:16:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000027224_13938688.pth... [2023-03-08 13:16:21,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000026648_13643776.pth [2023-03-08 13:16:24,686][434396] Updated weights for policy 0, policy_version 27280 (0.0005) [2023-03-08 13:16:26,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9830.4). Total num frames: 13987840. Throughput: 0: 9768.5. Samples: 13960548. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:16:26,893][434108] Avg episode reward: [(0, '1997.839')] [2023-03-08 13:16:28,885][434396] Updated weights for policy 0, policy_version 27360 (0.0005) [2023-03-08 13:16:31,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 14036992. Throughput: 0: 9776.2. Samples: 14019512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:16:31,893][434108] Avg episode reward: [(0, '1990.787')] [2023-03-08 13:16:33,081][434396] Updated weights for policy 0, policy_version 27440 (0.0005) [2023-03-08 13:16:36,892][434108] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9802.6). Total num frames: 14082048. Throughput: 0: 9736.6. Samples: 14077628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:16:36,893][434108] Avg episode reward: [(0, '1989.588')] [2023-03-08 13:16:36,912][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000027512_14086144.pth... [2023-03-08 13:16:36,914][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000026944_13795328.pth [2023-03-08 13:16:37,324][434396] Updated weights for policy 0, policy_version 27520 (0.0005) [2023-03-08 13:16:41,500][434396] Updated weights for policy 0, policy_version 27600 (0.0005) [2023-03-08 13:16:41,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9802.6). Total num frames: 14131200. Throughput: 0: 9741.3. Samples: 14106720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:16:41,893][434108] Avg episode reward: [(0, '1614.136')] [2023-03-08 13:16:45,738][434396] Updated weights for policy 0, policy_version 27680 (0.0005) [2023-03-08 13:16:46,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9802.6). Total num frames: 14180352. Throughput: 0: 9747.0. Samples: 14164920. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:16:46,893][434108] Avg episode reward: [(0, '1907.804')] [2023-03-08 13:16:49,711][434396] Updated weights for policy 0, policy_version 27760 (0.0004) [2023-03-08 13:16:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9816.5). Total num frames: 14233600. Throughput: 0: 9779.0. Samples: 14227088. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:16:51,893][434108] Avg episode reward: [(0, '1952.448')] [2023-03-08 13:16:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000027800_14233600.pth... [2023-03-08 13:16:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000027224_13938688.pth [2023-03-08 13:16:53,530][434396] Updated weights for policy 0, policy_version 27840 (0.0003) [2023-03-08 13:16:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 9898.7, 300 sec: 9816.5). Total num frames: 14286848. Throughput: 0: 9842.8. Samples: 14258800. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:16:56,893][434108] Avg episode reward: [(0, '1924.714')] [2023-03-08 13:16:57,635][434396] Updated weights for policy 0, policy_version 27920 (0.0005) [2023-03-08 13:17:01,542][434396] Updated weights for policy 0, policy_version 28000 (0.0005) [2023-03-08 13:17:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9816.5). Total num frames: 14336000. Throughput: 0: 9920.0. Samples: 14319624. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:17:01,893][434108] Avg episode reward: [(0, '2026.044')] [2023-03-08 13:17:01,893][434347] Saving new best policy, reward=2026.044! [2023-03-08 13:17:05,446][434396] Updated weights for policy 0, policy_version 28080 (0.0005) [2023-03-08 13:17:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9898.6, 300 sec: 9830.4). Total num frames: 14389248. Throughput: 0: 10063.6. Samples: 14383432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:17:06,893][434108] Avg episode reward: [(0, '2249.155')] [2023-03-08 13:17:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000028104_14389248.pth... [2023-03-08 13:17:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000027512_14086144.pth [2023-03-08 13:17:06,898][434347] Saving new best policy, reward=2249.155! [2023-03-08 13:17:09,281][434396] Updated weights for policy 0, policy_version 28160 (0.0005) [2023-03-08 13:17:11,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10035.2, 300 sec: 9844.3). Total num frames: 14442496. Throughput: 0: 10092.9. Samples: 14414728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:17:11,893][434108] Avg episode reward: [(0, '2268.072')] [2023-03-08 13:17:11,922][434347] Saving new best policy, reward=2268.072! [2023-03-08 13:17:13,012][434396] Updated weights for policy 0, policy_version 28240 (0.0004) [2023-03-08 13:17:16,690][434396] Updated weights for policy 0, policy_version 28320 (0.0004) [2023-03-08 13:17:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10171.7, 300 sec: 9872.1). Total num frames: 14499840. Throughput: 0: 10252.9. Samples: 14480892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:17:16,893][434108] Avg episode reward: [(0, '2354.424')] [2023-03-08 13:17:16,893][434347] Saving new best policy, reward=2354.424! [2023-03-08 13:17:20,758][434396] Updated weights for policy 0, policy_version 28400 (0.0005) [2023-03-08 13:17:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 9872.1). Total num frames: 14548992. Throughput: 0: 10356.5. Samples: 14543672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:17:21,893][434108] Avg episode reward: [(0, '2342.684')] [2023-03-08 13:17:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000028416_14548992.pth... [2023-03-08 13:17:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000027800_14233600.pth [2023-03-08 13:17:24,799][434396] Updated weights for policy 0, policy_version 28480 (0.0005) [2023-03-08 13:17:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9885.9). Total num frames: 14602240. Throughput: 0: 10376.0. Samples: 14573640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:17:26,893][434108] Avg episode reward: [(0, '2376.214')] [2023-03-08 13:17:26,894][434347] Saving new best policy, reward=2376.214! [2023-03-08 13:17:28,680][434396] Updated weights for policy 0, policy_version 28560 (0.0005) [2023-03-08 13:17:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 9899.8). Total num frames: 14655488. Throughput: 0: 10486.8. Samples: 14636824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:17:31,893][434108] Avg episode reward: [(0, '2394.934')] [2023-03-08 13:17:31,893][434347] Saving new best policy, reward=2394.934! [2023-03-08 13:17:32,578][434396] Updated weights for policy 0, policy_version 28640 (0.0004) [2023-03-08 13:17:36,516][434396] Updated weights for policy 0, policy_version 28720 (0.0005) [2023-03-08 13:17:36,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 9899.8). Total num frames: 14704640. Throughput: 0: 10501.4. Samples: 14699652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:17:36,893][434108] Avg episode reward: [(0, '2237.474')] [2023-03-08 13:17:36,927][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000028728_14708736.pth... [2023-03-08 13:17:36,928][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000028104_14389248.pth [2023-03-08 13:17:40,491][434396] Updated weights for policy 0, policy_version 28800 (0.0005) [2023-03-08 13:17:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 9913.7). Total num frames: 14757888. Throughput: 0: 10472.3. Samples: 14730052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:17:41,893][434108] Avg episode reward: [(0, '2284.677')] [2023-03-08 13:17:44,507][434396] Updated weights for policy 0, policy_version 28880 (0.0005) [2023-03-08 13:17:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 9927.6). Total num frames: 14807040. Throughput: 0: 10486.0. Samples: 14791492. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:17:46,893][434108] Avg episode reward: [(0, '2238.871')] [2023-03-08 13:17:48,511][434396] Updated weights for policy 0, policy_version 28960 (0.0005) [2023-03-08 13:17:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 9941.5). Total num frames: 14860288. Throughput: 0: 10473.0. Samples: 14854716. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:17:51,893][434108] Avg episode reward: [(0, '2403.669')] [2023-03-08 13:17:51,959][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000029032_14864384.pth... [2023-03-08 13:17:51,961][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000028416_14548992.pth [2023-03-08 13:17:51,961][434347] Saving new best policy, reward=2403.669! [2023-03-08 13:17:52,321][434396] Updated weights for policy 0, policy_version 29040 (0.0005) [2023-03-08 13:17:56,344][434396] Updated weights for policy 0, policy_version 29120 (0.0005) [2023-03-08 13:17:56,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 9955.4). Total num frames: 14913536. Throughput: 0: 10462.8. Samples: 14885552. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:17:56,893][434108] Avg episode reward: [(0, '2373.037')] [2023-03-08 13:18:00,324][434396] Updated weights for policy 0, policy_version 29200 (0.0005) [2023-03-08 13:18:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 9969.3). Total num frames: 14962688. Throughput: 0: 10358.2. Samples: 14947012. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:18:01,893][434108] Avg episode reward: [(0, '2403.623')] [2023-03-08 13:18:04,291][434396] Updated weights for policy 0, policy_version 29280 (0.0005) [2023-03-08 13:18:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 9983.1). Total num frames: 15015936. Throughput: 0: 10344.4. Samples: 15009172. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:18:06,893][434108] Avg episode reward: [(0, '2446.054')] [2023-03-08 13:18:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000029328_15015936.pth... [2023-03-08 13:18:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000028728_14708736.pth [2023-03-08 13:18:06,899][434347] Saving new best policy, reward=2446.054! [2023-03-08 13:18:08,270][434396] Updated weights for policy 0, policy_version 29360 (0.0005) [2023-03-08 13:18:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 9983.1). Total num frames: 15069184. Throughput: 0: 10369.8. Samples: 15040280. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:18:11,893][434108] Avg episode reward: [(0, '2459.858')] [2023-03-08 13:18:11,893][434347] Saving new best policy, reward=2459.858! [2023-03-08 13:18:12,277][434396] Updated weights for policy 0, policy_version 29440 (0.0005) [2023-03-08 13:18:16,248][434396] Updated weights for policy 0, policy_version 29520 (0.0005) [2023-03-08 13:18:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 9983.1). Total num frames: 15118336. Throughput: 0: 10336.4. Samples: 15101960. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:18:16,893][434108] Avg episode reward: [(0, '2488.090')] [2023-03-08 13:18:16,893][434347] Saving new best policy, reward=2488.090! [2023-03-08 13:18:20,219][434396] Updated weights for policy 0, policy_version 29600 (0.0005) [2023-03-08 13:18:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 9983.1). Total num frames: 15171584. Throughput: 0: 10306.9. Samples: 15163464. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:18:21,893][434108] Avg episode reward: [(0, '2526.010')] [2023-03-08 13:18:21,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000029632_15171584.pth... [2023-03-08 13:18:21,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000029032_14864384.pth [2023-03-08 13:18:21,898][434347] Saving new best policy, reward=2526.010! [2023-03-08 13:18:24,184][434396] Updated weights for policy 0, policy_version 29680 (0.0005) [2023-03-08 13:18:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9983.1). Total num frames: 15220736. Throughput: 0: 10323.1. Samples: 15194592. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:18:26,893][434108] Avg episode reward: [(0, '2589.658')] [2023-03-08 13:18:26,949][434347] Saving new best policy, reward=2589.658! [2023-03-08 13:18:28,195][434396] Updated weights for policy 0, policy_version 29760 (0.0005) [2023-03-08 13:18:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9983.1). Total num frames: 15273984. Throughput: 0: 10317.2. Samples: 15255764. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:18:31,893][434108] Avg episode reward: [(0, '2537.605')] [2023-03-08 13:18:32,212][434396] Updated weights for policy 0, policy_version 29840 (0.0005) [2023-03-08 13:18:36,184][434396] Updated weights for policy 0, policy_version 29920 (0.0005) [2023-03-08 13:18:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 9983.1). Total num frames: 15323136. Throughput: 0: 10284.5. Samples: 15317516. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:18:36,893][434108] Avg episode reward: [(0, '2597.020')] [2023-03-08 13:18:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000029928_15323136.pth... [2023-03-08 13:18:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000029328_15015936.pth [2023-03-08 13:18:36,899][434347] Saving new best policy, reward=2597.020! [2023-03-08 13:18:40,155][434396] Updated weights for policy 0, policy_version 30000 (0.0005) [2023-03-08 13:18:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9997.0). Total num frames: 15376384. Throughput: 0: 10271.8. Samples: 15347784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:18:41,893][434108] Avg episode reward: [(0, '2672.571')] [2023-03-08 13:18:41,893][434347] Saving new best policy, reward=2672.571! [2023-03-08 13:18:44,135][434396] Updated weights for policy 0, policy_version 30080 (0.0005) [2023-03-08 13:18:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10010.9). Total num frames: 15425536. Throughput: 0: 10289.5. Samples: 15410040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:18:46,893][434108] Avg episode reward: [(0, '2661.722')] [2023-03-08 13:18:48,166][434396] Updated weights for policy 0, policy_version 30160 (0.0005) [2023-03-08 13:18:51,893][434108] Fps is (10 sec: 10239.8, 60 sec: 10308.3, 300 sec: 10024.8). Total num frames: 15478784. Throughput: 0: 10277.7. Samples: 15471668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:18:51,893][434108] Avg episode reward: [(0, '2688.022')] [2023-03-08 13:18:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000030232_15478784.pth... [2023-03-08 13:18:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000029632_15171584.pth [2023-03-08 13:18:51,900][434347] Saving new best policy, reward=2688.022! [2023-03-08 13:18:52,153][434396] Updated weights for policy 0, policy_version 30240 (0.0005) [2023-03-08 13:18:56,171][434396] Updated weights for policy 0, policy_version 30320 (0.0005) [2023-03-08 13:18:56,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10024.8). Total num frames: 15527936. Throughput: 0: 10250.7. Samples: 15501564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:18:56,893][434108] Avg episode reward: [(0, '2687.983')] [2023-03-08 13:19:00,193][434396] Updated weights for policy 0, policy_version 30400 (0.0005) [2023-03-08 13:19:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10038.7). Total num frames: 15581184. Throughput: 0: 10248.1. Samples: 15563124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:19:01,893][434108] Avg episode reward: [(0, '2688.411')] [2023-03-08 13:19:01,894][434347] Saving new best policy, reward=2688.411! [2023-03-08 13:19:04,320][434396] Updated weights for policy 0, policy_version 30480 (0.0006) [2023-03-08 13:19:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10038.7). Total num frames: 15630336. Throughput: 0: 10195.9. Samples: 15622280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:19:06,893][434108] Avg episode reward: [(0, '2712.809')] [2023-03-08 13:19:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000030528_15630336.pth... [2023-03-08 13:19:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000029928_15323136.pth [2023-03-08 13:19:06,899][434347] Saving new best policy, reward=2712.809! [2023-03-08 13:19:08,252][434396] Updated weights for policy 0, policy_version 30560 (0.0005) [2023-03-08 13:19:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10052.6). Total num frames: 15683584. Throughput: 0: 10229.5. Samples: 15654920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:19:11,903][434108] Avg episode reward: [(0, '2534.561')] [2023-03-08 13:19:12,233][434396] Updated weights for policy 0, policy_version 30640 (0.0005) [2023-03-08 13:19:16,154][434396] Updated weights for policy 0, policy_version 30720 (0.0005) [2023-03-08 13:19:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10066.4). Total num frames: 15732736. Throughput: 0: 10241.1. Samples: 15716616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:19:16,903][434108] Avg episode reward: [(0, '2696.227')] [2023-03-08 13:19:20,185][434396] Updated weights for policy 0, policy_version 30800 (0.0005) [2023-03-08 13:19:21,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 15785984. Throughput: 0: 10229.9. Samples: 15777864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:19:21,893][434108] Avg episode reward: [(0, '2772.066')] [2023-03-08 13:19:21,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000030832_15785984.pth... [2023-03-08 13:19:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000030232_15478784.pth [2023-03-08 13:19:21,898][434347] Saving new best policy, reward=2772.066! [2023-03-08 13:19:24,156][434396] Updated weights for policy 0, policy_version 30880 (0.0005) [2023-03-08 13:19:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10094.2). Total num frames: 15839232. Throughput: 0: 10260.3. Samples: 15809500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:19:26,893][434108] Avg episode reward: [(0, '2754.683')] [2023-03-08 13:19:28,017][434396] Updated weights for policy 0, policy_version 30960 (0.0005) [2023-03-08 13:19:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 15888384. Throughput: 0: 10267.4. Samples: 15872072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:19:31,893][434108] Avg episode reward: [(0, '2726.005')] [2023-03-08 13:19:31,932][434396] Updated weights for policy 0, policy_version 31040 (0.0005) [2023-03-08 13:19:35,959][434396] Updated weights for policy 0, policy_version 31120 (0.0005) [2023-03-08 13:19:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 15941632. Throughput: 0: 10267.8. Samples: 15933720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:19:36,893][434108] Avg episode reward: [(0, '2706.114')] [2023-03-08 13:19:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000031136_15941632.pth... [2023-03-08 13:19:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000030528_15630336.pth [2023-03-08 13:19:39,995][434396] Updated weights for policy 0, policy_version 31200 (0.0005) [2023-03-08 13:19:41,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 15990784. Throughput: 0: 10292.0. Samples: 15964704. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:19:41,893][434108] Avg episode reward: [(0, '2680.375')] [2023-03-08 13:19:43,779][434396] Updated weights for policy 0, policy_version 31280 (0.0005) [2023-03-08 13:19:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10149.7). Total num frames: 16048128. Throughput: 0: 10347.2. Samples: 16028748. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:19:46,893][434108] Avg episode reward: [(0, '2640.590')] [2023-03-08 13:19:47,570][434396] Updated weights for policy 0, policy_version 31360 (0.0004) [2023-03-08 13:19:51,600][434396] Updated weights for policy 0, policy_version 31440 (0.0005) [2023-03-08 13:19:51,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10135.9). Total num frames: 16097280. Throughput: 0: 10426.1. Samples: 16091456. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:19:51,893][434108] Avg episode reward: [(0, '2665.031')] [2023-03-08 13:19:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000031440_16097280.pth... [2023-03-08 13:19:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000030832_15785984.pth [2023-03-08 13:19:55,643][434396] Updated weights for policy 0, policy_version 31520 (0.0005) [2023-03-08 13:19:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10149.7). Total num frames: 16150528. Throughput: 0: 10386.1. Samples: 16122296. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:19:56,893][434108] Avg episode reward: [(0, '2719.113')] [2023-03-08 13:19:59,766][434396] Updated weights for policy 0, policy_version 31600 (0.0006) [2023-03-08 13:20:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10149.8). Total num frames: 16199680. Throughput: 0: 10336.5. Samples: 16181756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:20:01,893][434108] Avg episode reward: [(0, '2759.532')] [2023-03-08 13:20:04,030][434396] Updated weights for policy 0, policy_version 31680 (0.0005) [2023-03-08 13:20:06,893][434108] Fps is (10 sec: 9420.7, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 16244736. Throughput: 0: 10286.0. Samples: 16240736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:20:06,893][434108] Avg episode reward: [(0, '2789.262')] [2023-03-08 13:20:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000031728_16244736.pth... [2023-03-08 13:20:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000031136_15941632.pth [2023-03-08 13:20:06,900][434347] Saving new best policy, reward=2789.262! [2023-03-08 13:20:10,115][434396] Updated weights for policy 0, policy_version 31760 (0.0005) [2023-03-08 13:20:11,893][434108] Fps is (10 sec: 6553.5, 60 sec: 9693.9, 300 sec: 10038.7). Total num frames: 16265216. Throughput: 0: 9942.6. Samples: 16256920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:20:11,893][434108] Avg episode reward: [(0, '2763.182')] [2023-03-08 13:20:16,124][434396] Updated weights for policy 0, policy_version 31840 (0.0005) [2023-03-08 13:20:16,892][434108] Fps is (10 sec: 6553.7, 60 sec: 9625.6, 300 sec: 10024.8). Total num frames: 16310272. Throughput: 0: 9284.1. Samples: 16289856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:20:16,893][434108] Avg episode reward: [(0, '2697.383')] [2023-03-08 13:20:20,085][434396] Updated weights for policy 0, policy_version 31920 (0.0004) [2023-03-08 13:20:21,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 10024.8). Total num frames: 16359424. Throughput: 0: 9278.2. Samples: 16351240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:20:21,893][434108] Avg episode reward: [(0, '2620.999')] [2023-03-08 13:20:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000031952_16359424.pth... [2023-03-08 13:20:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000031440_16097280.pth [2023-03-08 13:20:24,303][434396] Updated weights for policy 0, policy_version 32000 (0.0006) [2023-03-08 13:20:26,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 10038.7). Total num frames: 16408576. Throughput: 0: 9253.6. Samples: 16381116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:20:26,893][434108] Avg episode reward: [(0, '2747.455')] [2023-03-08 13:20:28,281][434396] Updated weights for policy 0, policy_version 32080 (0.0005) [2023-03-08 13:20:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 9557.3, 300 sec: 10052.6). Total num frames: 16461824. Throughput: 0: 9212.7. Samples: 16443320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:20:31,893][434108] Avg episode reward: [(0, '2751.392')] [2023-03-08 13:20:32,051][434396] Updated weights for policy 0, policy_version 32160 (0.0004) [2023-03-08 13:20:35,915][434396] Updated weights for policy 0, policy_version 32240 (0.0005) [2023-03-08 13:20:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 9557.3, 300 sec: 10066.4). Total num frames: 16515072. Throughput: 0: 9250.7. Samples: 16507736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:20:36,893][434108] Avg episode reward: [(0, '2735.848')] [2023-03-08 13:20:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000032256_16515072.pth... [2023-03-08 13:20:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000031728_16244736.pth [2023-03-08 13:20:39,755][434396] Updated weights for policy 0, policy_version 32320 (0.0004) [2023-03-08 13:20:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 9625.6, 300 sec: 10080.3). Total num frames: 16568320. Throughput: 0: 9275.9. Samples: 16539712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:20:41,893][434108] Avg episode reward: [(0, '2819.434')] [2023-03-08 13:20:41,893][434347] Saving new best policy, reward=2819.434! [2023-03-08 13:20:43,585][434396] Updated weights for policy 0, policy_version 32400 (0.0005) [2023-03-08 13:20:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 9557.3, 300 sec: 10094.2). Total num frames: 16621568. Throughput: 0: 9381.6. Samples: 16603928. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:20:46,893][434108] Avg episode reward: [(0, '3075.932')] [2023-03-08 13:20:46,893][434347] Saving new best policy, reward=3075.932! [2023-03-08 13:20:47,469][434396] Updated weights for policy 0, policy_version 32480 (0.0004) [2023-03-08 13:20:51,495][434396] Updated weights for policy 0, policy_version 32560 (0.0005) [2023-03-08 13:20:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9557.3, 300 sec: 10094.2). Total num frames: 16670720. Throughput: 0: 9457.5. Samples: 16666324. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:20:51,893][434108] Avg episode reward: [(0, '3249.951')] [2023-03-08 13:20:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000032568_16674816.pth... [2023-03-08 13:20:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000031952_16359424.pth [2023-03-08 13:20:51,898][434347] Saving new best policy, reward=3249.951! [2023-03-08 13:20:55,524][434396] Updated weights for policy 0, policy_version 32640 (0.0005) [2023-03-08 13:20:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9557.3, 300 sec: 10108.1). Total num frames: 16723968. Throughput: 0: 9770.4. Samples: 16696588. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:20:56,893][434108] Avg episode reward: [(0, '3107.092')] [2023-03-08 13:20:59,429][434396] Updated weights for policy 0, policy_version 32720 (0.0005) [2023-03-08 13:21:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9557.3, 300 sec: 10094.2). Total num frames: 16773120. Throughput: 0: 10416.3. Samples: 16758592. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:21:01,893][434108] Avg episode reward: [(0, '3317.456')] [2023-03-08 13:21:01,916][434347] Saving new best policy, reward=3317.456! [2023-03-08 13:21:03,764][434396] Updated weights for policy 0, policy_version 32800 (0.0005) [2023-03-08 13:21:06,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 10108.1). Total num frames: 16822272. Throughput: 0: 10289.6. Samples: 16814272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:21:06,893][434108] Avg episode reward: [(0, '3190.500')] [2023-03-08 13:21:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000032856_16822272.pth... [2023-03-08 13:21:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000032256_16515072.pth [2023-03-08 13:21:08,168][434396] Updated weights for policy 0, policy_version 32880 (0.0006) [2023-03-08 13:21:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10108.1). Total num frames: 16871424. Throughput: 0: 10260.0. Samples: 16842816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:21:11,893][434108] Avg episode reward: [(0, '3673.648')] [2023-03-08 13:21:11,893][434347] Saving new best policy, reward=3673.648! [2023-03-08 13:21:12,299][434396] Updated weights for policy 0, policy_version 32960 (0.0006) [2023-03-08 13:21:16,381][434396] Updated weights for policy 0, policy_version 33040 (0.0006) [2023-03-08 13:21:16,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 16920576. Throughput: 0: 10219.4. Samples: 16903192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:21:16,893][434108] Avg episode reward: [(0, '3866.202')] [2023-03-08 13:21:16,893][434347] Saving new best policy, reward=3866.202! [2023-03-08 13:21:20,295][434396] Updated weights for policy 0, policy_version 33120 (0.0004) [2023-03-08 13:21:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 16973824. Throughput: 0: 10163.0. Samples: 16965072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:21:21,893][434108] Avg episode reward: [(0, '3917.448')] [2023-03-08 13:21:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000033152_16973824.pth... [2023-03-08 13:21:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000032568_16674816.pth [2023-03-08 13:21:21,899][434347] Saving new best policy, reward=3917.448! [2023-03-08 13:21:24,296][434396] Updated weights for policy 0, policy_version 33200 (0.0004) [2023-03-08 13:21:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 17022976. Throughput: 0: 10129.0. Samples: 16995516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:21:26,893][434108] Avg episode reward: [(0, '4045.118')] [2023-03-08 13:21:26,893][434347] Saving new best policy, reward=4045.118! [2023-03-08 13:21:28,526][434396] Updated weights for policy 0, policy_version 33280 (0.0005) [2023-03-08 13:21:31,892][434108] Fps is (10 sec: 9420.9, 60 sec: 10103.5, 300 sec: 10122.0). Total num frames: 17068032. Throughput: 0: 9995.6. Samples: 17053732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:21:31,893][434108] Avg episode reward: [(0, '4129.369')] [2023-03-08 13:21:31,900][434347] Saving new best policy, reward=4129.369! [2023-03-08 13:21:32,750][434396] Updated weights for policy 0, policy_version 33360 (0.0005) [2023-03-08 13:21:36,834][434396] Updated weights for policy 0, policy_version 33440 (0.0005) [2023-03-08 13:21:36,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10135.9). Total num frames: 17121280. Throughput: 0: 9929.5. Samples: 17113152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:21:36,893][434108] Avg episode reward: [(0, '4270.345')] [2023-03-08 13:21:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000033440_17121280.pth... [2023-03-08 13:21:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000032856_16822272.pth [2023-03-08 13:21:36,899][434347] Saving new best policy, reward=4270.345! [2023-03-08 13:21:40,812][434396] Updated weights for policy 0, policy_version 33520 (0.0005) [2023-03-08 13:21:41,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 17170432. Throughput: 0: 9934.6. Samples: 17143648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:21:41,893][434108] Avg episode reward: [(0, '4410.085')] [2023-03-08 13:21:41,893][434347] Saving new best policy, reward=4410.085! [2023-03-08 13:21:44,751][434396] Updated weights for policy 0, policy_version 33600 (0.0004) [2023-03-08 13:21:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 17223680. Throughput: 0: 9940.6. Samples: 17205920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:21:46,893][434108] Avg episode reward: [(0, '4323.316')] [2023-03-08 13:21:49,011][434396] Updated weights for policy 0, policy_version 33680 (0.0005) [2023-03-08 13:21:51,893][434108] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10108.1). Total num frames: 17268736. Throughput: 0: 10008.2. Samples: 17264640. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:21:51,893][434108] Avg episode reward: [(0, '4262.254')] [2023-03-08 13:21:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000033728_17268736.pth... [2023-03-08 13:21:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000033152_16973824.pth [2023-03-08 13:21:53,259][434396] Updated weights for policy 0, policy_version 33760 (0.0004) [2023-03-08 13:21:56,892][434108] Fps is (10 sec: 9420.7, 60 sec: 9898.7, 300 sec: 10108.1). Total num frames: 17317888. Throughput: 0: 9984.3. Samples: 17292108. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:21:56,893][434108] Avg episode reward: [(0, '4206.975')] [2023-03-08 13:21:57,549][434396] Updated weights for policy 0, policy_version 33840 (0.0004) [2023-03-08 13:22:01,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 10080.3). Total num frames: 17362944. Throughput: 0: 9913.1. Samples: 17349280. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:22:01,893][434108] Avg episode reward: [(0, '4316.464')] [2023-03-08 13:22:01,949][434396] Updated weights for policy 0, policy_version 33920 (0.0005) [2023-03-08 13:22:06,315][434396] Updated weights for policy 0, policy_version 34000 (0.0006) [2023-03-08 13:22:06,893][434108] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 10066.4). Total num frames: 17412096. Throughput: 0: 9769.8. Samples: 17404716. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:22:06,893][434108] Avg episode reward: [(0, '4424.835')] [2023-03-08 13:22:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000034008_17412096.pth... [2023-03-08 13:22:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000033440_17121280.pth [2023-03-08 13:22:06,899][434347] Saving new best policy, reward=4424.835! [2023-03-08 13:22:10,486][434396] Updated weights for policy 0, policy_version 34080 (0.0006) [2023-03-08 13:22:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10038.7). Total num frames: 17461248. Throughput: 0: 9772.6. Samples: 17435284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:22:11,893][434108] Avg episode reward: [(0, '4414.307')] [2023-03-08 13:22:14,630][434396] Updated weights for policy 0, policy_version 34160 (0.0005) [2023-03-08 13:22:16,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10038.7). Total num frames: 17510400. Throughput: 0: 9784.1. Samples: 17494016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:22:16,893][434108] Avg episode reward: [(0, '4453.278')] [2023-03-08 13:22:16,894][434347] Saving new best policy, reward=4453.278! [2023-03-08 13:22:18,884][434396] Updated weights for policy 0, policy_version 34240 (0.0006) [2023-03-08 13:22:21,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10024.8). Total num frames: 17559552. Throughput: 0: 9742.3. Samples: 17551556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:22:21,893][434108] Avg episode reward: [(0, '4399.372')] [2023-03-08 13:22:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000034296_17559552.pth... [2023-03-08 13:22:21,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000033728_17268736.pth [2023-03-08 13:22:23,084][434396] Updated weights for policy 0, policy_version 34320 (0.0005) [2023-03-08 13:22:26,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10010.9). Total num frames: 17608704. Throughput: 0: 9713.2. Samples: 17580740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:22:26,893][434108] Avg episode reward: [(0, '4106.062')] [2023-03-08 13:22:27,244][434396] Updated weights for policy 0, policy_version 34400 (0.0005) [2023-03-08 13:22:31,327][434396] Updated weights for policy 0, policy_version 34480 (0.0005) [2023-03-08 13:22:31,893][434108] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 17657856. Throughput: 0: 9658.5. Samples: 17640552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:22:31,893][434108] Avg episode reward: [(0, '3973.042')] [2023-03-08 13:22:35,519][434396] Updated weights for policy 0, policy_version 34560 (0.0005) [2023-03-08 13:22:36,893][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9997.0). Total num frames: 17707008. Throughput: 0: 9661.2. Samples: 17699392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:22:36,893][434108] Avg episode reward: [(0, '4165.199')] [2023-03-08 13:22:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000034584_17707008.pth... [2023-03-08 13:22:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000034008_17412096.pth [2023-03-08 13:22:39,844][434396] Updated weights for policy 0, policy_version 34640 (0.0005) [2023-03-08 13:22:41,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9997.0). Total num frames: 17756160. Throughput: 0: 9682.1. Samples: 17727804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:22:41,893][434108] Avg episode reward: [(0, '3972.288')] [2023-03-08 13:22:43,868][434396] Updated weights for policy 0, policy_version 34720 (0.0005) [2023-03-08 13:22:46,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9983.1). Total num frames: 17805312. Throughput: 0: 9765.2. Samples: 17788716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:22:46,893][434108] Avg episode reward: [(0, '4259.301')] [2023-03-08 13:22:47,937][434396] Updated weights for policy 0, policy_version 34800 (0.0005) [2023-03-08 13:22:51,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9969.2). Total num frames: 17854464. Throughput: 0: 9847.0. Samples: 17847832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:22:51,893][434108] Avg episode reward: [(0, '4328.312')] [2023-03-08 13:22:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000034872_17854464.pth... [2023-03-08 13:22:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000034296_17559552.pth [2023-03-08 13:22:52,178][434396] Updated weights for policy 0, policy_version 34880 (0.0005) [2023-03-08 13:22:56,571][434396] Updated weights for policy 0, policy_version 34960 (0.0005) [2023-03-08 13:22:56,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9955.4). Total num frames: 17899520. Throughput: 0: 9786.8. Samples: 17875692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:22:56,893][434108] Avg episode reward: [(0, '4358.150')] [2023-03-08 13:23:00,720][434396] Updated weights for policy 0, policy_version 35040 (0.0005) [2023-03-08 13:23:01,892][434108] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9941.5). Total num frames: 17948672. Throughput: 0: 9751.3. Samples: 17932824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:23:01,893][434108] Avg episode reward: [(0, '4412.182')] [2023-03-08 13:23:04,904][434396] Updated weights for policy 0, policy_version 35120 (0.0005) [2023-03-08 13:23:06,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 17997824. Throughput: 0: 9802.5. Samples: 17992668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:23:06,893][434108] Avg episode reward: [(0, '4486.390')] [2023-03-08 13:23:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000035152_17997824.pth... [2023-03-08 13:23:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000034584_17707008.pth [2023-03-08 13:23:06,899][434347] Saving new best policy, reward=4486.390! [2023-03-08 13:23:09,023][434396] Updated weights for policy 0, policy_version 35200 (0.0005) [2023-03-08 13:23:11,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 18046976. Throughput: 0: 9814.9. Samples: 18022408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:23:11,893][434108] Avg episode reward: [(0, '4421.680')] [2023-03-08 13:23:13,116][434396] Updated weights for policy 0, policy_version 35280 (0.0005) [2023-03-08 13:23:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 9927.6). Total num frames: 18100224. Throughput: 0: 9824.0. Samples: 18082632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:23:16,893][434108] Avg episode reward: [(0, '4316.279')] [2023-03-08 13:23:17,072][434396] Updated weights for policy 0, policy_version 35360 (0.0005) [2023-03-08 13:23:20,855][434396] Updated weights for policy 0, policy_version 35440 (0.0004) [2023-03-08 13:23:21,893][434108] Fps is (10 sec: 10649.5, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 18153472. Throughput: 0: 9945.6. Samples: 18146944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:23:21,893][434108] Avg episode reward: [(0, '4371.979')] [2023-03-08 13:23:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000035456_18153472.pth... [2023-03-08 13:23:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000034872_17854464.pth [2023-03-08 13:23:24,677][434396] Updated weights for policy 0, policy_version 35520 (0.0004) [2023-03-08 13:23:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 18206720. Throughput: 0: 10024.4. Samples: 18178904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:23:26,893][434108] Avg episode reward: [(0, '4381.071')] [2023-03-08 13:23:28,495][434396] Updated weights for policy 0, policy_version 35600 (0.0004) [2023-03-08 13:23:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 18259968. Throughput: 0: 10103.7. Samples: 18243380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:23:31,893][434108] Avg episode reward: [(0, '4513.486')] [2023-03-08 13:23:31,893][434347] Saving new best policy, reward=4513.486! [2023-03-08 13:23:32,452][434396] Updated weights for policy 0, policy_version 35680 (0.0005) [2023-03-08 13:23:36,588][434396] Updated weights for policy 0, policy_version 35760 (0.0005) [2023-03-08 13:23:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9941.5). Total num frames: 18309120. Throughput: 0: 10134.2. Samples: 18303872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:23:36,893][434108] Avg episode reward: [(0, '4401.237')] [2023-03-08 13:23:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000035760_18309120.pth... [2023-03-08 13:23:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000035152_17997824.pth [2023-03-08 13:23:40,723][434396] Updated weights for policy 0, policy_version 35840 (0.0005) [2023-03-08 13:23:41,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9941.5). Total num frames: 18358272. Throughput: 0: 10165.5. Samples: 18333140. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:23:41,893][434108] Avg episode reward: [(0, '4538.886')] [2023-03-08 13:23:41,894][434347] Saving new best policy, reward=4538.886! [2023-03-08 13:23:44,819][434396] Updated weights for policy 0, policy_version 35920 (0.0004) [2023-03-08 13:23:46,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 18407424. Throughput: 0: 10223.7. Samples: 18392888. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:23:46,893][434108] Avg episode reward: [(0, '4215.123')] [2023-03-08 13:23:48,951][434396] Updated weights for policy 0, policy_version 36000 (0.0005) [2023-03-08 13:23:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 18460672. Throughput: 0: 10219.5. Samples: 18452544. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:23:51,893][434108] Avg episode reward: [(0, '4121.644')] [2023-03-08 13:23:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000036056_18460672.pth... [2023-03-08 13:23:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000035456_18153472.pth [2023-03-08 13:23:53,032][434396] Updated weights for policy 0, policy_version 36080 (0.0005) [2023-03-08 13:23:56,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 9927.6). Total num frames: 18509824. Throughput: 0: 10251.6. Samples: 18483732. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:23:56,893][434108] Avg episode reward: [(0, '3945.800')] [2023-03-08 13:23:56,952][434396] Updated weights for policy 0, policy_version 36160 (0.0004) [2023-03-08 13:24:00,952][434396] Updated weights for policy 0, policy_version 36240 (0.0005) [2023-03-08 13:24:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9941.5). Total num frames: 18563072. Throughput: 0: 10288.0. Samples: 18545592. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:24:01,893][434108] Avg episode reward: [(0, '4264.802')] [2023-03-08 13:24:05,028][434396] Updated weights for policy 0, policy_version 36320 (0.0004) [2023-03-08 13:24:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 9927.6). Total num frames: 18612224. Throughput: 0: 10211.0. Samples: 18606440. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:24:06,893][434108] Avg episode reward: [(0, '4337.315')] [2023-03-08 13:24:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000036352_18612224.pth... [2023-03-08 13:24:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000035760_18309120.pth [2023-03-08 13:24:09,016][434396] Updated weights for policy 0, policy_version 36400 (0.0005) [2023-03-08 13:24:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9941.5). Total num frames: 18665472. Throughput: 0: 10176.9. Samples: 18636864. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:24:11,893][434108] Avg episode reward: [(0, '4351.129')] [2023-03-08 13:24:12,748][434396] Updated weights for policy 0, policy_version 36480 (0.0003) [2023-03-08 13:24:16,678][434396] Updated weights for policy 0, policy_version 36560 (0.0004) [2023-03-08 13:24:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 9941.5). Total num frames: 18718720. Throughput: 0: 10177.3. Samples: 18701360. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:24:16,893][434108] Avg episode reward: [(0, '4197.834')] [2023-03-08 13:24:20,707][434396] Updated weights for policy 0, policy_version 36640 (0.0005) [2023-03-08 13:24:21,893][434108] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 9927.6). Total num frames: 18767872. Throughput: 0: 10210.6. Samples: 18763348. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:24:21,893][434108] Avg episode reward: [(0, '4176.625')] [2023-03-08 13:24:21,908][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000036664_18771968.pth... [2023-03-08 13:24:21,910][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000036056_18460672.pth [2023-03-08 13:24:24,789][434396] Updated weights for policy 0, policy_version 36720 (0.0005) [2023-03-08 13:24:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9941.5). Total num frames: 18821120. Throughput: 0: 10208.3. Samples: 18792512. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:24:26,893][434108] Avg episode reward: [(0, '3873.677')] [2023-03-08 13:24:28,679][434396] Updated weights for policy 0, policy_version 36800 (0.0004) [2023-03-08 13:24:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 9941.5). Total num frames: 18874368. Throughput: 0: 10280.1. Samples: 18855492. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:24:31,893][434108] Avg episode reward: [(0, '4243.542')] [2023-03-08 13:24:32,511][434396] Updated weights for policy 0, policy_version 36880 (0.0005) [2023-03-08 13:24:36,619][434396] Updated weights for policy 0, policy_version 36960 (0.0005) [2023-03-08 13:24:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9941.5). Total num frames: 18923520. Throughput: 0: 10336.1. Samples: 18917668. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:24:36,893][434108] Avg episode reward: [(0, '3817.765')] [2023-03-08 13:24:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000036960_18923520.pth... [2023-03-08 13:24:36,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000036352_18612224.pth [2023-03-08 13:24:40,487][434396] Updated weights for policy 0, policy_version 37040 (0.0004) [2023-03-08 13:24:41,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 9927.6). Total num frames: 18976768. Throughput: 0: 10332.3. Samples: 18948684. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:24:41,893][434108] Avg episode reward: [(0, '4197.754')] [2023-03-08 13:24:44,345][434396] Updated weights for policy 0, policy_version 37120 (0.0003) [2023-03-08 13:24:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 9941.5). Total num frames: 19030016. Throughput: 0: 10374.7. Samples: 19012452. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:24:46,893][434108] Avg episode reward: [(0, '4373.778')] [2023-03-08 13:24:48,346][434396] Updated weights for policy 0, policy_version 37200 (0.0004) [2023-03-08 13:24:51,893][434108] Fps is (10 sec: 10649.4, 60 sec: 10376.5, 300 sec: 9941.5). Total num frames: 19083264. Throughput: 0: 10408.3. Samples: 19074816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:24:51,893][434108] Avg episode reward: [(0, '4232.530')] [2023-03-08 13:24:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000037272_19083264.pth... [2023-03-08 13:24:51,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000036664_18771968.pth [2023-03-08 13:24:52,212][434396] Updated weights for policy 0, policy_version 37280 (0.0004) [2023-03-08 13:24:56,271][434396] Updated weights for policy 0, policy_version 37360 (0.0005) [2023-03-08 13:24:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 9941.5). Total num frames: 19132416. Throughput: 0: 10412.1. Samples: 19105408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:24:56,893][434108] Avg episode reward: [(0, '4315.334')] [2023-03-08 13:25:00,323][434396] Updated weights for policy 0, policy_version 37440 (0.0005) [2023-03-08 13:25:01,892][434108] Fps is (10 sec: 9830.6, 60 sec: 10308.3, 300 sec: 9955.4). Total num frames: 19181568. Throughput: 0: 10329.7. Samples: 19166196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:25:01,903][434108] Avg episode reward: [(0, '4236.408')] [2023-03-08 13:25:04,348][434396] Updated weights for policy 0, policy_version 37520 (0.0005) [2023-03-08 13:25:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10066.4). Total num frames: 19234816. Throughput: 0: 10324.1. Samples: 19227932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:25:06,903][434108] Avg episode reward: [(0, '4278.281')] [2023-03-08 13:25:06,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000037568_19234816.pth... [2023-03-08 13:25:06,910][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000036960_18923520.pth [2023-03-08 13:25:08,226][434396] Updated weights for policy 0, policy_version 37600 (0.0005) [2023-03-08 13:25:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10094.2). Total num frames: 19288064. Throughput: 0: 10376.5. Samples: 19259456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:25:11,904][434108] Avg episode reward: [(0, '4208.210')] [2023-03-08 13:25:12,156][434396] Updated weights for policy 0, policy_version 37680 (0.0005) [2023-03-08 13:25:16,159][434396] Updated weights for policy 0, policy_version 37760 (0.0005) [2023-03-08 13:25:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10094.2). Total num frames: 19337216. Throughput: 0: 10342.5. Samples: 19320904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:25:16,904][434108] Avg episode reward: [(0, '4445.201')] [2023-03-08 13:25:19,941][434396] Updated weights for policy 0, policy_version 37840 (0.0005) [2023-03-08 13:25:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10122.0). Total num frames: 19394560. Throughput: 0: 10412.5. Samples: 19386228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:25:21,893][434108] Avg episode reward: [(0, '4308.466')] [2023-03-08 13:25:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000037880_19394560.pth... [2023-03-08 13:25:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000037272_19083264.pth [2023-03-08 13:25:23,685][434396] Updated weights for policy 0, policy_version 37920 (0.0004) [2023-03-08 13:25:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10122.0). Total num frames: 19447808. Throughput: 0: 10445.4. Samples: 19418728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:25:26,893][434108] Avg episode reward: [(0, '4400.766')] [2023-03-08 13:25:27,512][434396] Updated weights for policy 0, policy_version 38000 (0.0003) [2023-03-08 13:25:31,258][434396] Updated weights for policy 0, policy_version 38080 (0.0004) [2023-03-08 13:25:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10122.0). Total num frames: 19501056. Throughput: 0: 10456.7. Samples: 19483004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:25:31,893][434108] Avg episode reward: [(0, '4325.797')] [2023-03-08 13:25:35,010][434396] Updated weights for policy 0, policy_version 38160 (0.0003) [2023-03-08 13:25:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10122.0). Total num frames: 19554304. Throughput: 0: 10526.8. Samples: 19548520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:25:36,893][434108] Avg episode reward: [(0, '4222.988')] [2023-03-08 13:25:36,902][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000038200_19558400.pth... [2023-03-08 13:25:36,903][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000037568_19234816.pth [2023-03-08 13:25:38,789][434396] Updated weights for policy 0, policy_version 38240 (0.0005) [2023-03-08 13:25:41,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10581.3, 300 sec: 10135.9). Total num frames: 19611648. Throughput: 0: 10573.9. Samples: 19581232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:25:41,893][434108] Avg episode reward: [(0, '4356.321')] [2023-03-08 13:25:42,553][434396] Updated weights for policy 0, policy_version 38320 (0.0004) [2023-03-08 13:25:46,276][434396] Updated weights for policy 0, policy_version 38400 (0.0004) [2023-03-08 13:25:46,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10581.3, 300 sec: 10149.8). Total num frames: 19664896. Throughput: 0: 10686.8. Samples: 19647104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:25:46,893][434108] Avg episode reward: [(0, '4349.322')] [2023-03-08 13:25:50,061][434396] Updated weights for policy 0, policy_version 38480 (0.0004) [2023-03-08 13:25:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.4, 300 sec: 10149.7). Total num frames: 19718144. Throughput: 0: 10755.2. Samples: 19711916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:25:51,893][434108] Avg episode reward: [(0, '4391.423')] [2023-03-08 13:25:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000038512_19718144.pth... [2023-03-08 13:25:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000037880_19394560.pth [2023-03-08 13:25:53,969][434396] Updated weights for policy 0, policy_version 38560 (0.0005) [2023-03-08 13:25:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10163.6). Total num frames: 19771392. Throughput: 0: 10742.4. Samples: 19742864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:25:56,893][434108] Avg episode reward: [(0, '4542.086')] [2023-03-08 13:25:56,893][434347] Saving new best policy, reward=4542.086! [2023-03-08 13:25:58,089][434396] Updated weights for policy 0, policy_version 38640 (0.0005) [2023-03-08 13:26:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10163.6). Total num frames: 19820544. Throughput: 0: 10735.6. Samples: 19804008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:26:01,893][434108] Avg episode reward: [(0, '4226.559')] [2023-03-08 13:26:02,090][434396] Updated weights for policy 0, policy_version 38720 (0.0005) [2023-03-08 13:26:06,126][434396] Updated weights for policy 0, policy_version 38800 (0.0005) [2023-03-08 13:26:06,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10581.4, 300 sec: 10163.6). Total num frames: 19869696. Throughput: 0: 10631.6. Samples: 19864648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:26:06,893][434108] Avg episode reward: [(0, '3998.697')] [2023-03-08 13:26:06,910][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000038816_19873792.pth... [2023-03-08 13:26:06,911][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000038200_19558400.pth [2023-03-08 13:26:10,122][434396] Updated weights for policy 0, policy_version 38880 (0.0005) [2023-03-08 13:26:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10177.5). Total num frames: 19922944. Throughput: 0: 10596.9. Samples: 19895588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:26:11,893][434108] Avg episode reward: [(0, '4104.197')] [2023-03-08 13:26:14,144][434396] Updated weights for policy 0, policy_version 38960 (0.0005) [2023-03-08 13:26:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10177.5). Total num frames: 19976192. Throughput: 0: 10520.5. Samples: 19956428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:26:16,893][434108] Avg episode reward: [(0, '4088.277')] [2023-03-08 13:26:18,066][434396] Updated weights for policy 0, policy_version 39040 (0.0005) [2023-03-08 13:26:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10177.5). Total num frames: 20025344. Throughput: 0: 10456.4. Samples: 20019056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:26:21,893][434108] Avg episode reward: [(0, '4143.198')] [2023-03-08 13:26:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000039112_20025344.pth... [2023-03-08 13:26:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000038512_19718144.pth [2023-03-08 13:26:21,996][434396] Updated weights for policy 0, policy_version 39120 (0.0005) [2023-03-08 13:26:25,962][434396] Updated weights for policy 0, policy_version 39200 (0.0005) [2023-03-08 13:26:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10205.3). Total num frames: 20078592. Throughput: 0: 10426.9. Samples: 20050440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:26:26,893][434108] Avg episode reward: [(0, '4201.579')] [2023-03-08 13:26:29,839][434396] Updated weights for policy 0, policy_version 39280 (0.0005) [2023-03-08 13:26:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10205.3). Total num frames: 20131840. Throughput: 0: 10353.9. Samples: 20113028. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:26:31,893][434108] Avg episode reward: [(0, '4305.274')] [2023-03-08 13:26:33,719][434396] Updated weights for policy 0, policy_version 39360 (0.0004) [2023-03-08 13:26:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10219.2). Total num frames: 20185088. Throughput: 0: 10332.5. Samples: 20176880. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:26:36,893][434108] Avg episode reward: [(0, '4367.869')] [2023-03-08 13:26:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000039424_20185088.pth... [2023-03-08 13:26:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000038816_19873792.pth [2023-03-08 13:26:37,643][434396] Updated weights for policy 0, policy_version 39440 (0.0005) [2023-03-08 13:26:41,574][434396] Updated weights for policy 0, policy_version 39520 (0.0005) [2023-03-08 13:26:41,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10205.3). Total num frames: 20234240. Throughput: 0: 10328.6. Samples: 20207652. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:26:41,893][434108] Avg episode reward: [(0, '4267.583')] [2023-03-08 13:26:45,522][434396] Updated weights for policy 0, policy_version 39600 (0.0004) [2023-03-08 13:26:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10233.1). Total num frames: 20287488. Throughput: 0: 10351.5. Samples: 20269824. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:26:46,893][434108] Avg episode reward: [(0, '4186.757')] [2023-03-08 13:26:49,269][434396] Updated weights for policy 0, policy_version 39680 (0.0004) [2023-03-08 13:26:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10246.9). Total num frames: 20340736. Throughput: 0: 10454.8. Samples: 20335116. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:26:51,893][434108] Avg episode reward: [(0, '4393.836')] [2023-03-08 13:26:51,910][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000039736_20344832.pth... [2023-03-08 13:26:51,912][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000039112_20025344.pth [2023-03-08 13:26:52,997][434396] Updated weights for policy 0, policy_version 39760 (0.0004) [2023-03-08 13:26:56,659][434396] Updated weights for policy 0, policy_version 39840 (0.0004) [2023-03-08 13:26:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10288.6). Total num frames: 20398080. Throughput: 0: 10507.6. Samples: 20368432. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:26:56,893][434108] Avg episode reward: [(0, '4397.802')] [2023-03-08 13:27:00,333][434396] Updated weights for policy 0, policy_version 39920 (0.0004) [2023-03-08 13:27:01,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10581.3, 300 sec: 10316.4). Total num frames: 20455424. Throughput: 0: 10635.3. Samples: 20435016. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:27:01,893][434108] Avg episode reward: [(0, '4524.220')] [2023-03-08 13:27:04,022][434396] Updated weights for policy 0, policy_version 40000 (0.0005) [2023-03-08 13:27:06,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10330.3). Total num frames: 20508672. Throughput: 0: 10725.6. Samples: 20501708. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:27:06,893][434108] Avg episode reward: [(0, '4496.721')] [2023-03-08 13:27:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000040056_20508672.pth... [2023-03-08 13:27:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000039424_20185088.pth [2023-03-08 13:27:07,783][434396] Updated weights for policy 0, policy_version 40080 (0.0005) [2023-03-08 13:27:11,478][434396] Updated weights for policy 0, policy_version 40160 (0.0004) [2023-03-08 13:27:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10358.0). Total num frames: 20566016. Throughput: 0: 10742.5. Samples: 20533852. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:27:11,893][434108] Avg episode reward: [(0, '4445.023')] [2023-03-08 13:27:15,106][434396] Updated weights for policy 0, policy_version 40240 (0.0003) [2023-03-08 13:27:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10371.9). Total num frames: 20619264. Throughput: 0: 10860.3. Samples: 20601740. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:27:16,893][434108] Avg episode reward: [(0, '4378.260')] [2023-03-08 13:27:18,763][434396] Updated weights for policy 0, policy_version 40320 (0.0004) [2023-03-08 13:27:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10399.7). Total num frames: 20676608. Throughput: 0: 10924.4. Samples: 20668480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:27:21,893][434108] Avg episode reward: [(0, '4567.885')] [2023-03-08 13:27:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000040384_20676608.pth... [2023-03-08 13:27:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000039736_20344832.pth [2023-03-08 13:27:21,898][434347] Saving new best policy, reward=4567.885! [2023-03-08 13:27:22,448][434396] Updated weights for policy 0, policy_version 40400 (0.0004) [2023-03-08 13:27:26,160][434396] Updated weights for policy 0, policy_version 40480 (0.0004) [2023-03-08 13:27:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10413.6). Total num frames: 20729856. Throughput: 0: 10990.1. Samples: 20702208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:27:26,893][434108] Avg episode reward: [(0, '4486.812')] [2023-03-08 13:27:29,874][434396] Updated weights for policy 0, policy_version 40560 (0.0004) [2023-03-08 13:27:31,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10441.3). Total num frames: 20787200. Throughput: 0: 11071.1. Samples: 20768024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:27:31,893][434108] Avg episode reward: [(0, '4517.564')] [2023-03-08 13:27:33,833][434396] Updated weights for policy 0, policy_version 40640 (0.0005) [2023-03-08 13:27:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10854.4, 300 sec: 10441.3). Total num frames: 20836352. Throughput: 0: 11005.8. Samples: 20830376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:27:36,893][434108] Avg episode reward: [(0, '4523.626')] [2023-03-08 13:27:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000040696_20836352.pth... [2023-03-08 13:27:36,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000040056_20508672.pth [2023-03-08 13:27:37,690][434396] Updated weights for policy 0, policy_version 40720 (0.0005) [2023-03-08 13:27:41,572][434396] Updated weights for policy 0, policy_version 40800 (0.0005) [2023-03-08 13:27:41,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10922.7, 300 sec: 10455.2). Total num frames: 20889600. Throughput: 0: 11003.0. Samples: 20863568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:27:41,893][434108] Avg episode reward: [(0, '4519.670')] [2023-03-08 13:27:45,329][434396] Updated weights for policy 0, policy_version 40880 (0.0004) [2023-03-08 13:27:46,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10990.9, 300 sec: 10483.0). Total num frames: 20946944. Throughput: 0: 10934.2. Samples: 20927056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:27:46,893][434108] Avg episode reward: [(0, '4564.230')] [2023-03-08 13:27:49,276][434396] Updated weights for policy 0, policy_version 40960 (0.0005) [2023-03-08 13:27:51,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10510.8). Total num frames: 21000192. Throughput: 0: 10870.6. Samples: 20990884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:27:51,893][434108] Avg episode reward: [(0, '4556.566')] [2023-03-08 13:27:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000041016_21000192.pth... [2023-03-08 13:27:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000040384_20676608.pth [2023-03-08 13:27:53,017][434396] Updated weights for policy 0, policy_version 41040 (0.0005) [2023-03-08 13:27:56,720][434396] Updated weights for policy 0, policy_version 41120 (0.0004) [2023-03-08 13:27:56,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10922.7, 300 sec: 10524.6). Total num frames: 21053440. Throughput: 0: 10882.9. Samples: 21023584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:27:56,893][434108] Avg episode reward: [(0, '4544.030')] [2023-03-08 13:28:00,424][434396] Updated weights for policy 0, policy_version 41200 (0.0004) [2023-03-08 13:28:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 10538.5). Total num frames: 21106688. Throughput: 0: 10851.0. Samples: 21090036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:28:01,893][434108] Avg episode reward: [(0, '4513.755')] [2023-03-08 13:28:04,110][434396] Updated weights for policy 0, policy_version 41280 (0.0004) [2023-03-08 13:28:06,893][434108] Fps is (10 sec: 11059.1, 60 sec: 10922.7, 300 sec: 10566.3). Total num frames: 21164032. Throughput: 0: 10835.4. Samples: 21156076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:28:06,893][434108] Avg episode reward: [(0, '4593.323')] [2023-03-08 13:28:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000041336_21164032.pth... [2023-03-08 13:28:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000040696_20836352.pth [2023-03-08 13:28:06,900][434347] Saving new best policy, reward=4593.323! [2023-03-08 13:28:07,856][434396] Updated weights for policy 0, policy_version 41360 (0.0004) [2023-03-08 13:28:11,543][434396] Updated weights for policy 0, policy_version 41440 (0.0005) [2023-03-08 13:28:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10566.3). Total num frames: 21217280. Throughput: 0: 10813.1. Samples: 21188800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:28:11,893][434108] Avg episode reward: [(0, '4519.989')] [2023-03-08 13:28:15,271][434396] Updated weights for policy 0, policy_version 41520 (0.0004) [2023-03-08 13:28:16,892][434108] Fps is (10 sec: 11059.4, 60 sec: 10922.7, 300 sec: 10580.2). Total num frames: 21274624. Throughput: 0: 10825.2. Samples: 21255156. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:28:16,893][434108] Avg episode reward: [(0, '4496.962')] [2023-03-08 13:28:19,044][434396] Updated weights for policy 0, policy_version 41600 (0.0005) [2023-03-08 13:28:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10580.2). Total num frames: 21327872. Throughput: 0: 10896.0. Samples: 21320696. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:28:21,893][434108] Avg episode reward: [(0, '4324.537')] [2023-03-08 13:28:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000041656_21327872.pth... [2023-03-08 13:28:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000041016_21000192.pth [2023-03-08 13:28:22,718][434396] Updated weights for policy 0, policy_version 41680 (0.0004) [2023-03-08 13:28:26,433][434396] Updated weights for policy 0, policy_version 41760 (0.0004) [2023-03-08 13:28:26,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10922.7, 300 sec: 10594.1). Total num frames: 21385216. Throughput: 0: 10910.4. Samples: 21354536. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:28:26,893][434108] Avg episode reward: [(0, '4495.708')] [2023-03-08 13:28:30,462][434396] Updated weights for policy 0, policy_version 41840 (0.0004) [2023-03-08 13:28:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10594.1). Total num frames: 21434368. Throughput: 0: 10909.5. Samples: 21417984. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:28:31,893][434108] Avg episode reward: [(0, '4381.695')] [2023-03-08 13:28:34,426][434396] Updated weights for policy 0, policy_version 41920 (0.0004) [2023-03-08 13:28:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10854.4, 300 sec: 10607.9). Total num frames: 21487616. Throughput: 0: 10858.0. Samples: 21479496. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:28:36,893][434108] Avg episode reward: [(0, '4442.018')] [2023-03-08 13:28:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000041968_21487616.pth... [2023-03-08 13:28:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000041336_21164032.pth [2023-03-08 13:28:38,281][434396] Updated weights for policy 0, policy_version 42000 (0.0004) [2023-03-08 13:28:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 10621.8). Total num frames: 21540864. Throughput: 0: 10859.4. Samples: 21512256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:28:41,893][434108] Avg episode reward: [(0, '4231.532')] [2023-03-08 13:28:41,942][434396] Updated weights for policy 0, policy_version 42080 (0.0004) [2023-03-08 13:28:45,758][434396] Updated weights for policy 0, policy_version 42160 (0.0004) [2023-03-08 13:28:46,892][434108] Fps is (10 sec: 10649.8, 60 sec: 10786.1, 300 sec: 10621.8). Total num frames: 21594112. Throughput: 0: 10837.6. Samples: 21577728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:28:46,893][434108] Avg episode reward: [(0, '4467.005')] [2023-03-08 13:28:49,567][434396] Updated weights for policy 0, policy_version 42240 (0.0005) [2023-03-08 13:28:51,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10854.4, 300 sec: 10649.6). Total num frames: 21651456. Throughput: 0: 10814.1. Samples: 21642712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:28:51,893][434108] Avg episode reward: [(0, '4289.156')] [2023-03-08 13:28:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000042288_21651456.pth... [2023-03-08 13:28:51,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000041656_21327872.pth [2023-03-08 13:28:53,380][434396] Updated weights for policy 0, policy_version 42320 (0.0005) [2023-03-08 13:28:56,892][434108] Fps is (10 sec: 11059.0, 60 sec: 10854.4, 300 sec: 10649.6). Total num frames: 21704704. Throughput: 0: 10798.7. Samples: 21674740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:28:56,893][434108] Avg episode reward: [(0, '3985.817')] [2023-03-08 13:28:57,048][434396] Updated weights for policy 0, policy_version 42400 (0.0004) [2023-03-08 13:29:00,876][434396] Updated weights for policy 0, policy_version 42480 (0.0005) [2023-03-08 13:29:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 10663.5). Total num frames: 21757952. Throughput: 0: 10798.8. Samples: 21741104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:29:01,893][434108] Avg episode reward: [(0, '3995.772')] [2023-03-08 13:29:04,839][434396] Updated weights for policy 0, policy_version 42560 (0.0005) [2023-03-08 13:29:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10663.5). Total num frames: 21811200. Throughput: 0: 10719.5. Samples: 21803072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:29:06,893][434108] Avg episode reward: [(0, '4440.352')] [2023-03-08 13:29:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000042600_21811200.pth... [2023-03-08 13:29:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000041968_21487616.pth [2023-03-08 13:29:08,645][434396] Updated weights for policy 0, policy_version 42640 (0.0005) [2023-03-08 13:29:11,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10854.4, 300 sec: 10677.4). Total num frames: 21868544. Throughput: 0: 10694.4. Samples: 21835784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:29:11,893][434108] Avg episode reward: [(0, '4497.851')] [2023-03-08 13:29:12,228][434396] Updated weights for policy 0, policy_version 42720 (0.0004) [2023-03-08 13:29:15,832][434396] Updated weights for policy 0, policy_version 42800 (0.0004) [2023-03-08 13:29:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10691.3). Total num frames: 21921792. Throughput: 0: 10814.8. Samples: 21904652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:29:16,893][434108] Avg episode reward: [(0, '4506.328')] [2023-03-08 13:29:19,662][434396] Updated weights for policy 0, policy_version 42880 (0.0005) [2023-03-08 13:29:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.2, 300 sec: 10691.3). Total num frames: 21975040. Throughput: 0: 10881.4. Samples: 21969156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:29:21,893][434108] Avg episode reward: [(0, '4588.352')] [2023-03-08 13:29:21,919][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000042928_21979136.pth... [2023-03-08 13:29:21,920][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000042288_21651456.pth [2023-03-08 13:29:23,417][434396] Updated weights for policy 0, policy_version 42960 (0.0004) [2023-03-08 13:29:26,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10691.3). Total num frames: 22028288. Throughput: 0: 10887.6. Samples: 22002196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:29:26,893][434108] Avg episode reward: [(0, '4588.274')] [2023-03-08 13:29:27,319][434396] Updated weights for policy 0, policy_version 43040 (0.0005) [2023-03-08 13:29:31,173][434396] Updated weights for policy 0, policy_version 43120 (0.0005) [2023-03-08 13:29:31,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10786.1, 300 sec: 10705.1). Total num frames: 22081536. Throughput: 0: 10831.4. Samples: 22065144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:29:31,893][434108] Avg episode reward: [(0, '4560.485')] [2023-03-08 13:29:35,015][434396] Updated weights for policy 0, policy_version 43200 (0.0005) [2023-03-08 13:29:36,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10854.4, 300 sec: 10719.0). Total num frames: 22138880. Throughput: 0: 10822.5. Samples: 22129724. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:29:36,893][434108] Avg episode reward: [(0, '4437.538')] [2023-03-08 13:29:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000043240_22138880.pth... [2023-03-08 13:29:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000042600_21811200.pth [2023-03-08 13:29:38,722][434396] Updated weights for policy 0, policy_version 43280 (0.0004) [2023-03-08 13:29:41,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10854.4, 300 sec: 10719.0). Total num frames: 22192128. Throughput: 0: 10843.1. Samples: 22162680. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:29:41,893][434108] Avg episode reward: [(0, '4552.367')] [2023-03-08 13:29:42,425][434396] Updated weights for policy 0, policy_version 43360 (0.0004) [2023-03-08 13:29:46,141][434396] Updated weights for policy 0, policy_version 43440 (0.0004) [2023-03-08 13:29:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 10719.0). Total num frames: 22245376. Throughput: 0: 10843.4. Samples: 22229056. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:29:46,893][434108] Avg episode reward: [(0, '4554.118')] [2023-03-08 13:29:50,128][434396] Updated weights for policy 0, policy_version 43520 (0.0005) [2023-03-08 13:29:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10732.9). Total num frames: 22298624. Throughput: 0: 10842.0. Samples: 22290964. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:29:51,893][434108] Avg episode reward: [(0, '4617.311')] [2023-03-08 13:29:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000043552_22298624.pth... [2023-03-08 13:29:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000042928_21979136.pth [2023-03-08 13:29:51,900][434347] Saving new best policy, reward=4617.311! [2023-03-08 13:29:54,052][434396] Updated weights for policy 0, policy_version 43600 (0.0005) [2023-03-08 13:29:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10746.8). Total num frames: 22351872. Throughput: 0: 10831.3. Samples: 22323192. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:29:56,893][434108] Avg episode reward: [(0, '4623.508')] [2023-03-08 13:29:56,894][434347] Saving new best policy, reward=4623.508! [2023-03-08 13:29:57,952][434396] Updated weights for policy 0, policy_version 43680 (0.0005) [2023-03-08 13:30:01,885][434396] Updated weights for policy 0, policy_version 43760 (0.0005) [2023-03-08 13:30:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10746.8). Total num frames: 22405120. Throughput: 0: 10687.7. Samples: 22385600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:30:01,893][434108] Avg episode reward: [(0, '4590.402')] [2023-03-08 13:30:05,682][434396] Updated weights for policy 0, policy_version 43840 (0.0004) [2023-03-08 13:30:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10746.8). Total num frames: 22458368. Throughput: 0: 10684.0. Samples: 22449936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:30:06,893][434108] Avg episode reward: [(0, '4489.124')] [2023-03-08 13:30:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000043864_22458368.pth... [2023-03-08 13:30:06,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000043240_22138880.pth [2023-03-08 13:30:09,381][434396] Updated weights for policy 0, policy_version 43920 (0.0004) [2023-03-08 13:30:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10760.7). Total num frames: 22511616. Throughput: 0: 10684.7. Samples: 22483008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:30:11,893][434108] Avg episode reward: [(0, '4403.048')] [2023-03-08 13:30:13,095][434396] Updated weights for policy 0, policy_version 44000 (0.0005) [2023-03-08 13:30:16,816][434396] Updated weights for policy 0, policy_version 44080 (0.0005) [2023-03-08 13:30:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10760.7). Total num frames: 22568960. Throughput: 0: 10744.5. Samples: 22548648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:30:16,893][434108] Avg episode reward: [(0, '4541.938')] [2023-03-08 13:30:20,474][434396] Updated weights for policy 0, policy_version 44160 (0.0004) [2023-03-08 13:30:21,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10786.1, 300 sec: 10760.7). Total num frames: 22622208. Throughput: 0: 10803.0. Samples: 22615860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:30:21,893][434108] Avg episode reward: [(0, '4487.248')] [2023-03-08 13:30:21,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000044184_22622208.pth... [2023-03-08 13:30:21,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000043552_22298624.pth [2023-03-08 13:30:24,150][434396] Updated weights for policy 0, policy_version 44240 (0.0004) [2023-03-08 13:30:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10774.6). Total num frames: 22679552. Throughput: 0: 10816.8. Samples: 22649436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:30:26,893][434108] Avg episode reward: [(0, '4562.421')] [2023-03-08 13:30:27,765][434396] Updated weights for policy 0, policy_version 44320 (0.0004) [2023-03-08 13:30:31,741][434396] Updated weights for policy 0, policy_version 44400 (0.0005) [2023-03-08 13:30:31,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10774.6). Total num frames: 22732800. Throughput: 0: 10814.3. Samples: 22715700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:30:31,893][434108] Avg episode reward: [(0, '4574.590')] [2023-03-08 13:30:35,688][434396] Updated weights for policy 0, policy_version 44480 (0.0005) [2023-03-08 13:30:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10786.1, 300 sec: 10760.7). Total num frames: 22786048. Throughput: 0: 10813.7. Samples: 22777580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:30:36,893][434108] Avg episode reward: [(0, '4494.017')] [2023-03-08 13:30:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000044504_22786048.pth... [2023-03-08 13:30:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000043864_22458368.pth [2023-03-08 13:30:39,608][434396] Updated weights for policy 0, policy_version 44560 (0.0005) [2023-03-08 13:30:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10746.8). Total num frames: 22835200. Throughput: 0: 10786.0. Samples: 22808564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:30:41,893][434108] Avg episode reward: [(0, '4482.692')] [2023-03-08 13:30:43,549][434396] Updated weights for policy 0, policy_version 44640 (0.0005) [2023-03-08 13:30:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10746.8). Total num frames: 22888448. Throughput: 0: 10793.2. Samples: 22871292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:30:46,893][434108] Avg episode reward: [(0, '4427.857')] [2023-03-08 13:30:47,509][434396] Updated weights for policy 0, policy_version 44720 (0.0005) [2023-03-08 13:30:51,503][434396] Updated weights for policy 0, policy_version 44800 (0.0005) [2023-03-08 13:30:51,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10732.9). Total num frames: 22937600. Throughput: 0: 10733.5. Samples: 22932944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:30:51,893][434108] Avg episode reward: [(0, '4417.573')] [2023-03-08 13:30:51,915][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000044808_22941696.pth... [2023-03-08 13:30:51,917][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000044184_22622208.pth [2023-03-08 13:30:55,471][434396] Updated weights for policy 0, policy_version 44880 (0.0005) [2023-03-08 13:30:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10746.8). Total num frames: 22990848. Throughput: 0: 10673.9. Samples: 22963332. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:30:56,893][434108] Avg episode reward: [(0, '4554.193')] [2023-03-08 13:30:59,447][434396] Updated weights for policy 0, policy_version 44960 (0.0005) [2023-03-08 13:31:01,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10760.7). Total num frames: 23044096. Throughput: 0: 10599.8. Samples: 23025640. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:31:01,893][434108] Avg episode reward: [(0, '4563.151')] [2023-03-08 13:31:03,307][434396] Updated weights for policy 0, policy_version 45040 (0.0005) [2023-03-08 13:31:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10760.7). Total num frames: 23097344. Throughput: 0: 10512.7. Samples: 23088932. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:31:06,893][434108] Avg episode reward: [(0, '4592.662')] [2023-03-08 13:31:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000045112_23097344.pth... [2023-03-08 13:31:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000044504_22786048.pth [2023-03-08 13:31:07,276][434396] Updated weights for policy 0, policy_version 45120 (0.0005) [2023-03-08 13:31:11,317][434396] Updated weights for policy 0, policy_version 45200 (0.0005) [2023-03-08 13:31:11,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10746.8). Total num frames: 23146496. Throughput: 0: 10438.2. Samples: 23119156. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:31:11,893][434108] Avg episode reward: [(0, '4536.771')] [2023-03-08 13:31:15,367][434396] Updated weights for policy 0, policy_version 45280 (0.0005) [2023-03-08 13:31:16,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10444.8, 300 sec: 10746.8). Total num frames: 23195648. Throughput: 0: 10303.0. Samples: 23179336. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:31:16,893][434108] Avg episode reward: [(0, '4527.536')] [2023-03-08 13:31:19,323][434396] Updated weights for policy 0, policy_version 45360 (0.0005) [2023-03-08 13:31:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10746.8). Total num frames: 23248896. Throughput: 0: 10316.5. Samples: 23241824. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:31:21,893][434108] Avg episode reward: [(0, '4508.066')] [2023-03-08 13:31:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000045408_23248896.pth... [2023-03-08 13:31:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000044808_22941696.pth [2023-03-08 13:31:23,302][434396] Updated weights for policy 0, policy_version 45440 (0.0005) [2023-03-08 13:31:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10746.8). Total num frames: 23302144. Throughput: 0: 10323.6. Samples: 23273124. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:31:26,893][434108] Avg episode reward: [(0, '4586.707')] [2023-03-08 13:31:27,282][434396] Updated weights for policy 0, policy_version 45520 (0.0005) [2023-03-08 13:31:31,073][434396] Updated weights for policy 0, policy_version 45600 (0.0004) [2023-03-08 13:31:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10746.8). Total num frames: 23355392. Throughput: 0: 10304.1. Samples: 23334976. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:31:31,893][434108] Avg episode reward: [(0, '4605.309')] [2023-03-08 13:31:34,806][434396] Updated weights for policy 0, policy_version 45680 (0.0005) [2023-03-08 13:31:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10760.7). Total num frames: 23408640. Throughput: 0: 10411.5. Samples: 23401464. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:31:36,903][434108] Avg episode reward: [(0, '4533.489')] [2023-03-08 13:31:36,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000045720_23408640.pth... [2023-03-08 13:31:36,910][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000045112_23097344.pth [2023-03-08 13:31:38,587][434396] Updated weights for policy 0, policy_version 45760 (0.0004) [2023-03-08 13:31:41,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10760.7). Total num frames: 23461888. Throughput: 0: 10443.3. Samples: 23433280. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:31:41,904][434108] Avg episode reward: [(0, '4576.382')] [2023-03-08 13:31:42,587][434396] Updated weights for policy 0, policy_version 45840 (0.0005) [2023-03-08 13:31:46,528][434396] Updated weights for policy 0, policy_version 45920 (0.0005) [2023-03-08 13:31:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10746.8). Total num frames: 23511040. Throughput: 0: 10429.2. Samples: 23494952. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:31:46,903][434108] Avg episode reward: [(0, '4559.311')] [2023-03-08 13:31:50,482][434396] Updated weights for policy 0, policy_version 46000 (0.0005) [2023-03-08 13:31:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10732.9). Total num frames: 23564288. Throughput: 0: 10414.8. Samples: 23557600. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:31:51,893][434108] Avg episode reward: [(0, '4522.102')] [2023-03-08 13:31:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000046024_23564288.pth... [2023-03-08 13:31:51,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000045408_23248896.pth [2023-03-08 13:31:54,505][434396] Updated weights for policy 0, policy_version 46080 (0.0005) [2023-03-08 13:31:56,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10705.1). Total num frames: 23613440. Throughput: 0: 10417.9. Samples: 23587960. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:31:56,893][434108] Avg episode reward: [(0, '4397.356')] [2023-03-08 13:31:58,519][434396] Updated weights for policy 0, policy_version 46160 (0.0005) [2023-03-08 13:32:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10705.1). Total num frames: 23666688. Throughput: 0: 10461.2. Samples: 23650092. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:32:01,893][434108] Avg episode reward: [(0, '4278.537')] [2023-03-08 13:32:02,503][434396] Updated weights for policy 0, policy_version 46240 (0.0006) [2023-03-08 13:32:06,494][434396] Updated weights for policy 0, policy_version 46320 (0.0006) [2023-03-08 13:32:06,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10677.4). Total num frames: 23715840. Throughput: 0: 10437.9. Samples: 23711528. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:32:06,893][434108] Avg episode reward: [(0, '4400.592')] [2023-03-08 13:32:06,905][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000046328_23719936.pth... [2023-03-08 13:32:06,907][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000045720_23408640.pth [2023-03-08 13:32:10,447][434396] Updated weights for policy 0, policy_version 46400 (0.0005) [2023-03-08 13:32:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10677.4). Total num frames: 23769088. Throughput: 0: 10411.9. Samples: 23741660. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:32:11,893][434108] Avg episode reward: [(0, '4355.892')] [2023-03-08 13:32:14,271][434396] Updated weights for policy 0, policy_version 46480 (0.0004) [2023-03-08 13:32:16,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10663.5). Total num frames: 23822336. Throughput: 0: 10466.5. Samples: 23805968. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:32:16,893][434108] Avg episode reward: [(0, '4540.454')] [2023-03-08 13:32:18,235][434396] Updated weights for policy 0, policy_version 46560 (0.0005) [2023-03-08 13:32:21,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10663.5). Total num frames: 23875584. Throughput: 0: 10381.1. Samples: 23868616. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:32:21,893][434108] Avg episode reward: [(0, '4594.130')] [2023-03-08 13:32:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000046632_23875584.pth... [2023-03-08 13:32:21,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000046024_23564288.pth [2023-03-08 13:32:22,056][434396] Updated weights for policy 0, policy_version 46640 (0.0004) [2023-03-08 13:32:26,273][434396] Updated weights for policy 0, policy_version 46720 (0.0006) [2023-03-08 13:32:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10635.7). Total num frames: 23924736. Throughput: 0: 10359.2. Samples: 23899444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:32:26,893][434108] Avg episode reward: [(0, '4449.883')] [2023-03-08 13:32:30,106][434396] Updated weights for policy 0, policy_version 46800 (0.0005) [2023-03-08 13:32:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10649.6). Total num frames: 23977984. Throughput: 0: 10345.0. Samples: 23960476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:32:31,893][434108] Avg episode reward: [(0, '4452.980')] [2023-03-08 13:32:33,980][434396] Updated weights for policy 0, policy_version 46880 (0.0005) [2023-03-08 13:32:36,893][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10649.6). Total num frames: 24031232. Throughput: 0: 10344.5. Samples: 24023104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:32:36,893][434108] Avg episode reward: [(0, '4245.820')] [2023-03-08 13:32:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000046936_24031232.pth... [2023-03-08 13:32:36,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000046328_23719936.pth [2023-03-08 13:32:37,991][434396] Updated weights for policy 0, policy_version 46960 (0.0005) [2023-03-08 13:32:41,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10621.8). Total num frames: 24080384. Throughput: 0: 10358.5. Samples: 24054092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:32:41,893][434108] Avg episode reward: [(0, '4045.059')] [2023-03-08 13:32:42,043][434396] Updated weights for policy 0, policy_version 47040 (0.0005) [2023-03-08 13:32:46,072][434396] Updated weights for policy 0, policy_version 47120 (0.0005) [2023-03-08 13:32:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10621.8). Total num frames: 24133632. Throughput: 0: 10329.1. Samples: 24114900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:32:46,893][434108] Avg episode reward: [(0, '3897.921')] [2023-03-08 13:32:50,171][434396] Updated weights for policy 0, policy_version 47200 (0.0005) [2023-03-08 13:32:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10607.9). Total num frames: 24182784. Throughput: 0: 10292.6. Samples: 24174696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:32:51,893][434108] Avg episode reward: [(0, '3602.427')] [2023-03-08 13:32:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000047232_24182784.pth... [2023-03-08 13:32:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000046632_23875584.pth [2023-03-08 13:32:54,161][434396] Updated weights for policy 0, policy_version 47280 (0.0005) [2023-03-08 13:32:56,893][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10607.9). Total num frames: 24236032. Throughput: 0: 10316.5. Samples: 24205904. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:32:56,893][434108] Avg episode reward: [(0, '3786.963')] [2023-03-08 13:32:58,090][434396] Updated weights for policy 0, policy_version 47360 (0.0005) [2023-03-08 13:33:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10580.2). Total num frames: 24285184. Throughput: 0: 10286.6. Samples: 24268864. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:33:01,893][434108] Avg episode reward: [(0, '3895.377')] [2023-03-08 13:33:02,036][434396] Updated weights for policy 0, policy_version 47440 (0.0005) [2023-03-08 13:33:06,033][434396] Updated weights for policy 0, policy_version 47520 (0.0005) [2023-03-08 13:33:06,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10580.2). Total num frames: 24338432. Throughput: 0: 10258.3. Samples: 24330240. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:33:06,893][434108] Avg episode reward: [(0, '3803.146')] [2023-03-08 13:33:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000047536_24338432.pth... [2023-03-08 13:33:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000046936_24031232.pth [2023-03-08 13:33:09,790][434396] Updated weights for policy 0, policy_version 47600 (0.0004) [2023-03-08 13:33:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10566.3). Total num frames: 24391680. Throughput: 0: 10301.6. Samples: 24363016. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:33:11,893][434108] Avg episode reward: [(0, '4093.204')] [2023-03-08 13:33:13,457][434396] Updated weights for policy 0, policy_version 47680 (0.0004) [2023-03-08 13:33:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10566.3). Total num frames: 24444928. Throughput: 0: 10397.2. Samples: 24428352. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:33:16,893][434108] Avg episode reward: [(0, '4084.460')] [2023-03-08 13:33:17,349][434396] Updated weights for policy 0, policy_version 47760 (0.0005) [2023-03-08 13:33:21,355][434396] Updated weights for policy 0, policy_version 47840 (0.0005) [2023-03-08 13:33:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10552.4). Total num frames: 24498176. Throughput: 0: 10378.0. Samples: 24490112. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:33:21,893][434108] Avg episode reward: [(0, '4061.738')] [2023-03-08 13:33:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000047848_24498176.pth... [2023-03-08 13:33:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000047232_24182784.pth [2023-03-08 13:33:25,069][434396] Updated weights for policy 0, policy_version 47920 (0.0004) [2023-03-08 13:33:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 24551424. Throughput: 0: 10426.2. Samples: 24523272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:33:26,893][434108] Avg episode reward: [(0, '4228.818')] [2023-03-08 13:33:29,056][434396] Updated weights for policy 0, policy_version 48000 (0.0005) [2023-03-08 13:33:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 24604672. Throughput: 0: 10458.1. Samples: 24585516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:33:31,893][434108] Avg episode reward: [(0, '4107.421')] [2023-03-08 13:33:32,931][434396] Updated weights for policy 0, policy_version 48080 (0.0005) [2023-03-08 13:33:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10552.4). Total num frames: 24653824. Throughput: 0: 10510.6. Samples: 24647672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:33:36,893][434108] Avg episode reward: [(0, '4217.011')] [2023-03-08 13:33:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000048152_24653824.pth... [2023-03-08 13:33:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000047536_24338432.pth [2023-03-08 13:33:37,039][434396] Updated weights for policy 0, policy_version 48160 (0.0006) [2023-03-08 13:33:40,945][434396] Updated weights for policy 0, policy_version 48240 (0.0005) [2023-03-08 13:33:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10552.4). Total num frames: 24707072. Throughput: 0: 10500.1. Samples: 24678408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:33:41,893][434108] Avg episode reward: [(0, '4210.206')] [2023-03-08 13:33:44,967][434396] Updated weights for policy 0, policy_version 48320 (0.0005) [2023-03-08 13:33:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 24756224. Throughput: 0: 10470.0. Samples: 24740016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:33:46,893][434108] Avg episode reward: [(0, '4369.886')] [2023-03-08 13:33:48,959][434396] Updated weights for policy 0, policy_version 48400 (0.0005) [2023-03-08 13:33:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 24809472. Throughput: 0: 10479.4. Samples: 24801812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:33:51,893][434108] Avg episode reward: [(0, '4305.975')] [2023-03-08 13:33:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000048456_24809472.pth... [2023-03-08 13:33:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000047848_24498176.pth [2023-03-08 13:33:52,917][434396] Updated weights for policy 0, policy_version 48480 (0.0005) [2023-03-08 13:33:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.6, 300 sec: 10510.8). Total num frames: 24858624. Throughput: 0: 10454.3. Samples: 24833460. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:33:56,893][434108] Avg episode reward: [(0, '4095.562')] [2023-03-08 13:33:56,904][434396] Updated weights for policy 0, policy_version 48560 (0.0005) [2023-03-08 13:34:00,999][434396] Updated weights for policy 0, policy_version 48640 (0.0006) [2023-03-08 13:34:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 24911872. Throughput: 0: 10353.8. Samples: 24894272. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:34:01,893][434108] Avg episode reward: [(0, '4312.847')] [2023-03-08 13:34:05,084][434396] Updated weights for policy 0, policy_version 48720 (0.0006) [2023-03-08 13:34:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 24961024. Throughput: 0: 10311.5. Samples: 24954128. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:34:06,893][434108] Avg episode reward: [(0, '4374.911')] [2023-03-08 13:34:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000048752_24961024.pth... [2023-03-08 13:34:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000048152_24653824.pth [2023-03-08 13:34:09,126][434396] Updated weights for policy 0, policy_version 48800 (0.0006) [2023-03-08 13:34:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 25014272. Throughput: 0: 10261.4. Samples: 24985036. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:34:11,893][434108] Avg episode reward: [(0, '4397.871')] [2023-03-08 13:34:13,095][434396] Updated weights for policy 0, policy_version 48880 (0.0005) [2023-03-08 13:34:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10469.1). Total num frames: 25063424. Throughput: 0: 10238.7. Samples: 25046256. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:34:16,893][434108] Avg episode reward: [(0, '4374.999')] [2023-03-08 13:34:17,176][434396] Updated weights for policy 0, policy_version 48960 (0.0006) [2023-03-08 13:34:21,135][434396] Updated weights for policy 0, policy_version 49040 (0.0005) [2023-03-08 13:34:21,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10455.2). Total num frames: 25112576. Throughput: 0: 10223.6. Samples: 25107732. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:34:21,893][434108] Avg episode reward: [(0, '4289.388')] [2023-03-08 13:34:21,914][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000049056_25116672.pth... [2023-03-08 13:34:21,917][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000048456_24809472.pth [2023-03-08 13:34:25,097][434396] Updated weights for policy 0, policy_version 49120 (0.0006) [2023-03-08 13:34:26,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10455.2). Total num frames: 25165824. Throughput: 0: 10216.4. Samples: 25138148. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:34:26,893][434108] Avg episode reward: [(0, '4458.467')] [2023-03-08 13:34:29,059][434396] Updated weights for policy 0, policy_version 49200 (0.0005) [2023-03-08 13:34:31,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10441.3). Total num frames: 25219072. Throughput: 0: 10226.3. Samples: 25200200. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:34:31,893][434108] Avg episode reward: [(0, '4402.278')] [2023-03-08 13:34:33,144][434396] Updated weights for policy 0, policy_version 49280 (0.0005) [2023-03-08 13:34:36,893][434108] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10427.4). Total num frames: 25268224. Throughput: 0: 10182.8. Samples: 25260040. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:34:36,893][434108] Avg episode reward: [(0, '4469.438')] [2023-03-08 13:34:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000049352_25268224.pth... [2023-03-08 13:34:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000048752_24961024.pth [2023-03-08 13:34:37,237][434396] Updated weights for policy 0, policy_version 49360 (0.0005) [2023-03-08 13:34:41,276][434396] Updated weights for policy 0, policy_version 49440 (0.0006) [2023-03-08 13:34:41,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10413.6). Total num frames: 25317376. Throughput: 0: 10157.6. Samples: 25290552. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:34:41,893][434108] Avg episode reward: [(0, '4403.045')] [2023-03-08 13:34:45,301][434396] Updated weights for policy 0, policy_version 49520 (0.0005) [2023-03-08 13:34:46,892][434108] Fps is (10 sec: 9830.6, 60 sec: 10171.7, 300 sec: 10399.7). Total num frames: 25366528. Throughput: 0: 10158.4. Samples: 25351400. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:34:46,893][434108] Avg episode reward: [(0, '4398.456')] [2023-03-08 13:34:49,405][434396] Updated weights for policy 0, policy_version 49600 (0.0006) [2023-03-08 13:34:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10399.7). Total num frames: 25419776. Throughput: 0: 10165.8. Samples: 25411592. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:34:51,893][434108] Avg episode reward: [(0, '4366.384')] [2023-03-08 13:34:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000049648_25419776.pth... [2023-03-08 13:34:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000049056_25116672.pth [2023-03-08 13:34:53,461][434396] Updated weights for policy 0, policy_version 49680 (0.0006) [2023-03-08 13:34:56,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10385.8). Total num frames: 25468928. Throughput: 0: 10147.3. Samples: 25441664. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:34:56,893][434108] Avg episode reward: [(0, '4495.097')] [2023-03-08 13:34:57,520][434396] Updated weights for policy 0, policy_version 49760 (0.0006) [2023-03-08 13:35:01,519][434396] Updated weights for policy 0, policy_version 49840 (0.0006) [2023-03-08 13:35:01,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10371.9). Total num frames: 25518080. Throughput: 0: 10147.2. Samples: 25502880. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:35:01,893][434108] Avg episode reward: [(0, '4525.094')] [2023-03-08 13:35:05,584][434396] Updated weights for policy 0, policy_version 49920 (0.0006) [2023-03-08 13:35:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10371.9). Total num frames: 25571328. Throughput: 0: 10121.7. Samples: 25563208. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:35:06,893][434108] Avg episode reward: [(0, '4460.762')] [2023-03-08 13:35:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000049944_25571328.pth... [2023-03-08 13:35:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000049352_25268224.pth [2023-03-08 13:35:09,603][434396] Updated weights for policy 0, policy_version 50000 (0.0006) [2023-03-08 13:35:11,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10344.1). Total num frames: 25620480. Throughput: 0: 10128.6. Samples: 25593936. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:35:11,893][434108] Avg episode reward: [(0, '4555.488')] [2023-03-08 13:35:13,672][434396] Updated weights for policy 0, policy_version 50080 (0.0005) [2023-03-08 13:35:16,892][434108] Fps is (10 sec: 10240.2, 60 sec: 10171.7, 300 sec: 10344.1). Total num frames: 25673728. Throughput: 0: 10097.7. Samples: 25654596. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:35:16,893][434108] Avg episode reward: [(0, '4582.866')] [2023-03-08 13:35:17,636][434396] Updated weights for policy 0, policy_version 50160 (0.0005) [2023-03-08 13:35:21,422][434396] Updated weights for policy 0, policy_version 50240 (0.0005) [2023-03-08 13:35:21,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10330.2). Total num frames: 25726976. Throughput: 0: 10194.7. Samples: 25718800. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:35:21,893][434108] Avg episode reward: [(0, '4538.162')] [2023-03-08 13:35:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000050248_25726976.pth... [2023-03-08 13:35:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000049648_25419776.pth [2023-03-08 13:35:25,310][434396] Updated weights for policy 0, policy_version 50320 (0.0005) [2023-03-08 13:35:26,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10330.3). Total num frames: 25780224. Throughput: 0: 10225.9. Samples: 25750716. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:35:26,893][434108] Avg episode reward: [(0, '4537.412')] [2023-03-08 13:35:29,094][434396] Updated weights for policy 0, policy_version 50400 (0.0005) [2023-03-08 13:35:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 10330.3). Total num frames: 25833472. Throughput: 0: 10305.0. Samples: 25815124. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:35:31,893][434108] Avg episode reward: [(0, '4443.412')] [2023-03-08 13:35:32,808][434396] Updated weights for policy 0, policy_version 50480 (0.0005) [2023-03-08 13:35:36,533][434396] Updated weights for policy 0, policy_version 50560 (0.0005) [2023-03-08 13:35:36,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10376.6, 300 sec: 10358.0). Total num frames: 25890816. Throughput: 0: 10421.5. Samples: 25880560. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:35:36,893][434108] Avg episode reward: [(0, '4486.250')] [2023-03-08 13:35:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000050568_25890816.pth... [2023-03-08 13:35:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000049944_25571328.pth [2023-03-08 13:35:40,184][434396] Updated weights for policy 0, policy_version 50640 (0.0004) [2023-03-08 13:35:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10358.0). Total num frames: 25944064. Throughput: 0: 10504.8. Samples: 25914380. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:35:41,893][434108] Avg episode reward: [(0, '4475.734')] [2023-03-08 13:35:43,833][434396] Updated weights for policy 0, policy_version 50720 (0.0005) [2023-03-08 13:35:46,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10581.3, 300 sec: 10385.8). Total num frames: 26001408. Throughput: 0: 10640.9. Samples: 25981720. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:35:46,904][434108] Avg episode reward: [(0, '4531.409')] [2023-03-08 13:35:47,488][434396] Updated weights for policy 0, policy_version 50800 (0.0004) [2023-03-08 13:35:51,482][434396] Updated weights for policy 0, policy_version 50880 (0.0005) [2023-03-08 13:35:51,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10385.8). Total num frames: 26054656. Throughput: 0: 10732.7. Samples: 26046180. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:35:51,893][434108] Avg episode reward: [(0, '4587.697')] [2023-03-08 13:35:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000050888_26054656.pth... [2023-03-08 13:35:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000050248_25726976.pth [2023-03-08 13:35:55,452][434396] Updated weights for policy 0, policy_version 50960 (0.0005) [2023-03-08 13:35:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10371.9). Total num frames: 26103808. Throughput: 0: 10736.2. Samples: 26077064. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:35:56,893][434108] Avg episode reward: [(0, '4469.990')] [2023-03-08 13:35:59,387][434396] Updated weights for policy 0, policy_version 51040 (0.0005) [2023-03-08 13:36:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10371.9). Total num frames: 26157056. Throughput: 0: 10773.2. Samples: 26139392. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:36:01,893][434108] Avg episode reward: [(0, '4546.278')] [2023-03-08 13:36:03,391][434396] Updated weights for policy 0, policy_version 51120 (0.0005) [2023-03-08 13:36:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10371.9). Total num frames: 26206208. Throughput: 0: 10700.5. Samples: 26200320. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:36:06,893][434108] Avg episode reward: [(0, '4528.107')] [2023-03-08 13:36:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000051184_26206208.pth... [2023-03-08 13:36:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000050568_25890816.pth [2023-03-08 13:36:07,392][434396] Updated weights for policy 0, policy_version 51200 (0.0005) [2023-03-08 13:36:11,474][434396] Updated weights for policy 0, policy_version 51280 (0.0004) [2023-03-08 13:36:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10385.8). Total num frames: 26259456. Throughput: 0: 10668.3. Samples: 26230792. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:36:11,903][434108] Avg episode reward: [(0, '4433.135')] [2023-03-08 13:36:15,488][434396] Updated weights for policy 0, policy_version 51360 (0.0005) [2023-03-08 13:36:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10371.9). Total num frames: 26308608. Throughput: 0: 10593.6. Samples: 26291836. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:36:16,893][434108] Avg episode reward: [(0, '4554.482')] [2023-03-08 13:36:19,453][434396] Updated weights for policy 0, policy_version 51440 (0.0005) [2023-03-08 13:36:21,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10371.9). Total num frames: 26361856. Throughput: 0: 10523.9. Samples: 26354136. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:36:21,893][434108] Avg episode reward: [(0, '4458.053')] [2023-03-08 13:36:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000051488_26361856.pth... [2023-03-08 13:36:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000050888_26054656.pth [2023-03-08 13:36:23,132][434396] Updated weights for policy 0, policy_version 51520 (0.0004) [2023-03-08 13:36:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10371.9). Total num frames: 26415104. Throughput: 0: 10515.3. Samples: 26387568. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:36:26,893][434108] Avg episode reward: [(0, '4323.191')] [2023-03-08 13:36:27,000][434396] Updated weights for policy 0, policy_version 51600 (0.0005) [2023-03-08 13:36:31,114][434396] Updated weights for policy 0, policy_version 51680 (0.0005) [2023-03-08 13:36:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10358.0). Total num frames: 26464256. Throughput: 0: 10380.1. Samples: 26448824. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:36:31,893][434108] Avg episode reward: [(0, '4614.014')] [2023-03-08 13:36:35,115][434396] Updated weights for policy 0, policy_version 51760 (0.0005) [2023-03-08 13:36:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10358.0). Total num frames: 26517504. Throughput: 0: 10319.7. Samples: 26510564. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:36:36,893][434108] Avg episode reward: [(0, '4580.569')] [2023-03-08 13:36:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000051792_26517504.pth... [2023-03-08 13:36:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000051184_26206208.pth [2023-03-08 13:36:39,091][434396] Updated weights for policy 0, policy_version 51840 (0.0005) [2023-03-08 13:36:41,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 26566656. Throughput: 0: 10330.8. Samples: 26541948. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:36:41,893][434108] Avg episode reward: [(0, '4483.938')] [2023-03-08 13:36:43,239][434396] Updated weights for policy 0, policy_version 51920 (0.0005) [2023-03-08 13:36:46,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10344.1). Total num frames: 26615808. Throughput: 0: 10239.5. Samples: 26600172. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:36:46,893][434108] Avg episode reward: [(0, '4491.090')] [2023-03-08 13:36:47,342][434396] Updated weights for policy 0, policy_version 52000 (0.0005) [2023-03-08 13:36:51,280][434396] Updated weights for policy 0, policy_version 52080 (0.0005) [2023-03-08 13:36:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10358.0). Total num frames: 26669056. Throughput: 0: 10259.1. Samples: 26661980. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:36:51,893][434108] Avg episode reward: [(0, '4505.231')] [2023-03-08 13:36:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000052088_26669056.pth... [2023-03-08 13:36:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000051488_26361856.pth [2023-03-08 13:36:55,057][434396] Updated weights for policy 0, policy_version 52160 (0.0005) [2023-03-08 13:36:56,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10358.0). Total num frames: 26722304. Throughput: 0: 10314.1. Samples: 26694928. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:36:56,893][434108] Avg episode reward: [(0, '4453.629')] [2023-03-08 13:36:59,108][434396] Updated weights for policy 0, policy_version 52240 (0.0005) [2023-03-08 13:37:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10358.0). Total num frames: 26771456. Throughput: 0: 10315.7. Samples: 26756044. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:37:01,893][434108] Avg episode reward: [(0, '4351.692')] [2023-03-08 13:37:03,150][434396] Updated weights for policy 0, policy_version 52320 (0.0005) [2023-03-08 13:37:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10358.0). Total num frames: 26824704. Throughput: 0: 10303.5. Samples: 26817792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:37:06,893][434108] Avg episode reward: [(0, '4527.548')] [2023-03-08 13:37:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000052392_26824704.pth... [2023-03-08 13:37:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000051792_26517504.pth [2023-03-08 13:37:07,065][434396] Updated weights for policy 0, policy_version 52400 (0.0005) [2023-03-08 13:37:10,944][434396] Updated weights for policy 0, policy_version 52480 (0.0005) [2023-03-08 13:37:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10358.0). Total num frames: 26877952. Throughput: 0: 10264.4. Samples: 26849464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:37:11,893][434108] Avg episode reward: [(0, '4508.945')] [2023-03-08 13:37:14,843][434396] Updated weights for policy 0, policy_version 52560 (0.0005) [2023-03-08 13:37:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 26931200. Throughput: 0: 10299.5. Samples: 26912300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:37:16,893][434108] Avg episode reward: [(0, '4490.040')] [2023-03-08 13:37:18,884][434396] Updated weights for policy 0, policy_version 52640 (0.0005) [2023-03-08 13:37:21,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10358.0). Total num frames: 26980352. Throughput: 0: 10312.0. Samples: 26974604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:37:21,893][434108] Avg episode reward: [(0, '4473.730')] [2023-03-08 13:37:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000052696_26980352.pth... [2023-03-08 13:37:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000052088_26669056.pth [2023-03-08 13:37:22,798][434396] Updated weights for policy 0, policy_version 52720 (0.0005) [2023-03-08 13:37:26,791][434396] Updated weights for policy 0, policy_version 52800 (0.0005) [2023-03-08 13:37:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10358.0). Total num frames: 27033600. Throughput: 0: 10292.3. Samples: 27005100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:37:26,893][434108] Avg episode reward: [(0, '4274.147')] [2023-03-08 13:37:30,747][434396] Updated weights for policy 0, policy_version 52880 (0.0005) [2023-03-08 13:37:31,893][434108] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10344.1). Total num frames: 27082752. Throughput: 0: 10371.5. Samples: 27066888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:37:31,893][434108] Avg episode reward: [(0, '4404.142')] [2023-03-08 13:37:34,755][434396] Updated weights for policy 0, policy_version 52960 (0.0005) [2023-03-08 13:37:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10358.0). Total num frames: 27136000. Throughput: 0: 10373.0. Samples: 27128764. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:37:36,893][434108] Avg episode reward: [(0, '4424.653')] [2023-03-08 13:37:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000053000_27136000.pth... [2023-03-08 13:37:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000052392_26824704.pth [2023-03-08 13:37:38,565][434396] Updated weights for policy 0, policy_version 53040 (0.0005) [2023-03-08 13:37:41,892][434108] Fps is (10 sec: 10649.8, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 27189248. Throughput: 0: 10355.9. Samples: 27160944. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:37:41,893][434108] Avg episode reward: [(0, '4507.327')] [2023-03-08 13:37:42,495][434396] Updated weights for policy 0, policy_version 53120 (0.0005) [2023-03-08 13:37:46,489][434396] Updated weights for policy 0, policy_version 53200 (0.0005) [2023-03-08 13:37:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10371.9). Total num frames: 27242496. Throughput: 0: 10381.4. Samples: 27223208. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:37:46,893][434108] Avg episode reward: [(0, '4563.001')] [2023-03-08 13:37:50,487][434396] Updated weights for policy 0, policy_version 53280 (0.0005) [2023-03-08 13:37:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 27291648. Throughput: 0: 10368.1. Samples: 27284356. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:37:51,893][434108] Avg episode reward: [(0, '4588.153')] [2023-03-08 13:37:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000053304_27291648.pth... [2023-03-08 13:37:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000052696_26980352.pth [2023-03-08 13:37:54,542][434396] Updated weights for policy 0, policy_version 53360 (0.0005) [2023-03-08 13:37:56,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10358.0). Total num frames: 27340800. Throughput: 0: 10361.3. Samples: 27315724. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:37:56,903][434108] Avg episode reward: [(0, '4604.127')] [2023-03-08 13:37:58,440][434396] Updated weights for policy 0, policy_version 53440 (0.0005) [2023-03-08 13:38:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10371.9). Total num frames: 27398144. Throughput: 0: 10353.8. Samples: 27378220. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:38:01,903][434108] Avg episode reward: [(0, '4595.860')] [2023-03-08 13:38:02,170][434396] Updated weights for policy 0, policy_version 53520 (0.0004) [2023-03-08 13:38:06,003][434396] Updated weights for policy 0, policy_version 53600 (0.0004) [2023-03-08 13:38:06,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10371.9). Total num frames: 27451392. Throughput: 0: 10414.7. Samples: 27443264. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:38:06,903][434108] Avg episode reward: [(0, '4568.077')] [2023-03-08 13:38:06,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000053616_27451392.pth... [2023-03-08 13:38:06,910][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000053000_27136000.pth [2023-03-08 13:38:09,998][434396] Updated weights for policy 0, policy_version 53680 (0.0005) [2023-03-08 13:38:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 27500544. Throughput: 0: 10429.5. Samples: 27474428. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:38:11,893][434108] Avg episode reward: [(0, '4497.255')] [2023-03-08 13:38:14,027][434396] Updated weights for policy 0, policy_version 53760 (0.0005) [2023-03-08 13:38:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 27553792. Throughput: 0: 10413.7. Samples: 27535504. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:38:16,893][434108] Avg episode reward: [(0, '4400.900')] [2023-03-08 13:38:17,837][434396] Updated weights for policy 0, policy_version 53840 (0.0005) [2023-03-08 13:38:21,449][434396] Updated weights for policy 0, policy_version 53920 (0.0004) [2023-03-08 13:38:21,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10513.1, 300 sec: 10371.9). Total num frames: 27611136. Throughput: 0: 10513.3. Samples: 27601864. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:38:21,893][434108] Avg episode reward: [(0, '4465.820')] [2023-03-08 13:38:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000053928_27611136.pth... [2023-03-08 13:38:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000053304_27291648.pth [2023-03-08 13:38:25,272][434396] Updated weights for policy 0, policy_version 54000 (0.0005) [2023-03-08 13:38:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10513.1, 300 sec: 10371.9). Total num frames: 27664384. Throughput: 0: 10507.2. Samples: 27633768. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:38:26,893][434108] Avg episode reward: [(0, '4352.405')] [2023-03-08 13:38:28,950][434396] Updated weights for policy 0, policy_version 54080 (0.0004) [2023-03-08 13:38:31,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10399.7). Total num frames: 27721728. Throughput: 0: 10618.5. Samples: 27701040. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:38:31,893][434108] Avg episode reward: [(0, '4462.344')] [2023-03-08 13:38:32,598][434396] Updated weights for policy 0, policy_version 54160 (0.0004) [2023-03-08 13:38:36,344][434396] Updated weights for policy 0, policy_version 54240 (0.0005) [2023-03-08 13:38:36,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10399.7). Total num frames: 27774976. Throughput: 0: 10727.0. Samples: 27767072. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:38:36,893][434108] Avg episode reward: [(0, '4498.936')] [2023-03-08 13:38:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000054248_27774976.pth... [2023-03-08 13:38:36,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000053616_27451392.pth [2023-03-08 13:38:40,319][434396] Updated weights for policy 0, policy_version 54320 (0.0005) [2023-03-08 13:38:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10399.7). Total num frames: 27824128. Throughput: 0: 10731.2. Samples: 27798628. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 13:38:41,893][434108] Avg episode reward: [(0, '4444.647')] [2023-03-08 13:38:44,245][434396] Updated weights for policy 0, policy_version 54400 (0.0005) [2023-03-08 13:38:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10399.7). Total num frames: 27877376. Throughput: 0: 10729.7. Samples: 27861056. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:38:46,893][434108] Avg episode reward: [(0, '4434.336')] [2023-03-08 13:38:48,215][434396] Updated weights for policy 0, policy_version 54480 (0.0005) [2023-03-08 13:38:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10413.6). Total num frames: 27930624. Throughput: 0: 10695.0. Samples: 27924540. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:38:51,893][434108] Avg episode reward: [(0, '4502.874')] [2023-03-08 13:38:51,926][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000054560_27934720.pth... [2023-03-08 13:38:51,926][434396] Updated weights for policy 0, policy_version 54560 (0.0004) [2023-03-08 13:38:51,927][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000053928_27611136.pth [2023-03-08 13:38:55,809][434396] Updated weights for policy 0, policy_version 54640 (0.0005) [2023-03-08 13:38:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10413.6). Total num frames: 27983872. Throughput: 0: 10736.2. Samples: 27957556. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:38:56,893][434108] Avg episode reward: [(0, '4564.779')] [2023-03-08 13:38:59,815][434396] Updated weights for policy 0, policy_version 54720 (0.0005) [2023-03-08 13:39:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10427.4). Total num frames: 28037120. Throughput: 0: 10743.3. Samples: 28018952. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:39:01,893][434108] Avg episode reward: [(0, '4617.973')] [2023-03-08 13:39:03,775][434396] Updated weights for policy 0, policy_version 54800 (0.0005) [2023-03-08 13:39:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10427.4). Total num frames: 28090368. Throughput: 0: 10673.6. Samples: 28082176. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:39:06,893][434108] Avg episode reward: [(0, '4550.386')] [2023-03-08 13:39:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000054864_28090368.pth... [2023-03-08 13:39:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000054248_27774976.pth [2023-03-08 13:39:07,719][434396] Updated weights for policy 0, policy_version 54880 (0.0005) [2023-03-08 13:39:11,540][434396] Updated weights for policy 0, policy_version 54960 (0.0005) [2023-03-08 13:39:11,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10427.4). Total num frames: 28139520. Throughput: 0: 10617.2. Samples: 28111540. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:39:11,893][434108] Avg episode reward: [(0, '4590.366')] [2023-03-08 13:39:15,171][434396] Updated weights for policy 0, policy_version 55040 (0.0004) [2023-03-08 13:39:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10455.2). Total num frames: 28196864. Throughput: 0: 10614.4. Samples: 28178688. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:39:16,893][434108] Avg episode reward: [(0, '4601.328')] [2023-03-08 13:39:18,838][434396] Updated weights for policy 0, policy_version 55120 (0.0004) [2023-03-08 13:39:21,892][434108] Fps is (10 sec: 11468.7, 60 sec: 10717.9, 300 sec: 10469.1). Total num frames: 28254208. Throughput: 0: 10642.4. Samples: 28245980. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:39:21,893][434108] Avg episode reward: [(0, '4635.585')] [2023-03-08 13:39:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000055184_28254208.pth... [2023-03-08 13:39:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000054560_27934720.pth [2023-03-08 13:39:21,899][434347] Saving new best policy, reward=4635.585! [2023-03-08 13:39:22,505][434396] Updated weights for policy 0, policy_version 55200 (0.0004) [2023-03-08 13:39:26,123][434396] Updated weights for policy 0, policy_version 55280 (0.0004) [2023-03-08 13:39:26,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10786.1, 300 sec: 10483.0). Total num frames: 28311552. Throughput: 0: 10683.6. Samples: 28279388. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:39:26,893][434108] Avg episode reward: [(0, '4646.037')] [2023-03-08 13:39:26,893][434347] Saving new best policy, reward=4646.037! [2023-03-08 13:39:29,806][434396] Updated weights for policy 0, policy_version 55360 (0.0004) [2023-03-08 13:39:31,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10496.9). Total num frames: 28364800. Throughput: 0: 10793.5. Samples: 28346764. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:39:31,893][434108] Avg episode reward: [(0, '4584.405')] [2023-03-08 13:39:33,702][434396] Updated weights for policy 0, policy_version 55440 (0.0005) [2023-03-08 13:39:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10510.8). Total num frames: 28418048. Throughput: 0: 10780.3. Samples: 28409652. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:39:36,893][434108] Avg episode reward: [(0, '4596.772')] [2023-03-08 13:39:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000055504_28418048.pth... [2023-03-08 13:39:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000054864_28090368.pth [2023-03-08 13:39:37,623][434396] Updated weights for policy 0, policy_version 55520 (0.0005) [2023-03-08 13:39:41,584][434396] Updated weights for policy 0, policy_version 55600 (0.0005) [2023-03-08 13:39:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10510.7). Total num frames: 28467200. Throughput: 0: 10741.4. Samples: 28440920. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:39:41,893][434108] Avg episode reward: [(0, '4632.739')] [2023-03-08 13:39:45,526][434396] Updated weights for policy 0, policy_version 55680 (0.0004) [2023-03-08 13:39:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10510.8). Total num frames: 28520448. Throughput: 0: 10758.5. Samples: 28503084. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:39:46,893][434108] Avg episode reward: [(0, '4628.036')] [2023-03-08 13:39:49,466][434396] Updated weights for policy 0, policy_version 55760 (0.0005) [2023-03-08 13:39:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10524.6). Total num frames: 28573696. Throughput: 0: 10740.8. Samples: 28565512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:39:51,893][434108] Avg episode reward: [(0, '4616.979')] [2023-03-08 13:39:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000055808_28573696.pth... [2023-03-08 13:39:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000055184_28254208.pth [2023-03-08 13:39:53,368][434396] Updated weights for policy 0, policy_version 55840 (0.0005) [2023-03-08 13:39:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 28622848. Throughput: 0: 10789.2. Samples: 28597052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:39:56,893][434108] Avg episode reward: [(0, '4602.218')] [2023-03-08 13:39:57,291][434396] Updated weights for policy 0, policy_version 55920 (0.0005) [2023-03-08 13:40:01,188][434396] Updated weights for policy 0, policy_version 56000 (0.0005) [2023-03-08 13:40:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 28676096. Throughput: 0: 10685.2. Samples: 28659520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:40:01,893][434108] Avg episode reward: [(0, '4471.107')] [2023-03-08 13:40:04,881][434396] Updated weights for policy 0, policy_version 56080 (0.0004) [2023-03-08 13:40:06,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 10552.4). Total num frames: 28733440. Throughput: 0: 10652.0. Samples: 28725320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:40:06,893][434108] Avg episode reward: [(0, '4563.940')] [2023-03-08 13:40:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000056120_28733440.pth... [2023-03-08 13:40:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000055504_28418048.pth [2023-03-08 13:40:08,532][434396] Updated weights for policy 0, policy_version 56160 (0.0004) [2023-03-08 13:40:11,892][434108] Fps is (10 sec: 11468.7, 60 sec: 10854.4, 300 sec: 10566.3). Total num frames: 28790784. Throughput: 0: 10665.4. Samples: 28759332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:40:11,893][434108] Avg episode reward: [(0, '4606.232')] [2023-03-08 13:40:12,203][434396] Updated weights for policy 0, policy_version 56240 (0.0004) [2023-03-08 13:40:15,967][434396] Updated weights for policy 0, policy_version 56320 (0.0004) [2023-03-08 13:40:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10566.3). Total num frames: 28844032. Throughput: 0: 10646.0. Samples: 28825836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:40:16,893][434108] Avg episode reward: [(0, '4594.170')] [2023-03-08 13:40:19,675][434396] Updated weights for policy 0, policy_version 56400 (0.0004) [2023-03-08 13:40:21,893][434108] Fps is (10 sec: 11059.0, 60 sec: 10786.1, 300 sec: 10580.2). Total num frames: 28901376. Throughput: 0: 10717.2. Samples: 28891928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:40:21,893][434108] Avg episode reward: [(0, '4543.297')] [2023-03-08 13:40:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000056448_28901376.pth... [2023-03-08 13:40:21,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000055808_28573696.pth [2023-03-08 13:40:23,322][434396] Updated weights for policy 0, policy_version 56480 (0.0004) [2023-03-08 13:40:26,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10717.9, 300 sec: 10580.2). Total num frames: 28954624. Throughput: 0: 10778.0. Samples: 28925928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:40:26,893][434108] Avg episode reward: [(0, '4501.772')] [2023-03-08 13:40:26,912][434396] Updated weights for policy 0, policy_version 56560 (0.0004) [2023-03-08 13:40:30,792][434396] Updated weights for policy 0, policy_version 56640 (0.0005) [2023-03-08 13:40:31,892][434108] Fps is (10 sec: 10649.9, 60 sec: 10717.9, 300 sec: 10566.3). Total num frames: 29007872. Throughput: 0: 10854.9. Samples: 28991552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:40:31,893][434108] Avg episode reward: [(0, '4573.822')] [2023-03-08 13:40:34,683][434396] Updated weights for policy 0, policy_version 56720 (0.0005) [2023-03-08 13:40:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10566.3). Total num frames: 29061120. Throughput: 0: 10865.3. Samples: 29054452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:40:36,893][434108] Avg episode reward: [(0, '4588.054')] [2023-03-08 13:40:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000056760_29061120.pth... [2023-03-08 13:40:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000056120_28733440.pth [2023-03-08 13:40:38,632][434396] Updated weights for policy 0, policy_version 56800 (0.0006) [2023-03-08 13:40:41,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10786.1, 300 sec: 10552.4). Total num frames: 29114368. Throughput: 0: 10860.3. Samples: 29085768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:40:41,893][434108] Avg episode reward: [(0, '4572.497')] [2023-03-08 13:40:42,584][434396] Updated weights for policy 0, policy_version 56880 (0.0005) [2023-03-08 13:40:46,326][434396] Updated weights for policy 0, policy_version 56960 (0.0005) [2023-03-08 13:40:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10552.4). Total num frames: 29167616. Throughput: 0: 10864.4. Samples: 29148420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:40:46,903][434108] Avg episode reward: [(0, '4512.781')] [2023-03-08 13:40:49,981][434396] Updated weights for policy 0, policy_version 57040 (0.0005) [2023-03-08 13:40:51,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10580.2). Total num frames: 29224960. Throughput: 0: 10907.9. Samples: 29216176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:40:51,893][434108] Avg episode reward: [(0, '4474.480')] [2023-03-08 13:40:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000057080_29224960.pth... [2023-03-08 13:40:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000056448_28901376.pth [2023-03-08 13:40:53,659][434396] Updated weights for policy 0, policy_version 57120 (0.0004) [2023-03-08 13:40:56,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10922.6, 300 sec: 10580.2). Total num frames: 29278208. Throughput: 0: 10893.6. Samples: 29249544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:40:56,893][434108] Avg episode reward: [(0, '4545.673')] [2023-03-08 13:40:57,611][434396] Updated weights for policy 0, policy_version 57200 (0.0005) [2023-03-08 13:41:01,463][434396] Updated weights for policy 0, policy_version 57280 (0.0005) [2023-03-08 13:41:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10594.1). Total num frames: 29331456. Throughput: 0: 10782.5. Samples: 29311048. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:41:01,893][434108] Avg episode reward: [(0, '4611.215')] [2023-03-08 13:41:05,265][434396] Updated weights for policy 0, policy_version 57360 (0.0005) [2023-03-08 13:41:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 10594.1). Total num frames: 29384704. Throughput: 0: 10770.0. Samples: 29376576. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:41:06,893][434108] Avg episode reward: [(0, '4602.568')] [2023-03-08 13:41:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000057392_29384704.pth... [2023-03-08 13:41:06,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000056760_29061120.pth [2023-03-08 13:41:09,107][434396] Updated weights for policy 0, policy_version 57440 (0.0005) [2023-03-08 13:41:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10607.9). Total num frames: 29437952. Throughput: 0: 10732.3. Samples: 29408884. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:41:11,893][434108] Avg episode reward: [(0, '4606.801')] [2023-03-08 13:41:13,083][434396] Updated weights for policy 0, policy_version 57520 (0.0005) [2023-03-08 13:41:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 29487104. Throughput: 0: 10648.3. Samples: 29470728. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:41:16,893][434108] Avg episode reward: [(0, '4601.762')] [2023-03-08 13:41:17,011][434396] Updated weights for policy 0, policy_version 57600 (0.0005) [2023-03-08 13:41:20,897][434396] Updated weights for policy 0, policy_version 57680 (0.0005) [2023-03-08 13:41:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10594.1). Total num frames: 29540352. Throughput: 0: 10641.2. Samples: 29533304. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:41:21,893][434108] Avg episode reward: [(0, '4405.003')] [2023-03-08 13:41:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000057696_29540352.pth... [2023-03-08 13:41:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000057080_29224960.pth [2023-03-08 13:41:24,645][434396] Updated weights for policy 0, policy_version 57760 (0.0005) [2023-03-08 13:41:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10621.8). Total num frames: 29597696. Throughput: 0: 10671.6. Samples: 29565992. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:41:26,893][434108] Avg episode reward: [(0, '4460.539')] [2023-03-08 13:41:28,313][434396] Updated weights for policy 0, policy_version 57840 (0.0004) [2023-03-08 13:41:31,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10621.8). Total num frames: 29650944. Throughput: 0: 10787.7. Samples: 29633868. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:41:31,893][434108] Avg episode reward: [(0, '4473.461')] [2023-03-08 13:41:31,995][434396] Updated weights for policy 0, policy_version 57920 (0.0004) [2023-03-08 13:41:35,787][434396] Updated weights for policy 0, policy_version 58000 (0.0005) [2023-03-08 13:41:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10635.7). Total num frames: 29704192. Throughput: 0: 10723.4. Samples: 29698728. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:41:36,893][434108] Avg episode reward: [(0, '4157.084')] [2023-03-08 13:41:36,919][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000058024_29708288.pth... [2023-03-08 13:41:36,921][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000057392_29384704.pth [2023-03-08 13:41:39,473][434396] Updated weights for policy 0, policy_version 58080 (0.0004) [2023-03-08 13:41:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10663.5). Total num frames: 29761536. Throughput: 0: 10726.9. Samples: 29732252. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:41:41,893][434108] Avg episode reward: [(0, '4464.711')] [2023-03-08 13:41:43,135][434396] Updated weights for policy 0, policy_version 58160 (0.0004) [2023-03-08 13:41:46,798][434396] Updated weights for policy 0, policy_version 58240 (0.0004) [2023-03-08 13:41:46,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10854.4, 300 sec: 10677.4). Total num frames: 29818880. Throughput: 0: 10831.7. Samples: 29798472. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:41:46,893][434108] Avg episode reward: [(0, '4205.133')] [2023-03-08 13:41:50,537][434396] Updated weights for policy 0, policy_version 58320 (0.0004) [2023-03-08 13:41:51,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10677.4). Total num frames: 29872128. Throughput: 0: 10849.3. Samples: 29864796. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:41:51,893][434108] Avg episode reward: [(0, '4296.309')] [2023-03-08 13:41:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000058344_29872128.pth... [2023-03-08 13:41:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000057696_29540352.pth [2023-03-08 13:41:54,315][434396] Updated weights for policy 0, policy_version 58400 (0.0005) [2023-03-08 13:41:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10705.1). Total num frames: 29929472. Throughput: 0: 10853.8. Samples: 29897304. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:41:56,893][434108] Avg episode reward: [(0, '4389.169')] [2023-03-08 13:41:57,999][434396] Updated weights for policy 0, policy_version 58480 (0.0004) [2023-03-08 13:42:01,715][434396] Updated weights for policy 0, policy_version 58560 (0.0004) [2023-03-08 13:42:01,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10854.4, 300 sec: 10705.1). Total num frames: 29982720. Throughput: 0: 10952.1. Samples: 29963572. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:42:01,893][434108] Avg episode reward: [(0, '4330.819')] [2023-03-08 13:42:05,422][434396] Updated weights for policy 0, policy_version 58640 (0.0004) [2023-03-08 13:42:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10854.4, 300 sec: 10705.1). Total num frames: 30035968. Throughput: 0: 11042.1. Samples: 30030200. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:42:06,893][434108] Avg episode reward: [(0, '4356.616')] [2023-03-08 13:42:06,924][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000058672_30040064.pth... [2023-03-08 13:42:06,925][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000058024_29708288.pth [2023-03-08 13:42:09,131][434396] Updated weights for policy 0, policy_version 58720 (0.0004) [2023-03-08 13:42:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10719.0). Total num frames: 30093312. Throughput: 0: 11059.7. Samples: 30063680. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:42:11,903][434108] Avg episode reward: [(0, '4449.748')] [2023-03-08 13:42:12,813][434396] Updated weights for policy 0, policy_version 58800 (0.0004) [2023-03-08 13:42:16,540][434396] Updated weights for policy 0, policy_version 58880 (0.0004) [2023-03-08 13:42:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10732.9). Total num frames: 30146560. Throughput: 0: 11029.2. Samples: 30130184. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:42:16,903][434108] Avg episode reward: [(0, '4253.614')] [2023-03-08 13:42:20,276][434396] Updated weights for policy 0, policy_version 58960 (0.0005) [2023-03-08 13:42:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10746.8). Total num frames: 30203904. Throughput: 0: 11045.5. Samples: 30195776. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:42:21,903][434108] Avg episode reward: [(0, '4461.681')] [2023-03-08 13:42:21,906][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000058992_30203904.pth... [2023-03-08 13:42:21,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000058344_29872128.pth [2023-03-08 13:42:23,925][434396] Updated weights for policy 0, policy_version 59040 (0.0005) [2023-03-08 13:42:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10760.7). Total num frames: 30257152. Throughput: 0: 11044.0. Samples: 30229232. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:42:26,903][434108] Avg episode reward: [(0, '4404.566')] [2023-03-08 13:42:27,637][434396] Updated weights for policy 0, policy_version 59120 (0.0004) [2023-03-08 13:42:31,345][434396] Updated weights for policy 0, policy_version 59200 (0.0004) [2023-03-08 13:42:31,892][434108] Fps is (10 sec: 11059.3, 60 sec: 11059.2, 300 sec: 10774.6). Total num frames: 30314496. Throughput: 0: 11052.4. Samples: 30295832. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:42:31,903][434108] Avg episode reward: [(0, '4347.839')] [2023-03-08 13:42:35,075][434396] Updated weights for policy 0, policy_version 59280 (0.0004) [2023-03-08 13:42:36,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10774.6). Total num frames: 30367744. Throughput: 0: 11036.5. Samples: 30361440. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:42:36,903][434108] Avg episode reward: [(0, '4424.079')] [2023-03-08 13:42:36,906][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000059312_30367744.pth... [2023-03-08 13:42:36,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000058672_30040064.pth [2023-03-08 13:42:38,798][434396] Updated weights for policy 0, policy_version 59360 (0.0004) [2023-03-08 13:42:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10788.4). Total num frames: 30425088. Throughput: 0: 11056.0. Samples: 30394824. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:42:41,903][434108] Avg episode reward: [(0, '4271.848')] [2023-03-08 13:42:42,528][434396] Updated weights for policy 0, policy_version 59440 (0.0004) [2023-03-08 13:42:46,189][434396] Updated weights for policy 0, policy_version 59520 (0.0004) [2023-03-08 13:42:46,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10802.3). Total num frames: 30478336. Throughput: 0: 11050.6. Samples: 30460848. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 13:42:46,903][434108] Avg episode reward: [(0, '4340.818')] [2023-03-08 13:42:49,910][434396] Updated weights for policy 0, policy_version 59600 (0.0004) [2023-03-08 13:42:51,892][434108] Fps is (10 sec: 11059.1, 60 sec: 11059.2, 300 sec: 10830.1). Total num frames: 30535680. Throughput: 0: 11052.4. Samples: 30527560. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:42:51,903][434108] Avg episode reward: [(0, '4415.680')] [2023-03-08 13:42:51,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000059640_30535680.pth... [2023-03-08 13:42:51,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000058992_30203904.pth [2023-03-08 13:42:53,629][434396] Updated weights for policy 0, policy_version 59680 (0.0004) [2023-03-08 13:42:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10816.2). Total num frames: 30588928. Throughput: 0: 11036.6. Samples: 30560328. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:42:56,893][434108] Avg episode reward: [(0, '4381.406')] [2023-03-08 13:42:57,389][434396] Updated weights for policy 0, policy_version 59760 (0.0004) [2023-03-08 13:43:01,136][434396] Updated weights for policy 0, policy_version 59840 (0.0004) [2023-03-08 13:43:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10990.9, 300 sec: 10816.2). Total num frames: 30642176. Throughput: 0: 11013.7. Samples: 30625800. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:43:01,893][434108] Avg episode reward: [(0, '4301.898')] [2023-03-08 13:43:04,823][434396] Updated weights for policy 0, policy_version 59920 (0.0004) [2023-03-08 13:43:06,893][434108] Fps is (10 sec: 11059.1, 60 sec: 11059.2, 300 sec: 10844.0). Total num frames: 30699520. Throughput: 0: 11018.8. Samples: 30691624. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:43:06,893][434108] Avg episode reward: [(0, '4377.998')] [2023-03-08 13:43:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000059960_30699520.pth... [2023-03-08 13:43:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000059312_30367744.pth [2023-03-08 13:43:08,678][434396] Updated weights for policy 0, policy_version 60000 (0.0005) [2023-03-08 13:43:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10844.0). Total num frames: 30752768. Throughput: 0: 10993.6. Samples: 30723944. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:43:11,893][434108] Avg episode reward: [(0, '4363.197')] [2023-03-08 13:43:12,595][434396] Updated weights for policy 0, policy_version 60080 (0.0005) [2023-03-08 13:43:16,482][434396] Updated weights for policy 0, policy_version 60160 (0.0005) [2023-03-08 13:43:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10990.9, 300 sec: 10830.1). Total num frames: 30806016. Throughput: 0: 10894.9. Samples: 30786104. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:43:16,893][434108] Avg episode reward: [(0, '4350.205')] [2023-03-08 13:43:20,452][434396] Updated weights for policy 0, policy_version 60240 (0.0005) [2023-03-08 13:43:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10854.4, 300 sec: 10816.2). Total num frames: 30855168. Throughput: 0: 10828.9. Samples: 30848740. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 13:43:21,893][434108] Avg episode reward: [(0, '4199.807')] [2023-03-08 13:43:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000060264_30855168.pth... [2023-03-08 13:43:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000059640_30535680.pth [2023-03-08 13:43:24,438][434396] Updated weights for policy 0, policy_version 60320 (0.0005) [2023-03-08 13:43:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10854.4, 300 sec: 10802.3). Total num frames: 30908416. Throughput: 0: 10776.2. Samples: 30879752. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:43:26,893][434108] Avg episode reward: [(0, '4234.326')] [2023-03-08 13:43:28,295][434396] Updated weights for policy 0, policy_version 60400 (0.0005) [2023-03-08 13:43:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10786.1, 300 sec: 10802.3). Total num frames: 30961664. Throughput: 0: 10705.7. Samples: 30942604. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:43:31,893][434108] Avg episode reward: [(0, '4422.991')] [2023-03-08 13:43:32,252][434396] Updated weights for policy 0, policy_version 60480 (0.0005) [2023-03-08 13:43:36,179][434396] Updated weights for policy 0, policy_version 60560 (0.0005) [2023-03-08 13:43:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10717.9, 300 sec: 10802.3). Total num frames: 31010816. Throughput: 0: 10608.7. Samples: 31004952. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:43:36,893][434108] Avg episode reward: [(0, '4279.667')] [2023-03-08 13:43:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000060568_31010816.pth... [2023-03-08 13:43:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000059960_30699520.pth [2023-03-08 13:43:40,135][434396] Updated weights for policy 0, policy_version 60640 (0.0005) [2023-03-08 13:43:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10802.3). Total num frames: 31064064. Throughput: 0: 10564.0. Samples: 31035708. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:43:41,893][434108] Avg episode reward: [(0, '4238.674')] [2023-03-08 13:43:44,148][434396] Updated weights for policy 0, policy_version 60720 (0.0005) [2023-03-08 13:43:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10788.5). Total num frames: 31113216. Throughput: 0: 10478.2. Samples: 31097320. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:43:46,893][434108] Avg episode reward: [(0, '4321.424')] [2023-03-08 13:43:48,117][434396] Updated weights for policy 0, policy_version 60800 (0.0005) [2023-03-08 13:43:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10788.4). Total num frames: 31166464. Throughput: 0: 10397.9. Samples: 31159528. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:43:51,893][434108] Avg episode reward: [(0, '4131.718')] [2023-03-08 13:43:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000060872_31166464.pth... [2023-03-08 13:43:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000060264_30855168.pth [2023-03-08 13:43:52,084][434396] Updated weights for policy 0, policy_version 60880 (0.0005) [2023-03-08 13:43:56,061][434396] Updated weights for policy 0, policy_version 60960 (0.0005) [2023-03-08 13:43:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10788.4). Total num frames: 31219712. Throughput: 0: 10379.9. Samples: 31191040. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:43:56,893][434108] Avg episode reward: [(0, '4201.648')] [2023-03-08 13:44:00,026][434396] Updated weights for policy 0, policy_version 61040 (0.0005) [2023-03-08 13:44:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10774.6). Total num frames: 31268864. Throughput: 0: 10364.1. Samples: 31252488. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:44:01,893][434108] Avg episode reward: [(0, '4541.452')] [2023-03-08 13:44:03,932][434396] Updated weights for policy 0, policy_version 61120 (0.0005) [2023-03-08 13:44:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10788.4). Total num frames: 31322112. Throughput: 0: 10362.6. Samples: 31315056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:44:06,893][434108] Avg episode reward: [(0, '4477.347')] [2023-03-08 13:44:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000061176_31322112.pth... [2023-03-08 13:44:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000060568_31010816.pth [2023-03-08 13:44:07,894][434396] Updated weights for policy 0, policy_version 61200 (0.0005) [2023-03-08 13:44:11,735][434396] Updated weights for policy 0, policy_version 61280 (0.0005) [2023-03-08 13:44:11,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10774.6). Total num frames: 31375360. Throughput: 0: 10376.5. Samples: 31346696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:44:11,893][434108] Avg episode reward: [(0, '4514.290')] [2023-03-08 13:44:15,455][434396] Updated weights for policy 0, policy_version 61360 (0.0004) [2023-03-08 13:44:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10760.7). Total num frames: 31428608. Throughput: 0: 10425.1. Samples: 31411732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:44:16,893][434108] Avg episode reward: [(0, '4478.379')] [2023-03-08 13:44:19,228][434396] Updated weights for policy 0, policy_version 61440 (0.0005) [2023-03-08 13:44:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10746.8). Total num frames: 31481856. Throughput: 0: 10455.4. Samples: 31475444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:44:21,893][434108] Avg episode reward: [(0, '4550.683')] [2023-03-08 13:44:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000061488_31481856.pth... [2023-03-08 13:44:21,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000060872_31166464.pth [2023-03-08 13:44:23,195][434396] Updated weights for policy 0, policy_version 61520 (0.0005) [2023-03-08 13:44:26,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10746.8). Total num frames: 31535104. Throughput: 0: 10474.2. Samples: 31507048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:44:26,903][434108] Avg episode reward: [(0, '4541.117')] [2023-03-08 13:44:26,955][434396] Updated weights for policy 0, policy_version 61600 (0.0004) [2023-03-08 13:44:31,007][434396] Updated weights for policy 0, policy_version 61680 (0.0005) [2023-03-08 13:44:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10746.8). Total num frames: 31588352. Throughput: 0: 10524.3. Samples: 31570916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:44:31,903][434108] Avg episode reward: [(0, '4540.151')] [2023-03-08 13:44:34,944][434396] Updated weights for policy 0, policy_version 61760 (0.0005) [2023-03-08 13:44:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10746.8). Total num frames: 31637504. Throughput: 0: 10493.2. Samples: 31631724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:44:36,903][434108] Avg episode reward: [(0, '4576.008')] [2023-03-08 13:44:36,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000061792_31637504.pth... [2023-03-08 13:44:36,910][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000061176_31322112.pth [2023-03-08 13:44:39,084][434396] Updated weights for policy 0, policy_version 61840 (0.0005) [2023-03-08 13:44:41,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10732.9). Total num frames: 31686656. Throughput: 0: 10464.4. Samples: 31661940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:44:41,903][434108] Avg episode reward: [(0, '4410.518')] [2023-03-08 13:44:43,088][434396] Updated weights for policy 0, policy_version 61920 (0.0005) [2023-03-08 13:44:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10732.9). Total num frames: 31739904. Throughput: 0: 10458.2. Samples: 31723108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:44:46,903][434108] Avg episode reward: [(0, '4540.042')] [2023-03-08 13:44:47,049][434396] Updated weights for policy 0, policy_version 62000 (0.0005) [2023-03-08 13:44:51,048][434396] Updated weights for policy 0, policy_version 62080 (0.0005) [2023-03-08 13:44:51,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10746.8). Total num frames: 31793152. Throughput: 0: 10442.3. Samples: 31784960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:44:51,903][434108] Avg episode reward: [(0, '4541.256')] [2023-03-08 13:44:51,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000062096_31793152.pth... [2023-03-08 13:44:51,910][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000061488_31481856.pth [2023-03-08 13:44:55,018][434396] Updated weights for policy 0, policy_version 62160 (0.0005) [2023-03-08 13:44:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10732.9). Total num frames: 31842304. Throughput: 0: 10417.3. Samples: 31815476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:44:56,903][434108] Avg episode reward: [(0, '4550.299')] [2023-03-08 13:44:58,840][434396] Updated weights for policy 0, policy_version 62240 (0.0004) [2023-03-08 13:45:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10732.9). Total num frames: 31899648. Throughput: 0: 10402.6. Samples: 31879848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:45:01,903][434108] Avg episode reward: [(0, '4596.489')] [2023-03-08 13:45:02,502][434396] Updated weights for policy 0, policy_version 62320 (0.0004) [2023-03-08 13:45:06,260][434396] Updated weights for policy 0, policy_version 62400 (0.0005) [2023-03-08 13:45:06,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10513.1, 300 sec: 10719.0). Total num frames: 31952896. Throughput: 0: 10459.3. Samples: 31946112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:45:06,903][434108] Avg episode reward: [(0, '4597.011')] [2023-03-08 13:45:06,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000062408_31952896.pth... [2023-03-08 13:45:06,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000061792_31637504.pth [2023-03-08 13:45:10,126][434396] Updated weights for policy 0, policy_version 62480 (0.0005) [2023-03-08 13:45:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10719.0). Total num frames: 32006144. Throughput: 0: 10474.8. Samples: 31978412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:45:11,903][434108] Avg episode reward: [(0, '4555.149')] [2023-03-08 13:45:13,922][434396] Updated weights for policy 0, policy_version 62560 (0.0004) [2023-03-08 13:45:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10705.1). Total num frames: 32059392. Throughput: 0: 10492.4. Samples: 32043072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:45:16,893][434108] Avg episode reward: [(0, '4534.965')] [2023-03-08 13:45:17,683][434396] Updated weights for policy 0, policy_version 62640 (0.0004) [2023-03-08 13:45:21,502][434396] Updated weights for policy 0, policy_version 62720 (0.0005) [2023-03-08 13:45:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10705.1). Total num frames: 32112640. Throughput: 0: 10580.9. Samples: 32107864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:45:21,893][434108] Avg episode reward: [(0, '4455.064')] [2023-03-08 13:45:21,906][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000062728_32116736.pth... [2023-03-08 13:45:21,907][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000062096_31793152.pth [2023-03-08 13:45:25,514][434396] Updated weights for policy 0, policy_version 62800 (0.0005) [2023-03-08 13:45:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10705.1). Total num frames: 32165888. Throughput: 0: 10576.9. Samples: 32137900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:45:26,893][434108] Avg episode reward: [(0, '4500.917')] [2023-03-08 13:45:29,450][434396] Updated weights for policy 0, policy_version 62880 (0.0005) [2023-03-08 13:45:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10705.1). Total num frames: 32219136. Throughput: 0: 10599.4. Samples: 32200080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:45:31,893][434108] Avg episode reward: [(0, '4510.981')] [2023-03-08 13:45:33,442][434396] Updated weights for policy 0, policy_version 62960 (0.0005) [2023-03-08 13:45:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10691.3). Total num frames: 32268288. Throughput: 0: 10627.7. Samples: 32263204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:45:36,893][434108] Avg episode reward: [(0, '4515.489')] [2023-03-08 13:45:36,950][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000063032_32272384.pth... [2023-03-08 13:45:36,952][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000062408_31952896.pth [2023-03-08 13:45:37,361][434396] Updated weights for policy 0, policy_version 63040 (0.0005) [2023-03-08 13:45:41,307][434396] Updated weights for policy 0, policy_version 63120 (0.0005) [2023-03-08 13:45:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10691.3). Total num frames: 32321536. Throughput: 0: 10621.0. Samples: 32293420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:45:41,893][434108] Avg episode reward: [(0, '4545.080')] [2023-03-08 13:45:45,233][434396] Updated weights for policy 0, policy_version 63200 (0.0004) [2023-03-08 13:45:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10677.4). Total num frames: 32374784. Throughput: 0: 10586.8. Samples: 32356252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:45:46,893][434108] Avg episode reward: [(0, '4604.568')] [2023-03-08 13:45:49,161][434396] Updated weights for policy 0, policy_version 63280 (0.0004) [2023-03-08 13:45:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10677.4). Total num frames: 32428032. Throughput: 0: 10512.7. Samples: 32419184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:45:51,893][434108] Avg episode reward: [(0, '4467.673')] [2023-03-08 13:45:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000063336_32428032.pth... [2023-03-08 13:45:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000062728_32116736.pth [2023-03-08 13:45:52,990][434396] Updated weights for policy 0, policy_version 63360 (0.0005) [2023-03-08 13:45:56,800][434396] Updated weights for policy 0, policy_version 63440 (0.0005) [2023-03-08 13:45:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 32481280. Throughput: 0: 10510.4. Samples: 32451380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:45:56,893][434108] Avg episode reward: [(0, '4177.190')] [2023-03-08 13:46:00,541][434396] Updated weights for policy 0, policy_version 63520 (0.0004) [2023-03-08 13:46:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10677.4). Total num frames: 32534528. Throughput: 0: 10508.6. Samples: 32515960. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:46:01,893][434108] Avg episode reward: [(0, '4007.565')] [2023-03-08 13:46:04,286][434396] Updated weights for policy 0, policy_version 63600 (0.0004) [2023-03-08 13:46:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10677.4). Total num frames: 32587776. Throughput: 0: 10542.1. Samples: 32582256. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:46:06,893][434108] Avg episode reward: [(0, '3740.517')] [2023-03-08 13:46:06,913][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000063656_32591872.pth... [2023-03-08 13:46:06,914][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000063032_32272384.pth [2023-03-08 13:46:08,013][434396] Updated weights for policy 0, policy_version 63680 (0.0004) [2023-03-08 13:46:11,712][434396] Updated weights for policy 0, policy_version 63760 (0.0004) [2023-03-08 13:46:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10705.1). Total num frames: 32645120. Throughput: 0: 10612.8. Samples: 32615476. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:46:11,893][434108] Avg episode reward: [(0, '3872.963')] [2023-03-08 13:46:15,420][434396] Updated weights for policy 0, policy_version 63840 (0.0004) [2023-03-08 13:46:16,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10717.9, 300 sec: 10719.0). Total num frames: 32702464. Throughput: 0: 10696.6. Samples: 32681428. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:46:16,893][434108] Avg episode reward: [(0, '3981.194')] [2023-03-08 13:46:19,060][434396] Updated weights for policy 0, policy_version 63920 (0.0003) [2023-03-08 13:46:21,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 10705.1). Total num frames: 32755712. Throughput: 0: 10780.1. Samples: 32748308. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:46:21,893][434108] Avg episode reward: [(0, '4272.099')] [2023-03-08 13:46:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000063976_32755712.pth... [2023-03-08 13:46:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000063336_32428032.pth [2023-03-08 13:46:22,894][434396] Updated weights for policy 0, policy_version 64000 (0.0005) [2023-03-08 13:46:26,849][434396] Updated weights for policy 0, policy_version 64080 (0.0005) [2023-03-08 13:46:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10705.1). Total num frames: 32808960. Throughput: 0: 10804.1. Samples: 32779604. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:46:26,893][434108] Avg episode reward: [(0, '4369.769')] [2023-03-08 13:46:30,644][434396] Updated weights for policy 0, policy_version 64160 (0.0005) [2023-03-08 13:46:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10705.1). Total num frames: 32862208. Throughput: 0: 10820.5. Samples: 32843176. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:46:31,893][434108] Avg episode reward: [(0, '4464.635')] [2023-03-08 13:46:34,625][434396] Updated weights for policy 0, policy_version 64240 (0.0005) [2023-03-08 13:46:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10677.4). Total num frames: 32911360. Throughput: 0: 10810.9. Samples: 32905672. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:46:36,893][434108] Avg episode reward: [(0, '4365.106')] [2023-03-08 13:46:36,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000064280_32911360.pth... [2023-03-08 13:46:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000063656_32591872.pth [2023-03-08 13:46:38,587][434396] Updated weights for policy 0, policy_version 64320 (0.0005) [2023-03-08 13:46:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10663.5). Total num frames: 32964608. Throughput: 0: 10769.3. Samples: 32936000. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:46:41,903][434108] Avg episode reward: [(0, '4491.730')] [2023-03-08 13:46:42,532][434396] Updated weights for policy 0, policy_version 64400 (0.0005) [2023-03-08 13:46:46,464][434396] Updated weights for policy 0, policy_version 64480 (0.0005) [2023-03-08 13:46:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10663.5). Total num frames: 33017856. Throughput: 0: 10735.3. Samples: 32999048. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:46:46,903][434108] Avg episode reward: [(0, '4460.081')] [2023-03-08 13:46:50,372][434396] Updated weights for policy 0, policy_version 64560 (0.0005) [2023-03-08 13:46:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 33067008. Throughput: 0: 10653.1. Samples: 33061648. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:46:51,903][434108] Avg episode reward: [(0, '4397.819')] [2023-03-08 13:46:51,906][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000064584_33067008.pth... [2023-03-08 13:46:51,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000063976_32755712.pth [2023-03-08 13:46:54,418][434396] Updated weights for policy 0, policy_version 64640 (0.0005) [2023-03-08 13:46:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 33120256. Throughput: 0: 10581.6. Samples: 33091648. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:46:56,893][434108] Avg episode reward: [(0, '4445.154')] [2023-03-08 13:46:58,384][434396] Updated weights for policy 0, policy_version 64720 (0.0005) [2023-03-08 13:47:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10621.8). Total num frames: 33169408. Throughput: 0: 10497.3. Samples: 33153808. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:47:01,893][434108] Avg episode reward: [(0, '4401.878')] [2023-03-08 13:47:02,337][434396] Updated weights for policy 0, policy_version 64800 (0.0005) [2023-03-08 13:47:06,307][434396] Updated weights for policy 0, policy_version 64880 (0.0005) [2023-03-08 13:47:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10607.9). Total num frames: 33222656. Throughput: 0: 10378.0. Samples: 33215316. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:47:06,893][434108] Avg episode reward: [(0, '4199.851')] [2023-03-08 13:47:06,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000064888_33222656.pth... [2023-03-08 13:47:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000064280_32911360.pth [2023-03-08 13:47:10,308][434396] Updated weights for policy 0, policy_version 64960 (0.0005) [2023-03-08 13:47:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10607.9). Total num frames: 33275904. Throughput: 0: 10388.1. Samples: 33247068. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:47:11,893][434108] Avg episode reward: [(0, '4359.222')] [2023-03-08 13:47:14,204][434396] Updated weights for policy 0, policy_version 65040 (0.0005) [2023-03-08 13:47:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10580.2). Total num frames: 33325056. Throughput: 0: 10345.8. Samples: 33308736. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 13:47:16,893][434108] Avg episode reward: [(0, '4532.693')] [2023-03-08 13:47:18,234][434396] Updated weights for policy 0, policy_version 65120 (0.0005) [2023-03-08 13:47:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10580.2). Total num frames: 33378304. Throughput: 0: 10320.9. Samples: 33370112. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:47:21,893][434108] Avg episode reward: [(0, '4549.279')] [2023-03-08 13:47:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000065192_33378304.pth... [2023-03-08 13:47:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000064584_33067008.pth [2023-03-08 13:47:22,252][434396] Updated weights for policy 0, policy_version 65200 (0.0005) [2023-03-08 13:47:26,210][434396] Updated weights for policy 0, policy_version 65280 (0.0005) [2023-03-08 13:47:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10552.4). Total num frames: 33427456. Throughput: 0: 10335.4. Samples: 33401092. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:47:26,893][434108] Avg episode reward: [(0, '4536.411')] [2023-03-08 13:47:30,252][434396] Updated weights for policy 0, policy_version 65360 (0.0005) [2023-03-08 13:47:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10552.4). Total num frames: 33480704. Throughput: 0: 10283.6. Samples: 33461812. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:47:31,893][434108] Avg episode reward: [(0, '4514.256')] [2023-03-08 13:47:34,235][434396] Updated weights for policy 0, policy_version 65440 (0.0005) [2023-03-08 13:47:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10524.6). Total num frames: 33529856. Throughput: 0: 10268.5. Samples: 33523728. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:47:36,893][434108] Avg episode reward: [(0, '4564.712')] [2023-03-08 13:47:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000065488_33529856.pth... [2023-03-08 13:47:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000064888_33222656.pth [2023-03-08 13:47:38,298][434396] Updated weights for policy 0, policy_version 65520 (0.0005) [2023-03-08 13:47:41,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10524.6). Total num frames: 33583104. Throughput: 0: 10278.4. Samples: 33554176. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:47:41,893][434108] Avg episode reward: [(0, '4492.352')] [2023-03-08 13:47:42,163][434396] Updated weights for policy 0, policy_version 65600 (0.0005) [2023-03-08 13:47:46,127][434396] Updated weights for policy 0, policy_version 65680 (0.0006) [2023-03-08 13:47:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10496.9). Total num frames: 33632256. Throughput: 0: 10274.6. Samples: 33616164. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:47:46,893][434108] Avg episode reward: [(0, '4432.969')] [2023-03-08 13:47:50,056][434396] Updated weights for policy 0, policy_version 65760 (0.0005) [2023-03-08 13:47:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10496.9). Total num frames: 33685504. Throughput: 0: 10310.8. Samples: 33679304. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:47:51,893][434108] Avg episode reward: [(0, '4353.824')] [2023-03-08 13:47:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000065792_33685504.pth... [2023-03-08 13:47:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000065192_33378304.pth [2023-03-08 13:47:54,022][434396] Updated weights for policy 0, policy_version 65840 (0.0005) [2023-03-08 13:47:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10483.0). Total num frames: 33734656. Throughput: 0: 10290.6. Samples: 33710144. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 13:47:56,893][434108] Avg episode reward: [(0, '4542.278')] [2023-03-08 13:47:58,089][434396] Updated weights for policy 0, policy_version 65920 (0.0005) [2023-03-08 13:48:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10469.1). Total num frames: 33787904. Throughput: 0: 10284.4. Samples: 33771536. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:48:01,893][434108] Avg episode reward: [(0, '4529.276')] [2023-03-08 13:48:01,982][434396] Updated weights for policy 0, policy_version 66000 (0.0005) [2023-03-08 13:48:05,981][434396] Updated weights for policy 0, policy_version 66080 (0.0005) [2023-03-08 13:48:06,893][434108] Fps is (10 sec: 10649.4, 60 sec: 10308.2, 300 sec: 10469.1). Total num frames: 33841152. Throughput: 0: 10287.1. Samples: 33833032. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:48:06,893][434108] Avg episode reward: [(0, '4447.197')] [2023-03-08 13:48:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000066096_33841152.pth... [2023-03-08 13:48:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000065488_33529856.pth [2023-03-08 13:48:09,929][434396] Updated weights for policy 0, policy_version 66160 (0.0005) [2023-03-08 13:48:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10469.1). Total num frames: 33894400. Throughput: 0: 10305.2. Samples: 33864828. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:48:11,893][434108] Avg episode reward: [(0, '4446.796')] [2023-03-08 13:48:13,690][434396] Updated weights for policy 0, policy_version 66240 (0.0004) [2023-03-08 13:48:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 33947648. Throughput: 0: 10382.3. Samples: 33929016. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:48:16,893][434108] Avg episode reward: [(0, '4537.316')] [2023-03-08 13:48:17,459][434396] Updated weights for policy 0, policy_version 66320 (0.0004) [2023-03-08 13:48:21,456][434396] Updated weights for policy 0, policy_version 66400 (0.0005) [2023-03-08 13:48:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 34000896. Throughput: 0: 10422.0. Samples: 33992716. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:48:21,893][434108] Avg episode reward: [(0, '4541.165')] [2023-03-08 13:48:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000066408_34000896.pth... [2023-03-08 13:48:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000065792_33685504.pth [2023-03-08 13:48:25,484][434396] Updated weights for policy 0, policy_version 66480 (0.0005) [2023-03-08 13:48:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 34050048. Throughput: 0: 10416.9. Samples: 34022936. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:48:26,893][434108] Avg episode reward: [(0, '4434.952')] [2023-03-08 13:48:29,469][434396] Updated weights for policy 0, policy_version 66560 (0.0005) [2023-03-08 13:48:31,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10469.1). Total num frames: 34099200. Throughput: 0: 10394.2. Samples: 34083904. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:48:31,893][434108] Avg episode reward: [(0, '4532.582')] [2023-03-08 13:48:33,578][434396] Updated weights for policy 0, policy_version 66640 (0.0005) [2023-03-08 13:48:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 34152448. Throughput: 0: 10333.9. Samples: 34144328. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:48:36,893][434108] Avg episode reward: [(0, '4594.046')] [2023-03-08 13:48:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000066704_34152448.pth... [2023-03-08 13:48:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000066096_33841152.pth [2023-03-08 13:48:37,591][434396] Updated weights for policy 0, policy_version 66720 (0.0005) [2023-03-08 13:48:41,527][434396] Updated weights for policy 0, policy_version 66800 (0.0004) [2023-03-08 13:48:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10469.1). Total num frames: 34201600. Throughput: 0: 10353.1. Samples: 34176032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:48:41,893][434108] Avg episode reward: [(0, '4612.879')] [2023-03-08 13:48:45,555][434396] Updated weights for policy 0, policy_version 66880 (0.0005) [2023-03-08 13:48:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 34254848. Throughput: 0: 10333.0. Samples: 34236520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:48:46,893][434108] Avg episode reward: [(0, '4612.811')] [2023-03-08 13:48:49,548][434396] Updated weights for policy 0, policy_version 66960 (0.0005) [2023-03-08 13:48:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10455.2). Total num frames: 34304000. Throughput: 0: 10354.6. Samples: 34298988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:48:51,893][434108] Avg episode reward: [(0, '4524.391')] [2023-03-08 13:48:51,937][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000067008_34308096.pth... [2023-03-08 13:48:51,939][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000066408_34000896.pth [2023-03-08 13:48:53,514][434396] Updated weights for policy 0, policy_version 67040 (0.0005) [2023-03-08 13:48:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 34357248. Throughput: 0: 10321.8. Samples: 34329308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:48:56,903][434108] Avg episode reward: [(0, '4508.921')] [2023-03-08 13:48:57,423][434396] Updated weights for policy 0, policy_version 67120 (0.0004) [2023-03-08 13:49:01,419][434396] Updated weights for policy 0, policy_version 67200 (0.0005) [2023-03-08 13:49:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 34410496. Throughput: 0: 10284.8. Samples: 34391832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:49:01,903][434108] Avg episode reward: [(0, '4525.385')] [2023-03-08 13:49:05,360][434396] Updated weights for policy 0, policy_version 67280 (0.0005) [2023-03-08 13:49:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10455.2). Total num frames: 34459648. Throughput: 0: 10253.1. Samples: 34454104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:49:06,903][434108] Avg episode reward: [(0, '4503.485')] [2023-03-08 13:49:06,906][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000067304_34459648.pth... [2023-03-08 13:49:06,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000066704_34152448.pth [2023-03-08 13:49:09,341][434396] Updated weights for policy 0, policy_version 67360 (0.0005) [2023-03-08 13:49:11,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10455.2). Total num frames: 34512896. Throughput: 0: 10259.1. Samples: 34484596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:49:11,893][434108] Avg episode reward: [(0, '4478.143')] [2023-03-08 13:49:13,219][434396] Updated weights for policy 0, policy_version 67440 (0.0005) [2023-03-08 13:49:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10455.2). Total num frames: 34566144. Throughput: 0: 10334.5. Samples: 34548956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:49:16,893][434108] Avg episode reward: [(0, '4459.129')] [2023-03-08 13:49:16,939][434396] Updated weights for policy 0, policy_version 67520 (0.0005) [2023-03-08 13:49:20,653][434396] Updated weights for policy 0, policy_version 67600 (0.0004) [2023-03-08 13:49:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 34623488. Throughput: 0: 10466.1. Samples: 34615304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:49:21,893][434108] Avg episode reward: [(0, '4459.119')] [2023-03-08 13:49:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000067624_34623488.pth... [2023-03-08 13:49:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000067008_34308096.pth [2023-03-08 13:49:24,427][434396] Updated weights for policy 0, policy_version 67680 (0.0005) [2023-03-08 13:49:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 34676736. Throughput: 0: 10489.5. Samples: 34648060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:49:26,893][434108] Avg episode reward: [(0, '4509.403')] [2023-03-08 13:49:28,148][434396] Updated weights for policy 0, policy_version 67760 (0.0004) [2023-03-08 13:49:31,858][434396] Updated weights for policy 0, policy_version 67840 (0.0004) [2023-03-08 13:49:31,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 34734080. Throughput: 0: 10603.2. Samples: 34713664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:49:31,893][434108] Avg episode reward: [(0, '4483.933')] [2023-03-08 13:49:35,633][434396] Updated weights for policy 0, policy_version 67920 (0.0005) [2023-03-08 13:49:36,893][434108] Fps is (10 sec: 11059.1, 60 sec: 10581.3, 300 sec: 10510.7). Total num frames: 34787328. Throughput: 0: 10671.3. Samples: 34779200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:49:36,893][434108] Avg episode reward: [(0, '4576.827')] [2023-03-08 13:49:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000067944_34787328.pth... [2023-03-08 13:49:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000067304_34459648.pth [2023-03-08 13:49:39,321][434396] Updated weights for policy 0, policy_version 68000 (0.0005) [2023-03-08 13:49:41,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10510.8). Total num frames: 34840576. Throughput: 0: 10728.8. Samples: 34812104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:49:41,903][434108] Avg episode reward: [(0, '4543.526')] [2023-03-08 13:49:43,253][434396] Updated weights for policy 0, policy_version 68080 (0.0005) [2023-03-08 13:49:46,892][434108] Fps is (10 sec: 10649.8, 60 sec: 10649.6, 300 sec: 10510.8). Total num frames: 34893824. Throughput: 0: 10744.1. Samples: 34875316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:49:46,903][434108] Avg episode reward: [(0, '4454.890')] [2023-03-08 13:49:47,194][434396] Updated weights for policy 0, policy_version 68160 (0.0005) [2023-03-08 13:49:51,081][434396] Updated weights for policy 0, policy_version 68240 (0.0005) [2023-03-08 13:49:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.8, 300 sec: 10524.6). Total num frames: 34947072. Throughput: 0: 10761.9. Samples: 34938388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:49:51,903][434108] Avg episode reward: [(0, '4425.280')] [2023-03-08 13:49:51,906][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000068256_34947072.pth... [2023-03-08 13:49:51,908][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000067624_34623488.pth [2023-03-08 13:49:54,789][434396] Updated weights for policy 0, policy_version 68320 (0.0004) [2023-03-08 13:49:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10510.8). Total num frames: 35000320. Throughput: 0: 10823.7. Samples: 34971660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:49:56,903][434108] Avg episode reward: [(0, '4447.953')] [2023-03-08 13:49:58,388][434396] Updated weights for policy 0, policy_version 68400 (0.0004) [2023-03-08 13:50:01,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10786.1, 300 sec: 10524.6). Total num frames: 35057664. Throughput: 0: 10886.1. Samples: 35038832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:50:01,893][434108] Avg episode reward: [(0, '4250.703')] [2023-03-08 13:50:02,049][434396] Updated weights for policy 0, policy_version 68480 (0.0004) [2023-03-08 13:50:05,741][434396] Updated weights for policy 0, policy_version 68560 (0.0004) [2023-03-08 13:50:06,892][434108] Fps is (10 sec: 11468.7, 60 sec: 10922.7, 300 sec: 10538.5). Total num frames: 35115008. Throughput: 0: 10899.1. Samples: 35105764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:50:06,893][434108] Avg episode reward: [(0, '4339.073')] [2023-03-08 13:50:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000068584_35115008.pth... [2023-03-08 13:50:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000067944_34787328.pth [2023-03-08 13:50:09,353][434396] Updated weights for policy 0, policy_version 68640 (0.0004) [2023-03-08 13:50:11,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10991.0, 300 sec: 10552.4). Total num frames: 35172352. Throughput: 0: 10924.2. Samples: 35139648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:50:11,903][434108] Avg episode reward: [(0, '4548.965')] [2023-03-08 13:50:13,039][434396] Updated weights for policy 0, policy_version 68720 (0.0004) [2023-03-08 13:50:16,820][434396] Updated weights for policy 0, policy_version 68800 (0.0004) [2023-03-08 13:50:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10552.4). Total num frames: 35225600. Throughput: 0: 10962.8. Samples: 35206988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:50:16,903][434108] Avg episode reward: [(0, '4408.597')] [2023-03-08 13:50:20,759][434396] Updated weights for policy 0, policy_version 68880 (0.0005) [2023-03-08 13:50:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10854.4, 300 sec: 10538.5). Total num frames: 35274752. Throughput: 0: 10892.6. Samples: 35269368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:50:21,893][434108] Avg episode reward: [(0, '4512.550')] [2023-03-08 13:50:21,959][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000068904_35278848.pth... [2023-03-08 13:50:21,961][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000068256_34947072.pth [2023-03-08 13:50:24,818][434396] Updated weights for policy 0, policy_version 68960 (0.0005) [2023-03-08 13:50:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10854.4, 300 sec: 10538.5). Total num frames: 35328000. Throughput: 0: 10828.6. Samples: 35299392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:50:26,893][434108] Avg episode reward: [(0, '4485.399')] [2023-03-08 13:50:28,775][434396] Updated weights for policy 0, policy_version 69040 (0.0006) [2023-03-08 13:50:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10717.9, 300 sec: 10538.5). Total num frames: 35377152. Throughput: 0: 10804.3. Samples: 35361508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:50:31,893][434108] Avg episode reward: [(0, '4384.164')] [2023-03-08 13:50:32,710][434396] Updated weights for policy 0, policy_version 69120 (0.0004) [2023-03-08 13:50:36,586][434396] Updated weights for policy 0, policy_version 69200 (0.0005) [2023-03-08 13:50:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10717.9, 300 sec: 10538.5). Total num frames: 35430400. Throughput: 0: 10805.1. Samples: 35424616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:50:36,893][434108] Avg episode reward: [(0, '4409.302')] [2023-03-08 13:50:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000069200_35430400.pth... [2023-03-08 13:50:36,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000068584_35115008.pth [2023-03-08 13:50:40,669][434396] Updated weights for policy 0, policy_version 69280 (0.0005) [2023-03-08 13:50:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10538.5). Total num frames: 35483648. Throughput: 0: 10728.8. Samples: 35454456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:50:41,893][434108] Avg episode reward: [(0, '4014.821')] [2023-03-08 13:50:44,738][434396] Updated weights for policy 0, policy_version 69360 (0.0005) [2023-03-08 13:50:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 35532800. Throughput: 0: 10596.3. Samples: 35515664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:50:46,893][434108] Avg episode reward: [(0, '4116.671')] [2023-03-08 13:50:48,775][434396] Updated weights for policy 0, policy_version 69440 (0.0005) [2023-03-08 13:50:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 35586048. Throughput: 0: 10456.7. Samples: 35576316. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:50:51,893][434108] Avg episode reward: [(0, '3941.307')] [2023-03-08 13:50:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000069504_35586048.pth... [2023-03-08 13:50:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000068904_35278848.pth [2023-03-08 13:50:52,598][434396] Updated weights for policy 0, policy_version 69520 (0.0004) [2023-03-08 13:50:56,254][434396] Updated weights for policy 0, policy_version 69600 (0.0004) [2023-03-08 13:50:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 35639296. Throughput: 0: 10457.9. Samples: 35610252. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:50:56,893][434108] Avg episode reward: [(0, '3981.948')] [2023-03-08 13:50:59,978][434396] Updated weights for policy 0, policy_version 69680 (0.0005) [2023-03-08 13:51:01,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10538.5). Total num frames: 35696640. Throughput: 0: 10427.6. Samples: 35676232. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:51:01,893][434108] Avg episode reward: [(0, '4068.013')] [2023-03-08 13:51:03,636][434396] Updated weights for policy 0, policy_version 69760 (0.0004) [2023-03-08 13:51:06,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 35749888. Throughput: 0: 10545.7. Samples: 35743924. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:51:06,893][434108] Avg episode reward: [(0, '4095.933')] [2023-03-08 13:51:06,914][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000069832_35753984.pth... [2023-03-08 13:51:06,916][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000069200_35430400.pth [2023-03-08 13:51:07,294][434396] Updated weights for policy 0, policy_version 69840 (0.0004) [2023-03-08 13:51:10,992][434396] Updated weights for policy 0, policy_version 69920 (0.0003) [2023-03-08 13:51:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 35807232. Throughput: 0: 10631.5. Samples: 35777808. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:51:11,893][434108] Avg episode reward: [(0, '4257.665')] [2023-03-08 13:51:14,748][434396] Updated weights for policy 0, policy_version 70000 (0.0004) [2023-03-08 13:51:16,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 35860480. Throughput: 0: 10705.1. Samples: 35843240. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:51:16,893][434108] Avg episode reward: [(0, '4391.026')] [2023-03-08 13:51:18,575][434396] Updated weights for policy 0, policy_version 70080 (0.0004) [2023-03-08 13:51:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 35913728. Throughput: 0: 10723.5. Samples: 35907172. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:51:21,893][434108] Avg episode reward: [(0, '4377.567')] [2023-03-08 13:51:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000070144_35913728.pth... [2023-03-08 13:51:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000069504_35586048.pth [2023-03-08 13:51:22,412][434396] Updated weights for policy 0, policy_version 70160 (0.0003) [2023-03-08 13:51:26,312][434396] Updated weights for policy 0, policy_version 70240 (0.0005) [2023-03-08 13:51:26,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 35966976. Throughput: 0: 10756.2. Samples: 35938484. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 13:51:26,893][434108] Avg episode reward: [(0, '4564.656')] [2023-03-08 13:51:30,020][434396] Updated weights for policy 0, policy_version 70320 (0.0003) [2023-03-08 13:51:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10538.5). Total num frames: 36020224. Throughput: 0: 10848.7. Samples: 36003856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:51:31,893][434108] Avg episode reward: [(0, '4539.306')] [2023-03-08 13:51:33,898][434396] Updated weights for policy 0, policy_version 70400 (0.0005) [2023-03-08 13:51:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10538.5). Total num frames: 36073472. Throughput: 0: 10887.5. Samples: 36066252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:51:36,893][434108] Avg episode reward: [(0, '4596.890')] [2023-03-08 13:51:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000070456_36073472.pth... [2023-03-08 13:51:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000069832_35753984.pth [2023-03-08 13:51:37,878][434396] Updated weights for policy 0, policy_version 70480 (0.0006) [2023-03-08 13:51:41,840][434396] Updated weights for policy 0, policy_version 70560 (0.0005) [2023-03-08 13:51:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10538.5). Total num frames: 36126720. Throughput: 0: 10834.0. Samples: 36097780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:51:41,893][434108] Avg episode reward: [(0, '4617.902')] [2023-03-08 13:51:45,804][434396] Updated weights for policy 0, policy_version 70640 (0.0005) [2023-03-08 13:51:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10717.9, 300 sec: 10538.5). Total num frames: 36175872. Throughput: 0: 10740.4. Samples: 36159552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:51:46,893][434108] Avg episode reward: [(0, '4554.909')] [2023-03-08 13:51:49,751][434396] Updated weights for policy 0, policy_version 70720 (0.0004) [2023-03-08 13:51:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10717.9, 300 sec: 10538.5). Total num frames: 36229120. Throughput: 0: 10611.1. Samples: 36221424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:51:51,893][434108] Avg episode reward: [(0, '4537.794')] [2023-03-08 13:51:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000070760_36229120.pth... [2023-03-08 13:51:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000070144_35913728.pth [2023-03-08 13:51:53,561][434396] Updated weights for policy 0, policy_version 70800 (0.0005) [2023-03-08 13:51:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10552.4). Total num frames: 36282368. Throughput: 0: 10593.2. Samples: 36254500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:51:56,893][434108] Avg episode reward: [(0, '4629.669')] [2023-03-08 13:51:57,283][434396] Updated weights for policy 0, policy_version 70880 (0.0005) [2023-03-08 13:52:01,017][434396] Updated weights for policy 0, policy_version 70960 (0.0005) [2023-03-08 13:52:01,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10717.9, 300 sec: 10566.3). Total num frames: 36339712. Throughput: 0: 10595.2. Samples: 36320024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:52:01,893][434108] Avg episode reward: [(0, '4511.084')] [2023-03-08 13:52:04,703][434396] Updated weights for policy 0, policy_version 71040 (0.0004) [2023-03-08 13:52:06,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 10566.3). Total num frames: 36392960. Throughput: 0: 10660.3. Samples: 36386884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:52:06,893][434108] Avg episode reward: [(0, '4535.949')] [2023-03-08 13:52:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000071080_36392960.pth... [2023-03-08 13:52:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000070456_36073472.pth [2023-03-08 13:52:08,425][434396] Updated weights for policy 0, policy_version 71120 (0.0003) [2023-03-08 13:52:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 36450304. Throughput: 0: 10700.4. Samples: 36420000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:52:11,893][434108] Avg episode reward: [(0, '4495.147')] [2023-03-08 13:52:12,149][434396] Updated weights for policy 0, policy_version 71200 (0.0003) [2023-03-08 13:52:15,858][434396] Updated weights for policy 0, policy_version 71280 (0.0003) [2023-03-08 13:52:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 36503552. Throughput: 0: 10711.3. Samples: 36485864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:52:16,893][434108] Avg episode reward: [(0, '4417.298')] [2023-03-08 13:52:19,613][434396] Updated weights for policy 0, policy_version 71360 (0.0004) [2023-03-08 13:52:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10621.8). Total num frames: 36560896. Throughput: 0: 10788.3. Samples: 36551724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:52:21,893][434108] Avg episode reward: [(0, '4630.922')] [2023-03-08 13:52:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000071408_36560896.pth... [2023-03-08 13:52:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000070760_36229120.pth [2023-03-08 13:52:23,278][434396] Updated weights for policy 0, policy_version 71440 (0.0004) [2023-03-08 13:52:26,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10786.1, 300 sec: 10621.8). Total num frames: 36614144. Throughput: 0: 10839.0. Samples: 36585536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:52:26,893][434108] Avg episode reward: [(0, '4608.261')] [2023-03-08 13:52:26,957][434396] Updated weights for policy 0, policy_version 71520 (0.0004) [2023-03-08 13:52:30,875][434396] Updated weights for policy 0, policy_version 71600 (0.0005) [2023-03-08 13:52:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10635.7). Total num frames: 36667392. Throughput: 0: 10921.2. Samples: 36651008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:52:31,893][434108] Avg episode reward: [(0, '4523.428')] [2023-03-08 13:52:34,959][434396] Updated weights for policy 0, policy_version 71680 (0.0005) [2023-03-08 13:52:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10717.9, 300 sec: 10621.8). Total num frames: 36716544. Throughput: 0: 10888.0. Samples: 36711384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:52:36,893][434108] Avg episode reward: [(0, '4478.330')] [2023-03-08 13:52:36,947][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000071720_36720640.pth... [2023-03-08 13:52:36,949][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000071080_36392960.pth [2023-03-08 13:52:38,927][434396] Updated weights for policy 0, policy_version 71760 (0.0005) [2023-03-08 13:52:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10635.7). Total num frames: 36769792. Throughput: 0: 10830.4. Samples: 36741868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:52:41,893][434108] Avg episode reward: [(0, '4547.084')] [2023-03-08 13:52:42,922][434396] Updated weights for policy 0, policy_version 71840 (0.0005) [2023-03-08 13:52:46,853][434396] Updated weights for policy 0, policy_version 71920 (0.0005) [2023-03-08 13:52:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10786.1, 300 sec: 10635.7). Total num frames: 36823040. Throughput: 0: 10734.5. Samples: 36803076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:52:46,893][434108] Avg episode reward: [(0, '4564.984')] [2023-03-08 13:52:50,800][434396] Updated weights for policy 0, policy_version 72000 (0.0004) [2023-03-08 13:52:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10635.7). Total num frames: 36872192. Throughput: 0: 10658.7. Samples: 36866524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:52:51,893][434108] Avg episode reward: [(0, '4521.566')] [2023-03-08 13:52:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000072016_36872192.pth... [2023-03-08 13:52:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000071408_36560896.pth [2023-03-08 13:52:54,750][434396] Updated weights for policy 0, policy_version 72080 (0.0005) [2023-03-08 13:52:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.8, 300 sec: 10635.7). Total num frames: 36925440. Throughput: 0: 10601.6. Samples: 36897072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:52:56,893][434108] Avg episode reward: [(0, '4589.777')] [2023-03-08 13:52:58,680][434396] Updated weights for policy 0, policy_version 72160 (0.0005) [2023-03-08 13:53:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 36978688. Throughput: 0: 10536.8. Samples: 36960020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:53:01,893][434108] Avg episode reward: [(0, '4475.183')] [2023-03-08 13:53:02,679][434396] Updated weights for policy 0, policy_version 72240 (0.0005) [2023-03-08 13:53:06,697][434396] Updated weights for policy 0, policy_version 72320 (0.0005) [2023-03-08 13:53:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10621.8). Total num frames: 37027840. Throughput: 0: 10424.9. Samples: 37020844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:53:06,893][434108] Avg episode reward: [(0, '4476.967')] [2023-03-08 13:53:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000072320_37027840.pth... [2023-03-08 13:53:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000071720_36720640.pth [2023-03-08 13:53:10,667][434396] Updated weights for policy 0, policy_version 72400 (0.0005) [2023-03-08 13:53:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10444.8, 300 sec: 10607.9). Total num frames: 37076992. Throughput: 0: 10371.2. Samples: 37052240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:53:11,893][434108] Avg episode reward: [(0, '4502.102')] [2023-03-08 13:53:14,675][434396] Updated weights for policy 0, policy_version 72480 (0.0005) [2023-03-08 13:53:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10607.9). Total num frames: 37130240. Throughput: 0: 10281.4. Samples: 37113668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:53:16,893][434108] Avg episode reward: [(0, '4401.004')] [2023-03-08 13:53:18,578][434396] Updated weights for policy 0, policy_version 72560 (0.0005) [2023-03-08 13:53:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10621.8). Total num frames: 37183488. Throughput: 0: 10314.0. Samples: 37175512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:53:21,893][434108] Avg episode reward: [(0, '4257.848')] [2023-03-08 13:53:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000072624_37183488.pth... [2023-03-08 13:53:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000072016_36872192.pth [2023-03-08 13:53:22,560][434396] Updated weights for policy 0, policy_version 72640 (0.0005) [2023-03-08 13:53:26,395][434396] Updated weights for policy 0, policy_version 72720 (0.0005) [2023-03-08 13:53:26,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10635.7). Total num frames: 37236736. Throughput: 0: 10359.5. Samples: 37208048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:53:26,893][434108] Avg episode reward: [(0, '4194.666')] [2023-03-08 13:53:30,384][434396] Updated weights for policy 0, policy_version 72800 (0.0005) [2023-03-08 13:53:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10635.7). Total num frames: 37289984. Throughput: 0: 10366.5. Samples: 37269568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:53:31,893][434108] Avg episode reward: [(0, '4285.727')] [2023-03-08 13:53:34,185][434396] Updated weights for policy 0, policy_version 72880 (0.0005) [2023-03-08 13:53:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10649.6). Total num frames: 37343232. Throughput: 0: 10408.2. Samples: 37334892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:53:36,893][434108] Avg episode reward: [(0, '4364.521')] [2023-03-08 13:53:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000072936_37343232.pth... [2023-03-08 13:53:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000072320_37027840.pth [2023-03-08 13:53:37,873][434396] Updated weights for policy 0, policy_version 72960 (0.0004) [2023-03-08 13:53:41,731][434396] Updated weights for policy 0, policy_version 73040 (0.0005) [2023-03-08 13:53:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10649.6). Total num frames: 37396480. Throughput: 0: 10462.4. Samples: 37367880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:53:41,893][434108] Avg episode reward: [(0, '4593.581')] [2023-03-08 13:53:45,762][434396] Updated weights for policy 0, policy_version 73120 (0.0005) [2023-03-08 13:53:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10649.6). Total num frames: 37445632. Throughput: 0: 10428.7. Samples: 37429312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:53:46,893][434108] Avg episode reward: [(0, '4443.013')] [2023-03-08 13:53:49,747][434396] Updated weights for policy 0, policy_version 73200 (0.0006) [2023-03-08 13:53:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10649.6). Total num frames: 37498880. Throughput: 0: 10458.9. Samples: 37491496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:53:51,893][434108] Avg episode reward: [(0, '4584.107')] [2023-03-08 13:53:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000073240_37498880.pth... [2023-03-08 13:53:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000072624_37183488.pth [2023-03-08 13:53:53,618][434396] Updated weights for policy 0, policy_version 73280 (0.0005) [2023-03-08 13:53:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10649.6). Total num frames: 37552128. Throughput: 0: 10472.9. Samples: 37523520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:53:56,893][434108] Avg episode reward: [(0, '4641.427')] [2023-03-08 13:53:57,338][434396] Updated weights for policy 0, policy_version 73360 (0.0004) [2023-03-08 13:54:01,226][434396] Updated weights for policy 0, policy_version 73440 (0.0005) [2023-03-08 13:54:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10663.5). Total num frames: 37605376. Throughput: 0: 10562.9. Samples: 37589000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:54:01,893][434108] Avg episode reward: [(0, '4587.579')] [2023-03-08 13:54:05,092][434396] Updated weights for policy 0, policy_version 73520 (0.0004) [2023-03-08 13:54:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10663.5). Total num frames: 37658624. Throughput: 0: 10577.0. Samples: 37651476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:54:06,893][434108] Avg episode reward: [(0, '4648.535')] [2023-03-08 13:54:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000073552_37658624.pth... [2023-03-08 13:54:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000072936_37343232.pth [2023-03-08 13:54:06,900][434347] Saving new best policy, reward=4648.535! [2023-03-08 13:54:09,192][434396] Updated weights for policy 0, policy_version 73600 (0.0005) [2023-03-08 13:54:11,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10513.0, 300 sec: 10649.6). Total num frames: 37707776. Throughput: 0: 10516.0. Samples: 37681268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:54:11,893][434108] Avg episode reward: [(0, '4597.267')] [2023-03-08 13:54:13,204][434396] Updated weights for policy 0, policy_version 73680 (0.0005) [2023-03-08 13:54:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 37761024. Throughput: 0: 10529.9. Samples: 37743412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:54:16,893][434108] Avg episode reward: [(0, '4644.798')] [2023-03-08 13:54:17,147][434396] Updated weights for policy 0, policy_version 73760 (0.0004) [2023-03-08 13:54:21,213][434396] Updated weights for policy 0, policy_version 73840 (0.0005) [2023-03-08 13:54:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10621.8). Total num frames: 37810176. Throughput: 0: 10430.8. Samples: 37804276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:54:21,893][434108] Avg episode reward: [(0, '4567.938')] [2023-03-08 13:54:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000073848_37810176.pth... [2023-03-08 13:54:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000073240_37498880.pth [2023-03-08 13:54:25,310][434396] Updated weights for policy 0, policy_version 73920 (0.0005) [2023-03-08 13:54:26,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10594.1). Total num frames: 37859328. Throughput: 0: 10360.5. Samples: 37834104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:54:26,893][434108] Avg episode reward: [(0, '4471.246')] [2023-03-08 13:54:29,352][434396] Updated weights for policy 0, policy_version 74000 (0.0005) [2023-03-08 13:54:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10594.1). Total num frames: 37912576. Throughput: 0: 10350.6. Samples: 37895088. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:54:31,893][434108] Avg episode reward: [(0, '4422.310')] [2023-03-08 13:54:33,463][434396] Updated weights for policy 0, policy_version 74080 (0.0005) [2023-03-08 13:54:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10580.2). Total num frames: 37961728. Throughput: 0: 10282.2. Samples: 37954196. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:54:36,893][434108] Avg episode reward: [(0, '4319.090')] [2023-03-08 13:54:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000074144_37961728.pth... [2023-03-08 13:54:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000073552_37658624.pth [2023-03-08 13:54:37,520][434396] Updated weights for policy 0, policy_version 74160 (0.0005) [2023-03-08 13:54:41,507][434396] Updated weights for policy 0, policy_version 74240 (0.0005) [2023-03-08 13:54:41,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10566.3). Total num frames: 38010880. Throughput: 0: 10270.8. Samples: 37985708. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:54:41,893][434108] Avg episode reward: [(0, '4560.525')] [2023-03-08 13:54:45,684][434396] Updated weights for policy 0, policy_version 74320 (0.0005) [2023-03-08 13:54:46,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10552.4). Total num frames: 38060032. Throughput: 0: 10145.2. Samples: 38045532. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:54:46,893][434108] Avg episode reward: [(0, '4531.081')] [2023-03-08 13:54:49,752][434396] Updated weights for policy 0, policy_version 74400 (0.0005) [2023-03-08 13:54:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10552.4). Total num frames: 38113280. Throughput: 0: 10093.1. Samples: 38105664. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:54:51,893][434108] Avg episode reward: [(0, '4518.920')] [2023-03-08 13:54:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000074440_38113280.pth... [2023-03-08 13:54:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000073848_37810176.pth [2023-03-08 13:54:53,748][434396] Updated weights for policy 0, policy_version 74480 (0.0005) [2023-03-08 13:54:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10524.6). Total num frames: 38162432. Throughput: 0: 10119.6. Samples: 38136648. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:54:56,893][434108] Avg episode reward: [(0, '4558.224')] [2023-03-08 13:54:57,779][434396] Updated weights for policy 0, policy_version 74560 (0.0005) [2023-03-08 13:55:01,665][434396] Updated weights for policy 0, policy_version 74640 (0.0004) [2023-03-08 13:55:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10510.8). Total num frames: 38215680. Throughput: 0: 10088.5. Samples: 38197396. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:55:01,893][434108] Avg episode reward: [(0, '4505.176')] [2023-03-08 13:55:05,715][434396] Updated weights for policy 0, policy_version 74720 (0.0004) [2023-03-08 13:55:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10483.0). Total num frames: 38264832. Throughput: 0: 10125.9. Samples: 38259940. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:55:06,893][434108] Avg episode reward: [(0, '4481.662')] [2023-03-08 13:55:06,934][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000074744_38268928.pth... [2023-03-08 13:55:06,936][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000074144_37961728.pth [2023-03-08 13:55:09,851][434396] Updated weights for policy 0, policy_version 74800 (0.0005) [2023-03-08 13:55:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10469.1). Total num frames: 38313984. Throughput: 0: 10118.1. Samples: 38289420. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:55:11,893][434108] Avg episode reward: [(0, '4394.859')] [2023-03-08 13:55:13,998][434396] Updated weights for policy 0, policy_version 74880 (0.0004) [2023-03-08 13:55:16,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10469.1). Total num frames: 38363136. Throughput: 0: 10072.5. Samples: 38348352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:55:16,893][434108] Avg episode reward: [(0, '3949.726')] [2023-03-08 13:55:18,199][434396] Updated weights for policy 0, policy_version 74960 (0.0005) [2023-03-08 13:55:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10469.1). Total num frames: 38416384. Throughput: 0: 10092.5. Samples: 38408360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:55:21,893][434108] Avg episode reward: [(0, '3388.002')] [2023-03-08 13:55:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000075032_38416384.pth... [2023-03-08 13:55:21,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000074440_38113280.pth [2023-03-08 13:55:22,187][434396] Updated weights for policy 0, policy_version 75040 (0.0005) [2023-03-08 13:55:26,263][434396] Updated weights for policy 0, policy_version 75120 (0.0005) [2023-03-08 13:55:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10469.1). Total num frames: 38465536. Throughput: 0: 10072.4. Samples: 38438964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:55:26,893][434108] Avg episode reward: [(0, '3872.490')] [2023-03-08 13:55:30,314][434396] Updated weights for policy 0, policy_version 75200 (0.0005) [2023-03-08 13:55:31,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10455.2). Total num frames: 38514688. Throughput: 0: 10081.9. Samples: 38499216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:55:31,893][434108] Avg episode reward: [(0, '4445.470')] [2023-03-08 13:55:34,385][434396] Updated weights for policy 0, policy_version 75280 (0.0005) [2023-03-08 13:55:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10455.2). Total num frames: 38567936. Throughput: 0: 10090.9. Samples: 38559752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:55:36,893][434108] Avg episode reward: [(0, '4496.557')] [2023-03-08 13:55:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000075328_38567936.pth... [2023-03-08 13:55:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000074744_38268928.pth [2023-03-08 13:55:38,490][434396] Updated weights for policy 0, policy_version 75360 (0.0004) [2023-03-08 13:55:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10455.2). Total num frames: 38617088. Throughput: 0: 10071.0. Samples: 38589844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:55:41,893][434108] Avg episode reward: [(0, '4491.051')] [2023-03-08 13:55:42,522][434396] Updated weights for policy 0, policy_version 75440 (0.0005) [2023-03-08 13:55:46,623][434396] Updated weights for policy 0, policy_version 75520 (0.0005) [2023-03-08 13:55:46,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10441.3). Total num frames: 38666240. Throughput: 0: 10056.3. Samples: 38649928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:55:46,893][434108] Avg episode reward: [(0, '4588.785')] [2023-03-08 13:55:50,562][434396] Updated weights for policy 0, policy_version 75600 (0.0005) [2023-03-08 13:55:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10441.3). Total num frames: 38719488. Throughput: 0: 10036.5. Samples: 38711584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:55:51,893][434108] Avg episode reward: [(0, '4548.715')] [2023-03-08 13:55:51,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000075624_38719488.pth... [2023-03-08 13:55:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000075032_38416384.pth [2023-03-08 13:55:54,785][434396] Updated weights for policy 0, policy_version 75680 (0.0005) [2023-03-08 13:55:56,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10399.7). Total num frames: 38764544. Throughput: 0: 10019.5. Samples: 38740296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:55:56,893][434108] Avg episode reward: [(0, '4588.172')] [2023-03-08 13:55:58,930][434396] Updated weights for policy 0, policy_version 75760 (0.0004) [2023-03-08 13:56:01,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10399.7). Total num frames: 38817792. Throughput: 0: 10062.0. Samples: 38801144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:56:01,893][434108] Avg episode reward: [(0, '4546.506')] [2023-03-08 13:56:02,761][434396] Updated weights for policy 0, policy_version 75840 (0.0004) [2023-03-08 13:56:06,582][434396] Updated weights for policy 0, policy_version 75920 (0.0004) [2023-03-08 13:56:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10385.8). Total num frames: 38871040. Throughput: 0: 10152.0. Samples: 38865200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:56:06,893][434108] Avg episode reward: [(0, '4525.143')] [2023-03-08 13:56:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000075920_38871040.pth... [2023-03-08 13:56:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000075328_38567936.pth [2023-03-08 13:56:10,375][434396] Updated weights for policy 0, policy_version 76000 (0.0005) [2023-03-08 13:56:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10240.0, 300 sec: 10399.7). Total num frames: 38928384. Throughput: 0: 10180.3. Samples: 38897076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:56:11,893][434108] Avg episode reward: [(0, '4563.827')] [2023-03-08 13:56:14,199][434396] Updated weights for policy 0, policy_version 76080 (0.0004) [2023-03-08 13:56:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10385.8). Total num frames: 38977536. Throughput: 0: 10267.7. Samples: 38961264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:56:16,893][434108] Avg episode reward: [(0, '4566.642')] [2023-03-08 13:56:18,306][434396] Updated weights for policy 0, policy_version 76160 (0.0005) [2023-03-08 13:56:21,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10371.9). Total num frames: 39026688. Throughput: 0: 10268.4. Samples: 39021832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:56:21,893][434108] Avg episode reward: [(0, '4545.483')] [2023-03-08 13:56:21,932][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000076232_39030784.pth... [2023-03-08 13:56:21,933][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000075624_38719488.pth [2023-03-08 13:56:22,370][434396] Updated weights for policy 0, policy_version 76240 (0.0005) [2023-03-08 13:56:26,513][434396] Updated weights for policy 0, policy_version 76320 (0.0005) [2023-03-08 13:56:26,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10371.9). Total num frames: 39079936. Throughput: 0: 10254.1. Samples: 39051276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:56:26,893][434108] Avg episode reward: [(0, '4545.159')] [2023-03-08 13:56:30,677][434396] Updated weights for policy 0, policy_version 76400 (0.0005) [2023-03-08 13:56:31,893][434108] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10344.1). Total num frames: 39124992. Throughput: 0: 10250.6. Samples: 39111204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:56:31,893][434108] Avg episode reward: [(0, '4500.923')] [2023-03-08 13:56:34,872][434396] Updated weights for policy 0, policy_version 76480 (0.0005) [2023-03-08 13:56:36,892][434108] Fps is (10 sec: 9420.7, 60 sec: 10103.5, 300 sec: 10330.2). Total num frames: 39174144. Throughput: 0: 10168.9. Samples: 39169184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:56:36,893][434108] Avg episode reward: [(0, '4480.088')] [2023-03-08 13:56:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000076512_39174144.pth... [2023-03-08 13:56:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000075920_38871040.pth [2023-03-08 13:56:39,055][434396] Updated weights for policy 0, policy_version 76560 (0.0005) [2023-03-08 13:56:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10344.1). Total num frames: 39227392. Throughput: 0: 10187.1. Samples: 39198716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:56:41,893][434108] Avg episode reward: [(0, '4406.438')] [2023-03-08 13:56:43,059][434396] Updated weights for policy 0, policy_version 76640 (0.0005) [2023-03-08 13:56:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10330.3). Total num frames: 39276544. Throughput: 0: 10195.3. Samples: 39259932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:56:46,893][434108] Avg episode reward: [(0, '4573.930')] [2023-03-08 13:56:47,148][434396] Updated weights for policy 0, policy_version 76720 (0.0004) [2023-03-08 13:56:51,118][434396] Updated weights for policy 0, policy_version 76800 (0.0004) [2023-03-08 13:56:51,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10316.4). Total num frames: 39325696. Throughput: 0: 10121.8. Samples: 39320680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:56:51,893][434108] Avg episode reward: [(0, '4460.649')] [2023-03-08 13:56:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000076816_39329792.pth... [2023-03-08 13:56:51,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000076232_39030784.pth [2023-03-08 13:56:55,043][434396] Updated weights for policy 0, policy_version 76880 (0.0004) [2023-03-08 13:56:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10302.5). Total num frames: 39378944. Throughput: 0: 10094.3. Samples: 39351320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:56:56,893][434108] Avg episode reward: [(0, '4557.700')] [2023-03-08 13:56:58,962][434396] Updated weights for policy 0, policy_version 76960 (0.0004) [2023-03-08 13:57:01,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10302.5). Total num frames: 39432192. Throughput: 0: 10098.0. Samples: 39415672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:57:01,893][434108] Avg episode reward: [(0, '4480.652')] [2023-03-08 13:57:02,795][434396] Updated weights for policy 0, policy_version 77040 (0.0004) [2023-03-08 13:57:06,585][434396] Updated weights for policy 0, policy_version 77120 (0.0003) [2023-03-08 13:57:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10288.6). Total num frames: 39485440. Throughput: 0: 10168.5. Samples: 39479416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:57:06,893][434108] Avg episode reward: [(0, '4517.082')] [2023-03-08 13:57:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000077120_39485440.pth... [2023-03-08 13:57:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000076512_39174144.pth [2023-03-08 13:57:10,721][434396] Updated weights for policy 0, policy_version 77200 (0.0005) [2023-03-08 13:57:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10274.7). Total num frames: 39534592. Throughput: 0: 10193.9. Samples: 39510000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:57:11,893][434108] Avg episode reward: [(0, '4483.331')] [2023-03-08 13:57:14,663][434396] Updated weights for policy 0, policy_version 77280 (0.0004) [2023-03-08 13:57:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10260.8). Total num frames: 39587840. Throughput: 0: 10227.7. Samples: 39571448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:57:16,893][434108] Avg episode reward: [(0, '4474.089')] [2023-03-08 13:57:18,836][434396] Updated weights for policy 0, policy_version 77360 (0.0005) [2023-03-08 13:57:21,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10246.9). Total num frames: 39636992. Throughput: 0: 10235.1. Samples: 39629764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:57:21,893][434108] Avg episode reward: [(0, '4498.793')] [2023-03-08 13:57:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000077416_39636992.pth... [2023-03-08 13:57:21,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000076816_39329792.pth [2023-03-08 13:57:23,107][434396] Updated weights for policy 0, policy_version 77440 (0.0005) [2023-03-08 13:57:26,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10233.1). Total num frames: 39686144. Throughput: 0: 10211.0. Samples: 39658208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:57:26,893][434108] Avg episode reward: [(0, '4610.239')] [2023-03-08 13:57:27,309][434396] Updated weights for policy 0, policy_version 77520 (0.0005) [2023-03-08 13:57:31,359][434396] Updated weights for policy 0, policy_version 77600 (0.0005) [2023-03-08 13:57:31,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10171.8, 300 sec: 10233.1). Total num frames: 39735296. Throughput: 0: 10174.7. Samples: 39717792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:57:31,893][434108] Avg episode reward: [(0, '4586.458')] [2023-03-08 13:57:35,673][434396] Updated weights for policy 0, policy_version 77680 (0.0005) [2023-03-08 13:57:36,892][434108] Fps is (10 sec: 9420.7, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 39780352. Throughput: 0: 10118.1. Samples: 39775996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:57:36,893][434108] Avg episode reward: [(0, '4632.052')] [2023-03-08 13:57:36,898][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000077704_39784448.pth... [2023-03-08 13:57:36,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000077120_39485440.pth [2023-03-08 13:57:39,796][434396] Updated weights for policy 0, policy_version 77760 (0.0005) [2023-03-08 13:57:41,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 39833600. Throughput: 0: 10087.2. Samples: 39805244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:57:41,893][434108] Avg episode reward: [(0, '4561.606')] [2023-03-08 13:57:44,113][434396] Updated weights for policy 0, policy_version 77840 (0.0006) [2023-03-08 13:57:46,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 39878656. Throughput: 0: 9937.4. Samples: 39862856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:57:46,893][434108] Avg episode reward: [(0, '4616.860')] [2023-03-08 13:57:48,274][434396] Updated weights for policy 0, policy_version 77920 (0.0006) [2023-03-08 13:57:51,892][434108] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10177.5). Total num frames: 39927808. Throughput: 0: 9849.1. Samples: 39922624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:57:51,893][434108] Avg episode reward: [(0, '4634.793')] [2023-03-08 13:57:51,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000077984_39927808.pth... [2023-03-08 13:57:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000077416_39636992.pth [2023-03-08 13:57:52,379][434396] Updated weights for policy 0, policy_version 78000 (0.0005) [2023-03-08 13:57:56,536][434396] Updated weights for policy 0, policy_version 78080 (0.0005) [2023-03-08 13:57:56,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10163.6). Total num frames: 39976960. Throughput: 0: 9832.4. Samples: 39952456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:57:56,893][434108] Avg episode reward: [(0, '4633.089')] [2023-03-08 13:58:00,761][434396] Updated weights for policy 0, policy_version 78160 (0.0005) [2023-03-08 13:58:01,892][434108] Fps is (10 sec: 9830.2, 60 sec: 9898.7, 300 sec: 10163.6). Total num frames: 40026112. Throughput: 0: 9758.7. Samples: 40010592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:58:01,893][434108] Avg episode reward: [(0, '4617.878')] [2023-03-08 13:58:05,115][434396] Updated weights for policy 0, policy_version 78240 (0.0006) [2023-03-08 13:58:06,892][434108] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10163.6). Total num frames: 40075264. Throughput: 0: 9724.8. Samples: 40067380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:58:06,893][434108] Avg episode reward: [(0, '4635.803')] [2023-03-08 13:58:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000078272_40075264.pth... [2023-03-08 13:58:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000077704_39784448.pth [2023-03-08 13:58:09,290][434396] Updated weights for policy 0, policy_version 78320 (0.0005) [2023-03-08 13:58:11,892][434108] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 10149.7). Total num frames: 40124416. Throughput: 0: 9766.7. Samples: 40097712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:58:11,893][434108] Avg episode reward: [(0, '4595.912')] [2023-03-08 13:58:13,193][434396] Updated weights for policy 0, policy_version 78400 (0.0004) [2023-03-08 13:58:16,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10135.9). Total num frames: 40173568. Throughput: 0: 9786.6. Samples: 40158192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:58:16,893][434108] Avg episode reward: [(0, '4537.017')] [2023-03-08 13:58:17,408][434396] Updated weights for policy 0, policy_version 78480 (0.0005) [2023-03-08 13:58:21,743][434396] Updated weights for policy 0, policy_version 78560 (0.0005) [2023-03-08 13:58:21,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10122.0). Total num frames: 40222720. Throughput: 0: 9774.8. Samples: 40215860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:58:21,893][434108] Avg episode reward: [(0, '4564.942')] [2023-03-08 13:58:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000078560_40222720.pth... [2023-03-08 13:58:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000077984_39927808.pth [2023-03-08 13:58:25,871][434396] Updated weights for policy 0, policy_version 78640 (0.0005) [2023-03-08 13:58:26,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10108.1). Total num frames: 40271872. Throughput: 0: 9786.3. Samples: 40245628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:58:26,893][434108] Avg episode reward: [(0, '4545.897')] [2023-03-08 13:58:29,910][434396] Updated weights for policy 0, policy_version 78720 (0.0005) [2023-03-08 13:58:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 10108.1). Total num frames: 40325120. Throughput: 0: 9838.2. Samples: 40305576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:58:31,893][434108] Avg episode reward: [(0, '4445.357')] [2023-03-08 13:58:33,947][434396] Updated weights for policy 0, policy_version 78800 (0.0005) [2023-03-08 13:58:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 10094.2). Total num frames: 40374272. Throughput: 0: 9856.3. Samples: 40366160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:58:36,893][434108] Avg episode reward: [(0, '4449.386')] [2023-03-08 13:58:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000078856_40374272.pth... [2023-03-08 13:58:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000078272_40075264.pth [2023-03-08 13:58:37,864][434396] Updated weights for policy 0, policy_version 78880 (0.0004) [2023-03-08 13:58:41,803][434396] Updated weights for policy 0, policy_version 78960 (0.0005) [2023-03-08 13:58:41,892][434108] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 10108.1). Total num frames: 40427520. Throughput: 0: 9921.2. Samples: 40398912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:58:41,903][434108] Avg episode reward: [(0, '4517.461')] [2023-03-08 13:58:45,634][434396] Updated weights for policy 0, policy_version 79040 (0.0005) [2023-03-08 13:58:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 40480768. Throughput: 0: 10033.5. Samples: 40462100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:58:46,893][434108] Avg episode reward: [(0, '4478.053')] [2023-03-08 13:58:49,616][434396] Updated weights for policy 0, policy_version 79120 (0.0005) [2023-03-08 13:58:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10094.2). Total num frames: 40529920. Throughput: 0: 10143.8. Samples: 40523852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:58:51,893][434108] Avg episode reward: [(0, '4479.661')] [2023-03-08 13:58:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000079160_40529920.pth... [2023-03-08 13:58:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000078560_40222720.pth [2023-03-08 13:58:53,737][434396] Updated weights for policy 0, policy_version 79200 (0.0005) [2023-03-08 13:58:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10094.2). Total num frames: 40583168. Throughput: 0: 10128.0. Samples: 40553472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:58:56,893][434108] Avg episode reward: [(0, '4538.073')] [2023-03-08 13:58:57,548][434396] Updated weights for policy 0, policy_version 79280 (0.0005) [2023-03-08 13:59:01,291][434396] Updated weights for policy 0, policy_version 79360 (0.0005) [2023-03-08 13:59:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10094.2). Total num frames: 40636416. Throughput: 0: 10219.4. Samples: 40618064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:59:01,893][434108] Avg episode reward: [(0, '4529.308')] [2023-03-08 13:59:05,047][434396] Updated weights for policy 0, policy_version 79440 (0.0005) [2023-03-08 13:59:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 40689664. Throughput: 0: 10396.3. Samples: 40683696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 13:59:06,893][434108] Avg episode reward: [(0, '4566.162')] [2023-03-08 13:59:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000079472_40689664.pth... [2023-03-08 13:59:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000078856_40374272.pth [2023-03-08 13:59:08,868][434396] Updated weights for policy 0, policy_version 79520 (0.0004) [2023-03-08 13:59:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10108.1). Total num frames: 40742912. Throughput: 0: 10443.3. Samples: 40715576. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:59:11,893][434108] Avg episode reward: [(0, '4512.057')] [2023-03-08 13:59:12,958][434396] Updated weights for policy 0, policy_version 79600 (0.0005) [2023-03-08 13:59:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10108.1). Total num frames: 40792064. Throughput: 0: 10451.3. Samples: 40775884. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:59:16,893][434108] Avg episode reward: [(0, '4572.476')] [2023-03-08 13:59:16,970][434396] Updated weights for policy 0, policy_version 79680 (0.0005) [2023-03-08 13:59:20,897][434396] Updated weights for policy 0, policy_version 79760 (0.0005) [2023-03-08 13:59:21,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10122.0). Total num frames: 40845312. Throughput: 0: 10490.7. Samples: 40838240. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:59:21,893][434108] Avg episode reward: [(0, '4601.561')] [2023-03-08 13:59:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000079776_40845312.pth... [2023-03-08 13:59:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000079160_40529920.pth [2023-03-08 13:59:24,873][434396] Updated weights for policy 0, policy_version 79840 (0.0005) [2023-03-08 13:59:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10122.0). Total num frames: 40898560. Throughput: 0: 10449.4. Samples: 40869136. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:59:26,893][434108] Avg episode reward: [(0, '4601.317')] [2023-03-08 13:59:28,822][434396] Updated weights for policy 0, policy_version 79920 (0.0005) [2023-03-08 13:59:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10122.0). Total num frames: 40947712. Throughput: 0: 10427.3. Samples: 40931328. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:59:31,893][434108] Avg episode reward: [(0, '4457.243')] [2023-03-08 13:59:32,844][434396] Updated weights for policy 0, policy_version 80000 (0.0005) [2023-03-08 13:59:36,876][434396] Updated weights for policy 0, policy_version 80080 (0.0005) [2023-03-08 13:59:36,893][434108] Fps is (10 sec: 10239.8, 60 sec: 10444.8, 300 sec: 10135.9). Total num frames: 41000960. Throughput: 0: 10420.1. Samples: 40992760. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:59:36,893][434108] Avg episode reward: [(0, '4362.473')] [2023-03-08 13:59:36,898][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000080080_41000960.pth... [2023-03-08 13:59:36,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000079472_40689664.pth [2023-03-08 13:59:40,718][434396] Updated weights for policy 0, policy_version 80160 (0.0004) [2023-03-08 13:59:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10149.7). Total num frames: 41054208. Throughput: 0: 10453.7. Samples: 41023888. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:59:41,900][434108] Avg episode reward: [(0, '4475.316')] [2023-03-08 13:59:44,525][434396] Updated weights for policy 0, policy_version 80240 (0.0004) [2023-03-08 13:59:46,893][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10135.9). Total num frames: 41103360. Throughput: 0: 10440.1. Samples: 41087868. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:59:46,902][434108] Avg episode reward: [(0, '4489.132')] [2023-03-08 13:59:48,465][434396] Updated weights for policy 0, policy_version 80320 (0.0004) [2023-03-08 13:59:51,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10163.6). Total num frames: 41160704. Throughput: 0: 10407.3. Samples: 41152024. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:59:51,893][434108] Avg episode reward: [(0, '4529.505')] [2023-03-08 13:59:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000080392_41160704.pth... [2023-03-08 13:59:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000079776_40845312.pth [2023-03-08 13:59:52,190][434396] Updated weights for policy 0, policy_version 80400 (0.0004) [2023-03-08 13:59:55,842][434396] Updated weights for policy 0, policy_version 80480 (0.0003) [2023-03-08 13:59:56,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10513.1, 300 sec: 10163.6). Total num frames: 41213952. Throughput: 0: 10429.0. Samples: 41184880. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 13:59:56,893][434108] Avg episode reward: [(0, '4529.352')] [2023-03-08 13:59:59,707][434396] Updated weights for policy 0, policy_version 80560 (0.0004) [2023-03-08 14:00:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10177.5). Total num frames: 41267200. Throughput: 0: 10539.7. Samples: 41250168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:00:01,893][434108] Avg episode reward: [(0, '4602.239')] [2023-03-08 14:00:03,772][434396] Updated weights for policy 0, policy_version 80640 (0.0005) [2023-03-08 14:00:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10177.5). Total num frames: 41316352. Throughput: 0: 10472.4. Samples: 41309500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:00:06,893][434108] Avg episode reward: [(0, '4619.288')] [2023-03-08 14:00:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000080696_41316352.pth... [2023-03-08 14:00:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000080080_41000960.pth [2023-03-08 14:00:07,833][434396] Updated weights for policy 0, policy_version 80720 (0.0005) [2023-03-08 14:00:11,771][434396] Updated weights for policy 0, policy_version 80800 (0.0004) [2023-03-08 14:00:11,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10191.4). Total num frames: 41369600. Throughput: 0: 10477.5. Samples: 41340624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:00:11,893][434108] Avg episode reward: [(0, '4565.495')] [2023-03-08 14:00:15,875][434396] Updated weights for policy 0, policy_version 80880 (0.0005) [2023-03-08 14:00:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10177.5). Total num frames: 41418752. Throughput: 0: 10467.6. Samples: 41402368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:00:16,893][434108] Avg episode reward: [(0, '4578.256')] [2023-03-08 14:00:19,876][434396] Updated weights for policy 0, policy_version 80960 (0.0005) [2023-03-08 14:00:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10191.4). Total num frames: 41472000. Throughput: 0: 10467.8. Samples: 41463808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:00:21,893][434108] Avg episode reward: [(0, '4524.179')] [2023-03-08 14:00:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000081000_41472000.pth... [2023-03-08 14:00:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000080392_41160704.pth [2023-03-08 14:00:23,948][434396] Updated weights for policy 0, policy_version 81040 (0.0005) [2023-03-08 14:00:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10191.4). Total num frames: 41521152. Throughput: 0: 10422.9. Samples: 41492920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:00:26,893][434108] Avg episode reward: [(0, '4328.406')] [2023-03-08 14:00:28,064][434396] Updated weights for policy 0, policy_version 81120 (0.0005) [2023-03-08 14:00:31,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10177.5). Total num frames: 41570304. Throughput: 0: 10357.1. Samples: 41553936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:00:31,893][434108] Avg episode reward: [(0, '3988.501')] [2023-03-08 14:00:32,055][434396] Updated weights for policy 0, policy_version 81200 (0.0005) [2023-03-08 14:00:36,034][434396] Updated weights for policy 0, policy_version 81280 (0.0004) [2023-03-08 14:00:36,893][434108] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10177.5). Total num frames: 41619456. Throughput: 0: 10296.5. Samples: 41615368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:00:36,893][434108] Avg episode reward: [(0, '4211.157')] [2023-03-08 14:00:36,908][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000081296_41623552.pth... [2023-03-08 14:00:36,910][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000080696_41316352.pth [2023-03-08 14:00:40,258][434396] Updated weights for policy 0, policy_version 81360 (0.0006) [2023-03-08 14:00:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10191.4). Total num frames: 41672704. Throughput: 0: 10205.0. Samples: 41644104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:00:41,893][434108] Avg episode reward: [(0, '4318.211')] [2023-03-08 14:00:44,149][434396] Updated weights for policy 0, policy_version 81440 (0.0004) [2023-03-08 14:00:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10177.5). Total num frames: 41721856. Throughput: 0: 10121.7. Samples: 41705644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:00:46,893][434108] Avg episode reward: [(0, '4476.691')] [2023-03-08 14:00:48,277][434396] Updated weights for policy 0, policy_version 81520 (0.0005) [2023-03-08 14:00:51,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 41771008. Throughput: 0: 10110.3. Samples: 41764464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:00:51,893][434108] Avg episode reward: [(0, '4556.001')] [2023-03-08 14:00:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000081584_41771008.pth... [2023-03-08 14:00:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000081000_41472000.pth [2023-03-08 14:00:52,422][434396] Updated weights for policy 0, policy_version 81600 (0.0005) [2023-03-08 14:00:56,464][434396] Updated weights for policy 0, policy_version 81680 (0.0005) [2023-03-08 14:00:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10171.8, 300 sec: 10191.4). Total num frames: 41824256. Throughput: 0: 10104.8. Samples: 41795340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:00:56,893][434108] Avg episode reward: [(0, '4616.308')] [2023-03-08 14:01:00,686][434396] Updated weights for policy 0, policy_version 81760 (0.0005) [2023-03-08 14:01:01,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10163.6). Total num frames: 41869312. Throughput: 0: 10062.6. Samples: 41855184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:01:01,893][434108] Avg episode reward: [(0, '4614.526')] [2023-03-08 14:01:04,827][434396] Updated weights for policy 0, policy_version 81840 (0.0005) [2023-03-08 14:01:06,892][434108] Fps is (10 sec: 9420.7, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 41918464. Throughput: 0: 10004.0. Samples: 41913988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:01:06,893][434108] Avg episode reward: [(0, '4612.627')] [2023-03-08 14:01:06,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000081872_41918464.pth... [2023-03-08 14:01:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000081296_41623552.pth [2023-03-08 14:01:08,849][434396] Updated weights for policy 0, policy_version 81920 (0.0005) [2023-03-08 14:01:11,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10149.8). Total num frames: 41971712. Throughput: 0: 10035.3. Samples: 41944508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:01:11,893][434108] Avg episode reward: [(0, '4582.213')] [2023-03-08 14:01:12,678][434396] Updated weights for policy 0, policy_version 82000 (0.0005) [2023-03-08 14:01:16,629][434396] Updated weights for policy 0, policy_version 82080 (0.0005) [2023-03-08 14:01:16,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 42024960. Throughput: 0: 10104.7. Samples: 42008648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:01:16,893][434108] Avg episode reward: [(0, '4619.598')] [2023-03-08 14:01:20,700][434396] Updated weights for policy 0, policy_version 82160 (0.0005) [2023-03-08 14:01:21,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10149.7). Total num frames: 42074112. Throughput: 0: 10089.8. Samples: 42069408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:01:21,893][434108] Avg episode reward: [(0, '4527.465')] [2023-03-08 14:01:21,924][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000082184_42078208.pth... [2023-03-08 14:01:21,925][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000081584_41771008.pth [2023-03-08 14:01:24,768][434396] Updated weights for policy 0, policy_version 82240 (0.0005) [2023-03-08 14:01:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 42127360. Throughput: 0: 10103.6. Samples: 42098768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:01:26,893][434108] Avg episode reward: [(0, '4419.717')] [2023-03-08 14:01:28,916][434396] Updated weights for policy 0, policy_version 82320 (0.0005) [2023-03-08 14:01:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 42176512. Throughput: 0: 10079.4. Samples: 42159216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:01:31,893][434108] Avg episode reward: [(0, '4511.389')] [2023-03-08 14:01:33,087][434396] Updated weights for policy 0, policy_version 82400 (0.0005) [2023-03-08 14:01:36,893][434108] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 42225664. Throughput: 0: 10073.2. Samples: 42217760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:01:36,893][434108] Avg episode reward: [(0, '4574.626')] [2023-03-08 14:01:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000082472_42225664.pth... [2023-03-08 14:01:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000081872_41918464.pth [2023-03-08 14:01:37,121][434396] Updated weights for policy 0, policy_version 82480 (0.0005) [2023-03-08 14:01:40,985][434396] Updated weights for policy 0, policy_version 82560 (0.0004) [2023-03-08 14:01:41,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 42278912. Throughput: 0: 10106.7. Samples: 42250140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:01:41,893][434108] Avg episode reward: [(0, '4610.245')] [2023-03-08 14:01:44,781][434396] Updated weights for policy 0, policy_version 82640 (0.0004) [2023-03-08 14:01:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 42332160. Throughput: 0: 10203.7. Samples: 42314348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:01:46,893][434108] Avg episode reward: [(0, '4449.198')] [2023-03-08 14:01:48,545][434396] Updated weights for policy 0, policy_version 82720 (0.0005) [2023-03-08 14:01:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10191.4). Total num frames: 42385408. Throughput: 0: 10346.2. Samples: 42379568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:01:51,893][434108] Avg episode reward: [(0, '4488.790')] [2023-03-08 14:01:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000082784_42385408.pth... [2023-03-08 14:01:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000082184_42078208.pth [2023-03-08 14:01:52,308][434396] Updated weights for policy 0, policy_version 82800 (0.0004) [2023-03-08 14:01:56,034][434396] Updated weights for policy 0, policy_version 82880 (0.0004) [2023-03-08 14:01:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10308.2, 300 sec: 10205.3). Total num frames: 42442752. Throughput: 0: 10390.6. Samples: 42412088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:01:56,893][434108] Avg episode reward: [(0, '4597.919')] [2023-03-08 14:01:59,775][434396] Updated weights for policy 0, policy_version 82960 (0.0005) [2023-03-08 14:02:01,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10205.3). Total num frames: 42496000. Throughput: 0: 10437.3. Samples: 42478324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:02:01,893][434108] Avg episode reward: [(0, '4588.932')] [2023-03-08 14:02:03,521][434396] Updated weights for policy 0, policy_version 83040 (0.0005) [2023-03-08 14:02:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.0, 300 sec: 10219.2). Total num frames: 42549248. Throughput: 0: 10546.4. Samples: 42543996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:02:06,893][434108] Avg episode reward: [(0, '4603.269')] [2023-03-08 14:02:06,908][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000083112_42553344.pth... [2023-03-08 14:02:06,910][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000082472_42225664.pth [2023-03-08 14:02:07,268][434396] Updated weights for policy 0, policy_version 83120 (0.0004) [2023-03-08 14:02:11,117][434396] Updated weights for policy 0, policy_version 83200 (0.0005) [2023-03-08 14:02:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10233.1). Total num frames: 42606592. Throughput: 0: 10626.6. Samples: 42576964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:02:11,893][434108] Avg episode reward: [(0, '4608.618')] [2023-03-08 14:02:14,810][434396] Updated weights for policy 0, policy_version 83280 (0.0004) [2023-03-08 14:02:16,893][434108] Fps is (10 sec: 11059.1, 60 sec: 10581.3, 300 sec: 10246.9). Total num frames: 42659840. Throughput: 0: 10715.7. Samples: 42641424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:02:16,893][434108] Avg episode reward: [(0, '4621.155')] [2023-03-08 14:02:18,712][434396] Updated weights for policy 0, policy_version 83360 (0.0004) [2023-03-08 14:02:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10260.8). Total num frames: 42713088. Throughput: 0: 10851.8. Samples: 42706088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:02:21,893][434108] Avg episode reward: [(0, '4635.207')] [2023-03-08 14:02:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000083424_42713088.pth... [2023-03-08 14:02:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000082784_42385408.pth [2023-03-08 14:02:22,387][434396] Updated weights for policy 0, policy_version 83440 (0.0004) [2023-03-08 14:02:26,196][434396] Updated weights for policy 0, policy_version 83520 (0.0004) [2023-03-08 14:02:26,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10274.7). Total num frames: 42766336. Throughput: 0: 10852.6. Samples: 42738508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:02:26,893][434108] Avg episode reward: [(0, '4575.766')] [2023-03-08 14:02:29,894][434396] Updated weights for policy 0, policy_version 83600 (0.0003) [2023-03-08 14:02:31,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10316.4). Total num frames: 42823680. Throughput: 0: 10883.8. Samples: 42804120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:02:31,893][434108] Avg episode reward: [(0, '4640.090')] [2023-03-08 14:02:33,585][434396] Updated weights for policy 0, policy_version 83680 (0.0004) [2023-03-08 14:02:36,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10854.4, 300 sec: 10316.4). Total num frames: 42876928. Throughput: 0: 10923.2. Samples: 42871112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:02:36,893][434108] Avg episode reward: [(0, '4621.487')] [2023-03-08 14:02:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000083744_42876928.pth... [2023-03-08 14:02:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000083112_42553344.pth [2023-03-08 14:02:37,318][434396] Updated weights for policy 0, policy_version 83760 (0.0004) [2023-03-08 14:02:41,058][434396] Updated weights for policy 0, policy_version 83840 (0.0004) [2023-03-08 14:02:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10358.0). Total num frames: 42934272. Throughput: 0: 10928.4. Samples: 42903868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:02:41,893][434108] Avg episode reward: [(0, '4643.823')] [2023-03-08 14:02:44,953][434396] Updated weights for policy 0, policy_version 83920 (0.0004) [2023-03-08 14:02:46,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10922.7, 300 sec: 10371.9). Total num frames: 42987520. Throughput: 0: 10868.6. Samples: 42967412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:02:46,893][434108] Avg episode reward: [(0, '4636.593')] [2023-03-08 14:02:48,740][434396] Updated weights for policy 0, policy_version 84000 (0.0005) [2023-03-08 14:02:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10854.4, 300 sec: 10371.9). Total num frames: 43036672. Throughput: 0: 10826.5. Samples: 43031188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:02:51,893][434108] Avg episode reward: [(0, '4615.991')] [2023-03-08 14:02:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000084056_43036672.pth... [2023-03-08 14:02:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000083424_42713088.pth [2023-03-08 14:02:52,832][434396] Updated weights for policy 0, policy_version 84080 (0.0005) [2023-03-08 14:02:56,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10717.9, 300 sec: 10371.9). Total num frames: 43085824. Throughput: 0: 10761.9. Samples: 43061248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:02:56,893][434108] Avg episode reward: [(0, '4562.878')] [2023-03-08 14:02:56,942][434396] Updated weights for policy 0, policy_version 84160 (0.0005) [2023-03-08 14:03:00,709][434396] Updated weights for policy 0, policy_version 84240 (0.0004) [2023-03-08 14:03:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10399.7). Total num frames: 43143168. Throughput: 0: 10696.4. Samples: 43122760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:03:01,893][434108] Avg episode reward: [(0, '4533.803')] [2023-03-08 14:03:04,687][434396] Updated weights for policy 0, policy_version 84320 (0.0005) [2023-03-08 14:03:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10399.7). Total num frames: 43192320. Throughput: 0: 10644.9. Samples: 43185108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:03:06,893][434108] Avg episode reward: [(0, '4501.125')] [2023-03-08 14:03:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000084360_43192320.pth... [2023-03-08 14:03:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000083744_42876928.pth [2023-03-08 14:03:08,738][434396] Updated weights for policy 0, policy_version 84400 (0.0004) [2023-03-08 14:03:11,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10649.6, 300 sec: 10413.6). Total num frames: 43245568. Throughput: 0: 10607.5. Samples: 43215848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:03:11,893][434108] Avg episode reward: [(0, '4599.541')] [2023-03-08 14:03:12,554][434396] Updated weights for policy 0, policy_version 84480 (0.0004) [2023-03-08 14:03:16,311][434396] Updated weights for policy 0, policy_version 84560 (0.0004) [2023-03-08 14:03:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10427.4). Total num frames: 43298816. Throughput: 0: 10594.8. Samples: 43280884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:03:16,893][434108] Avg episode reward: [(0, '4557.176')] [2023-03-08 14:03:20,142][434396] Updated weights for policy 0, policy_version 84640 (0.0005) [2023-03-08 14:03:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10441.3). Total num frames: 43352064. Throughput: 0: 10533.4. Samples: 43345116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:03:21,893][434108] Avg episode reward: [(0, '4531.591')] [2023-03-08 14:03:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000084672_43352064.pth... [2023-03-08 14:03:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000084056_43036672.pth [2023-03-08 14:03:23,856][434396] Updated weights for policy 0, policy_version 84720 (0.0005) [2023-03-08 14:03:26,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10441.3). Total num frames: 43405312. Throughput: 0: 10543.7. Samples: 43378336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:03:26,893][434108] Avg episode reward: [(0, '4608.594')] [2023-03-08 14:03:27,936][434396] Updated weights for policy 0, policy_version 84800 (0.0005) [2023-03-08 14:03:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10441.3). Total num frames: 43454464. Throughput: 0: 10469.2. Samples: 43438528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:03:31,893][434108] Avg episode reward: [(0, '4628.339')] [2023-03-08 14:03:31,923][434396] Updated weights for policy 0, policy_version 84880 (0.0005) [2023-03-08 14:03:36,013][434396] Updated weights for policy 0, policy_version 84960 (0.0005) [2023-03-08 14:03:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10441.3). Total num frames: 43507712. Throughput: 0: 10408.8. Samples: 43499584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:03:36,893][434108] Avg episode reward: [(0, '4607.340')] [2023-03-08 14:03:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000084976_43507712.pth... [2023-03-08 14:03:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000084360_43192320.pth [2023-03-08 14:03:39,998][434396] Updated weights for policy 0, policy_version 85040 (0.0005) [2023-03-08 14:03:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 43556864. Throughput: 0: 10431.5. Samples: 43530664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:03:41,893][434108] Avg episode reward: [(0, '4540.215')] [2023-03-08 14:03:43,974][434396] Updated weights for policy 0, policy_version 85120 (0.0005) [2023-03-08 14:03:46,892][434108] Fps is (10 sec: 10240.2, 60 sec: 10376.5, 300 sec: 10441.3). Total num frames: 43610112. Throughput: 0: 10432.6. Samples: 43592228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:03:46,893][434108] Avg episode reward: [(0, '4534.542')] [2023-03-08 14:03:47,810][434396] Updated weights for policy 0, policy_version 85200 (0.0004) [2023-03-08 14:03:51,732][434396] Updated weights for policy 0, policy_version 85280 (0.0004) [2023-03-08 14:03:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10441.3). Total num frames: 43663360. Throughput: 0: 10460.8. Samples: 43655844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:03:51,893][434108] Avg episode reward: [(0, '4494.936')] [2023-03-08 14:03:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000085280_43663360.pth... [2023-03-08 14:03:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000084672_43352064.pth [2023-03-08 14:03:55,773][434396] Updated weights for policy 0, policy_version 85360 (0.0005) [2023-03-08 14:03:56,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10441.3). Total num frames: 43716608. Throughput: 0: 10459.8. Samples: 43686540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:03:56,893][434108] Avg episode reward: [(0, '4280.525')] [2023-03-08 14:03:59,708][434396] Updated weights for policy 0, policy_version 85440 (0.0005) [2023-03-08 14:04:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 43765760. Throughput: 0: 10396.5. Samples: 43748728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:04:01,893][434108] Avg episode reward: [(0, '4479.341')] [2023-03-08 14:04:03,878][434396] Updated weights for policy 0, policy_version 85520 (0.0005) [2023-03-08 14:04:06,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 43814912. Throughput: 0: 10286.8. Samples: 43808024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:04:06,893][434108] Avg episode reward: [(0, '4403.545')] [2023-03-08 14:04:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000085576_43814912.pth... [2023-03-08 14:04:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000084976_43507712.pth [2023-03-08 14:04:07,960][434396] Updated weights for policy 0, policy_version 85600 (0.0005) [2023-03-08 14:04:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10413.6). Total num frames: 43864064. Throughput: 0: 10213.2. Samples: 43837932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:04:11,904][434108] Avg episode reward: [(0, '4226.456')] [2023-03-08 14:04:12,015][434396] Updated weights for policy 0, policy_version 85680 (0.0005) [2023-03-08 14:04:16,200][434396] Updated weights for policy 0, policy_version 85760 (0.0005) [2023-03-08 14:04:16,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10399.7). Total num frames: 43913216. Throughput: 0: 10213.4. Samples: 43898132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:04:16,893][434108] Avg episode reward: [(0, '4329.705')] [2023-03-08 14:04:20,340][434396] Updated weights for policy 0, policy_version 85840 (0.0005) [2023-03-08 14:04:21,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10385.8). Total num frames: 43962368. Throughput: 0: 10179.1. Samples: 43957644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:04:21,893][434108] Avg episode reward: [(0, '4484.738')] [2023-03-08 14:04:21,904][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000085872_43966464.pth... [2023-03-08 14:04:21,906][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000085280_43663360.pth [2023-03-08 14:04:24,352][434396] Updated weights for policy 0, policy_version 85920 (0.0005) [2023-03-08 14:04:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10399.7). Total num frames: 44015616. Throughput: 0: 10147.0. Samples: 43987280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:04:26,893][434108] Avg episode reward: [(0, '4553.101')] [2023-03-08 14:04:28,281][434396] Updated weights for policy 0, policy_version 86000 (0.0005) [2023-03-08 14:04:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10385.8). Total num frames: 44064768. Throughput: 0: 10149.9. Samples: 44048976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:04:31,893][434108] Avg episode reward: [(0, '4533.922')] [2023-03-08 14:04:32,325][434396] Updated weights for policy 0, policy_version 86080 (0.0005) [2023-03-08 14:04:36,184][434396] Updated weights for policy 0, policy_version 86160 (0.0005) [2023-03-08 14:04:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10385.8). Total num frames: 44118016. Throughput: 0: 10136.8. Samples: 44112000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:04:36,893][434108] Avg episode reward: [(0, '4542.322')] [2023-03-08 14:04:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000086168_44118016.pth... [2023-03-08 14:04:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000085576_43814912.pth [2023-03-08 14:04:40,013][434396] Updated weights for policy 0, policy_version 86240 (0.0004) [2023-03-08 14:04:41,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10308.3, 300 sec: 10413.6). Total num frames: 44175360. Throughput: 0: 10165.1. Samples: 44143968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:04:41,893][434108] Avg episode reward: [(0, '4539.399')] [2023-03-08 14:04:43,882][434396] Updated weights for policy 0, policy_version 86320 (0.0005) [2023-03-08 14:04:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 10385.8). Total num frames: 44224512. Throughput: 0: 10199.9. Samples: 44207724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:04:46,893][434108] Avg episode reward: [(0, '4507.366')] [2023-03-08 14:04:47,910][434396] Updated weights for policy 0, policy_version 86400 (0.0005) [2023-03-08 14:04:51,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10371.9). Total num frames: 44273664. Throughput: 0: 10218.4. Samples: 44267852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:04:51,893][434108] Avg episode reward: [(0, '4418.051')] [2023-03-08 14:04:51,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000086472_44273664.pth... [2023-03-08 14:04:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000085872_43966464.pth [2023-03-08 14:04:51,997][434396] Updated weights for policy 0, policy_version 86480 (0.0005) [2023-03-08 14:04:56,082][434396] Updated weights for policy 0, policy_version 86560 (0.0005) [2023-03-08 14:04:56,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10358.0). Total num frames: 44322816. Throughput: 0: 10229.1. Samples: 44298240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:04:56,893][434108] Avg episode reward: [(0, '4233.089')] [2023-03-08 14:05:00,159][434396] Updated weights for policy 0, policy_version 86640 (0.0005) [2023-03-08 14:05:01,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10371.9). Total num frames: 44376064. Throughput: 0: 10232.1. Samples: 44358576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:05:01,893][434108] Avg episode reward: [(0, '4343.518')] [2023-03-08 14:05:04,028][434396] Updated weights for policy 0, policy_version 86720 (0.0004) [2023-03-08 14:05:06,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10240.0, 300 sec: 10371.9). Total num frames: 44429312. Throughput: 0: 10300.9. Samples: 44421184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:05:06,893][434108] Avg episode reward: [(0, '4482.394')] [2023-03-08 14:05:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000086776_44429312.pth... [2023-03-08 14:05:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000086168_44118016.pth [2023-03-08 14:05:07,977][434396] Updated weights for policy 0, policy_version 86800 (0.0005) [2023-03-08 14:05:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10371.9). Total num frames: 44478464. Throughput: 0: 10326.2. Samples: 44451960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:05:11,903][434108] Avg episode reward: [(0, '4337.959')] [2023-03-08 14:05:11,987][434396] Updated weights for policy 0, policy_version 86880 (0.0005) [2023-03-08 14:05:15,950][434396] Updated weights for policy 0, policy_version 86960 (0.0005) [2023-03-08 14:05:16,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10371.9). Total num frames: 44531712. Throughput: 0: 10338.0. Samples: 44514188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:05:16,903][434108] Avg episode reward: [(0, '4391.145')] [2023-03-08 14:05:19,750][434396] Updated weights for policy 0, policy_version 87040 (0.0005) [2023-03-08 14:05:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10385.8). Total num frames: 44584960. Throughput: 0: 10348.2. Samples: 44577668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:05:21,893][434108] Avg episode reward: [(0, '4223.634')] [2023-03-08 14:05:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000087080_44584960.pth... [2023-03-08 14:05:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000086472_44273664.pth [2023-03-08 14:05:23,744][434396] Updated weights for policy 0, policy_version 87120 (0.0005) [2023-03-08 14:05:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 44634112. Throughput: 0: 10321.1. Samples: 44608416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:05:26,893][434108] Avg episode reward: [(0, '4358.565')] [2023-03-08 14:05:27,837][434396] Updated weights for policy 0, policy_version 87200 (0.0005) [2023-03-08 14:05:31,663][434396] Updated weights for policy 0, policy_version 87280 (0.0004) [2023-03-08 14:05:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 44687360. Throughput: 0: 10255.5. Samples: 44669224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:05:31,893][434108] Avg episode reward: [(0, '4507.412')] [2023-03-08 14:05:35,382][434396] Updated weights for policy 0, policy_version 87360 (0.0004) [2023-03-08 14:05:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 44740608. Throughput: 0: 10397.4. Samples: 44735736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:05:36,893][434108] Avg episode reward: [(0, '4366.710')] [2023-03-08 14:05:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000087392_44744704.pth... [2023-03-08 14:05:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000086776_44429312.pth [2023-03-08 14:05:39,164][434396] Updated weights for policy 0, policy_version 87440 (0.0005) [2023-03-08 14:05:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 44797952. Throughput: 0: 10438.6. Samples: 44767976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:05:41,893][434108] Avg episode reward: [(0, '4145.890')] [2023-03-08 14:05:43,011][434396] Updated weights for policy 0, policy_version 87520 (0.0004) [2023-03-08 14:05:46,709][434396] Updated weights for policy 0, policy_version 87600 (0.0004) [2023-03-08 14:05:46,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10441.3). Total num frames: 44851200. Throughput: 0: 10524.9. Samples: 44832196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:05:46,893][434108] Avg episode reward: [(0, '4294.706')] [2023-03-08 14:05:50,681][434396] Updated weights for policy 0, policy_version 87680 (0.0005) [2023-03-08 14:05:51,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.0, 300 sec: 10441.3). Total num frames: 44904448. Throughput: 0: 10551.9. Samples: 44896020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:05:51,893][434108] Avg episode reward: [(0, '4377.081')] [2023-03-08 14:05:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000087704_44904448.pth... [2023-03-08 14:05:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000087080_44584960.pth [2023-03-08 14:05:54,575][434396] Updated weights for policy 0, policy_version 87760 (0.0005) [2023-03-08 14:05:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10455.2). Total num frames: 44953600. Throughput: 0: 10575.5. Samples: 44927856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:05:56,893][434108] Avg episode reward: [(0, '4232.567')] [2023-03-08 14:05:58,733][434396] Updated weights for policy 0, policy_version 87840 (0.0005) [2023-03-08 14:06:01,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 45002752. Throughput: 0: 10509.6. Samples: 44987120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:06:01,893][434108] Avg episode reward: [(0, '4487.165')] [2023-03-08 14:06:02,824][434396] Updated weights for policy 0, policy_version 87920 (0.0005) [2023-03-08 14:06:06,893][434108] Fps is (10 sec: 9830.3, 60 sec: 10376.5, 300 sec: 10441.3). Total num frames: 45051904. Throughput: 0: 10417.6. Samples: 45046460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:06:06,893][434108] Avg episode reward: [(0, '4583.157')] [2023-03-08 14:06:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000087992_45051904.pth... [2023-03-08 14:06:06,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000087392_44744704.pth [2023-03-08 14:06:06,976][434396] Updated weights for policy 0, policy_version 88000 (0.0005) [2023-03-08 14:06:11,092][434396] Updated weights for policy 0, policy_version 88080 (0.0005) [2023-03-08 14:06:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 45101056. Throughput: 0: 10398.0. Samples: 45076328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:06:11,893][434108] Avg episode reward: [(0, '4585.448')] [2023-03-08 14:06:15,161][434396] Updated weights for policy 0, policy_version 88160 (0.0005) [2023-03-08 14:06:16,892][434108] Fps is (10 sec: 10240.2, 60 sec: 10376.5, 300 sec: 10441.3). Total num frames: 45154304. Throughput: 0: 10374.4. Samples: 45136072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:06:16,893][434108] Avg episode reward: [(0, '4637.035')] [2023-03-08 14:06:19,131][434396] Updated weights for policy 0, policy_version 88240 (0.0004) [2023-03-08 14:06:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 45203456. Throughput: 0: 10271.3. Samples: 45197944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:06:21,893][434108] Avg episode reward: [(0, '4558.470')] [2023-03-08 14:06:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000088288_45203456.pth... [2023-03-08 14:06:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000087704_44904448.pth [2023-03-08 14:06:23,246][434396] Updated weights for policy 0, policy_version 88320 (0.0005) [2023-03-08 14:06:26,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 45252608. Throughput: 0: 10223.5. Samples: 45228032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:06:26,893][434108] Avg episode reward: [(0, '4607.982')] [2023-03-08 14:06:27,324][434396] Updated weights for policy 0, policy_version 88400 (0.0005) [2023-03-08 14:06:31,375][434396] Updated weights for policy 0, policy_version 88480 (0.0005) [2023-03-08 14:06:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10441.3). Total num frames: 45305856. Throughput: 0: 10123.1. Samples: 45287736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:06:31,893][434108] Avg episode reward: [(0, '4546.765')] [2023-03-08 14:06:35,287][434396] Updated weights for policy 0, policy_version 88560 (0.0004) [2023-03-08 14:06:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10441.3). Total num frames: 45359104. Throughput: 0: 10107.6. Samples: 45350864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:06:36,893][434108] Avg episode reward: [(0, '4336.996')] [2023-03-08 14:06:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000088592_45359104.pth... [2023-03-08 14:06:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000087992_45051904.pth [2023-03-08 14:06:39,165][434396] Updated weights for policy 0, policy_version 88640 (0.0004) [2023-03-08 14:06:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10427.4). Total num frames: 45408256. Throughput: 0: 10094.6. Samples: 45382112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:06:41,893][434108] Avg episode reward: [(0, '4219.643')] [2023-03-08 14:06:43,229][434396] Updated weights for policy 0, policy_version 88720 (0.0005) [2023-03-08 14:06:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10427.4). Total num frames: 45461504. Throughput: 0: 10131.2. Samples: 45443024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:06:46,893][434108] Avg episode reward: [(0, '4429.190')] [2023-03-08 14:06:47,275][434396] Updated weights for policy 0, policy_version 88800 (0.0005) [2023-03-08 14:06:51,169][434396] Updated weights for policy 0, policy_version 88880 (0.0004) [2023-03-08 14:06:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10399.7). Total num frames: 45510656. Throughput: 0: 10183.7. Samples: 45504724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:06:51,893][434108] Avg episode reward: [(0, '4256.366')] [2023-03-08 14:06:51,917][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000088896_45514752.pth... [2023-03-08 14:06:51,918][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000088288_45203456.pth [2023-03-08 14:06:55,211][434396] Updated weights for policy 0, policy_version 88960 (0.0005) [2023-03-08 14:06:56,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10399.7). Total num frames: 45563904. Throughput: 0: 10202.3. Samples: 45535432. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:06:56,893][434108] Avg episode reward: [(0, '4134.202')] [2023-03-08 14:06:59,414][434396] Updated weights for policy 0, policy_version 89040 (0.0005) [2023-03-08 14:07:01,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10371.9). Total num frames: 45608960. Throughput: 0: 10173.3. Samples: 45593872. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:07:01,893][434108] Avg episode reward: [(0, '4152.647')] [2023-03-08 14:07:03,655][434396] Updated weights for policy 0, policy_version 89120 (0.0005) [2023-03-08 14:07:06,893][434108] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 10344.1). Total num frames: 45658112. Throughput: 0: 10103.9. Samples: 45652620. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:07:06,893][434108] Avg episode reward: [(0, '3878.288')] [2023-03-08 14:07:06,906][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000089184_45662208.pth... [2023-03-08 14:07:06,908][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000088592_45359104.pth [2023-03-08 14:07:07,818][434396] Updated weights for policy 0, policy_version 89200 (0.0005) [2023-03-08 14:07:11,755][434396] Updated weights for policy 0, policy_version 89280 (0.0005) [2023-03-08 14:07:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10344.1). Total num frames: 45711360. Throughput: 0: 10104.9. Samples: 45682752. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:07:11,893][434108] Avg episode reward: [(0, '3508.606')] [2023-03-08 14:07:15,561][434396] Updated weights for policy 0, policy_version 89360 (0.0004) [2023-03-08 14:07:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10171.7, 300 sec: 10344.1). Total num frames: 45764608. Throughput: 0: 10194.6. Samples: 45746492. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:07:16,893][434108] Avg episode reward: [(0, '3770.374')] [2023-03-08 14:07:19,420][434396] Updated weights for policy 0, policy_version 89440 (0.0005) [2023-03-08 14:07:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10344.1). Total num frames: 45817856. Throughput: 0: 10197.8. Samples: 45809764. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:07:21,893][434108] Avg episode reward: [(0, '3618.140')] [2023-03-08 14:07:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000089488_45817856.pth... [2023-03-08 14:07:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000088896_45514752.pth [2023-03-08 14:07:23,507][434396] Updated weights for policy 0, policy_version 89520 (0.0004) [2023-03-08 14:07:26,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10316.4). Total num frames: 45867008. Throughput: 0: 10149.6. Samples: 45838844. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:07:26,893][434108] Avg episode reward: [(0, '3878.655')] [2023-03-08 14:07:27,583][434396] Updated weights for policy 0, policy_version 89600 (0.0004) [2023-03-08 14:07:31,420][434396] Updated weights for policy 0, policy_version 89680 (0.0004) [2023-03-08 14:07:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10316.4). Total num frames: 45920256. Throughput: 0: 10170.7. Samples: 45900704. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:07:31,893][434108] Avg episode reward: [(0, '3619.364')] [2023-03-08 14:07:35,237][434396] Updated weights for policy 0, policy_version 89760 (0.0005) [2023-03-08 14:07:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10302.5). Total num frames: 45973504. Throughput: 0: 10235.6. Samples: 45965324. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:07:36,893][434108] Avg episode reward: [(0, '3996.968')] [2023-03-08 14:07:36,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000089792_45973504.pth... [2023-03-08 14:07:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000089184_45662208.pth [2023-03-08 14:07:39,121][434396] Updated weights for policy 0, policy_version 89840 (0.0005) [2023-03-08 14:07:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 46026752. Throughput: 0: 10260.8. Samples: 45997168. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:07:41,893][434108] Avg episode reward: [(0, '3653.722')] [2023-03-08 14:07:42,927][434396] Updated weights for policy 0, policy_version 89920 (0.0005) [2023-03-08 14:07:46,729][434396] Updated weights for policy 0, policy_version 90000 (0.0005) [2023-03-08 14:07:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10316.4). Total num frames: 46080000. Throughput: 0: 10377.4. Samples: 46060856. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:07:46,893][434108] Avg episode reward: [(0, '3267.090')] [2023-03-08 14:07:50,410][434396] Updated weights for policy 0, policy_version 90080 (0.0004) [2023-03-08 14:07:51,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10330.2). Total num frames: 46133248. Throughput: 0: 10556.7. Samples: 46127672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:07:51,893][434108] Avg episode reward: [(0, '3921.930')] [2023-03-08 14:07:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000090104_46133248.pth... [2023-03-08 14:07:51,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000089488_45817856.pth [2023-03-08 14:07:54,447][434396] Updated weights for policy 0, policy_version 90160 (0.0005) [2023-03-08 14:07:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10316.4). Total num frames: 46186496. Throughput: 0: 10557.2. Samples: 46157824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:07:56,893][434108] Avg episode reward: [(0, '3650.247')] [2023-03-08 14:07:58,481][434396] Updated weights for policy 0, policy_version 90240 (0.0005) [2023-03-08 14:08:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10316.4). Total num frames: 46235648. Throughput: 0: 10496.1. Samples: 46218816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:08:01,893][434108] Avg episode reward: [(0, '3630.582')] [2023-03-08 14:08:02,511][434396] Updated weights for policy 0, policy_version 90320 (0.0005) [2023-03-08 14:08:06,330][434396] Updated weights for policy 0, policy_version 90400 (0.0005) [2023-03-08 14:08:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10316.4). Total num frames: 46288896. Throughput: 0: 10476.9. Samples: 46281224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:08:06,893][434108] Avg episode reward: [(0, '4117.160')] [2023-03-08 14:08:06,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000090408_46288896.pth... [2023-03-08 14:08:06,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000089792_45973504.pth [2023-03-08 14:08:10,129][434396] Updated weights for policy 0, policy_version 90480 (0.0003) [2023-03-08 14:08:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10316.4). Total num frames: 46342144. Throughput: 0: 10548.7. Samples: 46313536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:08:11,893][434108] Avg episode reward: [(0, '4216.917')] [2023-03-08 14:08:13,895][434396] Updated weights for policy 0, policy_version 90560 (0.0004) [2023-03-08 14:08:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10316.4). Total num frames: 46395392. Throughput: 0: 10606.9. Samples: 46378012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:08:16,893][434108] Avg episode reward: [(0, '4235.827')] [2023-03-08 14:08:17,993][434396] Updated weights for policy 0, policy_version 90640 (0.0005) [2023-03-08 14:08:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10302.5). Total num frames: 46444544. Throughput: 0: 10533.5. Samples: 46439332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:08:21,893][434108] Avg episode reward: [(0, '4192.044')] [2023-03-08 14:08:21,923][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000090720_46448640.pth... [2023-03-08 14:08:21,923][434396] Updated weights for policy 0, policy_version 90720 (0.0005) [2023-03-08 14:08:21,924][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000090104_46133248.pth [2023-03-08 14:08:26,106][434396] Updated weights for policy 0, policy_version 90800 (0.0005) [2023-03-08 14:08:26,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10316.4). Total num frames: 46497792. Throughput: 0: 10489.2. Samples: 46469184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:08:26,893][434108] Avg episode reward: [(0, '4141.210')] [2023-03-08 14:08:30,206][434396] Updated weights for policy 0, policy_version 90880 (0.0005) [2023-03-08 14:08:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10302.5). Total num frames: 46546944. Throughput: 0: 10391.3. Samples: 46528464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:08:31,893][434108] Avg episode reward: [(0, '3663.758')] [2023-03-08 14:08:34,174][434396] Updated weights for policy 0, policy_version 90960 (0.0004) [2023-03-08 14:08:36,893][434108] Fps is (10 sec: 9830.3, 60 sec: 10376.5, 300 sec: 10302.5). Total num frames: 46596096. Throughput: 0: 10307.5. Samples: 46591512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:08:36,893][434108] Avg episode reward: [(0, '3850.932')] [2023-03-08 14:08:36,947][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000091016_46600192.pth... [2023-03-08 14:08:36,949][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000090408_46288896.pth [2023-03-08 14:08:38,183][434396] Updated weights for policy 0, policy_version 91040 (0.0004) [2023-03-08 14:08:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10302.5). Total num frames: 46649344. Throughput: 0: 10298.7. Samples: 46621264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:08:41,903][434108] Avg episode reward: [(0, '3913.150')] [2023-03-08 14:08:41,983][434396] Updated weights for policy 0, policy_version 91120 (0.0004) [2023-03-08 14:08:45,801][434396] Updated weights for policy 0, policy_version 91200 (0.0004) [2023-03-08 14:08:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10302.5). Total num frames: 46702592. Throughput: 0: 10382.5. Samples: 46686028. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:08:46,903][434108] Avg episode reward: [(0, '3767.907')] [2023-03-08 14:08:49,764][434396] Updated weights for policy 0, policy_version 91280 (0.0005) [2023-03-08 14:08:51,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10302.5). Total num frames: 46755840. Throughput: 0: 10375.3. Samples: 46748112. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:08:51,903][434108] Avg episode reward: [(0, '3901.440')] [2023-03-08 14:08:51,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000091320_46755840.pth... [2023-03-08 14:08:51,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000090720_46448640.pth [2023-03-08 14:08:53,666][434396] Updated weights for policy 0, policy_version 91360 (0.0005) [2023-03-08 14:08:56,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10316.4). Total num frames: 46809088. Throughput: 0: 10371.6. Samples: 46780260. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:08:56,903][434108] Avg episode reward: [(0, '4077.703')] [2023-03-08 14:08:57,540][434396] Updated weights for policy 0, policy_version 91440 (0.0005) [2023-03-08 14:09:01,463][434396] Updated weights for policy 0, policy_version 91520 (0.0004) [2023-03-08 14:09:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10330.3). Total num frames: 46862336. Throughput: 0: 10315.8. Samples: 46842224. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:09:01,893][434108] Avg episode reward: [(0, '4064.336')] [2023-03-08 14:09:05,324][434396] Updated weights for policy 0, policy_version 91600 (0.0005) [2023-03-08 14:09:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10344.1). Total num frames: 46915584. Throughput: 0: 10383.6. Samples: 46906596. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:09:06,893][434108] Avg episode reward: [(0, '4436.873')] [2023-03-08 14:09:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000091632_46915584.pth... [2023-03-08 14:09:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000091016_46600192.pth [2023-03-08 14:09:09,132][434396] Updated weights for policy 0, policy_version 91680 (0.0004) [2023-03-08 14:09:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 46964736. Throughput: 0: 10438.9. Samples: 46938936. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:09:11,893][434108] Avg episode reward: [(0, '4309.383')] [2023-03-08 14:09:13,091][434396] Updated weights for policy 0, policy_version 91760 (0.0004) [2023-03-08 14:09:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 47017984. Throughput: 0: 10513.3. Samples: 47001564. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:09:16,893][434108] Avg episode reward: [(0, '4356.447')] [2023-03-08 14:09:16,960][434396] Updated weights for policy 0, policy_version 91840 (0.0004) [2023-03-08 14:09:20,847][434396] Updated weights for policy 0, policy_version 91920 (0.0004) [2023-03-08 14:09:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10358.0). Total num frames: 47071232. Throughput: 0: 10513.2. Samples: 47064604. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:09:21,893][434108] Avg episode reward: [(0, '4396.122')] [2023-03-08 14:09:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000091936_47071232.pth... [2023-03-08 14:09:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000091320_46755840.pth [2023-03-08 14:09:24,632][434396] Updated weights for policy 0, policy_version 92000 (0.0005) [2023-03-08 14:09:26,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10371.9). Total num frames: 47124480. Throughput: 0: 10575.1. Samples: 47097144. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:09:26,893][434108] Avg episode reward: [(0, '4432.346')] [2023-03-08 14:09:28,567][434396] Updated weights for policy 0, policy_version 92080 (0.0005) [2023-03-08 14:09:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10371.9). Total num frames: 47177728. Throughput: 0: 10522.8. Samples: 47159552. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:09:31,893][434108] Avg episode reward: [(0, '4462.224')] [2023-03-08 14:09:32,493][434396] Updated weights for policy 0, policy_version 92160 (0.0005) [2023-03-08 14:09:36,367][434396] Updated weights for policy 0, policy_version 92240 (0.0004) [2023-03-08 14:09:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10358.0). Total num frames: 47230976. Throughput: 0: 10562.5. Samples: 47223424. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:09:36,893][434108] Avg episode reward: [(0, '4324.403')] [2023-03-08 14:09:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000092248_47230976.pth... [2023-03-08 14:09:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000091632_46915584.pth [2023-03-08 14:09:40,478][434396] Updated weights for policy 0, policy_version 92320 (0.0005) [2023-03-08 14:09:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.0, 300 sec: 10358.0). Total num frames: 47280128. Throughput: 0: 10515.9. Samples: 47253476. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:09:41,893][434108] Avg episode reward: [(0, '4348.192')] [2023-03-08 14:09:44,629][434396] Updated weights for policy 0, policy_version 92400 (0.0005) [2023-03-08 14:09:46,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10444.8, 300 sec: 10358.0). Total num frames: 47329280. Throughput: 0: 10459.4. Samples: 47312896. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:09:46,893][434108] Avg episode reward: [(0, '4056.107')] [2023-03-08 14:09:48,747][434396] Updated weights for policy 0, policy_version 92480 (0.0005) [2023-03-08 14:09:51,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 47378432. Throughput: 0: 10354.9. Samples: 47372568. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:09:51,893][434108] Avg episode reward: [(0, '4026.704')] [2023-03-08 14:09:51,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000092536_47378432.pth... [2023-03-08 14:09:51,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000091936_47071232.pth [2023-03-08 14:09:52,786][434396] Updated weights for policy 0, policy_version 92560 (0.0005) [2023-03-08 14:09:56,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10344.1). Total num frames: 47427584. Throughput: 0: 10314.1. Samples: 47403072. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:09:56,893][434108] Avg episode reward: [(0, '3900.859')] [2023-03-08 14:09:56,908][434396] Updated weights for policy 0, policy_version 92640 (0.0005) [2023-03-08 14:10:00,931][434396] Updated weights for policy 0, policy_version 92720 (0.0005) [2023-03-08 14:10:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10344.1). Total num frames: 47480832. Throughput: 0: 10267.2. Samples: 47463588. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:10:01,893][434108] Avg episode reward: [(0, '3868.400')] [2023-03-08 14:10:04,986][434396] Updated weights for policy 0, policy_version 92800 (0.0005) [2023-03-08 14:10:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10344.1). Total num frames: 47529984. Throughput: 0: 10209.1. Samples: 47524012. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:10:06,893][434108] Avg episode reward: [(0, '4301.764')] [2023-03-08 14:10:06,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000092832_47529984.pth... [2023-03-08 14:10:06,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000092248_47230976.pth [2023-03-08 14:10:09,040][434396] Updated weights for policy 0, policy_version 92880 (0.0004) [2023-03-08 14:10:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10344.1). Total num frames: 47583232. Throughput: 0: 10165.0. Samples: 47554572. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:10:11,893][434108] Avg episode reward: [(0, '4353.399')] [2023-03-08 14:10:13,092][434396] Updated weights for policy 0, policy_version 92960 (0.0005) [2023-03-08 14:10:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10330.3). Total num frames: 47632384. Throughput: 0: 10135.6. Samples: 47615656. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:10:16,893][434108] Avg episode reward: [(0, '4192.734')] [2023-03-08 14:10:17,048][434396] Updated weights for policy 0, policy_version 93040 (0.0005) [2023-03-08 14:10:20,888][434396] Updated weights for policy 0, policy_version 93120 (0.0005) [2023-03-08 14:10:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10344.1). Total num frames: 47685632. Throughput: 0: 10117.5. Samples: 47678712. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:10:21,893][434108] Avg episode reward: [(0, '4159.034')] [2023-03-08 14:10:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000093136_47685632.pth... [2023-03-08 14:10:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000092536_47378432.pth [2023-03-08 14:10:24,667][434396] Updated weights for policy 0, policy_version 93200 (0.0004) [2023-03-08 14:10:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10344.1). Total num frames: 47738880. Throughput: 0: 10174.0. Samples: 47711308. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:10:26,893][434108] Avg episode reward: [(0, '3980.718')] [2023-03-08 14:10:28,433][434396] Updated weights for policy 0, policy_version 93280 (0.0004) [2023-03-08 14:10:31,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10308.3, 300 sec: 10358.0). Total num frames: 47796224. Throughput: 0: 10292.0. Samples: 47776036. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:10:31,893][434108] Avg episode reward: [(0, '4005.988')] [2023-03-08 14:10:32,243][434396] Updated weights for policy 0, policy_version 93360 (0.0005) [2023-03-08 14:10:35,962][434396] Updated weights for policy 0, policy_version 93440 (0.0004) [2023-03-08 14:10:36,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10308.3, 300 sec: 10344.1). Total num frames: 47849472. Throughput: 0: 10421.6. Samples: 47841540. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:10:36,893][434108] Avg episode reward: [(0, '4192.076')] [2023-03-08 14:10:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000093456_47849472.pth... [2023-03-08 14:10:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000092832_47529984.pth [2023-03-08 14:10:39,878][434396] Updated weights for policy 0, policy_version 93520 (0.0005) [2023-03-08 14:10:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 47902720. Throughput: 0: 10465.1. Samples: 47874000. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:10:41,893][434108] Avg episode reward: [(0, '4203.745')] [2023-03-08 14:10:43,787][434396] Updated weights for policy 0, policy_version 93600 (0.0005) [2023-03-08 14:10:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10330.3). Total num frames: 47951872. Throughput: 0: 10488.3. Samples: 47935560. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:10:46,893][434108] Avg episode reward: [(0, '4279.547')] [2023-03-08 14:10:47,662][434396] Updated weights for policy 0, policy_version 93680 (0.0004) [2023-03-08 14:10:51,385][434396] Updated weights for policy 0, policy_version 93760 (0.0004) [2023-03-08 14:10:51,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10358.0). Total num frames: 48009216. Throughput: 0: 10601.7. Samples: 48001088. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:10:51,893][434108] Avg episode reward: [(0, '4150.384')] [2023-03-08 14:10:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000093768_48009216.pth... [2023-03-08 14:10:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000093136_47685632.pth [2023-03-08 14:10:55,025][434396] Updated weights for policy 0, policy_version 93840 (0.0004) [2023-03-08 14:10:56,892][434108] Fps is (10 sec: 11468.9, 60 sec: 10649.6, 300 sec: 10385.8). Total num frames: 48066560. Throughput: 0: 10667.0. Samples: 48034588. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:10:56,893][434108] Avg episode reward: [(0, '4292.766')] [2023-03-08 14:10:58,839][434396] Updated weights for policy 0, policy_version 93920 (0.0004) [2023-03-08 14:11:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10385.8). Total num frames: 48115712. Throughput: 0: 10748.5. Samples: 48099336. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:11:01,893][434108] Avg episode reward: [(0, '4378.792')] [2023-03-08 14:11:02,817][434396] Updated weights for policy 0, policy_version 94000 (0.0005) [2023-03-08 14:11:06,830][434396] Updated weights for policy 0, policy_version 94080 (0.0005) [2023-03-08 14:11:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10399.7). Total num frames: 48168960. Throughput: 0: 10712.4. Samples: 48160768. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:11:06,893][434108] Avg episode reward: [(0, '4507.679')] [2023-03-08 14:11:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000094080_48168960.pth... [2023-03-08 14:11:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000093456_47849472.pth [2023-03-08 14:11:10,752][434396] Updated weights for policy 0, policy_version 94160 (0.0005) [2023-03-08 14:11:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10385.8). Total num frames: 48218112. Throughput: 0: 10674.9. Samples: 48191680. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:11:11,893][434108] Avg episode reward: [(0, '4290.434')] [2023-03-08 14:11:14,739][434396] Updated weights for policy 0, policy_version 94240 (0.0005) [2023-03-08 14:11:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10399.7). Total num frames: 48271360. Throughput: 0: 10620.1. Samples: 48253940. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:11:16,893][434108] Avg episode reward: [(0, '4475.358')] [2023-03-08 14:11:18,677][434396] Updated weights for policy 0, policy_version 94320 (0.0004) [2023-03-08 14:11:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10399.7). Total num frames: 48320512. Throughput: 0: 10538.6. Samples: 48315776. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:11:21,893][434108] Avg episode reward: [(0, '4346.462')] [2023-03-08 14:11:21,926][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000094384_48324608.pth... [2023-03-08 14:11:21,928][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000093768_48009216.pth [2023-03-08 14:11:22,754][434396] Updated weights for policy 0, policy_version 94400 (0.0005) [2023-03-08 14:11:26,843][434396] Updated weights for policy 0, policy_version 94480 (0.0005) [2023-03-08 14:11:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10399.7). Total num frames: 48373760. Throughput: 0: 10470.0. Samples: 48345152. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:11:26,893][434108] Avg episode reward: [(0, '4436.738')] [2023-03-08 14:11:30,847][434396] Updated weights for policy 0, policy_version 94560 (0.0005) [2023-03-08 14:11:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10385.8). Total num frames: 48422912. Throughput: 0: 10460.4. Samples: 48406276. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:11:31,893][434108] Avg episode reward: [(0, '4443.397')] [2023-03-08 14:11:34,931][434396] Updated weights for policy 0, policy_version 94640 (0.0005) [2023-03-08 14:11:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10399.7). Total num frames: 48476160. Throughput: 0: 10370.0. Samples: 48467736. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:11:36,893][434108] Avg episode reward: [(0, '4535.649')] [2023-03-08 14:11:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000094680_48476160.pth... [2023-03-08 14:11:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000094080_48168960.pth [2023-03-08 14:11:38,831][434396] Updated weights for policy 0, policy_version 94720 (0.0005) [2023-03-08 14:11:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10385.8). Total num frames: 48525312. Throughput: 0: 10314.5. Samples: 48498740. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:11:41,893][434108] Avg episode reward: [(0, '4472.747')] [2023-03-08 14:11:42,856][434396] Updated weights for policy 0, policy_version 94800 (0.0005) [2023-03-08 14:11:46,739][434396] Updated weights for policy 0, policy_version 94880 (0.0005) [2023-03-08 14:11:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10399.7). Total num frames: 48578560. Throughput: 0: 10251.2. Samples: 48560640. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:11:46,893][434108] Avg episode reward: [(0, '4539.031')] [2023-03-08 14:11:50,659][434396] Updated weights for policy 0, policy_version 94960 (0.0004) [2023-03-08 14:11:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 48631808. Throughput: 0: 10285.5. Samples: 48623616. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:11:51,893][434108] Avg episode reward: [(0, '4480.405')] [2023-03-08 14:11:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000094984_48631808.pth... [2023-03-08 14:11:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000094384_48324608.pth [2023-03-08 14:11:54,550][434396] Updated weights for policy 0, policy_version 95040 (0.0005) [2023-03-08 14:11:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10413.6). Total num frames: 48680960. Throughput: 0: 10310.1. Samples: 48655632. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:11:56,903][434108] Avg episode reward: [(0, '4547.358')] [2023-03-08 14:11:58,508][434396] Updated weights for policy 0, policy_version 95120 (0.0005) [2023-03-08 14:12:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 48734208. Throughput: 0: 10308.8. Samples: 48717836. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:12:01,903][434108] Avg episode reward: [(0, '4411.430')] [2023-03-08 14:12:02,387][434396] Updated weights for policy 0, policy_version 95200 (0.0005) [2023-03-08 14:12:06,258][434396] Updated weights for policy 0, policy_version 95280 (0.0005) [2023-03-08 14:12:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 48787456. Throughput: 0: 10327.3. Samples: 48780504. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:12:06,903][434108] Avg episode reward: [(0, '4468.364')] [2023-03-08 14:12:06,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000095288_48787456.pth... [2023-03-08 14:12:06,910][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000094680_48476160.pth [2023-03-08 14:12:10,189][434396] Updated weights for policy 0, policy_version 95360 (0.0005) [2023-03-08 14:12:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 48840704. Throughput: 0: 10375.4. Samples: 48812044. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:12:11,903][434108] Avg episode reward: [(0, '4408.909')] [2023-03-08 14:12:14,074][434396] Updated weights for policy 0, policy_version 95440 (0.0005) [2023-03-08 14:12:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 48893952. Throughput: 0: 10420.4. Samples: 48875196. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:12:16,903][434108] Avg episode reward: [(0, '4511.971')] [2023-03-08 14:12:17,939][434396] Updated weights for policy 0, policy_version 95520 (0.0005) [2023-03-08 14:12:21,655][434396] Updated weights for policy 0, policy_version 95600 (0.0004) [2023-03-08 14:12:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10441.3). Total num frames: 48947200. Throughput: 0: 10501.7. Samples: 48940312. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:12:21,903][434108] Avg episode reward: [(0, '4579.026')] [2023-03-08 14:12:21,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000095600_48947200.pth... [2023-03-08 14:12:21,910][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000094984_48631808.pth [2023-03-08 14:12:25,444][434396] Updated weights for policy 0, policy_version 95680 (0.0005) [2023-03-08 14:12:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10441.3). Total num frames: 49000448. Throughput: 0: 10513.3. Samples: 48971840. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:12:26,903][434108] Avg episode reward: [(0, '4626.362')] [2023-03-08 14:12:29,095][434396] Updated weights for policy 0, policy_version 95760 (0.0004) [2023-03-08 14:12:31,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10581.3, 300 sec: 10455.2). Total num frames: 49057792. Throughput: 0: 10646.0. Samples: 49039708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:12:31,906][434108] Avg episode reward: [(0, '4605.902')] [2023-03-08 14:12:32,713][434396] Updated weights for policy 0, policy_version 95840 (0.0004) [2023-03-08 14:12:36,535][434396] Updated weights for policy 0, policy_version 95920 (0.0004) [2023-03-08 14:12:36,893][434108] Fps is (10 sec: 11059.1, 60 sec: 10581.3, 300 sec: 10455.2). Total num frames: 49111040. Throughput: 0: 10710.7. Samples: 49105600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:12:36,893][434108] Avg episode reward: [(0, '4603.113')] [2023-03-08 14:12:36,941][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000095928_49115136.pth... [2023-03-08 14:12:36,943][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000095288_48787456.pth [2023-03-08 14:12:40,249][434396] Updated weights for policy 0, policy_version 96000 (0.0004) [2023-03-08 14:12:41,893][434108] Fps is (10 sec: 10649.4, 60 sec: 10649.6, 300 sec: 10455.2). Total num frames: 49164288. Throughput: 0: 10718.8. Samples: 49137980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:12:41,893][434108] Avg episode reward: [(0, '4598.948')] [2023-03-08 14:12:44,400][434396] Updated weights for policy 0, policy_version 96080 (0.0005) [2023-03-08 14:12:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10455.2). Total num frames: 49217536. Throughput: 0: 10701.6. Samples: 49199408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:12:46,893][434108] Avg episode reward: [(0, '4593.626')] [2023-03-08 14:12:48,360][434396] Updated weights for policy 0, policy_version 96160 (0.0005) [2023-03-08 14:12:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10455.2). Total num frames: 49270784. Throughput: 0: 10706.3. Samples: 49262288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:12:51,893][434108] Avg episode reward: [(0, '4569.136')] [2023-03-08 14:12:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000096232_49270784.pth... [2023-03-08 14:12:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000095600_48947200.pth [2023-03-08 14:12:52,251][434396] Updated weights for policy 0, policy_version 96240 (0.0005) [2023-03-08 14:12:56,230][434396] Updated weights for policy 0, policy_version 96320 (0.0005) [2023-03-08 14:12:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10455.2). Total num frames: 49319936. Throughput: 0: 10692.1. Samples: 49293188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:12:56,904][434108] Avg episode reward: [(0, '4554.253')] [2023-03-08 14:13:00,159][434396] Updated weights for policy 0, policy_version 96400 (0.0005) [2023-03-08 14:13:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10455.2). Total num frames: 49373184. Throughput: 0: 10665.3. Samples: 49355136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:13:01,901][434108] Avg episode reward: [(0, '4588.496')] [2023-03-08 14:13:04,066][434396] Updated weights for policy 0, policy_version 96480 (0.0005) [2023-03-08 14:13:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10455.2). Total num frames: 49426432. Throughput: 0: 10617.9. Samples: 49418120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:13:06,893][434108] Avg episode reward: [(0, '4563.894')] [2023-03-08 14:13:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000096536_49426432.pth... [2023-03-08 14:13:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000095928_49115136.pth [2023-03-08 14:13:08,030][434396] Updated weights for policy 0, policy_version 96560 (0.0005) [2023-03-08 14:13:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10441.3). Total num frames: 49475584. Throughput: 0: 10595.1. Samples: 49448620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:13:11,893][434108] Avg episode reward: [(0, '4485.534')] [2023-03-08 14:13:12,031][434396] Updated weights for policy 0, policy_version 96640 (0.0005) [2023-03-08 14:13:16,015][434396] Updated weights for policy 0, policy_version 96720 (0.0005) [2023-03-08 14:13:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10455.2). Total num frames: 49528832. Throughput: 0: 10466.2. Samples: 49510688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:13:16,893][434108] Avg episode reward: [(0, '4403.015')] [2023-03-08 14:13:19,936][434396] Updated weights for policy 0, policy_version 96800 (0.0005) [2023-03-08 14:13:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10441.3). Total num frames: 49577984. Throughput: 0: 10381.0. Samples: 49572744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:13:21,893][434108] Avg episode reward: [(0, '4479.776')] [2023-03-08 14:13:21,906][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000096840_49582080.pth... [2023-03-08 14:13:21,908][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000096232_49270784.pth [2023-03-08 14:13:23,868][434396] Updated weights for policy 0, policy_version 96880 (0.0005) [2023-03-08 14:13:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10455.2). Total num frames: 49631232. Throughput: 0: 10351.7. Samples: 49603804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:13:26,893][434108] Avg episode reward: [(0, '4485.900')] [2023-03-08 14:13:27,668][434396] Updated weights for policy 0, policy_version 96960 (0.0005) [2023-03-08 14:13:31,524][434396] Updated weights for policy 0, policy_version 97040 (0.0005) [2023-03-08 14:13:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 49684480. Throughput: 0: 10422.4. Samples: 49668416. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:13:31,893][434108] Avg episode reward: [(0, '4488.069')] [2023-03-08 14:13:35,276][434396] Updated weights for policy 0, policy_version 97120 (0.0004) [2023-03-08 14:13:36,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 49741824. Throughput: 0: 10474.3. Samples: 49733632. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:13:36,893][434108] Avg episode reward: [(0, '4529.047')] [2023-03-08 14:13:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000097152_49741824.pth... [2023-03-08 14:13:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000096536_49426432.pth [2023-03-08 14:13:39,116][434396] Updated weights for policy 0, policy_version 97200 (0.0005) [2023-03-08 14:13:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 49795072. Throughput: 0: 10488.9. Samples: 49765188. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:13:41,893][434108] Avg episode reward: [(0, '4464.209')] [2023-03-08 14:13:42,846][434396] Updated weights for policy 0, policy_version 97280 (0.0004) [2023-03-08 14:13:46,720][434396] Updated weights for policy 0, policy_version 97360 (0.0004) [2023-03-08 14:13:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 49848320. Throughput: 0: 10582.5. Samples: 49831348. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:13:46,893][434108] Avg episode reward: [(0, '4620.818')] [2023-03-08 14:13:50,766][434396] Updated weights for policy 0, policy_version 97440 (0.0004) [2023-03-08 14:13:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 49897472. Throughput: 0: 10530.8. Samples: 49892004. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:13:51,893][434108] Avg episode reward: [(0, '4580.614')] [2023-03-08 14:13:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000097456_49897472.pth... [2023-03-08 14:13:51,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000096840_49582080.pth [2023-03-08 14:13:54,732][434396] Updated weights for policy 0, policy_version 97520 (0.0005) [2023-03-08 14:13:55,970][434347] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000001 [2023-03-08 14:13:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 49950720. Throughput: 0: 10544.4. Samples: 49923116. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:13:56,893][434108] Avg episode reward: [(0, '4563.953')] [2023-03-08 14:13:58,641][434396] Updated weights for policy 0, policy_version 97600 (0.0004) [2023-03-08 14:14:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 50003968. Throughput: 0: 10562.7. Samples: 49986008. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:14:01,893][434108] Avg episode reward: [(0, '4410.112')] [2023-03-08 14:14:02,419][434396] Updated weights for policy 0, policy_version 97680 (0.0005) [2023-03-08 14:14:06,167][434396] Updated weights for policy 0, policy_version 97760 (0.0004) [2023-03-08 14:14:06,892][434108] Fps is (10 sec: 11059.0, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 50061312. Throughput: 0: 10636.4. Samples: 50051384. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:14:06,893][434108] Avg episode reward: [(0, '4534.018')] [2023-03-08 14:14:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000097776_50061312.pth... [2023-03-08 14:14:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000097152_49741824.pth [2023-03-08 14:14:10,089][434396] Updated weights for policy 0, policy_version 97840 (0.0005) [2023-03-08 14:14:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10483.0). Total num frames: 50110464. Throughput: 0: 10657.3. Samples: 50083384. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:14:11,893][434108] Avg episode reward: [(0, '4496.429')] [2023-03-08 14:14:13,999][434396] Updated weights for policy 0, policy_version 97920 (0.0005) [2023-03-08 14:14:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10483.0). Total num frames: 50163712. Throughput: 0: 10592.4. Samples: 50145072. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:14:16,893][434108] Avg episode reward: [(0, '4525.630')] [2023-03-08 14:14:18,016][434396] Updated weights for policy 0, policy_version 98000 (0.0005) [2023-03-08 14:14:21,664][434396] Updated weights for policy 0, policy_version 98080 (0.0004) [2023-03-08 14:14:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10483.0). Total num frames: 50216960. Throughput: 0: 10569.3. Samples: 50209252. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:14:21,893][434108] Avg episode reward: [(0, '4548.508')] [2023-03-08 14:14:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000098080_50216960.pth... [2023-03-08 14:14:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000097456_49897472.pth [2023-03-08 14:14:25,338][434396] Updated weights for policy 0, policy_version 98160 (0.0004) [2023-03-08 14:14:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10496.9). Total num frames: 50274304. Throughput: 0: 10618.2. Samples: 50243008. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:14:26,893][434108] Avg episode reward: [(0, '4539.893')] [2023-03-08 14:14:29,109][434396] Updated weights for policy 0, policy_version 98240 (0.0004) [2023-03-08 14:14:31,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10496.9). Total num frames: 50327552. Throughput: 0: 10598.0. Samples: 50308260. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:14:31,894][434108] Avg episode reward: [(0, '4457.941')] [2023-03-08 14:14:32,859][434396] Updated weights for policy 0, policy_version 98320 (0.0004) [2023-03-08 14:14:36,655][434396] Updated weights for policy 0, policy_version 98400 (0.0004) [2023-03-08 14:14:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10510.8). Total num frames: 50380800. Throughput: 0: 10707.6. Samples: 50373848. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:14:36,893][434108] Avg episode reward: [(0, '4548.844')] [2023-03-08 14:14:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000098400_50380800.pth... [2023-03-08 14:14:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000097776_50061312.pth [2023-03-08 14:14:40,394][434396] Updated weights for policy 0, policy_version 98480 (0.0005) [2023-03-08 14:14:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10538.5). Total num frames: 50438144. Throughput: 0: 10744.4. Samples: 50406616. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:14:41,893][434108] Avg episode reward: [(0, '4596.110')] [2023-03-08 14:14:44,104][434396] Updated weights for policy 0, policy_version 98560 (0.0004) [2023-03-08 14:14:46,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10552.4). Total num frames: 50491392. Throughput: 0: 10806.8. Samples: 50472316. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:14:46,893][434108] Avg episode reward: [(0, '4607.233')] [2023-03-08 14:14:47,889][434396] Updated weights for policy 0, policy_version 98640 (0.0004) [2023-03-08 14:14:51,620][434396] Updated weights for policy 0, policy_version 98720 (0.0004) [2023-03-08 14:14:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10566.3). Total num frames: 50544640. Throughput: 0: 10808.4. Samples: 50537764. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:14:51,893][434108] Avg episode reward: [(0, '4582.934')] [2023-03-08 14:14:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000098720_50544640.pth... [2023-03-08 14:14:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000098080_50216960.pth [2023-03-08 14:14:55,574][434396] Updated weights for policy 0, policy_version 98800 (0.0005) [2023-03-08 14:14:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10566.3). Total num frames: 50597888. Throughput: 0: 10800.1. Samples: 50569388. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:14:56,893][434108] Avg episode reward: [(0, '4576.547')] [2023-03-08 14:14:59,482][434396] Updated weights for policy 0, policy_version 98880 (0.0005) [2023-03-08 14:15:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10580.2). Total num frames: 50651136. Throughput: 0: 10820.3. Samples: 50631984. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:15:01,893][434108] Avg episode reward: [(0, '4498.819')] [2023-03-08 14:15:03,364][434396] Updated weights for policy 0, policy_version 98960 (0.0005) [2023-03-08 14:15:06,893][434108] Fps is (10 sec: 10239.9, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 50700288. Throughput: 0: 10805.5. Samples: 50695500. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:15:06,893][434108] Avg episode reward: [(0, '4446.766')] [2023-03-08 14:15:06,948][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000099032_50704384.pth... [2023-03-08 14:15:06,950][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000098400_50380800.pth [2023-03-08 14:15:07,374][434396] Updated weights for policy 0, policy_version 99040 (0.0005) [2023-03-08 14:15:11,370][434396] Updated weights for policy 0, policy_version 99120 (0.0005) [2023-03-08 14:15:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10580.2). Total num frames: 50753536. Throughput: 0: 10717.3. Samples: 50725288. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:15:11,893][434108] Avg episode reward: [(0, '4456.430')] [2023-03-08 14:15:15,388][434396] Updated weights for policy 0, policy_version 99200 (0.0005) [2023-03-08 14:15:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 50802688. Throughput: 0: 10624.8. Samples: 50786376. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:15:16,893][434108] Avg episode reward: [(0, '4316.874')] [2023-03-08 14:15:19,356][434396] Updated weights for policy 0, policy_version 99280 (0.0005) [2023-03-08 14:15:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 50855936. Throughput: 0: 10537.5. Samples: 50848036. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:15:21,893][434108] Avg episode reward: [(0, '4313.052')] [2023-03-08 14:15:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000099328_50855936.pth... [2023-03-08 14:15:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000098720_50544640.pth [2023-03-08 14:15:23,343][434396] Updated weights for policy 0, policy_version 99360 (0.0005) [2023-03-08 14:15:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 50905088. Throughput: 0: 10511.6. Samples: 50879636. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:15:26,893][434108] Avg episode reward: [(0, '4520.356')] [2023-03-08 14:15:27,383][434396] Updated weights for policy 0, policy_version 99440 (0.0005) [2023-03-08 14:15:31,387][434396] Updated weights for policy 0, policy_version 99520 (0.0005) [2023-03-08 14:15:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 50958336. Throughput: 0: 10401.9. Samples: 50940400. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:15:31,893][434108] Avg episode reward: [(0, '4358.479')] [2023-03-08 14:15:35,326][434396] Updated weights for policy 0, policy_version 99600 (0.0005) [2023-03-08 14:15:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 51007488. Throughput: 0: 10322.4. Samples: 51002272. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:15:36,893][434108] Avg episode reward: [(0, '4418.369')] [2023-03-08 14:15:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000099624_51007488.pth... [2023-03-08 14:15:36,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000099032_50704384.pth [2023-03-08 14:15:39,279][434396] Updated weights for policy 0, policy_version 99680 (0.0005) [2023-03-08 14:15:41,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10538.5). Total num frames: 51060736. Throughput: 0: 10308.8. Samples: 51033284. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:15:41,893][434108] Avg episode reward: [(0, '4416.196')] [2023-03-08 14:15:43,094][434396] Updated weights for policy 0, policy_version 99760 (0.0004) [2023-03-08 14:15:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 51113984. Throughput: 0: 10367.6. Samples: 51098528. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:15:46,893][434108] Avg episode reward: [(0, '4503.348')] [2023-03-08 14:15:46,896][434396] Updated weights for policy 0, policy_version 99840 (0.0004) [2023-03-08 14:15:50,692][434396] Updated weights for policy 0, policy_version 99920 (0.0004) [2023-03-08 14:15:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10510.8). Total num frames: 51167232. Throughput: 0: 10383.2. Samples: 51162744. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:15:51,893][434108] Avg episode reward: [(0, '4599.140')] [2023-03-08 14:15:51,927][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000099944_51171328.pth... [2023-03-08 14:15:51,928][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000099328_50855936.pth [2023-03-08 14:15:54,712][434396] Updated weights for policy 0, policy_version 100000 (0.0005) [2023-03-08 14:15:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 51220480. Throughput: 0: 10382.9. Samples: 51192520. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:15:56,893][434108] Avg episode reward: [(0, '4525.866')] [2023-03-08 14:15:58,682][434396] Updated weights for policy 0, policy_version 100080 (0.0005) [2023-03-08 14:16:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 51273728. Throughput: 0: 10410.0. Samples: 51254824. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:16:01,893][434108] Avg episode reward: [(0, '4593.980')] [2023-03-08 14:16:02,640][434396] Updated weights for policy 0, policy_version 100160 (0.0005) [2023-03-08 14:16:06,613][434396] Updated weights for policy 0, policy_version 100240 (0.0005) [2023-03-08 14:16:06,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.6, 300 sec: 10524.6). Total num frames: 51322880. Throughput: 0: 10414.9. Samples: 51316704. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:16:06,893][434108] Avg episode reward: [(0, '4472.475')] [2023-03-08 14:16:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000100240_51322880.pth... [2023-03-08 14:16:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000099624_51007488.pth [2023-03-08 14:16:10,646][434396] Updated weights for policy 0, policy_version 100320 (0.0005) [2023-03-08 14:16:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 51376128. Throughput: 0: 10397.6. Samples: 51347528. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:16:11,893][434108] Avg episode reward: [(0, '4617.022')] [2023-03-08 14:16:14,639][434396] Updated weights for policy 0, policy_version 100400 (0.0005) [2023-03-08 14:16:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 51425280. Throughput: 0: 10411.3. Samples: 51408908. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:16:16,893][434108] Avg episode reward: [(0, '4353.092')] [2023-03-08 14:16:18,595][434396] Updated weights for policy 0, policy_version 100480 (0.0005) [2023-03-08 14:16:21,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 51478528. Throughput: 0: 10402.9. Samples: 51470400. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:16:21,893][434108] Avg episode reward: [(0, '4122.675')] [2023-03-08 14:16:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000100544_51478528.pth... [2023-03-08 14:16:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000099944_51171328.pth [2023-03-08 14:16:22,607][434396] Updated weights for policy 0, policy_version 100560 (0.0005) [2023-03-08 14:16:26,616][434396] Updated weights for policy 0, policy_version 100640 (0.0005) [2023-03-08 14:16:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 51527680. Throughput: 0: 10386.2. Samples: 51500664. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:16:26,893][434108] Avg episode reward: [(0, '4447.284')] [2023-03-08 14:16:30,616][434396] Updated weights for policy 0, policy_version 100720 (0.0005) [2023-03-08 14:16:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 51580928. Throughput: 0: 10322.7. Samples: 51563048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:16:31,893][434108] Avg episode reward: [(0, '4487.633')] [2023-03-08 14:16:34,582][434396] Updated weights for policy 0, policy_version 100800 (0.0005) [2023-03-08 14:16:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 51630080. Throughput: 0: 10262.9. Samples: 51624576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:16:36,893][434108] Avg episode reward: [(0, '4521.865')] [2023-03-08 14:16:36,952][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000100848_51634176.pth... [2023-03-08 14:16:36,953][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000100240_51322880.pth [2023-03-08 14:16:38,551][434396] Updated weights for policy 0, policy_version 100880 (0.0005) [2023-03-08 14:16:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 51683328. Throughput: 0: 10274.9. Samples: 51654892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:16:41,893][434108] Avg episode reward: [(0, '4332.261')] [2023-03-08 14:16:42,583][434396] Updated weights for policy 0, policy_version 100960 (0.0005) [2023-03-08 14:16:46,508][434396] Updated weights for policy 0, policy_version 101040 (0.0005) [2023-03-08 14:16:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10510.8). Total num frames: 51732480. Throughput: 0: 10264.9. Samples: 51716744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:16:46,893][434108] Avg episode reward: [(0, '4335.904')] [2023-03-08 14:16:50,528][434396] Updated weights for policy 0, policy_version 101120 (0.0005) [2023-03-08 14:16:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10524.6). Total num frames: 51785728. Throughput: 0: 10260.1. Samples: 51778408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:16:51,893][434108] Avg episode reward: [(0, '4351.599')] [2023-03-08 14:16:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000101144_51785728.pth... [2023-03-08 14:16:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000100544_51478528.pth [2023-03-08 14:16:54,527][434396] Updated weights for policy 0, policy_version 101200 (0.0005) [2023-03-08 14:16:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10510.8). Total num frames: 51834880. Throughput: 0: 10266.2. Samples: 51809508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:16:56,893][434108] Avg episode reward: [(0, '4386.610')] [2023-03-08 14:16:58,536][434396] Updated weights for policy 0, policy_version 101280 (0.0005) [2023-03-08 14:17:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10510.8). Total num frames: 51888128. Throughput: 0: 10257.3. Samples: 51870488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:17:01,893][434108] Avg episode reward: [(0, '4400.691')] [2023-03-08 14:17:02,562][434396] Updated weights for policy 0, policy_version 101360 (0.0005) [2023-03-08 14:17:06,547][434396] Updated weights for policy 0, policy_version 101440 (0.0005) [2023-03-08 14:17:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10496.9). Total num frames: 51937280. Throughput: 0: 10260.2. Samples: 51932108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:17:06,893][434108] Avg episode reward: [(0, '4527.651')] [2023-03-08 14:17:06,955][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000101448_51941376.pth... [2023-03-08 14:17:06,956][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000100848_51634176.pth [2023-03-08 14:17:10,547][434396] Updated weights for policy 0, policy_version 101520 (0.0005) [2023-03-08 14:17:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10496.9). Total num frames: 51990528. Throughput: 0: 10256.6. Samples: 51962212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:17:11,893][434108] Avg episode reward: [(0, '4272.632')] [2023-03-08 14:17:14,563][434396] Updated weights for policy 0, policy_version 101600 (0.0005) [2023-03-08 14:17:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10496.9). Total num frames: 52043776. Throughput: 0: 10237.0. Samples: 52023716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:17:16,893][434108] Avg episode reward: [(0, '4460.204')] [2023-03-08 14:17:18,308][434396] Updated weights for policy 0, policy_version 101680 (0.0004) [2023-03-08 14:17:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10496.9). Total num frames: 52097024. Throughput: 0: 10338.1. Samples: 52089792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:17:21,903][434108] Avg episode reward: [(0, '4479.732')] [2023-03-08 14:17:21,906][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000101752_52097024.pth... [2023-03-08 14:17:21,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000101144_51785728.pth [2023-03-08 14:17:22,040][434396] Updated weights for policy 0, policy_version 101760 (0.0004) [2023-03-08 14:17:25,833][434396] Updated weights for policy 0, policy_version 101840 (0.0004) [2023-03-08 14:17:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 52150272. Throughput: 0: 10382.9. Samples: 52122124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:17:26,893][434108] Avg episode reward: [(0, '4417.622')] [2023-03-08 14:17:29,566][434396] Updated weights for policy 0, policy_version 101920 (0.0004) [2023-03-08 14:17:31,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 52207616. Throughput: 0: 10461.3. Samples: 52187504. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:17:31,893][434108] Avg episode reward: [(0, '4490.321')] [2023-03-08 14:17:33,277][434396] Updated weights for policy 0, policy_version 102000 (0.0004) [2023-03-08 14:17:36,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 52260864. Throughput: 0: 10575.8. Samples: 52254320. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:17:36,893][434108] Avg episode reward: [(0, '4510.682')] [2023-03-08 14:17:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000102072_52260864.pth... [2023-03-08 14:17:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000101448_51941376.pth [2023-03-08 14:17:36,995][434396] Updated weights for policy 0, policy_version 102080 (0.0004) [2023-03-08 14:17:40,979][434396] Updated weights for policy 0, policy_version 102160 (0.0005) [2023-03-08 14:17:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 52314112. Throughput: 0: 10579.3. Samples: 52285576. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:17:41,893][434108] Avg episode reward: [(0, '4045.655')] [2023-03-08 14:17:45,019][434396] Updated weights for policy 0, policy_version 102240 (0.0005) [2023-03-08 14:17:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 52363264. Throughput: 0: 10587.9. Samples: 52346944. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:17:46,893][434108] Avg episode reward: [(0, '4105.301')] [2023-03-08 14:17:49,104][434396] Updated weights for policy 0, policy_version 102320 (0.0005) [2023-03-08 14:17:51,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 52412416. Throughput: 0: 10560.6. Samples: 52407332. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:17:51,893][434108] Avg episode reward: [(0, '4107.555')] [2023-03-08 14:17:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000102368_52412416.pth... [2023-03-08 14:17:51,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000101752_52097024.pth [2023-03-08 14:17:53,186][434396] Updated weights for policy 0, policy_version 102400 (0.0005) [2023-03-08 14:17:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 52465664. Throughput: 0: 10552.5. Samples: 52437072. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:17:56,903][434108] Avg episode reward: [(0, '4323.001')] [2023-03-08 14:17:57,036][434396] Updated weights for policy 0, policy_version 102480 (0.0004) [2023-03-08 14:18:00,825][434396] Updated weights for policy 0, policy_version 102560 (0.0004) [2023-03-08 14:18:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 52518912. Throughput: 0: 10631.6. Samples: 52502136. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:18:01,903][434108] Avg episode reward: [(0, '4607.128')] [2023-03-08 14:18:04,655][434396] Updated weights for policy 0, policy_version 102640 (0.0004) [2023-03-08 14:18:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.4, 300 sec: 10496.9). Total num frames: 52572160. Throughput: 0: 10601.6. Samples: 52566864. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:18:06,903][434108] Avg episode reward: [(0, '4568.099')] [2023-03-08 14:18:06,916][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000102688_52576256.pth... [2023-03-08 14:18:06,917][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000102072_52260864.pth [2023-03-08 14:18:08,550][434396] Updated weights for policy 0, policy_version 102720 (0.0005) [2023-03-08 14:18:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 52625408. Throughput: 0: 10551.9. Samples: 52596960. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:18:11,893][434108] Avg episode reward: [(0, '4593.542')] [2023-03-08 14:18:12,360][434396] Updated weights for policy 0, policy_version 102800 (0.0004) [2023-03-08 14:18:16,031][434396] Updated weights for policy 0, policy_version 102880 (0.0004) [2023-03-08 14:18:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 52682752. Throughput: 0: 10561.7. Samples: 52662780. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:18:16,893][434108] Avg episode reward: [(0, '4635.290')] [2023-03-08 14:18:19,764][434396] Updated weights for policy 0, policy_version 102960 (0.0004) [2023-03-08 14:18:21,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 52736000. Throughput: 0: 10556.6. Samples: 52729368. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:18:21,893][434108] Avg episode reward: [(0, '4567.052')] [2023-03-08 14:18:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000103000_52736000.pth... [2023-03-08 14:18:21,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000102368_52412416.pth [2023-03-08 14:18:23,542][434396] Updated weights for policy 0, policy_version 103040 (0.0005) [2023-03-08 14:18:26,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 52789248. Throughput: 0: 10570.5. Samples: 52761248. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:18:26,893][434108] Avg episode reward: [(0, '4540.914')] [2023-03-08 14:18:27,519][434396] Updated weights for policy 0, policy_version 103120 (0.0005) [2023-03-08 14:18:31,439][434396] Updated weights for policy 0, policy_version 103200 (0.0005) [2023-03-08 14:18:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 52842496. Throughput: 0: 10589.0. Samples: 52823448. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:18:31,893][434108] Avg episode reward: [(0, '4529.948')] [2023-03-08 14:18:35,371][434396] Updated weights for policy 0, policy_version 103280 (0.0005) [2023-03-08 14:18:36,893][434108] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 52891648. Throughput: 0: 10637.8. Samples: 52886036. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:18:36,893][434108] Avg episode reward: [(0, '4579.830')] [2023-03-08 14:18:36,927][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000103312_52895744.pth... [2023-03-08 14:18:36,928][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000102688_52576256.pth [2023-03-08 14:18:39,315][434396] Updated weights for policy 0, policy_version 103360 (0.0005) [2023-03-08 14:18:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 52944896. Throughput: 0: 10663.1. Samples: 52916912. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:18:41,893][434108] Avg episode reward: [(0, '4525.115')] [2023-03-08 14:18:43,294][434396] Updated weights for policy 0, policy_version 103440 (0.0004) [2023-03-08 14:18:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 52994048. Throughput: 0: 10588.3. Samples: 52978612. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:18:46,893][434108] Avg episode reward: [(0, '4576.824')] [2023-03-08 14:18:47,314][434396] Updated weights for policy 0, policy_version 103520 (0.0005) [2023-03-08 14:18:51,279][434396] Updated weights for policy 0, policy_version 103600 (0.0005) [2023-03-08 14:18:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 53047296. Throughput: 0: 10523.5. Samples: 53040424. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:18:51,903][434108] Avg episode reward: [(0, '4520.183')] [2023-03-08 14:18:51,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000103608_53047296.pth... [2023-03-08 14:18:51,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000103000_52736000.pth [2023-03-08 14:18:55,315][434396] Updated weights for policy 0, policy_version 103680 (0.0004) [2023-03-08 14:18:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 53096448. Throughput: 0: 10545.1. Samples: 53071488. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:18:56,893][434108] Avg episode reward: [(0, '4512.921')] [2023-03-08 14:18:59,223][434396] Updated weights for policy 0, policy_version 103760 (0.0005) [2023-03-08 14:19:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 53149696. Throughput: 0: 10457.7. Samples: 53133376. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:19:01,893][434108] Avg episode reward: [(0, '4565.053')] [2023-03-08 14:19:03,155][434396] Updated weights for policy 0, policy_version 103840 (0.0005) [2023-03-08 14:19:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 53202944. Throughput: 0: 10351.4. Samples: 53195180. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:19:06,893][434108] Avg episode reward: [(0, '4579.984')] [2023-03-08 14:19:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000103912_53202944.pth... [2023-03-08 14:19:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000103312_52895744.pth [2023-03-08 14:19:07,146][434396] Updated weights for policy 0, policy_version 103920 (0.0005) [2023-03-08 14:19:11,112][434396] Updated weights for policy 0, policy_version 104000 (0.0005) [2023-03-08 14:19:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 53252096. Throughput: 0: 10343.2. Samples: 53226692. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:19:11,893][434108] Avg episode reward: [(0, '4565.571')] [2023-03-08 14:19:15,039][434396] Updated weights for policy 0, policy_version 104080 (0.0005) [2023-03-08 14:19:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 53305344. Throughput: 0: 10345.0. Samples: 53288972. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:19:16,893][434108] Avg episode reward: [(0, '4504.848')] [2023-03-08 14:19:19,028][434396] Updated weights for policy 0, policy_version 104160 (0.0005) [2023-03-08 14:19:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10455.2). Total num frames: 53358592. Throughput: 0: 10320.6. Samples: 53350464. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:19:21,893][434108] Avg episode reward: [(0, '4539.972')] [2023-03-08 14:19:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000104216_53358592.pth... [2023-03-08 14:19:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000103608_53047296.pth [2023-03-08 14:19:22,971][434396] Updated weights for policy 0, policy_version 104240 (0.0005) [2023-03-08 14:19:26,800][434396] Updated weights for policy 0, policy_version 104320 (0.0005) [2023-03-08 14:19:26,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10455.2). Total num frames: 53411840. Throughput: 0: 10332.9. Samples: 53381892. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:19:26,893][434108] Avg episode reward: [(0, '4416.287')] [2023-03-08 14:19:30,647][434396] Updated weights for policy 0, policy_version 104400 (0.0006) [2023-03-08 14:19:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10455.2). Total num frames: 53465088. Throughput: 0: 10397.7. Samples: 53446508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:19:31,903][434108] Avg episode reward: [(0, '4274.510')] [2023-03-08 14:19:34,678][434396] Updated weights for policy 0, policy_version 104480 (0.0005) [2023-03-08 14:19:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 53514240. Throughput: 0: 10376.8. Samples: 53507380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:19:36,903][434108] Avg episode reward: [(0, '4368.006')] [2023-03-08 14:19:36,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000104520_53514240.pth... [2023-03-08 14:19:36,910][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000103912_53202944.pth [2023-03-08 14:19:38,727][434396] Updated weights for policy 0, policy_version 104560 (0.0005) [2023-03-08 14:19:41,893][434108] Fps is (10 sec: 9830.3, 60 sec: 10308.2, 300 sec: 10413.6). Total num frames: 53563392. Throughput: 0: 10365.4. Samples: 53537932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:19:41,893][434108] Avg episode reward: [(0, '4221.110')] [2023-03-08 14:19:42,739][434396] Updated weights for policy 0, policy_version 104640 (0.0005) [2023-03-08 14:19:46,541][434396] Updated weights for policy 0, policy_version 104720 (0.0005) [2023-03-08 14:19:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 53616640. Throughput: 0: 10372.5. Samples: 53600140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:19:46,893][434108] Avg episode reward: [(0, '4413.415')] [2023-03-08 14:19:50,269][434396] Updated weights for policy 0, policy_version 104800 (0.0005) [2023-03-08 14:19:51,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10444.8, 300 sec: 10427.4). Total num frames: 53673984. Throughput: 0: 10459.5. Samples: 53665856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:19:51,893][434108] Avg episode reward: [(0, '4379.893')] [2023-03-08 14:19:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000104832_53673984.pth... [2023-03-08 14:19:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000104216_53358592.pth [2023-03-08 14:19:54,246][434396] Updated weights for policy 0, policy_version 104880 (0.0005) [2023-03-08 14:19:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10413.6). Total num frames: 53723136. Throughput: 0: 10431.7. Samples: 53696120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:19:56,893][434108] Avg episode reward: [(0, '4340.529')] [2023-03-08 14:19:58,214][434396] Updated weights for policy 0, policy_version 104960 (0.0006) [2023-03-08 14:20:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10427.4). Total num frames: 53776384. Throughput: 0: 10441.0. Samples: 53758816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:20:01,893][434108] Avg episode reward: [(0, '4354.694')] [2023-03-08 14:20:02,108][434396] Updated weights for policy 0, policy_version 105040 (0.0006) [2023-03-08 14:20:05,946][434396] Updated weights for policy 0, policy_version 105120 (0.0005) [2023-03-08 14:20:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10427.4). Total num frames: 53829632. Throughput: 0: 10476.4. Samples: 53821904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:20:06,893][434108] Avg episode reward: [(0, '4515.992')] [2023-03-08 14:20:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000105136_53829632.pth... [2023-03-08 14:20:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000104520_53514240.pth [2023-03-08 14:20:09,749][434396] Updated weights for policy 0, policy_version 105200 (0.0005) [2023-03-08 14:20:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10441.3). Total num frames: 53882880. Throughput: 0: 10497.5. Samples: 53854280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:20:11,893][434108] Avg episode reward: [(0, '4197.137')] [2023-03-08 14:20:13,752][434396] Updated weights for policy 0, policy_version 105280 (0.0005) [2023-03-08 14:20:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10427.4). Total num frames: 53932032. Throughput: 0: 10426.9. Samples: 53915720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:20:16,893][434108] Avg episode reward: [(0, '4459.163')] [2023-03-08 14:20:17,822][434396] Updated weights for policy 0, policy_version 105360 (0.0006) [2023-03-08 14:20:21,872][434396] Updated weights for policy 0, policy_version 105440 (0.0005) [2023-03-08 14:20:21,893][434108] Fps is (10 sec: 10239.8, 60 sec: 10444.8, 300 sec: 10441.3). Total num frames: 53985280. Throughput: 0: 10437.2. Samples: 53977056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:20:21,893][434108] Avg episode reward: [(0, '4354.149')] [2023-03-08 14:20:21,898][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000105440_53985280.pth... [2023-03-08 14:20:21,901][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000104832_53673984.pth [2023-03-08 14:20:25,924][434396] Updated weights for policy 0, policy_version 105520 (0.0006) [2023-03-08 14:20:26,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 54034432. Throughput: 0: 10422.9. Samples: 54006960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:20:26,893][434108] Avg episode reward: [(0, '4324.633')] [2023-03-08 14:20:29,942][434396] Updated weights for policy 0, policy_version 105600 (0.0005) [2023-03-08 14:20:31,892][434108] Fps is (10 sec: 9830.6, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 54083584. Throughput: 0: 10389.8. Samples: 54067680. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:20:31,893][434108] Avg episode reward: [(0, '4073.741')] [2023-03-08 14:20:33,948][434396] Updated weights for policy 0, policy_version 105680 (0.0006) [2023-03-08 14:20:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 54136832. Throughput: 0: 10294.1. Samples: 54129092. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:20:36,893][434108] Avg episode reward: [(0, '4520.485')] [2023-03-08 14:20:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000105736_54136832.pth... [2023-03-08 14:20:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000105136_53829632.pth [2023-03-08 14:20:37,974][434396] Updated weights for policy 0, policy_version 105760 (0.0006) [2023-03-08 14:20:41,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 54185984. Throughput: 0: 10310.3. Samples: 54160084. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:20:41,893][434108] Avg episode reward: [(0, '4186.532')] [2023-03-08 14:20:42,011][434396] Updated weights for policy 0, policy_version 105840 (0.0005) [2023-03-08 14:20:46,056][434396] Updated weights for policy 0, policy_version 105920 (0.0005) [2023-03-08 14:20:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 54239232. Throughput: 0: 10262.8. Samples: 54220644. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:20:46,893][434108] Avg episode reward: [(0, '4311.067')] [2023-03-08 14:20:50,032][434396] Updated weights for policy 0, policy_version 106000 (0.0006) [2023-03-08 14:20:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10399.7). Total num frames: 54288384. Throughput: 0: 10232.9. Samples: 54282384. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:20:51,893][434108] Avg episode reward: [(0, '4225.629')] [2023-03-08 14:20:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000106032_54288384.pth... [2023-03-08 14:20:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000105440_53985280.pth [2023-03-08 14:20:54,098][434396] Updated weights for policy 0, policy_version 106080 (0.0006) [2023-03-08 14:20:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10399.7). Total num frames: 54341632. Throughput: 0: 10186.1. Samples: 54312656. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:20:56,893][434108] Avg episode reward: [(0, '4462.088')] [2023-03-08 14:20:57,969][434396] Updated weights for policy 0, policy_version 106160 (0.0004) [2023-03-08 14:21:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10399.7). Total num frames: 54390784. Throughput: 0: 10217.1. Samples: 54375492. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:21:01,893][434108] Avg episode reward: [(0, '4451.527')] [2023-03-08 14:21:01,903][434396] Updated weights for policy 0, policy_version 106240 (0.0005) [2023-03-08 14:21:05,997][434396] Updated weights for policy 0, policy_version 106320 (0.0006) [2023-03-08 14:21:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10399.7). Total num frames: 54444032. Throughput: 0: 10196.7. Samples: 54435904. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:21:06,893][434108] Avg episode reward: [(0, '4323.244')] [2023-03-08 14:21:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000106336_54444032.pth... [2023-03-08 14:21:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000105736_54136832.pth [2023-03-08 14:21:09,982][434396] Updated weights for policy 0, policy_version 106400 (0.0005) [2023-03-08 14:21:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10413.6). Total num frames: 54497280. Throughput: 0: 10220.3. Samples: 54466872. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:21:11,893][434108] Avg episode reward: [(0, '4440.147')] [2023-03-08 14:21:13,761][434396] Updated weights for policy 0, policy_version 106480 (0.0003) [2023-03-08 14:21:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10413.6). Total num frames: 54550528. Throughput: 0: 10300.3. Samples: 54531196. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:21:16,893][434108] Avg episode reward: [(0, '4499.202')] [2023-03-08 14:21:17,504][434396] Updated weights for policy 0, policy_version 106560 (0.0004) [2023-03-08 14:21:21,262][434396] Updated weights for policy 0, policy_version 106640 (0.0005) [2023-03-08 14:21:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 54603776. Throughput: 0: 10397.8. Samples: 54596992. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:21:21,893][434108] Avg episode reward: [(0, '4455.475')] [2023-03-08 14:21:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000106648_54603776.pth... [2023-03-08 14:21:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000106032_54288384.pth [2023-03-08 14:21:24,976][434396] Updated weights for policy 0, policy_version 106720 (0.0005) [2023-03-08 14:21:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10441.3). Total num frames: 54661120. Throughput: 0: 10437.8. Samples: 54629784. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:21:26,893][434108] Avg episode reward: [(0, '4528.416')] [2023-03-08 14:21:28,744][434396] Updated weights for policy 0, policy_version 106800 (0.0005) [2023-03-08 14:21:31,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10513.1, 300 sec: 10455.2). Total num frames: 54714368. Throughput: 0: 10553.2. Samples: 54695540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:21:31,893][434108] Avg episode reward: [(0, '4553.293')] [2023-03-08 14:21:32,482][434396] Updated weights for policy 0, policy_version 106880 (0.0004) [2023-03-08 14:21:36,468][434396] Updated weights for policy 0, policy_version 106960 (0.0005) [2023-03-08 14:21:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10455.2). Total num frames: 54767616. Throughput: 0: 10600.9. Samples: 54759424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:21:36,893][434108] Avg episode reward: [(0, '4530.190')] [2023-03-08 14:21:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000106968_54767616.pth... [2023-03-08 14:21:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000106336_54444032.pth [2023-03-08 14:21:40,085][434396] Updated weights for policy 0, policy_version 107040 (0.0004) [2023-03-08 14:21:41,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10483.0). Total num frames: 54824960. Throughput: 0: 10657.8. Samples: 54792256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:21:41,893][434108] Avg episode reward: [(0, '4604.802')] [2023-03-08 14:21:43,747][434396] Updated weights for policy 0, policy_version 107120 (0.0004) [2023-03-08 14:21:46,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10483.0). Total num frames: 54878208. Throughput: 0: 10746.1. Samples: 54859064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:21:46,893][434108] Avg episode reward: [(0, '4546.759')] [2023-03-08 14:21:47,435][434396] Updated weights for policy 0, policy_version 107200 (0.0004) [2023-03-08 14:21:51,064][434396] Updated weights for policy 0, policy_version 107280 (0.0004) [2023-03-08 14:21:51,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10786.1, 300 sec: 10510.7). Total num frames: 54935552. Throughput: 0: 10908.6. Samples: 54926792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:21:51,893][434108] Avg episode reward: [(0, '4564.879')] [2023-03-08 14:21:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000107296_54935552.pth... [2023-03-08 14:21:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000106648_54603776.pth [2023-03-08 14:21:54,748][434396] Updated weights for policy 0, policy_version 107360 (0.0004) [2023-03-08 14:21:56,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10854.4, 300 sec: 10524.6). Total num frames: 54992896. Throughput: 0: 10962.7. Samples: 54960192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:21:56,893][434108] Avg episode reward: [(0, '4576.010')] [2023-03-08 14:21:58,397][434396] Updated weights for policy 0, policy_version 107440 (0.0004) [2023-03-08 14:22:01,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10922.7, 300 sec: 10538.5). Total num frames: 55046144. Throughput: 0: 11032.8. Samples: 55027672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:22:01,893][434108] Avg episode reward: [(0, '4596.570')] [2023-03-08 14:22:02,012][434396] Updated weights for policy 0, policy_version 107520 (0.0004) [2023-03-08 14:22:05,946][434396] Updated weights for policy 0, policy_version 107600 (0.0005) [2023-03-08 14:22:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10538.5). Total num frames: 55099392. Throughput: 0: 10993.2. Samples: 55091684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:22:06,893][434108] Avg episode reward: [(0, '4566.954')] [2023-03-08 14:22:06,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000107616_55099392.pth... [2023-03-08 14:22:06,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000106968_54767616.pth [2023-03-08 14:22:09,648][434396] Updated weights for policy 0, policy_version 107680 (0.0004) [2023-03-08 14:22:11,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10990.9, 300 sec: 10552.4). Total num frames: 55156736. Throughput: 0: 10993.3. Samples: 55124484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:22:11,893][434108] Avg episode reward: [(0, '4524.232')] [2023-03-08 14:22:13,299][434396] Updated weights for policy 0, policy_version 107760 (0.0004) [2023-03-08 14:22:16,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10990.9, 300 sec: 10552.4). Total num frames: 55209984. Throughput: 0: 11039.5. Samples: 55192316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:22:16,893][434108] Avg episode reward: [(0, '4597.718')] [2023-03-08 14:22:16,953][434396] Updated weights for policy 0, policy_version 107840 (0.0004) [2023-03-08 14:22:20,641][434396] Updated weights for policy 0, policy_version 107920 (0.0004) [2023-03-08 14:22:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10566.3). Total num frames: 55267328. Throughput: 0: 11105.0. Samples: 55259148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:22:21,893][434108] Avg episode reward: [(0, '4626.263')] [2023-03-08 14:22:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000107944_55267328.pth... [2023-03-08 14:22:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000107296_54935552.pth [2023-03-08 14:22:24,326][434396] Updated weights for policy 0, policy_version 108000 (0.0005) [2023-03-08 14:22:26,892][434108] Fps is (10 sec: 11468.8, 60 sec: 11059.2, 300 sec: 10566.3). Total num frames: 55324672. Throughput: 0: 11105.9. Samples: 55292020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:22:26,893][434108] Avg episode reward: [(0, '4605.926')] [2023-03-08 14:22:27,972][434396] Updated weights for policy 0, policy_version 108080 (0.0004) [2023-03-08 14:22:31,692][434396] Updated weights for policy 0, policy_version 108160 (0.0004) [2023-03-08 14:22:31,892][434108] Fps is (10 sec: 11059.3, 60 sec: 11059.2, 300 sec: 10566.3). Total num frames: 55377920. Throughput: 0: 11120.8. Samples: 55359500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:22:31,893][434108] Avg episode reward: [(0, '4608.490')] [2023-03-08 14:22:35,376][434396] Updated weights for policy 0, policy_version 108240 (0.0004) [2023-03-08 14:22:36,892][434108] Fps is (10 sec: 11059.0, 60 sec: 11127.5, 300 sec: 10580.2). Total num frames: 55435264. Throughput: 0: 11092.2. Samples: 55425940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:22:36,893][434108] Avg episode reward: [(0, '4583.638')] [2023-03-08 14:22:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000108272_55435264.pth... [2023-03-08 14:22:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000107616_55099392.pth [2023-03-08 14:22:38,999][434396] Updated weights for policy 0, policy_version 108320 (0.0004) [2023-03-08 14:22:41,892][434108] Fps is (10 sec: 11468.7, 60 sec: 11127.5, 300 sec: 10607.9). Total num frames: 55492608. Throughput: 0: 11104.7. Samples: 55459904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:22:41,893][434108] Avg episode reward: [(0, '4623.531')] [2023-03-08 14:22:42,586][434396] Updated weights for policy 0, policy_version 108400 (0.0004) [2023-03-08 14:22:46,243][434396] Updated weights for policy 0, policy_version 108480 (0.0004) [2023-03-08 14:22:46,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 10621.8). Total num frames: 55545856. Throughput: 0: 11122.6. Samples: 55528192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:22:46,893][434108] Avg episode reward: [(0, '4624.985')] [2023-03-08 14:22:50,210][434396] Updated weights for policy 0, policy_version 108560 (0.0005) [2023-03-08 14:22:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 11059.2, 300 sec: 10621.8). Total num frames: 55599104. Throughput: 0: 11094.2. Samples: 55590924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:22:51,893][434108] Avg episode reward: [(0, '4529.132')] [2023-03-08 14:22:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000108592_55599104.pth... [2023-03-08 14:22:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000107944_55267328.pth [2023-03-08 14:22:54,188][434396] Updated weights for policy 0, policy_version 108640 (0.0005) [2023-03-08 14:22:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10922.7, 300 sec: 10607.9). Total num frames: 55648256. Throughput: 0: 11056.6. Samples: 55622032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:22:56,893][434108] Avg episode reward: [(0, '4579.915')] [2023-03-08 14:22:58,119][434396] Updated weights for policy 0, policy_version 108720 (0.0005) [2023-03-08 14:23:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10922.7, 300 sec: 10607.9). Total num frames: 55701504. Throughput: 0: 10935.1. Samples: 55684396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:23:01,893][434108] Avg episode reward: [(0, '4645.277')] [2023-03-08 14:23:02,120][434396] Updated weights for policy 0, policy_version 108800 (0.0005) [2023-03-08 14:23:06,150][434396] Updated weights for policy 0, policy_version 108880 (0.0005) [2023-03-08 14:23:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10854.4, 300 sec: 10594.1). Total num frames: 55750656. Throughput: 0: 10808.3. Samples: 55745520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:23:06,893][434108] Avg episode reward: [(0, '4614.284')] [2023-03-08 14:23:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000108888_55750656.pth... [2023-03-08 14:23:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000108272_55435264.pth [2023-03-08 14:23:10,163][434396] Updated weights for policy 0, policy_version 108960 (0.0005) [2023-03-08 14:23:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10786.1, 300 sec: 10580.2). Total num frames: 55803904. Throughput: 0: 10743.9. Samples: 55775496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:23:11,893][434108] Avg episode reward: [(0, '4428.370')] [2023-03-08 14:23:14,028][434396] Updated weights for policy 0, policy_version 109040 (0.0005) [2023-03-08 14:23:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10786.1, 300 sec: 10580.2). Total num frames: 55857152. Throughput: 0: 10668.1. Samples: 55839564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:23:16,893][434108] Avg episode reward: [(0, '4466.927')] [2023-03-08 14:23:17,837][434396] Updated weights for policy 0, policy_version 109120 (0.0004) [2023-03-08 14:23:21,488][434396] Updated weights for policy 0, policy_version 109200 (0.0003) [2023-03-08 14:23:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10594.1). Total num frames: 55914496. Throughput: 0: 10640.4. Samples: 55904756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:23:21,893][434108] Avg episode reward: [(0, '4494.355')] [2023-03-08 14:23:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000109208_55914496.pth... [2023-03-08 14:23:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000108592_55599104.pth [2023-03-08 14:23:25,163][434396] Updated weights for policy 0, policy_version 109280 (0.0003) [2023-03-08 14:23:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.8, 300 sec: 10594.1). Total num frames: 55967744. Throughput: 0: 10635.0. Samples: 55938480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:23:26,893][434108] Avg episode reward: [(0, '4462.527')] [2023-03-08 14:23:28,935][434396] Updated weights for policy 0, policy_version 109360 (0.0003) [2023-03-08 14:23:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10607.9). Total num frames: 56020992. Throughput: 0: 10580.4. Samples: 56004308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:23:31,893][434108] Avg episode reward: [(0, '4569.889')] [2023-03-08 14:23:32,776][434396] Updated weights for policy 0, policy_version 109440 (0.0004) [2023-03-08 14:23:36,635][434396] Updated weights for policy 0, policy_version 109520 (0.0004) [2023-03-08 14:23:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 56074240. Throughput: 0: 10588.3. Samples: 56067396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:23:36,893][434108] Avg episode reward: [(0, '4564.526')] [2023-03-08 14:23:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000109520_56074240.pth... [2023-03-08 14:23:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000108888_55750656.pth [2023-03-08 14:23:40,366][434396] Updated weights for policy 0, policy_version 109600 (0.0004) [2023-03-08 14:23:41,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 56131584. Throughput: 0: 10619.7. Samples: 56099920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:23:41,893][434108] Avg episode reward: [(0, '4525.060')] [2023-03-08 14:23:44,057][434396] Updated weights for policy 0, policy_version 109680 (0.0005) [2023-03-08 14:23:46,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 56184832. Throughput: 0: 10725.1. Samples: 56167024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:23:46,893][434108] Avg episode reward: [(0, '4582.913')] [2023-03-08 14:23:47,716][434396] Updated weights for policy 0, policy_version 109760 (0.0004) [2023-03-08 14:23:51,373][434396] Updated weights for policy 0, policy_version 109840 (0.0004) [2023-03-08 14:23:51,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10663.5). Total num frames: 56242176. Throughput: 0: 10856.2. Samples: 56234048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:23:51,893][434108] Avg episode reward: [(0, '4594.459')] [2023-03-08 14:23:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000109848_56242176.pth... [2023-03-08 14:23:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000109208_55914496.pth [2023-03-08 14:23:55,131][434396] Updated weights for policy 0, policy_version 109920 (0.0004) [2023-03-08 14:23:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10663.5). Total num frames: 56295424. Throughput: 0: 10918.2. Samples: 56266816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:23:56,903][434108] Avg episode reward: [(0, '4580.736')] [2023-03-08 14:23:58,760][434396] Updated weights for policy 0, policy_version 110000 (0.0005) [2023-03-08 14:24:01,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10677.4). Total num frames: 56352768. Throughput: 0: 10985.7. Samples: 56333920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:24:01,903][434108] Avg episode reward: [(0, '4583.279')] [2023-03-08 14:24:02,406][434396] Updated weights for policy 0, policy_version 110080 (0.0004) [2023-03-08 14:24:06,019][434396] Updated weights for policy 0, policy_version 110160 (0.0004) [2023-03-08 14:24:06,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10990.9, 300 sec: 10705.1). Total num frames: 56410112. Throughput: 0: 11048.4. Samples: 56401932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:24:06,903][434108] Avg episode reward: [(0, '4519.700')] [2023-03-08 14:24:06,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000110176_56410112.pth... [2023-03-08 14:24:06,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000109520_56074240.pth [2023-03-08 14:24:09,609][434396] Updated weights for policy 0, policy_version 110240 (0.0004) [2023-03-08 14:24:11,892][434108] Fps is (10 sec: 11468.8, 60 sec: 11059.2, 300 sec: 10719.0). Total num frames: 56467456. Throughput: 0: 11058.9. Samples: 56436132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:24:11,903][434108] Avg episode reward: [(0, '4539.758')] [2023-03-08 14:24:13,250][434396] Updated weights for policy 0, policy_version 110320 (0.0004) [2023-03-08 14:24:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10719.0). Total num frames: 56520704. Throughput: 0: 11100.5. Samples: 56503832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:24:16,897][434396] Updated weights for policy 0, policy_version 110400 (0.0004) [2023-03-08 14:24:16,903][434108] Avg episode reward: [(0, '4322.749')] [2023-03-08 14:24:20,567][434396] Updated weights for policy 0, policy_version 110480 (0.0004) [2023-03-08 14:24:21,893][434108] Fps is (10 sec: 11059.1, 60 sec: 11059.2, 300 sec: 10732.9). Total num frames: 56578048. Throughput: 0: 11177.2. Samples: 56570372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:24:21,904][434108] Avg episode reward: [(0, '4529.421')] [2023-03-08 14:24:21,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000110504_56578048.pth... [2023-03-08 14:24:21,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000109848_56242176.pth [2023-03-08 14:24:24,276][434396] Updated weights for policy 0, policy_version 110560 (0.0004) [2023-03-08 14:24:26,892][434108] Fps is (10 sec: 11468.8, 60 sec: 11127.5, 300 sec: 10746.8). Total num frames: 56635392. Throughput: 0: 11195.5. Samples: 56603720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:24:26,903][434108] Avg episode reward: [(0, '4542.919')] [2023-03-08 14:24:27,908][434396] Updated weights for policy 0, policy_version 110640 (0.0004) [2023-03-08 14:24:31,552][434396] Updated weights for policy 0, policy_version 110720 (0.0004) [2023-03-08 14:24:31,892][434108] Fps is (10 sec: 11059.3, 60 sec: 11127.5, 300 sec: 10760.7). Total num frames: 56688640. Throughput: 0: 11209.9. Samples: 56671468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:24:31,903][434108] Avg episode reward: [(0, '4549.161')] [2023-03-08 14:24:35,184][434396] Updated weights for policy 0, policy_version 110800 (0.0004) [2023-03-08 14:24:36,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11195.7, 300 sec: 10788.5). Total num frames: 56745984. Throughput: 0: 11207.2. Samples: 56738372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:24:36,893][434108] Avg episode reward: [(0, '4513.321')] [2023-03-08 14:24:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000110832_56745984.pth... [2023-03-08 14:24:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000110176_56410112.pth [2023-03-08 14:24:39,008][434396] Updated weights for policy 0, policy_version 110880 (0.0005) [2023-03-08 14:24:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 10788.4). Total num frames: 56799232. Throughput: 0: 11195.7. Samples: 56770624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:24:41,893][434108] Avg episode reward: [(0, '4540.712')] [2023-03-08 14:24:43,014][434396] Updated weights for policy 0, policy_version 110960 (0.0005) [2023-03-08 14:24:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 11059.2, 300 sec: 10760.7). Total num frames: 56848384. Throughput: 0: 11073.8. Samples: 56832240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:24:46,893][434108] Avg episode reward: [(0, '4496.105')] [2023-03-08 14:24:46,973][434396] Updated weights for policy 0, policy_version 111040 (0.0005) [2023-03-08 14:24:51,078][434396] Updated weights for policy 0, policy_version 111120 (0.0005) [2023-03-08 14:24:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10990.9, 300 sec: 10774.6). Total num frames: 56901632. Throughput: 0: 10921.5. Samples: 56893400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:24:51,893][434108] Avg episode reward: [(0, '4522.509')] [2023-03-08 14:24:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000111136_56901632.pth... [2023-03-08 14:24:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000110504_56578048.pth [2023-03-08 14:24:55,022][434396] Updated weights for policy 0, policy_version 111200 (0.0005) [2023-03-08 14:24:56,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10922.7, 300 sec: 10760.7). Total num frames: 56950784. Throughput: 0: 10840.4. Samples: 56923952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:24:56,893][434108] Avg episode reward: [(0, '4435.433')] [2023-03-08 14:24:58,923][434396] Updated weights for policy 0, policy_version 111280 (0.0005) [2023-03-08 14:25:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10854.4, 300 sec: 10760.7). Total num frames: 57004032. Throughput: 0: 10737.3. Samples: 56987012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:25:01,893][434108] Avg episode reward: [(0, '4505.901')] [2023-03-08 14:25:02,756][434396] Updated weights for policy 0, policy_version 111360 (0.0005) [2023-03-08 14:25:06,486][434396] Updated weights for policy 0, policy_version 111440 (0.0004) [2023-03-08 14:25:06,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10774.6). Total num frames: 57061376. Throughput: 0: 10699.7. Samples: 57051856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:25:06,893][434108] Avg episode reward: [(0, '4454.399')] [2023-03-08 14:25:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000111448_57061376.pth... [2023-03-08 14:25:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000110832_56745984.pth [2023-03-08 14:25:10,149][434396] Updated weights for policy 0, policy_version 111520 (0.0004) [2023-03-08 14:25:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10788.4). Total num frames: 57114624. Throughput: 0: 10711.8. Samples: 57085752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:25:11,894][434108] Avg episode reward: [(0, '4237.604')] [2023-03-08 14:25:13,824][434396] Updated weights for policy 0, policy_version 111600 (0.0004) [2023-03-08 14:25:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10802.3). Total num frames: 57171968. Throughput: 0: 10679.4. Samples: 57152040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:25:16,903][434108] Avg episode reward: [(0, '4240.828')] [2023-03-08 14:25:17,554][434396] Updated weights for policy 0, policy_version 111680 (0.0005) [2023-03-08 14:25:21,256][434396] Updated weights for policy 0, policy_version 111760 (0.0004) [2023-03-08 14:25:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10816.2). Total num frames: 57225216. Throughput: 0: 10668.3. Samples: 57218448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:25:21,893][434108] Avg episode reward: [(0, '4438.891')] [2023-03-08 14:25:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000111768_57225216.pth... [2023-03-08 14:25:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000111136_56901632.pth [2023-03-08 14:25:24,972][434396] Updated weights for policy 0, policy_version 111840 (0.0004) [2023-03-08 14:25:26,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10830.1). Total num frames: 57278464. Throughput: 0: 10695.9. Samples: 57251940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:25:26,893][434108] Avg episode reward: [(0, '4520.218')] [2023-03-08 14:25:28,882][434396] Updated weights for policy 0, policy_version 111920 (0.0005) [2023-03-08 14:25:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10830.1). Total num frames: 57331712. Throughput: 0: 10734.2. Samples: 57315280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:25:31,893][434108] Avg episode reward: [(0, '4546.303')] [2023-03-08 14:25:32,657][434396] Updated weights for policy 0, policy_version 112000 (0.0005) [2023-03-08 14:25:36,346][434396] Updated weights for policy 0, policy_version 112080 (0.0004) [2023-03-08 14:25:36,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10857.9). Total num frames: 57389056. Throughput: 0: 10834.0. Samples: 57380928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:25:36,893][434108] Avg episode reward: [(0, '4509.758')] [2023-03-08 14:25:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000112088_57389056.pth... [2023-03-08 14:25:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000111448_57061376.pth [2023-03-08 14:25:40,048][434396] Updated weights for policy 0, policy_version 112160 (0.0004) [2023-03-08 14:25:41,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10717.9, 300 sec: 10857.9). Total num frames: 57442304. Throughput: 0: 10895.9. Samples: 57414268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:25:41,893][434108] Avg episode reward: [(0, '4350.654')] [2023-03-08 14:25:43,792][434396] Updated weights for policy 0, policy_version 112240 (0.0004) [2023-03-08 14:25:46,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10885.6). Total num frames: 57499648. Throughput: 0: 10956.2. Samples: 57480040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:25:46,893][434108] Avg episode reward: [(0, '4476.924')] [2023-03-08 14:25:47,474][434396] Updated weights for policy 0, policy_version 112320 (0.0004) [2023-03-08 14:25:51,171][434396] Updated weights for policy 0, policy_version 112400 (0.0005) [2023-03-08 14:25:51,892][434108] Fps is (10 sec: 11468.7, 60 sec: 10922.7, 300 sec: 10899.5). Total num frames: 57556992. Throughput: 0: 11010.1. Samples: 57547312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:25:51,893][434108] Avg episode reward: [(0, '4490.622')] [2023-03-08 14:25:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000112416_57556992.pth... [2023-03-08 14:25:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000111768_57225216.pth [2023-03-08 14:25:54,712][434396] Updated weights for policy 0, policy_version 112480 (0.0003) [2023-03-08 14:25:56,892][434108] Fps is (10 sec: 11468.8, 60 sec: 11059.2, 300 sec: 10927.3). Total num frames: 57614336. Throughput: 0: 11019.6. Samples: 57581632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:25:56,893][434108] Avg episode reward: [(0, '4355.227')] [2023-03-08 14:25:58,393][434396] Updated weights for policy 0, policy_version 112560 (0.0004) [2023-03-08 14:26:01,892][434108] Fps is (10 sec: 11059.3, 60 sec: 11059.2, 300 sec: 10927.3). Total num frames: 57667584. Throughput: 0: 11003.0. Samples: 57647176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:26:01,893][434108] Avg episode reward: [(0, '4365.994')] [2023-03-08 14:26:02,209][434396] Updated weights for policy 0, policy_version 112640 (0.0004) [2023-03-08 14:26:05,869][434396] Updated weights for policy 0, policy_version 112720 (0.0004) [2023-03-08 14:26:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 10927.3). Total num frames: 57720832. Throughput: 0: 11016.7. Samples: 57714200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:26:06,893][434108] Avg episode reward: [(0, '4512.844')] [2023-03-08 14:26:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000112736_57720832.pth... [2023-03-08 14:26:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000112088_57389056.pth [2023-03-08 14:26:09,630][434396] Updated weights for policy 0, policy_version 112800 (0.0004) [2023-03-08 14:26:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10941.2). Total num frames: 57778176. Throughput: 0: 10991.6. Samples: 57746564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:26:11,893][434108] Avg episode reward: [(0, '4497.942')] [2023-03-08 14:26:13,326][434396] Updated weights for policy 0, policy_version 112880 (0.0004) [2023-03-08 14:26:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10941.2). Total num frames: 57831424. Throughput: 0: 11068.4. Samples: 57813360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:26:16,893][434108] Avg episode reward: [(0, '4347.989')] [2023-03-08 14:26:16,980][434396] Updated weights for policy 0, policy_version 112960 (0.0004) [2023-03-08 14:26:20,664][434396] Updated weights for policy 0, policy_version 113040 (0.0005) [2023-03-08 14:26:21,892][434108] Fps is (10 sec: 11059.1, 60 sec: 11059.2, 300 sec: 10941.2). Total num frames: 57888768. Throughput: 0: 11103.4. Samples: 57880584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:26:21,893][434108] Avg episode reward: [(0, '4391.337')] [2023-03-08 14:26:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000113064_57888768.pth... [2023-03-08 14:26:21,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000112416_57556992.pth [2023-03-08 14:26:24,473][434396] Updated weights for policy 0, policy_version 113120 (0.0005) [2023-03-08 14:26:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10941.2). Total num frames: 57942016. Throughput: 0: 11075.3. Samples: 57912656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:26:26,893][434108] Avg episode reward: [(0, '4436.084')] [2023-03-08 14:26:28,423][434396] Updated weights for policy 0, policy_version 113200 (0.0005) [2023-03-08 14:26:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10990.9, 300 sec: 10927.3). Total num frames: 57991168. Throughput: 0: 10995.7. Samples: 57974848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:26:31,893][434108] Avg episode reward: [(0, '4459.910')] [2023-03-08 14:26:32,418][434396] Updated weights for policy 0, policy_version 113280 (0.0005) [2023-03-08 14:26:36,402][434396] Updated weights for policy 0, policy_version 113360 (0.0005) [2023-03-08 14:26:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10922.7, 300 sec: 10913.4). Total num frames: 58044416. Throughput: 0: 10866.1. Samples: 58036288. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:26:36,893][434108] Avg episode reward: [(0, '4377.626')] [2023-03-08 14:26:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000113368_58044416.pth... [2023-03-08 14:26:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000112736_57720832.pth [2023-03-08 14:26:40,323][434396] Updated weights for policy 0, policy_version 113440 (0.0005) [2023-03-08 14:26:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10913.4). Total num frames: 58097664. Throughput: 0: 10802.3. Samples: 58067736. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:26:41,893][434108] Avg episode reward: [(0, '4377.163')] [2023-03-08 14:26:44,230][434396] Updated weights for policy 0, policy_version 113520 (0.0005) [2023-03-08 14:26:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10786.1, 300 sec: 10885.6). Total num frames: 58146816. Throughput: 0: 10740.4. Samples: 58130496. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:26:46,893][434108] Avg episode reward: [(0, '4462.660')] [2023-03-08 14:26:48,202][434396] Updated weights for policy 0, policy_version 113600 (0.0005) [2023-03-08 14:26:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10871.8). Total num frames: 58200064. Throughput: 0: 10620.1. Samples: 58192104. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:26:51,893][434108] Avg episode reward: [(0, '4462.493')] [2023-03-08 14:26:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000113672_58200064.pth... [2023-03-08 14:26:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000113064_57888768.pth [2023-03-08 14:26:52,170][434396] Updated weights for policy 0, policy_version 113680 (0.0005) [2023-03-08 14:26:56,034][434396] Updated weights for policy 0, policy_version 113760 (0.0004) [2023-03-08 14:26:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10871.8). Total num frames: 58253312. Throughput: 0: 10622.9. Samples: 58224596. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:26:56,893][434108] Avg episode reward: [(0, '4531.423')] [2023-03-08 14:26:59,954][434396] Updated weights for policy 0, policy_version 113840 (0.0005) [2023-03-08 14:27:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10857.9). Total num frames: 58302464. Throughput: 0: 10513.0. Samples: 58286444. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:27:01,893][434108] Avg episode reward: [(0, '4555.088')] [2023-03-08 14:27:03,915][434396] Updated weights for policy 0, policy_version 113920 (0.0005) [2023-03-08 14:27:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10844.0). Total num frames: 58355712. Throughput: 0: 10409.8. Samples: 58349024. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:27:06,893][434108] Avg episode reward: [(0, '4568.033')] [2023-03-08 14:27:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000113976_58355712.pth... [2023-03-08 14:27:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000113368_58044416.pth [2023-03-08 14:27:07,800][434396] Updated weights for policy 0, policy_version 114000 (0.0005) [2023-03-08 14:27:11,722][434396] Updated weights for policy 0, policy_version 114080 (0.0005) [2023-03-08 14:27:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10844.0). Total num frames: 58408960. Throughput: 0: 10393.4. Samples: 58380360. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:27:11,893][434108] Avg episode reward: [(0, '4529.656')] [2023-03-08 14:27:15,646][434396] Updated weights for policy 0, policy_version 114160 (0.0005) [2023-03-08 14:27:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10830.1). Total num frames: 58462208. Throughput: 0: 10405.2. Samples: 58443080. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:27:16,893][434108] Avg episode reward: [(0, '4570.782')] [2023-03-08 14:27:19,532][434396] Updated weights for policy 0, policy_version 114240 (0.0005) [2023-03-08 14:27:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10816.2). Total num frames: 58515456. Throughput: 0: 10454.3. Samples: 58506732. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:27:21,893][434108] Avg episode reward: [(0, '4588.233')] [2023-03-08 14:27:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000114288_58515456.pth... [2023-03-08 14:27:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000113672_58200064.pth [2023-03-08 14:27:23,436][434396] Updated weights for policy 0, policy_version 114320 (0.0005) [2023-03-08 14:27:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10802.3). Total num frames: 58564608. Throughput: 0: 10454.2. Samples: 58538176. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:27:26,893][434108] Avg episode reward: [(0, '4398.130')] [2023-03-08 14:27:27,488][434396] Updated weights for policy 0, policy_version 114400 (0.0005) [2023-03-08 14:27:31,376][434396] Updated weights for policy 0, policy_version 114480 (0.0005) [2023-03-08 14:27:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10788.4). Total num frames: 58617856. Throughput: 0: 10404.4. Samples: 58598696. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:27:31,893][434108] Avg episode reward: [(0, '4493.452')] [2023-03-08 14:27:35,058][434396] Updated weights for policy 0, policy_version 114560 (0.0004) [2023-03-08 14:27:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10774.6). Total num frames: 58671104. Throughput: 0: 10497.7. Samples: 58664500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:27:36,893][434108] Avg episode reward: [(0, '4607.671')] [2023-03-08 14:27:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000114592_58671104.pth... [2023-03-08 14:27:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000113976_58355712.pth [2023-03-08 14:27:38,802][434396] Updated weights for policy 0, policy_version 114640 (0.0004) [2023-03-08 14:27:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10513.1, 300 sec: 10788.4). Total num frames: 58728448. Throughput: 0: 10524.2. Samples: 58698184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:27:41,893][434108] Avg episode reward: [(0, '4577.355')] [2023-03-08 14:27:42,449][434396] Updated weights for policy 0, policy_version 114720 (0.0004) [2023-03-08 14:27:46,167][434396] Updated weights for policy 0, policy_version 114800 (0.0004) [2023-03-08 14:27:46,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10581.3, 300 sec: 10788.4). Total num frames: 58781696. Throughput: 0: 10636.4. Samples: 58765080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:27:46,893][434108] Avg episode reward: [(0, '4556.194')] [2023-03-08 14:27:49,939][434396] Updated weights for policy 0, policy_version 114880 (0.0005) [2023-03-08 14:27:51,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10816.2). Total num frames: 58839040. Throughput: 0: 10705.7. Samples: 58830780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:27:51,893][434108] Avg episode reward: [(0, '4296.228')] [2023-03-08 14:27:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000114920_58839040.pth... [2023-03-08 14:27:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000114288_58515456.pth [2023-03-08 14:27:53,645][434396] Updated weights for policy 0, policy_version 114960 (0.0004) [2023-03-08 14:27:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10816.2). Total num frames: 58892288. Throughput: 0: 10739.2. Samples: 58863624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:27:56,893][434108] Avg episode reward: [(0, '4247.769')] [2023-03-08 14:27:57,359][434396] Updated weights for policy 0, policy_version 115040 (0.0005) [2023-03-08 14:28:01,250][434396] Updated weights for policy 0, policy_version 115120 (0.0004) [2023-03-08 14:28:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10830.1). Total num frames: 58945536. Throughput: 0: 10782.3. Samples: 58928284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:28:01,893][434108] Avg episode reward: [(0, '4372.540')] [2023-03-08 14:28:04,993][434396] Updated weights for policy 0, policy_version 115200 (0.0004) [2023-03-08 14:28:06,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10844.0). Total num frames: 59002880. Throughput: 0: 10826.7. Samples: 58993932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:28:06,893][434108] Avg episode reward: [(0, '4483.083')] [2023-03-08 14:28:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000115240_59002880.pth... [2023-03-08 14:28:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000114592_58671104.pth [2023-03-08 14:28:08,669][434396] Updated weights for policy 0, policy_version 115280 (0.0004) [2023-03-08 14:28:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10844.0). Total num frames: 59056128. Throughput: 0: 10871.9. Samples: 59027412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:28:11,893][434108] Avg episode reward: [(0, '4490.874')] [2023-03-08 14:28:12,677][434396] Updated weights for policy 0, policy_version 115360 (0.0005) [2023-03-08 14:28:16,556][434396] Updated weights for policy 0, policy_version 115440 (0.0005) [2023-03-08 14:28:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10816.2). Total num frames: 59105280. Throughput: 0: 10889.9. Samples: 59088740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:28:16,893][434108] Avg episode reward: [(0, '4550.868')] [2023-03-08 14:28:20,230][434396] Updated weights for policy 0, policy_version 115520 (0.0004) [2023-03-08 14:28:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10786.1, 300 sec: 10830.1). Total num frames: 59162624. Throughput: 0: 10889.0. Samples: 59154504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:28:21,893][434108] Avg episode reward: [(0, '4361.250')] [2023-03-08 14:28:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000115552_59162624.pth... [2023-03-08 14:28:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000114920_58839040.pth [2023-03-08 14:28:23,951][434396] Updated weights for policy 0, policy_version 115600 (0.0004) [2023-03-08 14:28:26,892][434108] Fps is (10 sec: 11468.9, 60 sec: 10922.7, 300 sec: 10844.0). Total num frames: 59219968. Throughput: 0: 10876.1. Samples: 59187608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:28:26,893][434108] Avg episode reward: [(0, '4388.457')] [2023-03-08 14:28:27,659][434396] Updated weights for policy 0, policy_version 115680 (0.0005) [2023-03-08 14:28:31,579][434396] Updated weights for policy 0, policy_version 115760 (0.0005) [2023-03-08 14:28:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10854.4, 300 sec: 10830.1). Total num frames: 59269120. Throughput: 0: 10837.2. Samples: 59252752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:28:31,893][434108] Avg episode reward: [(0, '4515.545')] [2023-03-08 14:28:35,476][434396] Updated weights for policy 0, policy_version 115840 (0.0005) [2023-03-08 14:28:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10854.4, 300 sec: 10816.2). Total num frames: 59322368. Throughput: 0: 10773.0. Samples: 59315564. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:28:36,893][434108] Avg episode reward: [(0, '4573.847')] [2023-03-08 14:28:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000115864_59322368.pth... [2023-03-08 14:28:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000115240_59002880.pth [2023-03-08 14:28:39,399][434396] Updated weights for policy 0, policy_version 115920 (0.0005) [2023-03-08 14:28:41,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10786.1, 300 sec: 10816.2). Total num frames: 59375616. Throughput: 0: 10740.4. Samples: 59346944. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:28:41,893][434108] Avg episode reward: [(0, '4396.604')] [2023-03-08 14:28:43,081][434396] Updated weights for policy 0, policy_version 116000 (0.0005) [2023-03-08 14:28:46,756][434396] Updated weights for policy 0, policy_version 116080 (0.0004) [2023-03-08 14:28:46,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10816.2). Total num frames: 59432960. Throughput: 0: 10782.6. Samples: 59413500. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:28:46,893][434108] Avg episode reward: [(0, '4536.830')] [2023-03-08 14:28:50,384][434396] Updated weights for policy 0, policy_version 116160 (0.0004) [2023-03-08 14:28:51,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10854.4, 300 sec: 10830.1). Total num frames: 59490304. Throughput: 0: 10827.4. Samples: 59481164. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:28:51,893][434108] Avg episode reward: [(0, '4590.564')] [2023-03-08 14:28:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000116192_59490304.pth... [2023-03-08 14:28:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000115552_59162624.pth [2023-03-08 14:28:54,036][434396] Updated weights for policy 0, policy_version 116240 (0.0004) [2023-03-08 14:28:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10816.2). Total num frames: 59543552. Throughput: 0: 10828.7. Samples: 59514704. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:28:56,893][434108] Avg episode reward: [(0, '4580.105')] [2023-03-08 14:28:57,751][434396] Updated weights for policy 0, policy_version 116320 (0.0004) [2023-03-08 14:29:01,418][434396] Updated weights for policy 0, policy_version 116400 (0.0004) [2023-03-08 14:29:01,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10816.2). Total num frames: 59600896. Throughput: 0: 10946.2. Samples: 59581320. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:29:01,893][434108] Avg episode reward: [(0, '4631.526')] [2023-03-08 14:29:05,092][434396] Updated weights for policy 0, policy_version 116480 (0.0004) [2023-03-08 14:29:06,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10854.4, 300 sec: 10802.3). Total num frames: 59654144. Throughput: 0: 10967.2. Samples: 59648028. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:29:06,893][434108] Avg episode reward: [(0, '4521.292')] [2023-03-08 14:29:06,918][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000116520_59658240.pth... [2023-03-08 14:29:06,919][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000115864_59322368.pth [2023-03-08 14:29:08,839][434396] Updated weights for policy 0, policy_version 116560 (0.0005) [2023-03-08 14:29:11,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10854.4, 300 sec: 10802.3). Total num frames: 59707392. Throughput: 0: 10951.3. Samples: 59680416. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:29:11,893][434108] Avg episode reward: [(0, '4591.685')] [2023-03-08 14:29:12,661][434396] Updated weights for policy 0, policy_version 116640 (0.0004) [2023-03-08 14:29:16,271][434396] Updated weights for policy 0, policy_version 116720 (0.0004) [2023-03-08 14:29:16,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10990.9, 300 sec: 10802.3). Total num frames: 59764736. Throughput: 0: 10964.1. Samples: 59746136. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:29:16,893][434108] Avg episode reward: [(0, '4561.408')] [2023-03-08 14:29:19,934][434396] Updated weights for policy 0, policy_version 116800 (0.0004) [2023-03-08 14:29:21,893][434108] Fps is (10 sec: 11468.6, 60 sec: 10990.9, 300 sec: 10802.3). Total num frames: 59822080. Throughput: 0: 11068.8. Samples: 59813660. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:29:21,893][434108] Avg episode reward: [(0, '4575.680')] [2023-03-08 14:29:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000116840_59822080.pth... [2023-03-08 14:29:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000116192_59490304.pth [2023-03-08 14:29:23,573][434396] Updated weights for policy 0, policy_version 116880 (0.0004) [2023-03-08 14:29:26,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10990.9, 300 sec: 10816.2). Total num frames: 59879424. Throughput: 0: 11122.1. Samples: 59847436. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:29:26,893][434108] Avg episode reward: [(0, '4559.239')] [2023-03-08 14:29:27,219][434396] Updated weights for policy 0, policy_version 116960 (0.0003) [2023-03-08 14:29:30,842][434396] Updated weights for policy 0, policy_version 117040 (0.0004) [2023-03-08 14:29:31,892][434108] Fps is (10 sec: 11059.3, 60 sec: 11059.2, 300 sec: 10802.3). Total num frames: 59932672. Throughput: 0: 11152.2. Samples: 59915348. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:29:31,893][434108] Avg episode reward: [(0, '4586.163')] [2023-03-08 14:29:34,534][434396] Updated weights for policy 0, policy_version 117120 (0.0004) [2023-03-08 14:29:36,892][434108] Fps is (10 sec: 11059.1, 60 sec: 11127.5, 300 sec: 10816.2). Total num frames: 59990016. Throughput: 0: 11130.0. Samples: 59982016. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:29:36,893][434108] Avg episode reward: [(0, '4545.477')] [2023-03-08 14:29:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000117168_59990016.pth... [2023-03-08 14:29:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000116520_59658240.pth [2023-03-08 14:29:38,211][434396] Updated weights for policy 0, policy_version 117200 (0.0004) [2023-03-08 14:29:41,887][434396] Updated weights for policy 0, policy_version 117280 (0.0004) [2023-03-08 14:29:41,892][434108] Fps is (10 sec: 11468.8, 60 sec: 11195.7, 300 sec: 10844.0). Total num frames: 60047360. Throughput: 0: 11121.2. Samples: 60015156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:29:41,893][434108] Avg episode reward: [(0, '4562.354')] [2023-03-08 14:29:45,528][434396] Updated weights for policy 0, policy_version 117360 (0.0003) [2023-03-08 14:29:46,892][434108] Fps is (10 sec: 11059.3, 60 sec: 11127.5, 300 sec: 10844.0). Total num frames: 60100608. Throughput: 0: 11141.1. Samples: 60082668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:29:46,893][434108] Avg episode reward: [(0, '4601.746')] [2023-03-08 14:29:49,207][434396] Updated weights for policy 0, policy_version 117440 (0.0004) [2023-03-08 14:29:51,892][434108] Fps is (10 sec: 11059.1, 60 sec: 11127.5, 300 sec: 10871.8). Total num frames: 60157952. Throughput: 0: 11139.5. Samples: 60149308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:29:51,893][434108] Avg episode reward: [(0, '4577.543')] [2023-03-08 14:29:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000117496_60157952.pth... [2023-03-08 14:29:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000116840_59822080.pth [2023-03-08 14:29:52,981][434396] Updated weights for policy 0, policy_version 117520 (0.0004) [2023-03-08 14:29:56,622][434396] Updated weights for policy 0, policy_version 117600 (0.0004) [2023-03-08 14:29:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 10871.8). Total num frames: 60211200. Throughput: 0: 11141.1. Samples: 60181768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:29:56,893][434108] Avg episode reward: [(0, '4604.048')] [2023-03-08 14:30:00,242][434396] Updated weights for policy 0, policy_version 117680 (0.0004) [2023-03-08 14:30:01,892][434108] Fps is (10 sec: 11059.3, 60 sec: 11127.5, 300 sec: 10871.8). Total num frames: 60268544. Throughput: 0: 11189.6. Samples: 60249668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:30:01,893][434108] Avg episode reward: [(0, '4644.241')] [2023-03-08 14:30:04,030][434396] Updated weights for policy 0, policy_version 117760 (0.0005) [2023-03-08 14:30:06,892][434108] Fps is (10 sec: 11059.1, 60 sec: 11127.4, 300 sec: 10871.8). Total num frames: 60321792. Throughput: 0: 11111.2. Samples: 60313664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:30:06,893][434108] Avg episode reward: [(0, '4608.304')] [2023-03-08 14:30:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000117816_60321792.pth... [2023-03-08 14:30:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000117168_59990016.pth [2023-03-08 14:30:07,999][434396] Updated weights for policy 0, policy_version 117840 (0.0005) [2023-03-08 14:30:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 11059.2, 300 sec: 10844.0). Total num frames: 60370944. Throughput: 0: 11057.3. Samples: 60345016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:30:11,893][434108] Avg episode reward: [(0, '4554.600')] [2023-03-08 14:30:11,932][434396] Updated weights for policy 0, policy_version 117920 (0.0005) [2023-03-08 14:30:15,860][434396] Updated weights for policy 0, policy_version 118000 (0.0005) [2023-03-08 14:30:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10990.9, 300 sec: 10844.0). Total num frames: 60424192. Throughput: 0: 10938.2. Samples: 60407564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:30:16,893][434108] Avg episode reward: [(0, '4549.065')] [2023-03-08 14:30:19,855][434396] Updated weights for policy 0, policy_version 118080 (0.0005) [2023-03-08 14:30:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10922.7, 300 sec: 10844.0). Total num frames: 60477440. Throughput: 0: 10828.8. Samples: 60469312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:30:21,893][434108] Avg episode reward: [(0, '4485.661')] [2023-03-08 14:30:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000118120_60477440.pth... [2023-03-08 14:30:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000117496_60157952.pth [2023-03-08 14:30:23,756][434396] Updated weights for policy 0, policy_version 118160 (0.0005) [2023-03-08 14:30:26,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10854.4, 300 sec: 10844.0). Total num frames: 60530688. Throughput: 0: 10792.3. Samples: 60500808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:30:26,893][434108] Avg episode reward: [(0, '4474.294')] [2023-03-08 14:30:27,608][434396] Updated weights for policy 0, policy_version 118240 (0.0005) [2023-03-08 14:30:31,574][434396] Updated weights for policy 0, policy_version 118320 (0.0005) [2023-03-08 14:30:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10786.1, 300 sec: 10816.2). Total num frames: 60579840. Throughput: 0: 10685.8. Samples: 60563528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:30:31,893][434108] Avg episode reward: [(0, '4533.533')] [2023-03-08 14:30:35,461][434396] Updated weights for policy 0, policy_version 118400 (0.0005) [2023-03-08 14:30:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10816.2). Total num frames: 60633088. Throughput: 0: 10611.9. Samples: 60626844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:30:36,893][434108] Avg episode reward: [(0, '4417.192')] [2023-03-08 14:30:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000118424_60633088.pth... [2023-03-08 14:30:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000117816_60321792.pth [2023-03-08 14:30:39,371][434396] Updated weights for policy 0, policy_version 118480 (0.0005) [2023-03-08 14:30:41,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10802.3). Total num frames: 60686336. Throughput: 0: 10580.7. Samples: 60657900. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:30:41,893][434108] Avg episode reward: [(0, '4166.442')] [2023-03-08 14:30:43,262][434396] Updated weights for policy 0, policy_version 118560 (0.0005) [2023-03-08 14:30:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10788.5). Total num frames: 60739584. Throughput: 0: 10468.9. Samples: 60720768. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:30:46,893][434108] Avg episode reward: [(0, '4580.882')] [2023-03-08 14:30:47,179][434396] Updated weights for policy 0, policy_version 118640 (0.0004) [2023-03-08 14:30:51,043][434396] Updated weights for policy 0, policy_version 118720 (0.0005) [2023-03-08 14:30:51,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10774.6). Total num frames: 60792832. Throughput: 0: 10466.4. Samples: 60784652. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:30:51,893][434108] Avg episode reward: [(0, '4437.738')] [2023-03-08 14:30:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000118736_60792832.pth... [2023-03-08 14:30:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000118120_60477440.pth [2023-03-08 14:30:54,883][434396] Updated weights for policy 0, policy_version 118800 (0.0005) [2023-03-08 14:30:56,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10774.6). Total num frames: 60846080. Throughput: 0: 10479.2. Samples: 60816580. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:30:56,893][434108] Avg episode reward: [(0, '4579.031')] [2023-03-08 14:30:58,528][434396] Updated weights for policy 0, policy_version 118880 (0.0004) [2023-03-08 14:31:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10774.6). Total num frames: 60899328. Throughput: 0: 10564.0. Samples: 60882944. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:31:01,893][434108] Avg episode reward: [(0, '4534.122')] [2023-03-08 14:31:02,454][434396] Updated weights for policy 0, policy_version 118960 (0.0005) [2023-03-08 14:31:06,370][434396] Updated weights for policy 0, policy_version 119040 (0.0005) [2023-03-08 14:31:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10760.7). Total num frames: 60952576. Throughput: 0: 10561.6. Samples: 60944584. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:31:06,893][434108] Avg episode reward: [(0, '4489.699')] [2023-03-08 14:31:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000119048_60952576.pth... [2023-03-08 14:31:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000118424_60633088.pth [2023-03-08 14:31:10,044][434396] Updated weights for policy 0, policy_version 119120 (0.0004) [2023-03-08 14:31:11,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10774.6). Total num frames: 61009920. Throughput: 0: 10588.4. Samples: 60977284. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:31:11,893][434108] Avg episode reward: [(0, '4569.517')] [2023-03-08 14:31:13,601][434396] Updated weights for policy 0, policy_version 119200 (0.0003) [2023-03-08 14:31:16,892][434108] Fps is (10 sec: 11468.7, 60 sec: 10717.8, 300 sec: 10774.6). Total num frames: 61067264. Throughput: 0: 10730.9. Samples: 61046420. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:31:16,893][434108] Avg episode reward: [(0, '4589.379')] [2023-03-08 14:31:17,245][434396] Updated weights for policy 0, policy_version 119280 (0.0004) [2023-03-08 14:31:20,924][434396] Updated weights for policy 0, policy_version 119360 (0.0004) [2023-03-08 14:31:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10774.6). Total num frames: 61120512. Throughput: 0: 10806.0. Samples: 61113112. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:31:21,893][434108] Avg episode reward: [(0, '4510.319')] [2023-03-08 14:31:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000119376_61120512.pth... [2023-03-08 14:31:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000118736_60792832.pth [2023-03-08 14:31:24,816][434396] Updated weights for policy 0, policy_version 119440 (0.0005) [2023-03-08 14:31:26,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10788.4). Total num frames: 61173760. Throughput: 0: 10827.8. Samples: 61145152. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:31:26,893][434108] Avg episode reward: [(0, '4594.393')] [2023-03-08 14:31:28,788][434396] Updated weights for policy 0, policy_version 119520 (0.0005) [2023-03-08 14:31:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10788.4). Total num frames: 61227008. Throughput: 0: 10804.6. Samples: 61206976. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:31:31,893][434108] Avg episode reward: [(0, '4625.031')] [2023-03-08 14:31:32,548][434396] Updated weights for policy 0, policy_version 119600 (0.0004) [2023-03-08 14:31:36,470][434396] Updated weights for policy 0, policy_version 119680 (0.0005) [2023-03-08 14:31:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.2, 300 sec: 10788.4). Total num frames: 61280256. Throughput: 0: 10825.9. Samples: 61271816. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:31:36,893][434108] Avg episode reward: [(0, '4588.136')] [2023-03-08 14:31:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000119688_61280256.pth... [2023-03-08 14:31:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000119048_60952576.pth [2023-03-08 14:31:40,412][434396] Updated weights for policy 0, policy_version 119760 (0.0005) [2023-03-08 14:31:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10788.4). Total num frames: 61329408. Throughput: 0: 10809.4. Samples: 61303004. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:31:41,893][434108] Avg episode reward: [(0, '4447.283')] [2023-03-08 14:31:44,272][434396] Updated weights for policy 0, policy_version 119840 (0.0004) [2023-03-08 14:31:46,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10717.9, 300 sec: 10788.4). Total num frames: 61382656. Throughput: 0: 10737.7. Samples: 61366140. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:31:46,893][434108] Avg episode reward: [(0, '4376.403')] [2023-03-08 14:31:48,270][434396] Updated weights for policy 0, policy_version 119920 (0.0005) [2023-03-08 14:31:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10788.4). Total num frames: 61435904. Throughput: 0: 10729.1. Samples: 61427392. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:31:51,903][434108] Avg episode reward: [(0, '4164.805')] [2023-03-08 14:31:51,906][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000119992_61435904.pth... [2023-03-08 14:31:51,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000119376_61120512.pth [2023-03-08 14:31:52,256][434396] Updated weights for policy 0, policy_version 120000 (0.0005) [2023-03-08 14:31:56,156][434396] Updated weights for policy 0, policy_version 120080 (0.0005) [2023-03-08 14:31:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10788.4). Total num frames: 61485056. Throughput: 0: 10699.8. Samples: 61458776. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:31:56,893][434108] Avg episode reward: [(0, '4509.123')] [2023-03-08 14:32:00,135][434396] Updated weights for policy 0, policy_version 120160 (0.0005) [2023-03-08 14:32:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10788.4). Total num frames: 61538304. Throughput: 0: 10543.0. Samples: 61520856. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:32:01,893][434108] Avg episode reward: [(0, '4485.370')] [2023-03-08 14:32:04,091][434396] Updated weights for policy 0, policy_version 120240 (0.0005) [2023-03-08 14:32:06,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10788.4). Total num frames: 61591552. Throughput: 0: 10450.2. Samples: 61583372. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:32:06,893][434108] Avg episode reward: [(0, '4454.185')] [2023-03-08 14:32:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000120296_61591552.pth... [2023-03-08 14:32:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000119688_61280256.pth [2023-03-08 14:32:07,989][434396] Updated weights for policy 0, policy_version 120320 (0.0005) [2023-03-08 14:32:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10774.6). Total num frames: 61640704. Throughput: 0: 10433.2. Samples: 61614644. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:32:11,893][434108] Avg episode reward: [(0, '4442.828')] [2023-03-08 14:32:11,944][434396] Updated weights for policy 0, policy_version 120400 (0.0005) [2023-03-08 14:32:15,884][434396] Updated weights for policy 0, policy_version 120480 (0.0005) [2023-03-08 14:32:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10774.6). Total num frames: 61693952. Throughput: 0: 10451.8. Samples: 61677308. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:32:16,893][434108] Avg episode reward: [(0, '4518.931')] [2023-03-08 14:32:19,850][434396] Updated weights for policy 0, policy_version 120560 (0.0005) [2023-03-08 14:32:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10788.4). Total num frames: 61747200. Throughput: 0: 10382.0. Samples: 61739008. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:32:21,893][434108] Avg episode reward: [(0, '4406.771')] [2023-03-08 14:32:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000120600_61747200.pth... [2023-03-08 14:32:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000119992_61435904.pth [2023-03-08 14:32:23,807][434396] Updated weights for policy 0, policy_version 120640 (0.0005) [2023-03-08 14:32:26,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10774.6). Total num frames: 61796352. Throughput: 0: 10381.1. Samples: 61770152. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:32:26,893][434108] Avg episode reward: [(0, '4471.322')] [2023-03-08 14:32:27,764][434396] Updated weights for policy 0, policy_version 120720 (0.0005) [2023-03-08 14:32:31,764][434396] Updated weights for policy 0, policy_version 120800 (0.0005) [2023-03-08 14:32:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10774.6). Total num frames: 61849600. Throughput: 0: 10353.2. Samples: 61832032. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:32:31,893][434108] Avg episode reward: [(0, '4459.436')] [2023-03-08 14:32:35,715][434396] Updated weights for policy 0, policy_version 120880 (0.0005) [2023-03-08 14:32:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10746.8). Total num frames: 61898752. Throughput: 0: 10369.4. Samples: 61894016. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:32:36,893][434108] Avg episode reward: [(0, '4501.409')] [2023-03-08 14:32:36,912][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000120904_61902848.pth... [2023-03-08 14:32:36,914][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000120296_61591552.pth [2023-03-08 14:32:39,759][434396] Updated weights for policy 0, policy_version 120960 (0.0005) [2023-03-08 14:32:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10746.8). Total num frames: 61952000. Throughput: 0: 10329.3. Samples: 61923592. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:32:41,903][434108] Avg episode reward: [(0, '4292.487')] [2023-03-08 14:32:43,623][434396] Updated weights for policy 0, policy_version 121040 (0.0005) [2023-03-08 14:32:46,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10732.9). Total num frames: 62005248. Throughput: 0: 10367.0. Samples: 61987372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:32:46,903][434108] Avg episode reward: [(0, '4560.905')] [2023-03-08 14:32:47,622][434396] Updated weights for policy 0, policy_version 121120 (0.0005) [2023-03-08 14:32:51,516][434396] Updated weights for policy 0, policy_version 121200 (0.0005) [2023-03-08 14:32:51,893][434108] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10719.0). Total num frames: 62054400. Throughput: 0: 10357.1. Samples: 62049444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:32:51,903][434108] Avg episode reward: [(0, '4382.595')] [2023-03-08 14:32:51,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000121208_62058496.pth... [2023-03-08 14:32:51,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000120600_61747200.pth [2023-03-08 14:32:55,455][434396] Updated weights for policy 0, policy_version 121280 (0.0005) [2023-03-08 14:32:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.6, 300 sec: 10719.0). Total num frames: 62107648. Throughput: 0: 10351.3. Samples: 62080452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:32:56,903][434108] Avg episode reward: [(0, '4547.406')] [2023-03-08 14:32:59,472][434396] Updated weights for policy 0, policy_version 121360 (0.0005) [2023-03-08 14:33:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10705.1). Total num frames: 62160896. Throughput: 0: 10323.7. Samples: 62141876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:33:01,893][434108] Avg episode reward: [(0, '4421.130')] [2023-03-08 14:33:03,448][434396] Updated weights for policy 0, policy_version 121440 (0.0005) [2023-03-08 14:33:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10691.3). Total num frames: 62210048. Throughput: 0: 10320.3. Samples: 62203424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:33:06,893][434108] Avg episode reward: [(0, '4433.378')] [2023-03-08 14:33:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000121504_62210048.pth... [2023-03-08 14:33:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000120904_61902848.pth [2023-03-08 14:33:07,415][434396] Updated weights for policy 0, policy_version 121520 (0.0005) [2023-03-08 14:33:11,335][434396] Updated weights for policy 0, policy_version 121600 (0.0005) [2023-03-08 14:33:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10705.1). Total num frames: 62263296. Throughput: 0: 10324.1. Samples: 62234736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:33:11,893][434108] Avg episode reward: [(0, '4516.254')] [2023-03-08 14:33:15,318][434396] Updated weights for policy 0, policy_version 121680 (0.0005) [2023-03-08 14:33:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10691.3). Total num frames: 62316544. Throughput: 0: 10326.8. Samples: 62296736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:33:16,893][434108] Avg episode reward: [(0, '4628.635')] [2023-03-08 14:33:19,107][434396] Updated weights for policy 0, policy_version 121760 (0.0005) [2023-03-08 14:33:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10677.4). Total num frames: 62369792. Throughput: 0: 10407.2. Samples: 62362340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:33:21,893][434108] Avg episode reward: [(0, '4568.499')] [2023-03-08 14:33:21,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000121816_62369792.pth... [2023-03-08 14:33:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000121208_62058496.pth [2023-03-08 14:33:22,756][434396] Updated weights for policy 0, policy_version 121840 (0.0004) [2023-03-08 14:33:26,395][434396] Updated weights for policy 0, policy_version 121920 (0.0004) [2023-03-08 14:33:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10513.0, 300 sec: 10705.1). Total num frames: 62427136. Throughput: 0: 10496.4. Samples: 62395928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:33:26,893][434108] Avg episode reward: [(0, '4490.784')] [2023-03-08 14:33:28,362][434347] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000001 [2023-03-08 14:33:30,285][434396] Updated weights for policy 0, policy_version 122000 (0.0005) [2023-03-08 14:33:31,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10513.1, 300 sec: 10705.1). Total num frames: 62480384. Throughput: 0: 10526.0. Samples: 62461040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:33:31,893][434108] Avg episode reward: [(0, '4578.867')] [2023-03-08 14:33:34,212][434396] Updated weights for policy 0, policy_version 122080 (0.0005) [2023-03-08 14:33:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10513.0, 300 sec: 10691.3). Total num frames: 62529536. Throughput: 0: 10538.3. Samples: 62523668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:33:36,893][434108] Avg episode reward: [(0, '4638.661')] [2023-03-08 14:33:36,962][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000122136_62533632.pth... [2023-03-08 14:33:36,964][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000121504_62210048.pth [2023-03-08 14:33:38,107][434396] Updated weights for policy 0, policy_version 122160 (0.0005) [2023-03-08 14:33:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10677.4). Total num frames: 62582784. Throughput: 0: 10560.3. Samples: 62555664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:33:41,893][434108] Avg episode reward: [(0, '4539.167')] [2023-03-08 14:33:41,969][434396] Updated weights for policy 0, policy_version 122240 (0.0005) [2023-03-08 14:33:45,959][434396] Updated weights for policy 0, policy_version 122320 (0.0005) [2023-03-08 14:33:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10663.5). Total num frames: 62636032. Throughput: 0: 10587.2. Samples: 62618300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:33:46,893][434108] Avg episode reward: [(0, '4632.012')] [2023-03-08 14:33:49,860][434396] Updated weights for policy 0, policy_version 122400 (0.0005) [2023-03-08 14:33:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10663.5). Total num frames: 62689280. Throughput: 0: 10614.8. Samples: 62681088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:33:51,893][434108] Avg episode reward: [(0, '4599.002')] [2023-03-08 14:33:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000122440_62689280.pth... [2023-03-08 14:33:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000121816_62369792.pth [2023-03-08 14:33:53,739][434396] Updated weights for policy 0, policy_version 122480 (0.0005) [2023-03-08 14:33:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10649.6). Total num frames: 62742528. Throughput: 0: 10625.9. Samples: 62712900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:33:56,893][434108] Avg episode reward: [(0, '4644.873')] [2023-03-08 14:33:57,644][434396] Updated weights for policy 0, policy_version 122560 (0.0005) [2023-03-08 14:34:01,577][434396] Updated weights for policy 0, policy_version 122640 (0.0005) [2023-03-08 14:34:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 62791680. Throughput: 0: 10634.9. Samples: 62775308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:34:01,893][434108] Avg episode reward: [(0, '4642.405')] [2023-03-08 14:34:05,508][434396] Updated weights for policy 0, policy_version 122720 (0.0005) [2023-03-08 14:34:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10635.7). Total num frames: 62844928. Throughput: 0: 10571.0. Samples: 62838036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:34:06,893][434108] Avg episode reward: [(0, '4607.485')] [2023-03-08 14:34:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000122744_62844928.pth... [2023-03-08 14:34:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000122136_62533632.pth [2023-03-08 14:34:09,388][434396] Updated weights for policy 0, policy_version 122800 (0.0005) [2023-03-08 14:34:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10621.8). Total num frames: 62898176. Throughput: 0: 10525.5. Samples: 62869576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:34:11,893][434108] Avg episode reward: [(0, '4615.165')] [2023-03-08 14:34:13,362][434396] Updated weights for policy 0, policy_version 122880 (0.0005) [2023-03-08 14:34:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10607.9). Total num frames: 62951424. Throughput: 0: 10448.5. Samples: 62931224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:34:16,893][434108] Avg episode reward: [(0, '4463.736')] [2023-03-08 14:34:17,274][434396] Updated weights for policy 0, policy_version 122960 (0.0005) [2023-03-08 14:34:21,167][434396] Updated weights for policy 0, policy_version 123040 (0.0006) [2023-03-08 14:34:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10580.2). Total num frames: 63000576. Throughput: 0: 10484.3. Samples: 62995460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:34:21,893][434108] Avg episode reward: [(0, '4578.277')] [2023-03-08 14:34:21,956][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000123056_63004672.pth... [2023-03-08 14:34:21,958][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000122440_62689280.pth [2023-03-08 14:34:25,100][434396] Updated weights for policy 0, policy_version 123120 (0.0006) [2023-03-08 14:34:26,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10580.2). Total num frames: 63053824. Throughput: 0: 10443.7. Samples: 63025628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:34:26,902][434108] Avg episode reward: [(0, '4626.099')] [2023-03-08 14:34:28,870][434396] Updated weights for policy 0, policy_version 123200 (0.0004) [2023-03-08 14:34:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 63107072. Throughput: 0: 10497.7. Samples: 63090696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:34:31,893][434108] Avg episode reward: [(0, '4583.460')] [2023-03-08 14:34:32,759][434396] Updated weights for policy 0, policy_version 123280 (0.0005) [2023-03-08 14:34:36,655][434396] Updated weights for policy 0, policy_version 123360 (0.0005) [2023-03-08 14:34:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 63160320. Throughput: 0: 10502.7. Samples: 63153708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:34:36,903][434108] Avg episode reward: [(0, '4639.485')] [2023-03-08 14:34:36,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000123360_63160320.pth... [2023-03-08 14:34:36,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000122744_62844928.pth [2023-03-08 14:34:40,629][434396] Updated weights for policy 0, policy_version 123440 (0.0005) [2023-03-08 14:34:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 63213568. Throughput: 0: 10490.2. Samples: 63184960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:34:41,903][434108] Avg episode reward: [(0, '4638.983')] [2023-03-08 14:34:44,562][434396] Updated weights for policy 0, policy_version 123520 (0.0005) [2023-03-08 14:34:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 63262720. Throughput: 0: 10472.6. Samples: 63246576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:34:46,893][434108] Avg episode reward: [(0, '4628.772')] [2023-03-08 14:34:48,531][434396] Updated weights for policy 0, policy_version 123600 (0.0004) [2023-03-08 14:34:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 63315968. Throughput: 0: 10452.9. Samples: 63308416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:34:51,893][434108] Avg episode reward: [(0, '4571.029')] [2023-03-08 14:34:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000123664_63315968.pth... [2023-03-08 14:34:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000123056_63004672.pth [2023-03-08 14:34:52,515][434396] Updated weights for policy 0, policy_version 123680 (0.0005) [2023-03-08 14:34:56,483][434396] Updated weights for policy 0, policy_version 123760 (0.0005) [2023-03-08 14:34:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 63369216. Throughput: 0: 10444.7. Samples: 63339588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:34:56,893][434108] Avg episode reward: [(0, '4546.467')] [2023-03-08 14:35:00,524][434396] Updated weights for policy 0, policy_version 123840 (0.0005) [2023-03-08 14:35:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 63418368. Throughput: 0: 10444.4. Samples: 63401224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:35:01,893][434108] Avg episode reward: [(0, '4485.970')] [2023-03-08 14:35:04,507][434396] Updated weights for policy 0, policy_version 123920 (0.0005) [2023-03-08 14:35:06,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10496.9). Total num frames: 63467520. Throughput: 0: 10387.1. Samples: 63462880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:35:06,893][434108] Avg episode reward: [(0, '4535.447')] [2023-03-08 14:35:06,920][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000123968_63471616.pth... [2023-03-08 14:35:06,922][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000123360_63160320.pth [2023-03-08 14:35:08,380][434396] Updated weights for policy 0, policy_version 124000 (0.0004) [2023-03-08 14:35:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 63524864. Throughput: 0: 10431.1. Samples: 63495028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:35:11,893][434108] Avg episode reward: [(0, '4524.660')] [2023-03-08 14:35:12,227][434396] Updated weights for policy 0, policy_version 124080 (0.0003) [2023-03-08 14:35:15,855][434396] Updated weights for policy 0, policy_version 124160 (0.0003) [2023-03-08 14:35:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 63578112. Throughput: 0: 10437.2. Samples: 63560372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:35:16,893][434108] Avg episode reward: [(0, '4481.174')] [2023-03-08 14:35:19,703][434396] Updated weights for policy 0, policy_version 124240 (0.0004) [2023-03-08 14:35:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 63635456. Throughput: 0: 10492.7. Samples: 63625880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:35:21,893][434108] Avg episode reward: [(0, '4502.991')] [2023-03-08 14:35:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000124288_63635456.pth... [2023-03-08 14:35:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000123664_63315968.pth [2023-03-08 14:35:23,383][434396] Updated weights for policy 0, policy_version 124320 (0.0003) [2023-03-08 14:35:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10538.5). Total num frames: 63688704. Throughput: 0: 10540.5. Samples: 63659284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:35:26,893][434108] Avg episode reward: [(0, '4554.028')] [2023-03-08 14:35:27,018][434396] Updated weights for policy 0, policy_version 124400 (0.0003) [2023-03-08 14:35:30,910][434396] Updated weights for policy 0, policy_version 124480 (0.0005) [2023-03-08 14:35:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10538.5). Total num frames: 63741952. Throughput: 0: 10622.5. Samples: 63724588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:35:31,893][434108] Avg episode reward: [(0, '4319.388')] [2023-03-08 14:35:34,838][434396] Updated weights for policy 0, policy_version 124560 (0.0005) [2023-03-08 14:35:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10538.5). Total num frames: 63795200. Throughput: 0: 10635.6. Samples: 63787016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:35:36,893][434108] Avg episode reward: [(0, '4295.393')] [2023-03-08 14:35:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000124600_63795200.pth... [2023-03-08 14:35:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000123968_63471616.pth [2023-03-08 14:35:38,803][434396] Updated weights for policy 0, policy_version 124640 (0.0005) [2023-03-08 14:35:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 63844352. Throughput: 0: 10635.6. Samples: 63818192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:35:41,893][434108] Avg episode reward: [(0, '4378.451')] [2023-03-08 14:35:42,676][434396] Updated weights for policy 0, policy_version 124720 (0.0005) [2023-03-08 14:35:46,622][434396] Updated weights for policy 0, policy_version 124800 (0.0005) [2023-03-08 14:35:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 63897600. Throughput: 0: 10666.7. Samples: 63881224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:35:46,893][434108] Avg episode reward: [(0, '4157.486')] [2023-03-08 14:35:50,473][434396] Updated weights for policy 0, policy_version 124880 (0.0004) [2023-03-08 14:35:51,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 63950848. Throughput: 0: 10694.0. Samples: 63944108. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:35:51,893][434108] Avg episode reward: [(0, '4461.186')] [2023-03-08 14:35:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000124904_63950848.pth... [2023-03-08 14:35:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000124288_63635456.pth [2023-03-08 14:35:54,377][434396] Updated weights for policy 0, policy_version 124960 (0.0005) [2023-03-08 14:35:56,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 64004096. Throughput: 0: 10676.9. Samples: 63975488. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:35:56,893][434108] Avg episode reward: [(0, '4457.253')] [2023-03-08 14:35:58,029][434396] Updated weights for policy 0, policy_version 125040 (0.0005) [2023-03-08 14:36:01,742][434396] Updated weights for policy 0, policy_version 125120 (0.0004) [2023-03-08 14:36:01,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10538.5). Total num frames: 64061440. Throughput: 0: 10714.2. Samples: 64042512. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:36:01,893][434108] Avg episode reward: [(0, '4204.376')] [2023-03-08 14:36:05,384][434396] Updated weights for policy 0, policy_version 125200 (0.0004) [2023-03-08 14:36:06,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10854.4, 300 sec: 10538.5). Total num frames: 64118784. Throughput: 0: 10749.7. Samples: 64109616. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:36:06,893][434108] Avg episode reward: [(0, '4391.700')] [2023-03-08 14:36:06,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000125232_64118784.pth... [2023-03-08 14:36:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000124600_63795200.pth [2023-03-08 14:36:09,016][434396] Updated weights for policy 0, policy_version 125280 (0.0004) [2023-03-08 14:36:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10524.6). Total num frames: 64172032. Throughput: 0: 10758.7. Samples: 64143424. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:36:11,893][434108] Avg episode reward: [(0, '4415.059')] [2023-03-08 14:36:12,729][434396] Updated weights for policy 0, policy_version 125360 (0.0004) [2023-03-08 14:36:16,510][434396] Updated weights for policy 0, policy_version 125440 (0.0004) [2023-03-08 14:36:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10538.5). Total num frames: 64229376. Throughput: 0: 10775.2. Samples: 64209472. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:36:16,893][434108] Avg episode reward: [(0, '4481.519')] [2023-03-08 14:36:20,331][434396] Updated weights for policy 0, policy_version 125520 (0.0005) [2023-03-08 14:36:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10524.6). Total num frames: 64278528. Throughput: 0: 10811.2. Samples: 64273520. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:36:21,893][434108] Avg episode reward: [(0, '4275.473')] [2023-03-08 14:36:21,899][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000125552_64282624.pth... [2023-03-08 14:36:21,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000124904_63950848.pth [2023-03-08 14:36:24,210][434396] Updated weights for policy 0, policy_version 125600 (0.0005) [2023-03-08 14:36:26,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10717.9, 300 sec: 10524.6). Total num frames: 64331776. Throughput: 0: 10808.2. Samples: 64304560. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:36:26,893][434108] Avg episode reward: [(0, '4426.669')] [2023-03-08 14:36:28,188][434396] Updated weights for policy 0, policy_version 125680 (0.0006) [2023-03-08 14:36:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10524.6). Total num frames: 64385024. Throughput: 0: 10810.0. Samples: 64367676. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:36:31,893][434108] Avg episode reward: [(0, '4478.342')] [2023-03-08 14:36:32,007][434396] Updated weights for policy 0, policy_version 125760 (0.0004) [2023-03-08 14:36:35,654][434396] Updated weights for policy 0, policy_version 125840 (0.0004) [2023-03-08 14:36:36,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10552.4). Total num frames: 64442368. Throughput: 0: 10890.6. Samples: 64434184. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:36:36,893][434108] Avg episode reward: [(0, '4509.895')] [2023-03-08 14:36:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000125864_64442368.pth... [2023-03-08 14:36:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000125232_64118784.pth [2023-03-08 14:36:39,570][434396] Updated weights for policy 0, policy_version 125920 (0.0006) [2023-03-08 14:36:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10552.4). Total num frames: 64495616. Throughput: 0: 10877.9. Samples: 64464992. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:36:41,893][434108] Avg episode reward: [(0, '4587.094')] [2023-03-08 14:36:43,161][434396] Updated weights for policy 0, policy_version 126000 (0.0005) [2023-03-08 14:36:46,743][434396] Updated weights for policy 0, policy_version 126080 (0.0004) [2023-03-08 14:36:46,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10566.3). Total num frames: 64552960. Throughput: 0: 10901.4. Samples: 64533076. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:36:46,893][434108] Avg episode reward: [(0, '4562.899')] [2023-03-08 14:36:50,573][434396] Updated weights for policy 0, policy_version 126160 (0.0005) [2023-03-08 14:36:51,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10922.7, 300 sec: 10580.2). Total num frames: 64606208. Throughput: 0: 10864.2. Samples: 64598504. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:36:51,893][434108] Avg episode reward: [(0, '4526.102')] [2023-03-08 14:36:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000126184_64606208.pth... [2023-03-08 14:36:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000125552_64282624.pth [2023-03-08 14:36:54,505][434396] Updated weights for policy 0, policy_version 126240 (0.0005) [2023-03-08 14:36:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10854.4, 300 sec: 10566.3). Total num frames: 64655360. Throughput: 0: 10824.9. Samples: 64630544. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:36:56,893][434108] Avg episode reward: [(0, '4613.025')] [2023-03-08 14:36:58,541][434396] Updated weights for policy 0, policy_version 126320 (0.0005) [2023-03-08 14:37:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10786.1, 300 sec: 10566.3). Total num frames: 64708608. Throughput: 0: 10715.1. Samples: 64691652. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:37:01,893][434108] Avg episode reward: [(0, '4572.566')] [2023-03-08 14:37:02,435][434396] Updated weights for policy 0, policy_version 126400 (0.0005) [2023-03-08 14:37:06,262][434396] Updated weights for policy 0, policy_version 126480 (0.0005) [2023-03-08 14:37:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10580.2). Total num frames: 64761856. Throughput: 0: 10700.5. Samples: 64755044. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:37:06,893][434108] Avg episode reward: [(0, '4612.177')] [2023-03-08 14:37:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000126488_64761856.pth... [2023-03-08 14:37:06,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000125864_64442368.pth [2023-03-08 14:37:10,119][434396] Updated weights for policy 0, policy_version 126560 (0.0005) [2023-03-08 14:37:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10580.2). Total num frames: 64815104. Throughput: 0: 10709.9. Samples: 64786504. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:37:11,893][434108] Avg episode reward: [(0, '4561.602')] [2023-03-08 14:37:14,014][434396] Updated weights for policy 0, policy_version 126640 (0.0005) [2023-03-08 14:37:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10580.2). Total num frames: 64868352. Throughput: 0: 10729.8. Samples: 64850516. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:37:16,893][434108] Avg episode reward: [(0, '4622.551')] [2023-03-08 14:37:17,890][434396] Updated weights for policy 0, policy_version 126720 (0.0005) [2023-03-08 14:37:21,710][434396] Updated weights for policy 0, policy_version 126800 (0.0005) [2023-03-08 14:37:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 64921600. Throughput: 0: 10656.0. Samples: 64913704. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:37:21,893][434108] Avg episode reward: [(0, '4575.846')] [2023-03-08 14:37:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000126800_64921600.pth... [2023-03-08 14:37:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000126184_64606208.pth [2023-03-08 14:37:25,321][434396] Updated weights for policy 0, policy_version 126880 (0.0004) [2023-03-08 14:37:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10607.9). Total num frames: 64978944. Throughput: 0: 10727.7. Samples: 64947740. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:37:26,893][434108] Avg episode reward: [(0, '4633.701')] [2023-03-08 14:37:29,032][434396] Updated weights for policy 0, policy_version 126960 (0.0005) [2023-03-08 14:37:31,892][434108] Fps is (10 sec: 11468.9, 60 sec: 10854.4, 300 sec: 10635.7). Total num frames: 65036288. Throughput: 0: 10714.1. Samples: 65015212. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:37:31,893][434108] Avg episode reward: [(0, '4642.963')] [2023-03-08 14:37:32,614][434396] Updated weights for policy 0, policy_version 127040 (0.0005) [2023-03-08 14:37:36,257][434396] Updated weights for policy 0, policy_version 127120 (0.0004) [2023-03-08 14:37:36,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10786.1, 300 sec: 10635.7). Total num frames: 65089536. Throughput: 0: 10753.3. Samples: 65082404. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:37:36,893][434108] Avg episode reward: [(0, '4572.350')] [2023-03-08 14:37:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000127128_65089536.pth... [2023-03-08 14:37:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000126488_64761856.pth [2023-03-08 14:37:39,864][434396] Updated weights for policy 0, policy_version 127200 (0.0004) [2023-03-08 14:37:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10649.6). Total num frames: 65146880. Throughput: 0: 10807.9. Samples: 65116900. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:37:41,893][434108] Avg episode reward: [(0, '4604.269')] [2023-03-08 14:37:43,446][434396] Updated weights for policy 0, policy_version 127280 (0.0004) [2023-03-08 14:37:46,892][434108] Fps is (10 sec: 11468.9, 60 sec: 10854.4, 300 sec: 10677.4). Total num frames: 65204224. Throughput: 0: 10946.2. Samples: 65184232. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:37:46,893][434108] Avg episode reward: [(0, '4619.640')] [2023-03-08 14:37:47,099][434396] Updated weights for policy 0, policy_version 127360 (0.0005) [2023-03-08 14:37:50,719][434396] Updated weights for policy 0, policy_version 127440 (0.0004) [2023-03-08 14:37:51,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10922.7, 300 sec: 10691.3). Total num frames: 65261568. Throughput: 0: 11058.6. Samples: 65252680. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 14:37:51,893][434108] Avg episode reward: [(0, '4606.550')] [2023-03-08 14:37:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000127464_65261568.pth... [2023-03-08 14:37:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000126800_64921600.pth [2023-03-08 14:37:54,386][434396] Updated weights for policy 0, policy_version 127520 (0.0004) [2023-03-08 14:37:56,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10990.9, 300 sec: 10691.3). Total num frames: 65314816. Throughput: 0: 11104.7. Samples: 65286216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:37:56,893][434108] Avg episode reward: [(0, '4438.496')] [2023-03-08 14:37:58,256][434396] Updated weights for policy 0, policy_version 127600 (0.0005) [2023-03-08 14:38:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10990.9, 300 sec: 10705.1). Total num frames: 65368064. Throughput: 0: 11090.3. Samples: 65349580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:38:01,893][434108] Avg episode reward: [(0, '4438.055')] [2023-03-08 14:38:02,191][434396] Updated weights for policy 0, policy_version 127680 (0.0005) [2023-03-08 14:38:06,047][434396] Updated weights for policy 0, policy_version 127760 (0.0005) [2023-03-08 14:38:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 10705.1). Total num frames: 65421312. Throughput: 0: 11098.4. Samples: 65413132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:38:06,893][434108] Avg episode reward: [(0, '4548.254')] [2023-03-08 14:38:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000127776_65421312.pth... [2023-03-08 14:38:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000127128_65089536.pth [2023-03-08 14:38:10,019][434396] Updated weights for policy 0, policy_version 127840 (0.0004) [2023-03-08 14:38:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10922.7, 300 sec: 10691.3). Total num frames: 65470464. Throughput: 0: 11029.9. Samples: 65444084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:38:11,893][434108] Avg episode reward: [(0, '4522.838')] [2023-03-08 14:38:14,004][434396] Updated weights for policy 0, policy_version 127920 (0.0005) [2023-03-08 14:38:16,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10922.7, 300 sec: 10691.3). Total num frames: 65523712. Throughput: 0: 10902.9. Samples: 65505844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:38:16,893][434108] Avg episode reward: [(0, '4526.428')] [2023-03-08 14:38:17,936][434396] Updated weights for policy 0, policy_version 128000 (0.0005) [2023-03-08 14:38:21,764][434396] Updated weights for policy 0, policy_version 128080 (0.0005) [2023-03-08 14:38:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10922.7, 300 sec: 10677.4). Total num frames: 65576960. Throughput: 0: 10809.7. Samples: 65568840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:38:21,893][434108] Avg episode reward: [(0, '4579.165')] [2023-03-08 14:38:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000128080_65576960.pth... [2023-03-08 14:38:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000127464_65261568.pth [2023-03-08 14:38:25,612][434396] Updated weights for policy 0, policy_version 128160 (0.0005) [2023-03-08 14:38:26,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10854.4, 300 sec: 10677.4). Total num frames: 65630208. Throughput: 0: 10769.9. Samples: 65601544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:38:26,893][434108] Avg episode reward: [(0, '4511.327')] [2023-03-08 14:38:29,556][434396] Updated weights for policy 0, policy_version 128240 (0.0005) [2023-03-08 14:38:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10691.3). Total num frames: 65683456. Throughput: 0: 10644.6. Samples: 65663240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:38:31,893][434108] Avg episode reward: [(0, '4579.692')] [2023-03-08 14:38:33,458][434396] Updated weights for policy 0, policy_version 128320 (0.0005) [2023-03-08 14:38:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10717.9, 300 sec: 10677.4). Total num frames: 65732608. Throughput: 0: 10534.1. Samples: 65726716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:38:36,893][434108] Avg episode reward: [(0, '4567.851')] [2023-03-08 14:38:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000128384_65732608.pth... [2023-03-08 14:38:36,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000127776_65421312.pth [2023-03-08 14:38:37,416][434396] Updated weights for policy 0, policy_version 128400 (0.0005) [2023-03-08 14:38:41,286][434396] Updated weights for policy 0, policy_version 128480 (0.0005) [2023-03-08 14:38:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 65785856. Throughput: 0: 10467.4. Samples: 65757248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:38:41,893][434108] Avg episode reward: [(0, '4534.340')] [2023-03-08 14:38:44,939][434396] Updated weights for policy 0, policy_version 128560 (0.0004) [2023-03-08 14:38:46,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10691.3). Total num frames: 65843200. Throughput: 0: 10527.1. Samples: 65823300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:38:46,893][434108] Avg episode reward: [(0, '4576.957')] [2023-03-08 14:38:48,588][434396] Updated weights for policy 0, policy_version 128640 (0.0004) [2023-03-08 14:38:51,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10649.6, 300 sec: 10705.1). Total num frames: 65900544. Throughput: 0: 10627.9. Samples: 65891388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:38:51,893][434108] Avg episode reward: [(0, '4541.238')] [2023-03-08 14:38:51,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000128712_65900544.pth... [2023-03-08 14:38:51,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000128080_65576960.pth [2023-03-08 14:38:52,223][434396] Updated weights for policy 0, policy_version 128720 (0.0004) [2023-03-08 14:38:56,130][434396] Updated weights for policy 0, policy_version 128800 (0.0005) [2023-03-08 14:38:56,893][434108] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10719.0). Total num frames: 65953792. Throughput: 0: 10667.8. Samples: 65924136. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:38:56,893][434108] Avg episode reward: [(0, '4600.315')] [2023-03-08 14:39:00,027][434396] Updated weights for policy 0, policy_version 128880 (0.0005) [2023-03-08 14:39:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10705.1). Total num frames: 66002944. Throughput: 0: 10683.0. Samples: 65986580. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:39:01,893][434108] Avg episode reward: [(0, '4588.674')] [2023-03-08 14:39:03,971][434396] Updated weights for policy 0, policy_version 128960 (0.0004) [2023-03-08 14:39:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10705.1). Total num frames: 66056192. Throughput: 0: 10669.7. Samples: 66048976. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:39:06,893][434108] Avg episode reward: [(0, '4551.585')] [2023-03-08 14:39:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000129016_66056192.pth... [2023-03-08 14:39:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000128384_65732608.pth [2023-03-08 14:39:07,875][434396] Updated weights for policy 0, policy_version 129040 (0.0005) [2023-03-08 14:39:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10691.3). Total num frames: 66105344. Throughput: 0: 10644.1. Samples: 66080528. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:39:11,893][434108] Avg episode reward: [(0, '4561.176')] [2023-03-08 14:39:11,895][434396] Updated weights for policy 0, policy_version 129120 (0.0005) [2023-03-08 14:39:15,816][434396] Updated weights for policy 0, policy_version 129200 (0.0005) [2023-03-08 14:39:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10705.1). Total num frames: 66158592. Throughput: 0: 10645.2. Samples: 66142272. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:39:16,895][434108] Avg episode reward: [(0, '4612.557')] [2023-03-08 14:39:19,664][434396] Updated weights for policy 0, policy_version 129280 (0.0005) [2023-03-08 14:39:21,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10705.1). Total num frames: 66211840. Throughput: 0: 10631.0. Samples: 66205112. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:39:21,904][434108] Avg episode reward: [(0, '4614.823')] [2023-03-08 14:39:21,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000129320_66211840.pth... [2023-03-08 14:39:21,910][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000128712_65900544.pth [2023-03-08 14:39:23,626][434396] Updated weights for policy 0, policy_version 129360 (0.0005) [2023-03-08 14:39:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10705.1). Total num frames: 66265088. Throughput: 0: 10648.4. Samples: 66236424. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:39:26,903][434108] Avg episode reward: [(0, '4617.262')] [2023-03-08 14:39:27,541][434396] Updated weights for policy 0, policy_version 129440 (0.0005) [2023-03-08 14:39:31,445][434396] Updated weights for policy 0, policy_version 129520 (0.0005) [2023-03-08 14:39:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10705.1). Total num frames: 66318336. Throughput: 0: 10578.1. Samples: 66299316. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:39:31,903][434108] Avg episode reward: [(0, '4636.234')] [2023-03-08 14:39:35,154][434396] Updated weights for policy 0, policy_version 129600 (0.0004) [2023-03-08 14:39:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10705.1). Total num frames: 66371584. Throughput: 0: 10516.2. Samples: 66364616. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:39:36,903][434108] Avg episode reward: [(0, '4610.242')] [2023-03-08 14:39:36,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000129632_66371584.pth... [2023-03-08 14:39:36,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000129016_66056192.pth [2023-03-08 14:39:38,803][434396] Updated weights for policy 0, policy_version 129680 (0.0004) [2023-03-08 14:39:41,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 10732.9). Total num frames: 66428928. Throughput: 0: 10545.1. Samples: 66398664. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:39:41,893][434108] Avg episode reward: [(0, '4618.309')] [2023-03-08 14:39:42,388][434396] Updated weights for policy 0, policy_version 129760 (0.0003) [2023-03-08 14:39:46,000][434396] Updated weights for policy 0, policy_version 129840 (0.0004) [2023-03-08 14:39:46,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10717.9, 300 sec: 10746.8). Total num frames: 66486272. Throughput: 0: 10665.5. Samples: 66466528. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:39:46,893][434108] Avg episode reward: [(0, '4513.149')] [2023-03-08 14:39:49,729][434396] Updated weights for policy 0, policy_version 129920 (0.0004) [2023-03-08 14:39:51,893][434108] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10746.8). Total num frames: 66539520. Throughput: 0: 10775.6. Samples: 66533880. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:39:51,893][434108] Avg episode reward: [(0, '4569.093')] [2023-03-08 14:39:51,927][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000129968_66543616.pth... [2023-03-08 14:39:51,928][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000129320_66211840.pth [2023-03-08 14:39:53,455][434396] Updated weights for policy 0, policy_version 130000 (0.0005) [2023-03-08 14:39:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10760.7). Total num frames: 66592768. Throughput: 0: 10782.9. Samples: 66565760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:39:56,903][434108] Avg episode reward: [(0, '4598.262')] [2023-03-08 14:39:57,366][434396] Updated weights for policy 0, policy_version 130080 (0.0005) [2023-03-08 14:40:01,322][434396] Updated weights for policy 0, policy_version 130160 (0.0005) [2023-03-08 14:40:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10774.6). Total num frames: 66646016. Throughput: 0: 10795.9. Samples: 66628088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:40:01,893][434108] Avg episode reward: [(0, '4556.209')] [2023-03-08 14:40:05,184][434396] Updated weights for policy 0, policy_version 130240 (0.0005) [2023-03-08 14:40:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10760.7). Total num frames: 66699264. Throughput: 0: 10800.6. Samples: 66691136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:40:06,893][434108] Avg episode reward: [(0, '4438.918')] [2023-03-08 14:40:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000130272_66699264.pth... [2023-03-08 14:40:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000129632_66371584.pth [2023-03-08 14:40:09,175][434396] Updated weights for policy 0, policy_version 130320 (0.0005) [2023-03-08 14:40:11,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10717.9, 300 sec: 10746.8). Total num frames: 66748416. Throughput: 0: 10807.0. Samples: 66722740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:40:11,893][434108] Avg episode reward: [(0, '4225.366')] [2023-03-08 14:40:13,084][434396] Updated weights for policy 0, policy_version 130400 (0.0005) [2023-03-08 14:40:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10732.9). Total num frames: 66801664. Throughput: 0: 10791.7. Samples: 66784944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:40:16,893][434108] Avg episode reward: [(0, '4519.117')] [2023-03-08 14:40:17,079][434396] Updated weights for policy 0, policy_version 130480 (0.0005) [2023-03-08 14:40:20,961][434396] Updated weights for policy 0, policy_version 130560 (0.0005) [2023-03-08 14:40:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10732.9). Total num frames: 66854912. Throughput: 0: 10719.7. Samples: 66847004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:40:21,893][434108] Avg episode reward: [(0, '4579.636')] [2023-03-08 14:40:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000130576_66854912.pth... [2023-03-08 14:40:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000129968_66543616.pth [2023-03-08 14:40:24,828][434396] Updated weights for policy 0, policy_version 130640 (0.0005) [2023-03-08 14:40:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10732.9). Total num frames: 66908160. Throughput: 0: 10685.0. Samples: 66879488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:40:26,893][434108] Avg episode reward: [(0, '4609.622')] [2023-03-08 14:40:28,738][434396] Updated weights for policy 0, policy_version 130720 (0.0005) [2023-03-08 14:40:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10719.0). Total num frames: 66957312. Throughput: 0: 10571.2. Samples: 66942232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:40:31,893][434108] Avg episode reward: [(0, '4545.384')] [2023-03-08 14:40:32,715][434396] Updated weights for policy 0, policy_version 130800 (0.0005) [2023-03-08 14:40:36,618][434396] Updated weights for policy 0, policy_version 130880 (0.0005) [2023-03-08 14:40:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10649.6, 300 sec: 10732.9). Total num frames: 67010560. Throughput: 0: 10452.4. Samples: 67004236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:40:36,893][434108] Avg episode reward: [(0, '4593.438')] [2023-03-08 14:40:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000130880_67010560.pth... [2023-03-08 14:40:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000130272_66699264.pth [2023-03-08 14:40:40,603][434396] Updated weights for policy 0, policy_version 130960 (0.0005) [2023-03-08 14:40:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10732.9). Total num frames: 67063808. Throughput: 0: 10432.0. Samples: 67035200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:40:41,893][434108] Avg episode reward: [(0, '4564.807')] [2023-03-08 14:40:44,483][434396] Updated weights for policy 0, policy_version 131040 (0.0005) [2023-03-08 14:40:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10732.9). Total num frames: 67117056. Throughput: 0: 10455.8. Samples: 67098600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:40:46,893][434108] Avg episode reward: [(0, '4587.861')] [2023-03-08 14:40:48,214][434396] Updated weights for policy 0, policy_version 131120 (0.0005) [2023-03-08 14:40:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10732.9). Total num frames: 67170304. Throughput: 0: 10487.5. Samples: 67163072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:40:51,893][434108] Avg episode reward: [(0, '4583.197')] [2023-03-08 14:40:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000131192_67170304.pth... [2023-03-08 14:40:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000130576_66854912.pth [2023-03-08 14:40:52,054][434396] Updated weights for policy 0, policy_version 131200 (0.0005) [2023-03-08 14:40:55,977][434396] Updated weights for policy 0, policy_version 131280 (0.0004) [2023-03-08 14:40:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10719.0). Total num frames: 67223552. Throughput: 0: 10492.4. Samples: 67194900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:40:56,893][434108] Avg episode reward: [(0, '4638.542')] [2023-03-08 14:40:59,954][434396] Updated weights for policy 0, policy_version 131360 (0.0005) [2023-03-08 14:41:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10691.3). Total num frames: 67272704. Throughput: 0: 10483.2. Samples: 67256688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:41:01,893][434108] Avg episode reward: [(0, '4533.660')] [2023-03-08 14:41:03,895][434396] Updated weights for policy 0, policy_version 131440 (0.0005) [2023-03-08 14:41:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10691.3). Total num frames: 67325952. Throughput: 0: 10484.3. Samples: 67318796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:41:06,893][434108] Avg episode reward: [(0, '4255.233')] [2023-03-08 14:41:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000131496_67325952.pth... [2023-03-08 14:41:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000130880_67010560.pth [2023-03-08 14:41:07,888][434396] Updated weights for policy 0, policy_version 131520 (0.0005) [2023-03-08 14:41:11,839][434396] Updated weights for policy 0, policy_version 131600 (0.0005) [2023-03-08 14:41:11,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.0, 300 sec: 10677.4). Total num frames: 67379200. Throughput: 0: 10459.6. Samples: 67350172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:41:11,893][434108] Avg episode reward: [(0, '4318.865')] [2023-03-08 14:41:15,695][434396] Updated weights for policy 0, policy_version 131680 (0.0005) [2023-03-08 14:41:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10691.3). Total num frames: 67432448. Throughput: 0: 10460.2. Samples: 67412940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:41:16,893][434108] Avg episode reward: [(0, '4509.742')] [2023-03-08 14:41:19,603][434396] Updated weights for policy 0, policy_version 131760 (0.0005) [2023-03-08 14:41:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10677.4). Total num frames: 67481600. Throughput: 0: 10477.7. Samples: 67475732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:41:21,893][434108] Avg episode reward: [(0, '4535.098')] [2023-03-08 14:41:21,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000131800_67481600.pth... [2023-03-08 14:41:21,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000131192_67170304.pth [2023-03-08 14:41:23,611][434396] Updated weights for policy 0, policy_version 131840 (0.0005) [2023-03-08 14:41:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10677.4). Total num frames: 67534848. Throughput: 0: 10467.7. Samples: 67506248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:41:26,893][434108] Avg episode reward: [(0, '4529.070')] [2023-03-08 14:41:27,550][434396] Updated weights for policy 0, policy_version 131920 (0.0005) [2023-03-08 14:41:31,390][434396] Updated weights for policy 0, policy_version 132000 (0.0005) [2023-03-08 14:41:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10663.5). Total num frames: 67588096. Throughput: 0: 10452.9. Samples: 67568980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:41:31,893][434108] Avg episode reward: [(0, '4554.471')] [2023-03-08 14:41:35,315][434396] Updated weights for policy 0, policy_version 132080 (0.0005) [2023-03-08 14:41:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10663.5). Total num frames: 67641344. Throughput: 0: 10435.9. Samples: 67632688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:41:36,893][434108] Avg episode reward: [(0, '4593.678')] [2023-03-08 14:41:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000132112_67641344.pth... [2023-03-08 14:41:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000131496_67325952.pth [2023-03-08 14:41:39,211][434396] Updated weights for policy 0, policy_version 132160 (0.0004) [2023-03-08 14:41:41,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10635.7). Total num frames: 67690496. Throughput: 0: 10424.6. Samples: 67664008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:41:41,893][434108] Avg episode reward: [(0, '4509.718')] [2023-03-08 14:41:43,078][434396] Updated weights for policy 0, policy_version 132240 (0.0005) [2023-03-08 14:41:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10635.7). Total num frames: 67743744. Throughput: 0: 10459.4. Samples: 67727360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:41:46,893][434108] Avg episode reward: [(0, '4627.987')] [2023-03-08 14:41:47,062][434396] Updated weights for policy 0, policy_version 132320 (0.0004) [2023-03-08 14:41:50,992][434396] Updated weights for policy 0, policy_version 132400 (0.0005) [2023-03-08 14:41:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10649.6). Total num frames: 67796992. Throughput: 0: 10445.9. Samples: 67788864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:41:51,893][434108] Avg episode reward: [(0, '4661.005')] [2023-03-08 14:41:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000132416_67796992.pth... [2023-03-08 14:41:51,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000131800_67481600.pth [2023-03-08 14:41:51,900][434347] Saving new best policy, reward=4661.005! [2023-03-08 14:41:54,912][434396] Updated weights for policy 0, policy_version 132480 (0.0005) [2023-03-08 14:41:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10649.6). Total num frames: 67850240. Throughput: 0: 10451.2. Samples: 67820476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:41:56,903][434108] Avg episode reward: [(0, '4659.237')] [2023-03-08 14:41:58,769][434396] Updated weights for policy 0, policy_version 132560 (0.0005) [2023-03-08 14:42:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10649.6). Total num frames: 67903488. Throughput: 0: 10477.4. Samples: 67884424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:42:01,903][434108] Avg episode reward: [(0, '4535.668')] [2023-03-08 14:42:02,550][434396] Updated weights for policy 0, policy_version 132640 (0.0004) [2023-03-08 14:42:06,457][434396] Updated weights for policy 0, policy_version 132720 (0.0005) [2023-03-08 14:42:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10649.6). Total num frames: 67956736. Throughput: 0: 10500.4. Samples: 67948252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:42:06,903][434108] Avg episode reward: [(0, '4626.399')] [2023-03-08 14:42:06,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000132728_67956736.pth... [2023-03-08 14:42:06,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000132112_67641344.pth [2023-03-08 14:42:10,142][434396] Updated weights for policy 0, policy_version 132800 (0.0004) [2023-03-08 14:42:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10649.6). Total num frames: 68009984. Throughput: 0: 10553.1. Samples: 67981136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:42:11,893][434108] Avg episode reward: [(0, '4518.004')] [2023-03-08 14:42:14,017][434396] Updated weights for policy 0, policy_version 132880 (0.0004) [2023-03-08 14:42:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10649.6). Total num frames: 68063232. Throughput: 0: 10575.8. Samples: 68044892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:42:16,893][434108] Avg episode reward: [(0, '4560.476')] [2023-03-08 14:42:17,884][434396] Updated weights for policy 0, policy_version 132960 (0.0005) [2023-03-08 14:42:21,507][434396] Updated weights for policy 0, policy_version 133040 (0.0004) [2023-03-08 14:42:21,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10649.6). Total num frames: 68120576. Throughput: 0: 10631.8. Samples: 68111120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:42:21,893][434108] Avg episode reward: [(0, '4615.760')] [2023-03-08 14:42:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000133048_68120576.pth... [2023-03-08 14:42:21,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000132416_67796992.pth [2023-03-08 14:42:25,146][434396] Updated weights for policy 0, policy_version 133120 (0.0004) [2023-03-08 14:42:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 68173824. Throughput: 0: 10686.2. Samples: 68144888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:42:26,893][434108] Avg episode reward: [(0, '4522.869')] [2023-03-08 14:42:29,041][434396] Updated weights for policy 0, policy_version 133200 (0.0005) [2023-03-08 14:42:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 68227072. Throughput: 0: 10692.0. Samples: 68208500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:42:31,893][434108] Avg episode reward: [(0, '4570.974')] [2023-03-08 14:42:33,030][434396] Updated weights for policy 0, policy_version 133280 (0.0005) [2023-03-08 14:42:36,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10581.4, 300 sec: 10607.9). Total num frames: 68276224. Throughput: 0: 10697.6. Samples: 68270256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:42:36,893][434108] Avg episode reward: [(0, '4626.834')] [2023-03-08 14:42:36,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000133352_68276224.pth... [2023-03-08 14:42:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000132728_67956736.pth [2023-03-08 14:42:36,988][434396] Updated weights for policy 0, policy_version 133360 (0.0005) [2023-03-08 14:42:40,659][434396] Updated weights for policy 0, policy_version 133440 (0.0004) [2023-03-08 14:42:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10607.9). Total num frames: 68333568. Throughput: 0: 10704.0. Samples: 68302156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:42:41,893][434108] Avg episode reward: [(0, '4627.585')] [2023-03-08 14:42:44,230][434396] Updated weights for policy 0, policy_version 133520 (0.0003) [2023-03-08 14:42:46,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 68386816. Throughput: 0: 10800.3. Samples: 68370440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:42:46,893][434108] Avg episode reward: [(0, '4462.257')] [2023-03-08 14:42:48,200][434396] Updated weights for policy 0, policy_version 133600 (0.0005) [2023-03-08 14:42:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 68440064. Throughput: 0: 10776.6. Samples: 68433200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:42:51,893][434108] Avg episode reward: [(0, '4519.579')] [2023-03-08 14:42:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000133672_68440064.pth... [2023-03-08 14:42:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000133048_68120576.pth [2023-03-08 14:42:52,064][434396] Updated weights for policy 0, policy_version 133680 (0.0005) [2023-03-08 14:42:55,735][434396] Updated weights for policy 0, policy_version 133760 (0.0004) [2023-03-08 14:42:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10607.9). Total num frames: 68497408. Throughput: 0: 10765.1. Samples: 68465564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:42:56,893][434108] Avg episode reward: [(0, '4658.208')] [2023-03-08 14:42:59,373][434396] Updated weights for policy 0, policy_version 133840 (0.0004) [2023-03-08 14:43:01,892][434108] Fps is (10 sec: 11468.9, 60 sec: 10854.4, 300 sec: 10621.8). Total num frames: 68554752. Throughput: 0: 10849.3. Samples: 68533108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:43:01,893][434108] Avg episode reward: [(0, '4607.019')] [2023-03-08 14:43:02,958][434396] Updated weights for policy 0, policy_version 133920 (0.0004) [2023-03-08 14:43:06,571][434396] Updated weights for policy 0, policy_version 134000 (0.0004) [2023-03-08 14:43:06,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10635.7). Total num frames: 68608000. Throughput: 0: 10897.5. Samples: 68601508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:43:06,893][434108] Avg episode reward: [(0, '4575.084')] [2023-03-08 14:43:06,922][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000134008_68612096.pth... [2023-03-08 14:43:06,923][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000133352_68276224.pth [2023-03-08 14:43:10,179][434396] Updated weights for policy 0, policy_version 134080 (0.0003) [2023-03-08 14:43:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10649.6). Total num frames: 68665344. Throughput: 0: 10912.4. Samples: 68635944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:43:11,893][434108] Avg episode reward: [(0, '4635.960')] [2023-03-08 14:43:13,912][434396] Updated weights for policy 0, policy_version 134160 (0.0004) [2023-03-08 14:43:16,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10990.9, 300 sec: 10663.5). Total num frames: 68722688. Throughput: 0: 10972.7. Samples: 68702272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:43:16,893][434108] Avg episode reward: [(0, '4494.167')] [2023-03-08 14:43:17,550][434396] Updated weights for policy 0, policy_version 134240 (0.0004) [2023-03-08 14:43:21,271][434396] Updated weights for policy 0, policy_version 134320 (0.0004) [2023-03-08 14:43:21,893][434108] Fps is (10 sec: 11059.1, 60 sec: 10922.7, 300 sec: 10663.5). Total num frames: 68775936. Throughput: 0: 11086.0. Samples: 68769128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:43:21,893][434108] Avg episode reward: [(0, '4517.879')] [2023-03-08 14:43:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000134328_68775936.pth... [2023-03-08 14:43:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000133672_68440064.pth [2023-03-08 14:43:24,920][434396] Updated weights for policy 0, policy_version 134400 (0.0004) [2023-03-08 14:43:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10677.4). Total num frames: 68833280. Throughput: 0: 11118.9. Samples: 68802508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:43:26,893][434108] Avg episode reward: [(0, '4611.985')] [2023-03-08 14:43:28,670][434396] Updated weights for policy 0, policy_version 134480 (0.0004) [2023-03-08 14:43:31,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10990.9, 300 sec: 10691.3). Total num frames: 68886528. Throughput: 0: 11044.8. Samples: 68867456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:43:31,893][434108] Avg episode reward: [(0, '4612.675')] [2023-03-08 14:43:32,551][434396] Updated weights for policy 0, policy_version 134560 (0.0005) [2023-03-08 14:43:36,309][434396] Updated weights for policy 0, policy_version 134640 (0.0004) [2023-03-08 14:43:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 11059.2, 300 sec: 10691.3). Total num frames: 68939776. Throughput: 0: 11102.1. Samples: 68932796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:43:36,893][434108] Avg episode reward: [(0, '4568.988')] [2023-03-08 14:43:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000134648_68939776.pth... [2023-03-08 14:43:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000134008_68612096.pth [2023-03-08 14:43:40,050][434396] Updated weights for policy 0, policy_version 134720 (0.0004) [2023-03-08 14:43:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 10677.4). Total num frames: 68993024. Throughput: 0: 11102.6. Samples: 68965180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:43:41,893][434108] Avg episode reward: [(0, '4485.518')] [2023-03-08 14:43:43,928][434396] Updated weights for policy 0, policy_version 134800 (0.0005) [2023-03-08 14:43:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10990.9, 300 sec: 10663.5). Total num frames: 69046272. Throughput: 0: 11017.6. Samples: 69028900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:43:46,893][434108] Avg episode reward: [(0, '4560.813')] [2023-03-08 14:43:47,922][434396] Updated weights for policy 0, policy_version 134880 (0.0005) [2023-03-08 14:43:51,888][434396] Updated weights for policy 0, policy_version 134960 (0.0005) [2023-03-08 14:43:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 10663.5). Total num frames: 69099520. Throughput: 0: 10878.4. Samples: 69091036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:43:51,893][434108] Avg episode reward: [(0, '4481.026')] [2023-03-08 14:43:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000134960_69099520.pth... [2023-03-08 14:43:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000134328_68775936.pth [2023-03-08 14:43:55,863][434396] Updated weights for policy 0, policy_version 135040 (0.0005) [2023-03-08 14:43:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10854.4, 300 sec: 10663.5). Total num frames: 69148672. Throughput: 0: 10791.7. Samples: 69121568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:43:56,893][434108] Avg episode reward: [(0, '4531.238')] [2023-03-08 14:43:59,786][434396] Updated weights for policy 0, policy_version 135120 (0.0005) [2023-03-08 14:44:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10786.1, 300 sec: 10663.5). Total num frames: 69201920. Throughput: 0: 10705.4. Samples: 69184016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:44:01,893][434108] Avg episode reward: [(0, '4427.583')] [2023-03-08 14:44:03,776][434396] Updated weights for policy 0, policy_version 135200 (0.0005) [2023-03-08 14:44:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10786.1, 300 sec: 10677.4). Total num frames: 69255168. Throughput: 0: 10605.7. Samples: 69246384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:44:06,893][434108] Avg episode reward: [(0, '4451.751')] [2023-03-08 14:44:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000135264_69255168.pth... [2023-03-08 14:44:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000134648_68939776.pth [2023-03-08 14:44:07,705][434396] Updated weights for policy 0, policy_version 135280 (0.0005) [2023-03-08 14:44:11,672][434396] Updated weights for policy 0, policy_version 135360 (0.0005) [2023-03-08 14:44:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10663.5). Total num frames: 69304320. Throughput: 0: 10541.0. Samples: 69276852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:44:11,893][434108] Avg episode reward: [(0, '4524.913')] [2023-03-08 14:44:15,618][434396] Updated weights for policy 0, policy_version 135440 (0.0005) [2023-03-08 14:44:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10663.5). Total num frames: 69357568. Throughput: 0: 10465.6. Samples: 69338408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:44:16,893][434108] Avg episode reward: [(0, '4592.071')] [2023-03-08 14:44:19,531][434396] Updated weights for policy 0, policy_version 135520 (0.0005) [2023-03-08 14:44:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.4, 300 sec: 10663.5). Total num frames: 69410816. Throughput: 0: 10429.3. Samples: 69402112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:44:21,893][434108] Avg episode reward: [(0, '4636.848')] [2023-03-08 14:44:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000135568_69410816.pth... [2023-03-08 14:44:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000134960_69099520.pth [2023-03-08 14:44:23,395][434396] Updated weights for policy 0, policy_version 135600 (0.0005) [2023-03-08 14:44:26,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10663.5). Total num frames: 69464064. Throughput: 0: 10407.0. Samples: 69433496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:44:26,893][434108] Avg episode reward: [(0, '4644.033')] [2023-03-08 14:44:27,207][434396] Updated weights for policy 0, policy_version 135680 (0.0005) [2023-03-08 14:44:30,888][434396] Updated weights for policy 0, policy_version 135760 (0.0004) [2023-03-08 14:44:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10663.5). Total num frames: 69517312. Throughput: 0: 10459.5. Samples: 69499580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:44:31,893][434108] Avg episode reward: [(0, '4577.562')] [2023-03-08 14:44:34,525][434396] Updated weights for policy 0, policy_version 135840 (0.0004) [2023-03-08 14:44:36,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10581.3, 300 sec: 10663.5). Total num frames: 69574656. Throughput: 0: 10565.1. Samples: 69566464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:44:36,893][434108] Avg episode reward: [(0, '4582.781')] [2023-03-08 14:44:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000135888_69574656.pth... [2023-03-08 14:44:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000135264_69255168.pth [2023-03-08 14:44:38,415][434396] Updated weights for policy 0, policy_version 135920 (0.0005) [2023-03-08 14:44:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10649.6). Total num frames: 69627904. Throughput: 0: 10567.9. Samples: 69597124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:44:41,893][434108] Avg episode reward: [(0, '4614.352')] [2023-03-08 14:44:42,273][434396] Updated weights for policy 0, policy_version 136000 (0.0005) [2023-03-08 14:44:46,250][434396] Updated weights for policy 0, policy_version 136080 (0.0005) [2023-03-08 14:44:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 69677056. Throughput: 0: 10592.6. Samples: 69660684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:44:46,893][434108] Avg episode reward: [(0, '4551.577')] [2023-03-08 14:44:50,115][434396] Updated weights for policy 0, policy_version 136160 (0.0005) [2023-03-08 14:44:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 69730304. Throughput: 0: 10594.4. Samples: 69723132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:44:51,893][434108] Avg episode reward: [(0, '4498.144')] [2023-03-08 14:44:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000136192_69730304.pth... [2023-03-08 14:44:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000135568_69410816.pth [2023-03-08 14:44:54,044][434396] Updated weights for policy 0, policy_version 136240 (0.0005) [2023-03-08 14:44:56,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10635.7). Total num frames: 69783552. Throughput: 0: 10623.0. Samples: 69754888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:44:56,893][434108] Avg episode reward: [(0, '4497.495')] [2023-03-08 14:44:57,995][434396] Updated weights for policy 0, policy_version 136320 (0.0005) [2023-03-08 14:45:01,888][434396] Updated weights for policy 0, policy_version 136400 (0.0005) [2023-03-08 14:45:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10635.7). Total num frames: 69836800. Throughput: 0: 10637.7. Samples: 69817104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:45:01,893][434108] Avg episode reward: [(0, '4633.102')] [2023-03-08 14:45:05,763][434396] Updated weights for policy 0, policy_version 136480 (0.0005) [2023-03-08 14:45:06,893][434108] Fps is (10 sec: 10239.9, 60 sec: 10513.0, 300 sec: 10635.7). Total num frames: 69885952. Throughput: 0: 10632.0. Samples: 69880552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:45:06,893][434108] Avg episode reward: [(0, '4634.478')] [2023-03-08 14:45:06,911][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000136504_69890048.pth... [2023-03-08 14:45:06,912][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000135888_69574656.pth [2023-03-08 14:45:09,727][434396] Updated weights for policy 0, policy_version 136560 (0.0004) [2023-03-08 14:45:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10635.7). Total num frames: 69939200. Throughput: 0: 10620.9. Samples: 69911436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:45:11,893][434108] Avg episode reward: [(0, '4441.048')] [2023-03-08 14:45:13,756][434396] Updated weights for policy 0, policy_version 136640 (0.0005) [2023-03-08 14:45:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10621.8). Total num frames: 69988352. Throughput: 0: 10503.6. Samples: 69972240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:45:16,893][434108] Avg episode reward: [(0, '4559.211')] [2023-03-08 14:45:17,710][434396] Updated weights for policy 0, policy_version 136720 (0.0005) [2023-03-08 14:45:21,560][434396] Updated weights for policy 0, policy_version 136800 (0.0005) [2023-03-08 14:45:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10621.8). Total num frames: 70041600. Throughput: 0: 10422.2. Samples: 70035464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:45:21,893][434108] Avg episode reward: [(0, '4408.820')] [2023-03-08 14:45:21,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000136800_70041600.pth... [2023-03-08 14:45:21,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000136192_69730304.pth [2023-03-08 14:45:25,426][434396] Updated weights for policy 0, policy_version 136880 (0.0004) [2023-03-08 14:45:26,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 70094848. Throughput: 0: 10461.4. Samples: 70067888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:45:26,893][434108] Avg episode reward: [(0, '4436.755')] [2023-03-08 14:45:29,371][434396] Updated weights for policy 0, policy_version 136960 (0.0005) [2023-03-08 14:45:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 70148096. Throughput: 0: 10447.6. Samples: 70130824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:45:31,893][434108] Avg episode reward: [(0, '4443.503')] [2023-03-08 14:45:33,206][434396] Updated weights for policy 0, policy_version 137040 (0.0004) [2023-03-08 14:45:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10635.7). Total num frames: 70201344. Throughput: 0: 10469.4. Samples: 70194256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:45:36,893][434108] Avg episode reward: [(0, '4474.516')] [2023-03-08 14:45:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000137112_70201344.pth... [2023-03-08 14:45:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000136504_69890048.pth [2023-03-08 14:45:37,048][434396] Updated weights for policy 0, policy_version 137120 (0.0005) [2023-03-08 14:45:40,948][434396] Updated weights for policy 0, policy_version 137200 (0.0005) [2023-03-08 14:45:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10635.7). Total num frames: 70254592. Throughput: 0: 10468.8. Samples: 70225984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:45:41,893][434108] Avg episode reward: [(0, '4440.229')] [2023-03-08 14:45:44,950][434396] Updated weights for policy 0, policy_version 137280 (0.0005) [2023-03-08 14:45:46,892][434108] Fps is (10 sec: 10649.8, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 70307840. Throughput: 0: 10457.5. Samples: 70287688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:45:46,893][434108] Avg episode reward: [(0, '4471.996')] [2023-03-08 14:45:48,911][434396] Updated weights for policy 0, policy_version 137360 (0.0005) [2023-03-08 14:45:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10621.8). Total num frames: 70356992. Throughput: 0: 10417.7. Samples: 70349348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:45:51,893][434108] Avg episode reward: [(0, '4548.450')] [2023-03-08 14:45:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000137416_70356992.pth... [2023-03-08 14:45:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000136800_70041600.pth [2023-03-08 14:45:52,874][434396] Updated weights for policy 0, policy_version 137440 (0.0005) [2023-03-08 14:45:56,578][434396] Updated weights for policy 0, policy_version 137520 (0.0004) [2023-03-08 14:45:56,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10635.7). Total num frames: 70410240. Throughput: 0: 10448.8. Samples: 70381632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:45:56,893][434108] Avg episode reward: [(0, '4517.229')] [2023-03-08 14:46:00,508][434396] Updated weights for policy 0, policy_version 137600 (0.0005) [2023-03-08 14:46:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10635.7). Total num frames: 70463488. Throughput: 0: 10550.0. Samples: 70446992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:46:01,893][434108] Avg episode reward: [(0, '4585.870')] [2023-03-08 14:46:04,494][434396] Updated weights for policy 0, policy_version 137680 (0.0004) [2023-03-08 14:46:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 70516736. Throughput: 0: 10512.7. Samples: 70508536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:46:06,893][434108] Avg episode reward: [(0, '4612.660')] [2023-03-08 14:46:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000137728_70516736.pth... [2023-03-08 14:46:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000137112_70201344.pth [2023-03-08 14:46:08,395][434396] Updated weights for policy 0, policy_version 137760 (0.0004) [2023-03-08 14:46:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10621.8). Total num frames: 70565888. Throughput: 0: 10481.9. Samples: 70539572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:46:11,893][434108] Avg episode reward: [(0, '4470.073')] [2023-03-08 14:46:12,378][434396] Updated weights for policy 0, policy_version 137840 (0.0005) [2023-03-08 14:46:16,324][434396] Updated weights for policy 0, policy_version 137920 (0.0005) [2023-03-08 14:46:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 70619136. Throughput: 0: 10479.3. Samples: 70602392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:46:16,893][434108] Avg episode reward: [(0, '4570.267')] [2023-03-08 14:46:20,376][434396] Updated weights for policy 0, policy_version 138000 (0.0005) [2023-03-08 14:46:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10621.8). Total num frames: 70668288. Throughput: 0: 10423.1. Samples: 70663296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:46:21,893][434108] Avg episode reward: [(0, '4617.238')] [2023-03-08 14:46:21,899][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000138032_70672384.pth... [2023-03-08 14:46:21,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000137416_70356992.pth [2023-03-08 14:46:24,218][434396] Updated weights for policy 0, policy_version 138080 (0.0004) [2023-03-08 14:46:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10621.8). Total num frames: 70721536. Throughput: 0: 10424.9. Samples: 70695104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:46:26,893][434108] Avg episode reward: [(0, '4561.246')] [2023-03-08 14:46:28,094][434396] Updated weights for policy 0, policy_version 138160 (0.0005) [2023-03-08 14:46:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10621.8). Total num frames: 70774784. Throughput: 0: 10461.6. Samples: 70758464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:46:31,893][434108] Avg episode reward: [(0, '4579.823')] [2023-03-08 14:46:31,907][434396] Updated weights for policy 0, policy_version 138240 (0.0005) [2023-03-08 14:46:35,855][434396] Updated weights for policy 0, policy_version 138320 (0.0005) [2023-03-08 14:46:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10635.7). Total num frames: 70828032. Throughput: 0: 10494.4. Samples: 70821596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:46:36,893][434108] Avg episode reward: [(0, '4632.341')] [2023-03-08 14:46:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000138336_70828032.pth... [2023-03-08 14:46:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000137728_70516736.pth [2023-03-08 14:46:39,600][434396] Updated weights for policy 0, policy_version 138400 (0.0005) [2023-03-08 14:46:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10513.1, 300 sec: 10649.6). Total num frames: 70885376. Throughput: 0: 10510.3. Samples: 70854596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:46:41,893][434108] Avg episode reward: [(0, '4530.552')] [2023-03-08 14:46:43,244][434396] Updated weights for policy 0, policy_version 138480 (0.0004) [2023-03-08 14:46:46,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10513.0, 300 sec: 10649.6). Total num frames: 70938624. Throughput: 0: 10538.3. Samples: 70921216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:46:46,893][434108] Avg episode reward: [(0, '4634.280')] [2023-03-08 14:46:46,978][434396] Updated weights for policy 0, policy_version 138560 (0.0005) [2023-03-08 14:46:50,987][434396] Updated weights for policy 0, policy_version 138640 (0.0005) [2023-03-08 14:46:51,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10649.6). Total num frames: 70991872. Throughput: 0: 10560.4. Samples: 70983752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:46:51,893][434108] Avg episode reward: [(0, '4611.551')] [2023-03-08 14:46:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000138656_70991872.pth... [2023-03-08 14:46:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000138032_70672384.pth [2023-03-08 14:46:55,011][434396] Updated weights for policy 0, policy_version 138720 (0.0006) [2023-03-08 14:46:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.0, 300 sec: 10635.7). Total num frames: 71041024. Throughput: 0: 10560.6. Samples: 71014800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:46:56,893][434108] Avg episode reward: [(0, '4616.078')] [2023-03-08 14:46:58,879][434396] Updated weights for policy 0, policy_version 138800 (0.0005) [2023-03-08 14:47:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 71094272. Throughput: 0: 10555.7. Samples: 71077396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:47:01,893][434108] Avg episode reward: [(0, '4562.144')] [2023-03-08 14:47:02,869][434396] Updated weights for policy 0, policy_version 138880 (0.0005) [2023-03-08 14:47:06,763][434396] Updated weights for policy 0, policy_version 138960 (0.0005) [2023-03-08 14:47:06,893][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 71147520. Throughput: 0: 10578.9. Samples: 71139348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:47:06,893][434108] Avg episode reward: [(0, '4606.513')] [2023-03-08 14:47:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000138960_71147520.pth... [2023-03-08 14:47:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000138336_70828032.pth [2023-03-08 14:47:10,699][434396] Updated weights for policy 0, policy_version 139040 (0.0005) [2023-03-08 14:47:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10621.8). Total num frames: 71196672. Throughput: 0: 10583.8. Samples: 71171376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:47:11,893][434108] Avg episode reward: [(0, '4526.946')] [2023-03-08 14:47:14,666][434396] Updated weights for policy 0, policy_version 139120 (0.0005) [2023-03-08 14:47:16,892][434108] Fps is (10 sec: 10649.8, 60 sec: 10581.4, 300 sec: 10621.8). Total num frames: 71254016. Throughput: 0: 10558.6. Samples: 71233600. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:47:16,893][434108] Avg episode reward: [(0, '4586.009')] [2023-03-08 14:47:18,514][434396] Updated weights for policy 0, policy_version 139200 (0.0005) [2023-03-08 14:47:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10607.9). Total num frames: 71303168. Throughput: 0: 10522.1. Samples: 71295088. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:47:21,893][434108] Avg episode reward: [(0, '4629.150')] [2023-03-08 14:47:21,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000139264_71303168.pth... [2023-03-08 14:47:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000138656_70991872.pth [2023-03-08 14:47:22,621][434396] Updated weights for policy 0, policy_version 139280 (0.0005) [2023-03-08 14:47:26,570][434396] Updated weights for policy 0, policy_version 139360 (0.0005) [2023-03-08 14:47:26,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10513.1, 300 sec: 10594.1). Total num frames: 71352320. Throughput: 0: 10451.0. Samples: 71324892. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:47:26,893][434108] Avg episode reward: [(0, '4591.393')] [2023-03-08 14:47:30,469][434396] Updated weights for policy 0, policy_version 139440 (0.0005) [2023-03-08 14:47:31,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10607.9). Total num frames: 71405568. Throughput: 0: 10389.8. Samples: 71388756. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:47:31,893][434108] Avg episode reward: [(0, '4573.625')] [2023-03-08 14:47:34,168][434396] Updated weights for policy 0, policy_version 139520 (0.0005) [2023-03-08 14:47:36,892][434108] Fps is (10 sec: 11059.0, 60 sec: 10581.3, 300 sec: 10607.9). Total num frames: 71462912. Throughput: 0: 10460.1. Samples: 71454456. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:47:36,893][434108] Avg episode reward: [(0, '4587.138')] [2023-03-08 14:47:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000139576_71462912.pth... [2023-03-08 14:47:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000138960_71147520.pth [2023-03-08 14:47:38,079][434396] Updated weights for policy 0, policy_version 139600 (0.0005) [2023-03-08 14:47:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 71512064. Throughput: 0: 10439.8. Samples: 71484592. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:47:41,893][434108] Avg episode reward: [(0, '4574.948')] [2023-03-08 14:47:42,101][434396] Updated weights for policy 0, policy_version 139680 (0.0006) [2023-03-08 14:47:46,137][434396] Updated weights for policy 0, policy_version 139760 (0.0005) [2023-03-08 14:47:46,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10376.6, 300 sec: 10580.2). Total num frames: 71561216. Throughput: 0: 10389.1. Samples: 71544904. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:47:46,893][434108] Avg episode reward: [(0, '4573.132')] [2023-03-08 14:47:50,075][434396] Updated weights for policy 0, policy_version 139840 (0.0005) [2023-03-08 14:47:51,893][434108] Fps is (10 sec: 10239.8, 60 sec: 10376.5, 300 sec: 10566.3). Total num frames: 71614464. Throughput: 0: 10400.8. Samples: 71607384. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:47:51,893][434108] Avg episode reward: [(0, '4483.629')] [2023-03-08 14:47:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000139872_71614464.pth... [2023-03-08 14:47:51,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000139264_71303168.pth [2023-03-08 14:47:54,147][434396] Updated weights for policy 0, policy_version 139920 (0.0005) [2023-03-08 14:47:56,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10538.5). Total num frames: 71663616. Throughput: 0: 10365.6. Samples: 71637828. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:47:56,893][434108] Avg episode reward: [(0, '4593.820')] [2023-03-08 14:47:58,094][434396] Updated weights for policy 0, policy_version 140000 (0.0005) [2023-03-08 14:48:01,892][434108] Fps is (10 sec: 10240.2, 60 sec: 10376.5, 300 sec: 10538.5). Total num frames: 71716864. Throughput: 0: 10375.1. Samples: 71700480. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:48:01,893][434108] Avg episode reward: [(0, '4437.517')] [2023-03-08 14:48:02,001][434396] Updated weights for policy 0, policy_version 140080 (0.0005) [2023-03-08 14:48:06,171][434396] Updated weights for policy 0, policy_version 140160 (0.0005) [2023-03-08 14:48:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10510.8). Total num frames: 71766016. Throughput: 0: 10347.1. Samples: 71760708. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:48:06,893][434108] Avg episode reward: [(0, '4485.297')] [2023-03-08 14:48:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000140168_71766016.pth... [2023-03-08 14:48:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000139576_71462912.pth [2023-03-08 14:48:10,332][434396] Updated weights for policy 0, policy_version 140240 (0.0005) [2023-03-08 14:48:11,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10483.0). Total num frames: 71815168. Throughput: 0: 10329.0. Samples: 71789700. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:48:11,893][434108] Avg episode reward: [(0, '4644.689')] [2023-03-08 14:48:14,473][434396] Updated weights for policy 0, policy_version 140320 (0.0005) [2023-03-08 14:48:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10483.0). Total num frames: 71868416. Throughput: 0: 10228.5. Samples: 71849036. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 14:48:16,893][434108] Avg episode reward: [(0, '4617.277')] [2023-03-08 14:48:18,292][434396] Updated weights for policy 0, policy_version 140400 (0.0005) [2023-03-08 14:48:21,893][434108] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10455.2). Total num frames: 71917568. Throughput: 0: 10177.1. Samples: 71912424. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:48:21,893][434108] Avg episode reward: [(0, '4649.415')] [2023-03-08 14:48:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000140472_71921664.pth... [2023-03-08 14:48:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000139872_71614464.pth [2023-03-08 14:48:22,253][434396] Updated weights for policy 0, policy_version 140480 (0.0005) [2023-03-08 14:48:26,185][434396] Updated weights for policy 0, policy_version 140560 (0.0005) [2023-03-08 14:48:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.2, 300 sec: 10455.2). Total num frames: 71970816. Throughput: 0: 10206.3. Samples: 71943876. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:48:26,893][434108] Avg episode reward: [(0, '4644.012')] [2023-03-08 14:48:29,965][434396] Updated weights for policy 0, policy_version 140640 (0.0004) [2023-03-08 14:48:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10455.2). Total num frames: 72024064. Throughput: 0: 10288.5. Samples: 72007888. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:48:31,893][434108] Avg episode reward: [(0, '4644.703')] [2023-03-08 14:48:33,745][434396] Updated weights for policy 0, policy_version 140720 (0.0005) [2023-03-08 14:48:36,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10308.3, 300 sec: 10469.1). Total num frames: 72081408. Throughput: 0: 10351.8. Samples: 72073212. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:48:36,893][434108] Avg episode reward: [(0, '4649.136')] [2023-03-08 14:48:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000140784_72081408.pth... [2023-03-08 14:48:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000140168_71766016.pth [2023-03-08 14:48:37,508][434396] Updated weights for policy 0, policy_version 140800 (0.0004) [2023-03-08 14:48:41,526][434396] Updated weights for policy 0, policy_version 140880 (0.0005) [2023-03-08 14:48:41,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 72134656. Throughput: 0: 10400.3. Samples: 72105840. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:48:41,893][434108] Avg episode reward: [(0, '4620.020')] [2023-03-08 14:48:45,446][434396] Updated weights for policy 0, policy_version 140960 (0.0005) [2023-03-08 14:48:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10455.2). Total num frames: 72183808. Throughput: 0: 10376.5. Samples: 72167424. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:48:46,893][434108] Avg episode reward: [(0, '4539.310')] [2023-03-08 14:48:49,640][434396] Updated weights for policy 0, policy_version 141040 (0.0005) [2023-03-08 14:48:51,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10308.3, 300 sec: 10455.2). Total num frames: 72232960. Throughput: 0: 10316.3. Samples: 72224944. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:48:51,893][434108] Avg episode reward: [(0, '4459.811')] [2023-03-08 14:48:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000141080_72232960.pth... [2023-03-08 14:48:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000140472_71921664.pth [2023-03-08 14:48:53,821][434396] Updated weights for policy 0, policy_version 141120 (0.0005) [2023-03-08 14:48:56,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10441.3). Total num frames: 72282112. Throughput: 0: 10353.2. Samples: 72255596. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:48:56,893][434108] Avg episode reward: [(0, '4049.791')] [2023-03-08 14:48:57,955][434396] Updated weights for policy 0, policy_version 141200 (0.0005) [2023-03-08 14:49:01,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10427.4). Total num frames: 72331264. Throughput: 0: 10338.7. Samples: 72314276. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:49:01,893][434108] Avg episode reward: [(0, '4479.862')] [2023-03-08 14:49:02,067][434396] Updated weights for policy 0, policy_version 141280 (0.0006) [2023-03-08 14:49:05,992][434396] Updated weights for policy 0, policy_version 141360 (0.0005) [2023-03-08 14:49:06,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10427.4). Total num frames: 72380416. Throughput: 0: 10310.5. Samples: 72376396. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:49:06,893][434108] Avg episode reward: [(0, '4450.838')] [2023-03-08 14:49:06,924][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000141376_72384512.pth... [2023-03-08 14:49:06,925][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000140784_72081408.pth [2023-03-08 14:49:10,221][434396] Updated weights for policy 0, policy_version 141440 (0.0005) [2023-03-08 14:49:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 72433664. Throughput: 0: 10248.4. Samples: 72405056. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:49:11,893][434108] Avg episode reward: [(0, '4485.352')] [2023-03-08 14:49:14,327][434396] Updated weights for policy 0, policy_version 141520 (0.0005) [2023-03-08 14:49:16,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10413.6). Total num frames: 72482816. Throughput: 0: 10172.0. Samples: 72465628. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:49:16,893][434108] Avg episode reward: [(0, '4520.189')] [2023-03-08 14:49:18,410][434396] Updated weights for policy 0, policy_version 141600 (0.0005) [2023-03-08 14:49:21,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10399.7). Total num frames: 72531968. Throughput: 0: 10059.9. Samples: 72525908. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:49:21,893][434108] Avg episode reward: [(0, '4634.891')] [2023-03-08 14:49:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000141664_72531968.pth... [2023-03-08 14:49:21,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000141080_72232960.pth [2023-03-08 14:49:22,527][434396] Updated weights for policy 0, policy_version 141680 (0.0004) [2023-03-08 14:49:26,584][434396] Updated weights for policy 0, policy_version 141760 (0.0006) [2023-03-08 14:49:26,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10385.8). Total num frames: 72581120. Throughput: 0: 10000.7. Samples: 72555872. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:49:26,893][434108] Avg episode reward: [(0, '4541.974')] [2023-03-08 14:49:30,952][434396] Updated weights for policy 0, policy_version 141840 (0.0006) [2023-03-08 14:49:31,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10358.0). Total num frames: 72630272. Throughput: 0: 9905.0. Samples: 72613148. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:49:31,893][434108] Avg episode reward: [(0, '4590.082')] [2023-03-08 14:49:35,070][434396] Updated weights for policy 0, policy_version 141920 (0.0006) [2023-03-08 14:49:36,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10344.1). Total num frames: 72679424. Throughput: 0: 9938.0. Samples: 72672156. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:49:36,893][434108] Avg episode reward: [(0, '4583.538')] [2023-03-08 14:49:36,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000141952_72679424.pth... [2023-03-08 14:49:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000141376_72384512.pth [2023-03-08 14:49:39,029][434396] Updated weights for policy 0, policy_version 142000 (0.0006) [2023-03-08 14:49:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10358.0). Total num frames: 72732672. Throughput: 0: 9964.7. Samples: 72704008. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:49:41,893][434108] Avg episode reward: [(0, '4532.235')] [2023-03-08 14:49:43,058][434396] Updated weights for policy 0, policy_version 142080 (0.0004) [2023-03-08 14:49:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10344.1). Total num frames: 72781824. Throughput: 0: 10025.9. Samples: 72765440. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:49:46,893][434108] Avg episode reward: [(0, '4463.747')] [2023-03-08 14:49:46,968][434396] Updated weights for policy 0, policy_version 142160 (0.0004) [2023-03-08 14:49:50,712][434396] Updated weights for policy 0, policy_version 142240 (0.0005) [2023-03-08 14:49:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10344.1). Total num frames: 72835072. Throughput: 0: 10079.7. Samples: 72829984. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:49:51,893][434108] Avg episode reward: [(0, '4571.954')] [2023-03-08 14:49:51,902][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000142264_72839168.pth... [2023-03-08 14:49:51,904][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000141664_72531968.pth [2023-03-08 14:49:54,589][434396] Updated weights for policy 0, policy_version 142320 (0.0004) [2023-03-08 14:49:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10171.7, 300 sec: 10358.0). Total num frames: 72892416. Throughput: 0: 10143.4. Samples: 72861508. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:49:56,893][434108] Avg episode reward: [(0, '4465.900')] [2023-03-08 14:49:58,255][434396] Updated weights for policy 0, policy_version 142400 (0.0003) [2023-03-08 14:50:01,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10240.0, 300 sec: 10371.9). Total num frames: 72945664. Throughput: 0: 10290.4. Samples: 72928696. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:50:01,893][434108] Avg episode reward: [(0, '4470.953')] [2023-03-08 14:50:01,960][434396] Updated weights for policy 0, policy_version 142480 (0.0004) [2023-03-08 14:50:06,190][434396] Updated weights for policy 0, policy_version 142560 (0.0005) [2023-03-08 14:50:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10358.0). Total num frames: 72994816. Throughput: 0: 10301.1. Samples: 72989456. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:50:06,893][434108] Avg episode reward: [(0, '4554.513')] [2023-03-08 14:50:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000142568_72994816.pth... [2023-03-08 14:50:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000141952_72679424.pth [2023-03-08 14:50:10,202][434396] Updated weights for policy 0, policy_version 142640 (0.0005) [2023-03-08 14:50:11,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10371.9). Total num frames: 73048064. Throughput: 0: 10301.9. Samples: 73019456. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:50:11,893][434108] Avg episode reward: [(0, '4439.068')] [2023-03-08 14:50:14,176][434396] Updated weights for policy 0, policy_version 142720 (0.0005) [2023-03-08 14:50:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10358.0). Total num frames: 73097216. Throughput: 0: 10403.3. Samples: 73081296. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:50:16,893][434108] Avg episode reward: [(0, '4494.574')] [2023-03-08 14:50:18,182][434396] Updated weights for policy 0, policy_version 142800 (0.0005) [2023-03-08 14:50:21,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10358.0). Total num frames: 73150464. Throughput: 0: 10447.2. Samples: 73142280. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:50:21,893][434108] Avg episode reward: [(0, '4378.367')] [2023-03-08 14:50:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000142872_73150464.pth... [2023-03-08 14:50:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000142264_72839168.pth [2023-03-08 14:50:22,215][434396] Updated weights for policy 0, policy_version 142880 (0.0005) [2023-03-08 14:50:26,004][434396] Updated weights for policy 0, policy_version 142960 (0.0005) [2023-03-08 14:50:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10358.0). Total num frames: 73203712. Throughput: 0: 10458.7. Samples: 73174648. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:50:26,893][434108] Avg episode reward: [(0, '4397.141')] [2023-03-08 14:50:30,067][434396] Updated weights for policy 0, policy_version 143040 (0.0006) [2023-03-08 14:50:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 73252864. Throughput: 0: 10467.6. Samples: 73236480. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:50:31,893][434108] Avg episode reward: [(0, '4602.208')] [2023-03-08 14:50:33,992][434396] Updated weights for policy 0, policy_version 143120 (0.0005) [2023-03-08 14:50:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10344.1). Total num frames: 73306112. Throughput: 0: 10419.5. Samples: 73298860. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:50:36,893][434108] Avg episode reward: [(0, '4668.429')] [2023-03-08 14:50:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000143176_73306112.pth... [2023-03-08 14:50:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000142568_72994816.pth [2023-03-08 14:50:36,899][434347] Saving new best policy, reward=4668.429! [2023-03-08 14:50:37,826][434396] Updated weights for policy 0, policy_version 143200 (0.0004) [2023-03-08 14:50:41,802][434396] Updated weights for policy 0, policy_version 143280 (0.0004) [2023-03-08 14:50:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10344.1). Total num frames: 73359360. Throughput: 0: 10427.8. Samples: 73330760. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:50:41,893][434108] Avg episode reward: [(0, '4500.589')] [2023-03-08 14:50:45,884][434396] Updated weights for policy 0, policy_version 143360 (0.0005) [2023-03-08 14:50:46,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10344.1). Total num frames: 73408512. Throughput: 0: 10298.5. Samples: 73392128. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:50:46,893][434108] Avg episode reward: [(0, '4648.021')] [2023-03-08 14:50:50,004][434396] Updated weights for policy 0, policy_version 143440 (0.0005) [2023-03-08 14:50:51,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10376.5, 300 sec: 10330.2). Total num frames: 73457664. Throughput: 0: 10273.2. Samples: 73451748. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:50:51,893][434108] Avg episode reward: [(0, '4495.527')] [2023-03-08 14:50:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000143472_73457664.pth... [2023-03-08 14:50:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000142872_73150464.pth [2023-03-08 14:50:54,091][434396] Updated weights for policy 0, policy_version 143520 (0.0006) [2023-03-08 14:50:56,893][434108] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10316.4). Total num frames: 73506816. Throughput: 0: 10266.9. Samples: 73481468. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:50:56,893][434108] Avg episode reward: [(0, '4579.824')] [2023-03-08 14:50:58,336][434396] Updated weights for policy 0, policy_version 143600 (0.0005) [2023-03-08 14:51:01,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10302.5). Total num frames: 73555968. Throughput: 0: 10184.2. Samples: 73539584. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:51:01,893][434108] Avg episode reward: [(0, '4537.083')] [2023-03-08 14:51:02,550][434396] Updated weights for policy 0, policy_version 143680 (0.0005) [2023-03-08 14:51:06,663][434396] Updated weights for policy 0, policy_version 143760 (0.0006) [2023-03-08 14:51:06,892][434108] Fps is (10 sec: 9830.6, 60 sec: 10171.7, 300 sec: 10302.5). Total num frames: 73605120. Throughput: 0: 10142.3. Samples: 73598684. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:51:06,893][434108] Avg episode reward: [(0, '4611.227')] [2023-03-08 14:51:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000143760_73605120.pth... [2023-03-08 14:51:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000143176_73306112.pth [2023-03-08 14:51:10,587][434396] Updated weights for policy 0, policy_version 143840 (0.0004) [2023-03-08 14:51:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10302.5). Total num frames: 73658368. Throughput: 0: 10099.1. Samples: 73629108. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:51:11,893][434108] Avg episode reward: [(0, '4654.653')] [2023-03-08 14:51:14,321][434396] Updated weights for policy 0, policy_version 143920 (0.0004) [2023-03-08 14:51:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10316.4). Total num frames: 73711616. Throughput: 0: 10158.6. Samples: 73693616. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:51:16,893][434108] Avg episode reward: [(0, '4638.035')] [2023-03-08 14:51:18,348][434396] Updated weights for policy 0, policy_version 144000 (0.0006) [2023-03-08 14:51:21,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10302.5). Total num frames: 73760768. Throughput: 0: 10137.0. Samples: 73755024. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 14:51:21,893][434108] Avg episode reward: [(0, '4636.358')] [2023-03-08 14:51:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000144064_73760768.pth... [2023-03-08 14:51:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000143472_73457664.pth [2023-03-08 14:51:22,432][434396] Updated weights for policy 0, policy_version 144080 (0.0005) [2023-03-08 14:51:26,627][434396] Updated weights for policy 0, policy_version 144160 (0.0005) [2023-03-08 14:51:26,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10288.6). Total num frames: 73809920. Throughput: 0: 10101.4. Samples: 73785324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:51:26,893][434108] Avg episode reward: [(0, '4588.341')] [2023-03-08 14:51:30,751][434396] Updated weights for policy 0, policy_version 144240 (0.0006) [2023-03-08 14:51:31,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10274.7). Total num frames: 73859072. Throughput: 0: 10020.2. Samples: 73843036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:51:31,893][434108] Avg episode reward: [(0, '4504.873')] [2023-03-08 14:51:34,899][434396] Updated weights for policy 0, policy_version 144320 (0.0005) [2023-03-08 14:51:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10260.8). Total num frames: 73912320. Throughput: 0: 10054.3. Samples: 73904192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:51:36,893][434108] Avg episode reward: [(0, '4408.118')] [2023-03-08 14:51:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000144360_73912320.pth... [2023-03-08 14:51:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000143760_73605120.pth [2023-03-08 14:51:38,571][434396] Updated weights for policy 0, policy_version 144400 (0.0004) [2023-03-08 14:51:41,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10103.5, 300 sec: 10260.8). Total num frames: 73965568. Throughput: 0: 10130.0. Samples: 73937316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:51:41,893][434108] Avg episode reward: [(0, '4581.814')] [2023-03-08 14:51:42,347][434396] Updated weights for policy 0, policy_version 144480 (0.0004) [2023-03-08 14:51:46,067][434396] Updated weights for policy 0, policy_version 144560 (0.0003) [2023-03-08 14:51:46,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10240.0, 300 sec: 10274.7). Total num frames: 74022912. Throughput: 0: 10295.0. Samples: 74002860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:51:46,893][434108] Avg episode reward: [(0, '4635.562')] [2023-03-08 14:51:49,689][434396] Updated weights for policy 0, policy_version 144640 (0.0004) [2023-03-08 14:51:51,893][434108] Fps is (10 sec: 11059.1, 60 sec: 10308.3, 300 sec: 10288.6). Total num frames: 74076160. Throughput: 0: 10463.6. Samples: 74069548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:51:51,893][434108] Avg episode reward: [(0, '4555.538')] [2023-03-08 14:51:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000144680_74076160.pth... [2023-03-08 14:51:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000144064_73760768.pth [2023-03-08 14:51:53,559][434396] Updated weights for policy 0, policy_version 144720 (0.0005) [2023-03-08 14:51:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10302.5). Total num frames: 74133504. Throughput: 0: 10501.0. Samples: 74101652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:51:56,893][434108] Avg episode reward: [(0, '4642.970')] [2023-03-08 14:51:57,231][434396] Updated weights for policy 0, policy_version 144800 (0.0004) [2023-03-08 14:52:01,175][434396] Updated weights for policy 0, policy_version 144880 (0.0005) [2023-03-08 14:52:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10288.6). Total num frames: 74182656. Throughput: 0: 10505.3. Samples: 74166356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:52:01,893][434108] Avg episode reward: [(0, '4630.704')] [2023-03-08 14:52:05,212][434396] Updated weights for policy 0, policy_version 144960 (0.0006) [2023-03-08 14:52:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10302.5). Total num frames: 74235904. Throughput: 0: 10505.6. Samples: 74227776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:52:06,893][434108] Avg episode reward: [(0, '4650.678')] [2023-03-08 14:52:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000144992_74235904.pth... [2023-03-08 14:52:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000144360_73912320.pth [2023-03-08 14:52:09,114][434396] Updated weights for policy 0, policy_version 145040 (0.0005) [2023-03-08 14:52:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10288.6). Total num frames: 74289152. Throughput: 0: 10543.2. Samples: 74259768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:52:11,893][434108] Avg episode reward: [(0, '4618.416')] [2023-03-08 14:52:13,029][434396] Updated weights for policy 0, policy_version 145120 (0.0005) [2023-03-08 14:52:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10288.6). Total num frames: 74338304. Throughput: 0: 10653.4. Samples: 74322440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:52:16,893][434108] Avg episode reward: [(0, '4651.820')] [2023-03-08 14:52:16,942][434396] Updated weights for policy 0, policy_version 145200 (0.0005) [2023-03-08 14:52:20,953][434396] Updated weights for policy 0, policy_version 145280 (0.0005) [2023-03-08 14:52:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10302.5). Total num frames: 74391552. Throughput: 0: 10655.4. Samples: 74383684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:52:21,893][434108] Avg episode reward: [(0, '4654.933')] [2023-03-08 14:52:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000145296_74391552.pth... [2023-03-08 14:52:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000144680_74076160.pth [2023-03-08 14:52:24,881][434396] Updated weights for policy 0, policy_version 145360 (0.0005) [2023-03-08 14:52:26,892][434108] Fps is (10 sec: 10649.8, 60 sec: 10581.4, 300 sec: 10302.5). Total num frames: 74444800. Throughput: 0: 10633.0. Samples: 74415800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:52:26,893][434108] Avg episode reward: [(0, '4630.152')] [2023-03-08 14:52:28,908][434396] Updated weights for policy 0, policy_version 145440 (0.0005) [2023-03-08 14:52:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10274.7). Total num frames: 74493952. Throughput: 0: 10511.4. Samples: 74475872. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:52:31,893][434108] Avg episode reward: [(0, '4643.857')] [2023-03-08 14:52:32,984][434396] Updated weights for policy 0, policy_version 145520 (0.0004) [2023-03-08 14:52:36,893][434108] Fps is (10 sec: 9830.2, 60 sec: 10513.1, 300 sec: 10274.7). Total num frames: 74543104. Throughput: 0: 10377.2. Samples: 74536524. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:52:36,893][434108] Avg episode reward: [(0, '4646.389')] [2023-03-08 14:52:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000145592_74543104.pth... [2023-03-08 14:52:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000144992_74235904.pth [2023-03-08 14:52:36,981][434396] Updated weights for policy 0, policy_version 145600 (0.0005) [2023-03-08 14:52:41,129][434396] Updated weights for policy 0, policy_version 145680 (0.0005) [2023-03-08 14:52:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10288.6). Total num frames: 74596352. Throughput: 0: 10356.4. Samples: 74567688. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:52:41,893][434108] Avg episode reward: [(0, '4581.318')] [2023-03-08 14:52:45,139][434396] Updated weights for policy 0, policy_version 145760 (0.0005) [2023-03-08 14:52:46,892][434108] Fps is (10 sec: 10240.2, 60 sec: 10376.5, 300 sec: 10274.7). Total num frames: 74645504. Throughput: 0: 10256.8. Samples: 74627912. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:52:46,893][434108] Avg episode reward: [(0, '4568.379')] [2023-03-08 14:52:49,236][434396] Updated weights for policy 0, policy_version 145840 (0.0006) [2023-03-08 14:52:51,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10274.7). Total num frames: 74694656. Throughput: 0: 10230.9. Samples: 74688168. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:52:51,893][434108] Avg episode reward: [(0, '4650.767')] [2023-03-08 14:52:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000145888_74694656.pth... [2023-03-08 14:52:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000145296_74391552.pth [2023-03-08 14:52:53,156][434396] Updated weights for policy 0, policy_version 145920 (0.0005) [2023-03-08 14:52:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10274.7). Total num frames: 74747904. Throughput: 0: 10229.4. Samples: 74720092. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:52:56,893][434108] Avg episode reward: [(0, '4566.814')] [2023-03-08 14:52:56,938][434396] Updated weights for policy 0, policy_version 146000 (0.0005) [2023-03-08 14:53:01,034][434396] Updated weights for policy 0, policy_version 146080 (0.0006) [2023-03-08 14:53:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10288.6). Total num frames: 74801152. Throughput: 0: 10240.8. Samples: 74783276. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:53:01,893][434108] Avg episode reward: [(0, '4309.711')] [2023-03-08 14:53:04,766][434396] Updated weights for policy 0, policy_version 146160 (0.0004) [2023-03-08 14:53:06,893][434108] Fps is (10 sec: 10649.4, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 74854400. Throughput: 0: 10300.9. Samples: 74847224. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:53:06,893][434108] Avg episode reward: [(0, '4420.449')] [2023-03-08 14:53:06,898][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000146200_74854400.pth... [2023-03-08 14:53:06,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000145592_74543104.pth [2023-03-08 14:53:08,675][434396] Updated weights for policy 0, policy_version 146240 (0.0005) [2023-03-08 14:53:11,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 74907648. Throughput: 0: 10283.7. Samples: 74878568. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:53:11,893][434108] Avg episode reward: [(0, '4441.849')] [2023-03-08 14:53:12,733][434396] Updated weights for policy 0, policy_version 146320 (0.0005) [2023-03-08 14:53:16,707][434396] Updated weights for policy 0, policy_version 146400 (0.0005) [2023-03-08 14:53:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10302.5). Total num frames: 74956800. Throughput: 0: 10312.2. Samples: 74939920. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:53:16,893][434108] Avg episode reward: [(0, '4622.666')] [2023-03-08 14:53:20,764][434396] Updated weights for policy 0, policy_version 146480 (0.0005) [2023-03-08 14:53:21,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10288.6). Total num frames: 75005952. Throughput: 0: 10309.4. Samples: 75000448. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:53:21,893][434108] Avg episode reward: [(0, '4424.411')] [2023-03-08 14:53:21,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000146496_75005952.pth... [2023-03-08 14:53:21,896][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000145888_74694656.pth [2023-03-08 14:53:24,880][434396] Updated weights for policy 0, policy_version 146560 (0.0005) [2023-03-08 14:53:26,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10288.6). Total num frames: 75059200. Throughput: 0: 10285.5. Samples: 75030536. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:53:26,893][434108] Avg episode reward: [(0, '4478.539')] [2023-03-08 14:53:28,924][434396] Updated weights for policy 0, policy_version 146640 (0.0005) [2023-03-08 14:53:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10260.8). Total num frames: 75108352. Throughput: 0: 10309.7. Samples: 75091852. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 14:53:31,893][434108] Avg episode reward: [(0, '4458.880')] [2023-03-08 14:53:32,955][434396] Updated weights for policy 0, policy_version 146720 (0.0005) [2023-03-08 14:53:36,875][434396] Updated weights for policy 0, policy_version 146800 (0.0004) [2023-03-08 14:53:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10260.8). Total num frames: 75161600. Throughput: 0: 10318.8. Samples: 75152516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:53:36,893][434108] Avg episode reward: [(0, '4613.340')] [2023-03-08 14:53:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000146800_75161600.pth... [2023-03-08 14:53:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000146200_74854400.pth [2023-03-08 14:53:40,869][434396] Updated weights for policy 0, policy_version 146880 (0.0004) [2023-03-08 14:53:41,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10260.8). Total num frames: 75210752. Throughput: 0: 10308.8. Samples: 75183988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:53:41,893][434108] Avg episode reward: [(0, '4478.316')] [2023-03-08 14:53:45,005][434396] Updated weights for policy 0, policy_version 146960 (0.0005) [2023-03-08 14:53:46,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10260.8). Total num frames: 75259904. Throughput: 0: 10229.1. Samples: 75243584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:53:46,893][434108] Avg episode reward: [(0, '4402.812')] [2023-03-08 14:53:49,231][434396] Updated weights for policy 0, policy_version 147040 (0.0005) [2023-03-08 14:53:51,893][434108] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10260.8). Total num frames: 75309056. Throughput: 0: 10083.2. Samples: 75300968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:53:51,893][434108] Avg episode reward: [(0, '4198.669')] [2023-03-08 14:53:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000147088_75309056.pth... [2023-03-08 14:53:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000146496_75005952.pth [2023-03-08 14:53:53,443][434396] Updated weights for policy 0, policy_version 147120 (0.0005) [2023-03-08 14:53:56,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10260.8). Total num frames: 75358208. Throughput: 0: 10049.6. Samples: 75330800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:53:56,893][434108] Avg episode reward: [(0, '3815.982')] [2023-03-08 14:53:57,659][434396] Updated weights for policy 0, policy_version 147200 (0.0005) [2023-03-08 14:54:01,577][434396] Updated weights for policy 0, policy_version 147280 (0.0005) [2023-03-08 14:54:01,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10260.8). Total num frames: 75407360. Throughput: 0: 10025.1. Samples: 75391048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:54:01,893][434108] Avg episode reward: [(0, '3174.513')] [2023-03-08 14:54:06,004][434396] Updated weights for policy 0, policy_version 147360 (0.0004) [2023-03-08 14:54:06,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10246.9). Total num frames: 75456512. Throughput: 0: 9954.1. Samples: 75448384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:54:06,893][434108] Avg episode reward: [(0, '3417.530')] [2023-03-08 14:54:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000147376_75456512.pth... [2023-03-08 14:54:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000146800_75161600.pth [2023-03-08 14:54:10,208][434396] Updated weights for policy 0, policy_version 147440 (0.0005) [2023-03-08 14:54:11,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 10233.1). Total num frames: 75501568. Throughput: 0: 9938.9. Samples: 75477788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:54:11,893][434108] Avg episode reward: [(0, '3854.282')] [2023-03-08 14:54:14,587][434396] Updated weights for policy 0, policy_version 147520 (0.0005) [2023-03-08 14:54:16,892][434108] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 10233.1). Total num frames: 75550720. Throughput: 0: 9834.4. Samples: 75534400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:54:16,893][434108] Avg episode reward: [(0, '4425.015')] [2023-03-08 14:54:18,626][434396] Updated weights for policy 0, policy_version 147600 (0.0005) [2023-03-08 14:54:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10246.9). Total num frames: 75603968. Throughput: 0: 9850.2. Samples: 75595776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:54:21,893][434108] Avg episode reward: [(0, '4372.272')] [2023-03-08 14:54:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000147664_75603968.pth... [2023-03-08 14:54:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000147088_75309056.pth [2023-03-08 14:54:22,646][434396] Updated weights for policy 0, policy_version 147680 (0.0005) [2023-03-08 14:54:26,793][434396] Updated weights for policy 0, policy_version 147760 (0.0006) [2023-03-08 14:54:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10246.9). Total num frames: 75653120. Throughput: 0: 9804.7. Samples: 75625200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:54:26,893][434108] Avg episode reward: [(0, '4567.306')] [2023-03-08 14:54:30,917][434396] Updated weights for policy 0, policy_version 147840 (0.0005) [2023-03-08 14:54:31,892][434108] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10246.9). Total num frames: 75702272. Throughput: 0: 9819.1. Samples: 75685444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:54:31,893][434108] Avg episode reward: [(0, '4610.775')] [2023-03-08 14:54:34,899][434396] Updated weights for policy 0, policy_version 147920 (0.0005) [2023-03-08 14:54:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 10246.9). Total num frames: 75755520. Throughput: 0: 9918.9. Samples: 75747316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:54:36,893][434108] Avg episode reward: [(0, '4610.036')] [2023-03-08 14:54:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000147960_75755520.pth... [2023-03-08 14:54:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000147376_75456512.pth [2023-03-08 14:54:38,791][434396] Updated weights for policy 0, policy_version 148000 (0.0005) [2023-03-08 14:54:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10246.9). Total num frames: 75804672. Throughput: 0: 9949.8. Samples: 75778540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:54:41,893][434108] Avg episode reward: [(0, '4587.820')] [2023-03-08 14:54:42,689][434396] Updated weights for policy 0, policy_version 148080 (0.0005) [2023-03-08 14:54:46,760][434396] Updated weights for policy 0, policy_version 148160 (0.0006) [2023-03-08 14:54:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10246.9). Total num frames: 75857920. Throughput: 0: 9994.9. Samples: 75840820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:54:46,893][434108] Avg episode reward: [(0, '4454.904')] [2023-03-08 14:54:50,780][434396] Updated weights for policy 0, policy_version 148240 (0.0005) [2023-03-08 14:54:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10219.2). Total num frames: 75907072. Throughput: 0: 10065.3. Samples: 75901324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:54:51,893][434108] Avg episode reward: [(0, '4607.249')] [2023-03-08 14:54:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000148256_75907072.pth... [2023-03-08 14:54:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000147664_75603968.pth [2023-03-08 14:54:54,636][434396] Updated weights for policy 0, policy_version 148320 (0.0005) [2023-03-08 14:54:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10219.2). Total num frames: 75960320. Throughput: 0: 10117.2. Samples: 75933064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:54:56,893][434108] Avg episode reward: [(0, '4519.719')] [2023-03-08 14:54:58,535][434396] Updated weights for policy 0, policy_version 148400 (0.0005) [2023-03-08 14:55:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10103.5, 300 sec: 10233.1). Total num frames: 76013568. Throughput: 0: 10268.5. Samples: 75996484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:55:01,893][434108] Avg episode reward: [(0, '4501.789')] [2023-03-08 14:55:02,622][434396] Updated weights for policy 0, policy_version 148480 (0.0005) [2023-03-08 14:55:06,521][434396] Updated weights for policy 0, policy_version 148560 (0.0004) [2023-03-08 14:55:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10219.2). Total num frames: 76062720. Throughput: 0: 10261.9. Samples: 76057564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:55:06,893][434108] Avg episode reward: [(0, '4439.491')] [2023-03-08 14:55:06,952][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000148568_76066816.pth... [2023-03-08 14:55:06,953][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000147960_75755520.pth [2023-03-08 14:55:10,473][434396] Updated weights for policy 0, policy_version 148640 (0.0005) [2023-03-08 14:55:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10233.1). Total num frames: 76115968. Throughput: 0: 10278.7. Samples: 76087740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:55:11,893][434108] Avg episode reward: [(0, '4501.153')] [2023-03-08 14:55:14,521][434396] Updated weights for policy 0, policy_version 148720 (0.0005) [2023-03-08 14:55:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10233.1). Total num frames: 76169216. Throughput: 0: 10328.0. Samples: 76150204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:55:16,893][434108] Avg episode reward: [(0, '4543.165')] [2023-03-08 14:55:18,171][434396] Updated weights for policy 0, policy_version 148800 (0.0004) [2023-03-08 14:55:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10233.1). Total num frames: 76222464. Throughput: 0: 10387.0. Samples: 76214732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:55:21,893][434108] Avg episode reward: [(0, '4532.796')] [2023-03-08 14:55:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000148872_76222464.pth... [2023-03-08 14:55:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000148256_75907072.pth [2023-03-08 14:55:22,160][434396] Updated weights for policy 0, policy_version 148880 (0.0005) [2023-03-08 14:55:26,101][434396] Updated weights for policy 0, policy_version 148960 (0.0005) [2023-03-08 14:55:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10246.9). Total num frames: 76275712. Throughput: 0: 10378.8. Samples: 76245584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:55:26,893][434108] Avg episode reward: [(0, '4326.429')] [2023-03-08 14:55:29,926][434396] Updated weights for policy 0, policy_version 149040 (0.0005) [2023-03-08 14:55:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10233.1). Total num frames: 76324864. Throughput: 0: 10409.2. Samples: 76309232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:55:31,893][434108] Avg episode reward: [(0, '4474.645')] [2023-03-08 14:55:33,856][434396] Updated weights for policy 0, policy_version 149120 (0.0005) [2023-03-08 14:55:36,893][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10233.1). Total num frames: 76378112. Throughput: 0: 10462.1. Samples: 76372120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:55:36,893][434108] Avg episode reward: [(0, '4430.390')] [2023-03-08 14:55:36,938][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000149184_76382208.pth... [2023-03-08 14:55:36,939][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000148568_76066816.pth [2023-03-08 14:55:37,707][434396] Updated weights for policy 0, policy_version 149200 (0.0005) [2023-03-08 14:55:41,668][434396] Updated weights for policy 0, policy_version 149280 (0.0005) [2023-03-08 14:55:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10246.9). Total num frames: 76431360. Throughput: 0: 10459.4. Samples: 76403736. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:55:41,893][434108] Avg episode reward: [(0, '4566.771')] [2023-03-08 14:55:45,587][434396] Updated weights for policy 0, policy_version 149360 (0.0005) [2023-03-08 14:55:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10260.8). Total num frames: 76484608. Throughput: 0: 10450.0. Samples: 76466736. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:55:46,893][434108] Avg episode reward: [(0, '4392.639')] [2023-03-08 14:55:49,490][434396] Updated weights for policy 0, policy_version 149440 (0.0005) [2023-03-08 14:55:51,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10274.7). Total num frames: 76537856. Throughput: 0: 10485.5. Samples: 76529412. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:55:51,893][434108] Avg episode reward: [(0, '4419.028')] [2023-03-08 14:55:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000149488_76537856.pth... [2023-03-08 14:55:51,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000148872_76222464.pth [2023-03-08 14:55:53,394][434396] Updated weights for policy 0, policy_version 149520 (0.0005) [2023-03-08 14:55:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10274.7). Total num frames: 76587008. Throughput: 0: 10515.3. Samples: 76560928. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:55:56,893][434108] Avg episode reward: [(0, '4466.503')] [2023-03-08 14:55:57,375][434396] Updated weights for policy 0, policy_version 149600 (0.0005) [2023-03-08 14:56:01,312][434396] Updated weights for policy 0, policy_version 149680 (0.0005) [2023-03-08 14:56:01,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10288.6). Total num frames: 76640256. Throughput: 0: 10501.8. Samples: 76622784. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:56:01,893][434108] Avg episode reward: [(0, '4442.664')] [2023-03-08 14:56:05,261][434396] Updated weights for policy 0, policy_version 149760 (0.0005) [2023-03-08 14:56:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10288.6). Total num frames: 76693504. Throughput: 0: 10457.5. Samples: 76685320. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:56:06,893][434108] Avg episode reward: [(0, '4530.563')] [2023-03-08 14:56:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000149792_76693504.pth... [2023-03-08 14:56:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000149184_76382208.pth [2023-03-08 14:56:09,140][434396] Updated weights for policy 0, policy_version 149840 (0.0005) [2023-03-08 14:56:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10274.7). Total num frames: 76742656. Throughput: 0: 10472.4. Samples: 76716844. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:56:11,903][434108] Avg episode reward: [(0, '4460.835')] [2023-03-08 14:56:13,124][434396] Updated weights for policy 0, policy_version 149920 (0.0005) [2023-03-08 14:56:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10288.6). Total num frames: 76795904. Throughput: 0: 10438.5. Samples: 76778964. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:56:16,903][434108] Avg episode reward: [(0, '4550.430')] [2023-03-08 14:56:17,031][434396] Updated weights for policy 0, policy_version 150000 (0.0005) [2023-03-08 14:56:20,893][434396] Updated weights for policy 0, policy_version 150080 (0.0005) [2023-03-08 14:56:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10302.5). Total num frames: 76849152. Throughput: 0: 10445.5. Samples: 76842168. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:56:21,903][434108] Avg episode reward: [(0, '4539.454')] [2023-03-08 14:56:21,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000150096_76849152.pth... [2023-03-08 14:56:21,910][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000149488_76537856.pth [2023-03-08 14:56:24,558][434396] Updated weights for policy 0, policy_version 150160 (0.0004) [2023-03-08 14:56:26,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10513.1, 300 sec: 10330.3). Total num frames: 76906496. Throughput: 0: 10486.5. Samples: 76875628. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:56:26,903][434108] Avg episode reward: [(0, '4479.459')] [2023-03-08 14:56:28,207][434396] Updated weights for policy 0, policy_version 150240 (0.0004) [2023-03-08 14:56:31,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10330.3). Total num frames: 76959744. Throughput: 0: 10589.1. Samples: 76943248. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:56:31,893][434108] Avg episode reward: [(0, '4427.709')] [2023-03-08 14:56:31,909][434396] Updated weights for policy 0, policy_version 150320 (0.0004) [2023-03-08 14:56:35,622][434396] Updated weights for policy 0, policy_version 150400 (0.0005) [2023-03-08 14:56:36,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10344.1). Total num frames: 77017088. Throughput: 0: 10656.6. Samples: 77008960. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:56:36,893][434108] Avg episode reward: [(0, '4349.774')] [2023-03-08 14:56:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000150424_77017088.pth... [2023-03-08 14:56:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000149792_76693504.pth [2023-03-08 14:56:39,348][434396] Updated weights for policy 0, policy_version 150480 (0.0005) [2023-03-08 14:56:41,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10330.3). Total num frames: 77070336. Throughput: 0: 10686.9. Samples: 77041836. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 14:56:41,893][434108] Avg episode reward: [(0, '4480.012')] [2023-03-08 14:56:43,104][434396] Updated weights for policy 0, policy_version 150560 (0.0004) [2023-03-08 14:56:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10330.3). Total num frames: 77123584. Throughput: 0: 10767.1. Samples: 77107304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:56:46,893][434108] Avg episode reward: [(0, '4357.258')] [2023-03-08 14:56:46,914][434396] Updated weights for policy 0, policy_version 150640 (0.0005) [2023-03-08 14:56:50,808][434396] Updated weights for policy 0, policy_version 150720 (0.0004) [2023-03-08 14:56:51,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10316.4). Total num frames: 77176832. Throughput: 0: 10794.3. Samples: 77171064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:56:51,893][434108] Avg episode reward: [(0, '4560.736')] [2023-03-08 14:56:51,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000150736_77176832.pth... [2023-03-08 14:56:51,896][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000150096_76849152.pth [2023-03-08 14:56:54,509][434396] Updated weights for policy 0, policy_version 150800 (0.0004) [2023-03-08 14:56:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10344.1). Total num frames: 77234176. Throughput: 0: 10832.1. Samples: 77204288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:56:56,903][434108] Avg episode reward: [(0, '4582.900')] [2023-03-08 14:56:58,134][434396] Updated weights for policy 0, policy_version 150880 (0.0004) [2023-03-08 14:57:01,773][434396] Updated weights for policy 0, policy_version 150960 (0.0004) [2023-03-08 14:57:01,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10854.4, 300 sec: 10358.0). Total num frames: 77291520. Throughput: 0: 10945.7. Samples: 77271520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:57:01,893][434108] Avg episode reward: [(0, '4569.332')] [2023-03-08 14:57:05,427][434396] Updated weights for policy 0, policy_version 151040 (0.0004) [2023-03-08 14:57:06,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10358.0). Total num frames: 77344768. Throughput: 0: 11040.4. Samples: 77338984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:57:06,903][434108] Avg episode reward: [(0, '4415.145')] [2023-03-08 14:57:06,906][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000151064_77344768.pth... [2023-03-08 14:57:06,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000150424_77017088.pth [2023-03-08 14:57:09,378][434396] Updated weights for policy 0, policy_version 151120 (0.0005) [2023-03-08 14:57:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10371.9). Total num frames: 77398016. Throughput: 0: 10973.1. Samples: 77369416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:57:11,903][434108] Avg episode reward: [(0, '4508.794')] [2023-03-08 14:57:13,267][434396] Updated weights for policy 0, policy_version 151200 (0.0005) [2023-03-08 14:57:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10371.9). Total num frames: 77451264. Throughput: 0: 10880.7. Samples: 77432880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:57:16,893][434108] Avg episode reward: [(0, '4612.534')] [2023-03-08 14:57:17,178][434396] Updated weights for policy 0, policy_version 151280 (0.0004) [2023-03-08 14:57:20,862][434396] Updated weights for policy 0, policy_version 151360 (0.0005) [2023-03-08 14:57:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10371.9). Total num frames: 77504512. Throughput: 0: 10866.7. Samples: 77497960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:57:21,893][434108] Avg episode reward: [(0, '4518.426')] [2023-03-08 14:57:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000151376_77504512.pth... [2023-03-08 14:57:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000150736_77176832.pth [2023-03-08 14:57:24,549][434396] Updated weights for policy 0, policy_version 151440 (0.0005) [2023-03-08 14:57:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10399.7). Total num frames: 77561856. Throughput: 0: 10878.0. Samples: 77531344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:57:26,893][434108] Avg episode reward: [(0, '4231.591')] [2023-03-08 14:57:28,213][434396] Updated weights for policy 0, policy_version 151520 (0.0004) [2023-03-08 14:57:31,879][434396] Updated weights for policy 0, policy_version 151600 (0.0004) [2023-03-08 14:57:31,892][434108] Fps is (10 sec: 11468.9, 60 sec: 10991.0, 300 sec: 10427.4). Total num frames: 77619200. Throughput: 0: 10921.8. Samples: 77598784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:57:31,893][434108] Avg episode reward: [(0, '4518.820')] [2023-03-08 14:57:35,643][434396] Updated weights for policy 0, policy_version 151680 (0.0005) [2023-03-08 14:57:36,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10922.7, 300 sec: 10427.4). Total num frames: 77672448. Throughput: 0: 10961.2. Samples: 77664320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:57:36,903][434108] Avg episode reward: [(0, '4341.266')] [2023-03-08 14:57:36,906][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000151704_77672448.pth... [2023-03-08 14:57:36,908][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000151064_77344768.pth [2023-03-08 14:57:39,261][434396] Updated weights for policy 0, policy_version 151760 (0.0004) [2023-03-08 14:57:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 10455.2). Total num frames: 77729792. Throughput: 0: 10981.6. Samples: 77698460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:57:41,903][434108] Avg episode reward: [(0, '4469.043')] [2023-03-08 14:57:42,896][434396] Updated weights for policy 0, policy_version 151840 (0.0003) [2023-03-08 14:57:46,547][434396] Updated weights for policy 0, policy_version 151920 (0.0003) [2023-03-08 14:57:46,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10990.9, 300 sec: 10469.1). Total num frames: 77783040. Throughput: 0: 10971.8. Samples: 77765252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:57:46,903][434108] Avg episode reward: [(0, '4357.881')] [2023-03-08 14:57:50,137][434396] Updated weights for policy 0, policy_version 152000 (0.0005) [2023-03-08 14:57:51,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10483.0). Total num frames: 77840384. Throughput: 0: 10987.6. Samples: 77833424. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:57:51,903][434108] Avg episode reward: [(0, '4458.638')] [2023-03-08 14:57:51,906][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000152032_77840384.pth... [2023-03-08 14:57:51,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000151376_77504512.pth [2023-03-08 14:57:53,805][434396] Updated weights for policy 0, policy_version 152080 (0.0004) [2023-03-08 14:57:56,892][434108] Fps is (10 sec: 11468.7, 60 sec: 11059.2, 300 sec: 10496.9). Total num frames: 77897728. Throughput: 0: 11066.7. Samples: 77867416. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:57:56,903][434108] Avg episode reward: [(0, '4533.336')] [2023-03-08 14:57:57,465][434396] Updated weights for policy 0, policy_version 152160 (0.0004) [2023-03-08 14:58:01,171][434396] Updated weights for policy 0, policy_version 152240 (0.0004) [2023-03-08 14:58:01,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10990.9, 300 sec: 10496.9). Total num frames: 77950976. Throughput: 0: 11143.6. Samples: 77934344. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:58:01,903][434108] Avg episode reward: [(0, '4507.529')] [2023-03-08 14:58:04,863][434396] Updated weights for policy 0, policy_version 152320 (0.0004) [2023-03-08 14:58:06,892][434108] Fps is (10 sec: 11059.1, 60 sec: 11059.2, 300 sec: 10510.8). Total num frames: 78008320. Throughput: 0: 11172.0. Samples: 78000700. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:58:06,893][434108] Avg episode reward: [(0, '4491.477')] [2023-03-08 14:58:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000152360_78008320.pth... [2023-03-08 14:58:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000151704_77672448.pth [2023-03-08 14:58:08,498][434396] Updated weights for policy 0, policy_version 152400 (0.0003) [2023-03-08 14:58:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10524.6). Total num frames: 78061568. Throughput: 0: 11168.5. Samples: 78033928. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:58:11,893][434108] Avg episode reward: [(0, '4477.801')] [2023-03-08 14:58:12,401][434396] Updated weights for policy 0, policy_version 152480 (0.0005) [2023-03-08 14:58:16,009][434396] Updated weights for policy 0, policy_version 152560 (0.0003) [2023-03-08 14:58:16,892][434108] Fps is (10 sec: 11059.3, 60 sec: 11127.5, 300 sec: 10552.4). Total num frames: 78118912. Throughput: 0: 11113.3. Samples: 78098884. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:58:16,893][434108] Avg episode reward: [(0, '4532.191')] [2023-03-08 14:58:19,747][434396] Updated weights for policy 0, policy_version 152640 (0.0004) [2023-03-08 14:58:21,892][434108] Fps is (10 sec: 11468.7, 60 sec: 11195.7, 300 sec: 10566.3). Total num frames: 78176256. Throughput: 0: 11151.1. Samples: 78166120. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:58:21,893][434108] Avg episode reward: [(0, '4543.563')] [2023-03-08 14:58:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000152688_78176256.pth... [2023-03-08 14:58:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000152032_77840384.pth [2023-03-08 14:58:23,341][434396] Updated weights for policy 0, policy_version 152720 (0.0004) [2023-03-08 14:58:26,892][434108] Fps is (10 sec: 11059.1, 60 sec: 11127.5, 300 sec: 10580.2). Total num frames: 78229504. Throughput: 0: 11144.7. Samples: 78199972. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:58:26,893][434108] Avg episode reward: [(0, '4530.235')] [2023-03-08 14:58:27,152][434396] Updated weights for policy 0, policy_version 152800 (0.0005) [2023-03-08 14:58:30,767][434396] Updated weights for policy 0, policy_version 152880 (0.0004) [2023-03-08 14:58:31,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 10594.1). Total num frames: 78286848. Throughput: 0: 11130.3. Samples: 78266116. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:58:31,893][434108] Avg episode reward: [(0, '4490.117')] [2023-03-08 14:58:34,475][434396] Updated weights for policy 0, policy_version 152960 (0.0004) [2023-03-08 14:58:36,892][434108] Fps is (10 sec: 11059.1, 60 sec: 11127.5, 300 sec: 10607.9). Total num frames: 78340096. Throughput: 0: 11081.2. Samples: 78332080. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:58:36,893][434108] Avg episode reward: [(0, '4557.646')] [2023-03-08 14:58:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000153008_78340096.pth... [2023-03-08 14:58:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000152360_78008320.pth [2023-03-08 14:58:38,150][434396] Updated weights for policy 0, policy_version 153040 (0.0004) [2023-03-08 14:58:41,864][434396] Updated weights for policy 0, policy_version 153120 (0.0004) [2023-03-08 14:58:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 10635.7). Total num frames: 78397440. Throughput: 0: 11081.4. Samples: 78366080. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:58:41,893][434108] Avg episode reward: [(0, '4569.023')] [2023-03-08 14:58:45,700][434396] Updated weights for policy 0, policy_version 153200 (0.0005) [2023-03-08 14:58:46,892][434108] Fps is (10 sec: 11059.3, 60 sec: 11127.5, 300 sec: 10649.6). Total num frames: 78450688. Throughput: 0: 11040.7. Samples: 78431176. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 14:58:46,893][434108] Avg episode reward: [(0, '4480.407')] [2023-03-08 14:58:49,651][434396] Updated weights for policy 0, policy_version 153280 (0.0005) [2023-03-08 14:58:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10990.9, 300 sec: 10649.6). Total num frames: 78499840. Throughput: 0: 10959.7. Samples: 78493888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:58:51,893][434108] Avg episode reward: [(0, '4600.692')] [2023-03-08 14:58:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000153320_78499840.pth... [2023-03-08 14:58:51,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000152688_78176256.pth [2023-03-08 14:58:53,602][434396] Updated weights for policy 0, policy_version 153360 (0.0005) [2023-03-08 14:58:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10922.7, 300 sec: 10663.5). Total num frames: 78553088. Throughput: 0: 10901.3. Samples: 78524488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:58:56,893][434108] Avg episode reward: [(0, '4611.899')] [2023-03-08 14:58:57,576][434396] Updated weights for policy 0, policy_version 153440 (0.0005) [2023-03-08 14:59:01,514][434396] Updated weights for policy 0, policy_version 153520 (0.0005) [2023-03-08 14:59:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10854.4, 300 sec: 10663.5). Total num frames: 78602240. Throughput: 0: 10823.2. Samples: 78585928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:59:01,893][434108] Avg episode reward: [(0, '4460.780')] [2023-03-08 14:59:05,463][434396] Updated weights for policy 0, policy_version 153600 (0.0005) [2023-03-08 14:59:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10786.1, 300 sec: 10691.3). Total num frames: 78655488. Throughput: 0: 10722.3. Samples: 78648624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:59:06,893][434108] Avg episode reward: [(0, '4594.551')] [2023-03-08 14:59:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000153624_78655488.pth... [2023-03-08 14:59:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000153008_78340096.pth [2023-03-08 14:59:09,443][434396] Updated weights for policy 0, policy_version 153680 (0.0005) [2023-03-08 14:59:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10705.1). Total num frames: 78708736. Throughput: 0: 10668.7. Samples: 78680064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:59:11,893][434108] Avg episode reward: [(0, '4532.718')] [2023-03-08 14:59:13,399][434396] Updated weights for policy 0, policy_version 153760 (0.0005) [2023-03-08 14:59:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10691.3). Total num frames: 78757888. Throughput: 0: 10565.8. Samples: 78741576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:59:16,893][434108] Avg episode reward: [(0, '4569.906')] [2023-03-08 14:59:17,332][434396] Updated weights for policy 0, policy_version 153840 (0.0005) [2023-03-08 14:59:21,374][434396] Updated weights for policy 0, policy_version 153920 (0.0005) [2023-03-08 14:59:21,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10705.1). Total num frames: 78811136. Throughput: 0: 10466.8. Samples: 78803088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:59:21,893][434108] Avg episode reward: [(0, '4565.084')] [2023-03-08 14:59:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000153928_78811136.pth... [2023-03-08 14:59:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000153320_78499840.pth [2023-03-08 14:59:25,336][434396] Updated weights for policy 0, policy_version 154000 (0.0005) [2023-03-08 14:59:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10719.0). Total num frames: 78864384. Throughput: 0: 10404.6. Samples: 78834288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:59:26,893][434108] Avg episode reward: [(0, '4598.107')] [2023-03-08 14:59:29,016][434396] Updated weights for policy 0, policy_version 154080 (0.0004) [2023-03-08 14:59:31,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10581.3, 300 sec: 10732.9). Total num frames: 78921728. Throughput: 0: 10430.5. Samples: 78900548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:59:31,893][434108] Avg episode reward: [(0, '4551.842')] [2023-03-08 14:59:32,632][434396] Updated weights for policy 0, policy_version 154160 (0.0004) [2023-03-08 14:59:36,243][434396] Updated weights for policy 0, policy_version 154240 (0.0004) [2023-03-08 14:59:36,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10746.8). Total num frames: 78974976. Throughput: 0: 10532.0. Samples: 78967828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:59:36,893][434108] Avg episode reward: [(0, '4466.617')] [2023-03-08 14:59:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000154248_78974976.pth... [2023-03-08 14:59:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000153624_78655488.pth [2023-03-08 14:59:39,902][434396] Updated weights for policy 0, policy_version 154320 (0.0004) [2023-03-08 14:59:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10760.7). Total num frames: 79032320. Throughput: 0: 10606.8. Samples: 79001796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:59:41,893][434108] Avg episode reward: [(0, '4453.931')] [2023-03-08 14:59:43,563][434396] Updated weights for policy 0, policy_version 154400 (0.0004) [2023-03-08 14:59:46,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10649.6, 300 sec: 10788.5). Total num frames: 79089664. Throughput: 0: 10738.8. Samples: 79069176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:59:46,893][434108] Avg episode reward: [(0, '4430.425')] [2023-03-08 14:59:47,235][434396] Updated weights for policy 0, policy_version 154480 (0.0004) [2023-03-08 14:59:50,950][434396] Updated weights for policy 0, policy_version 154560 (0.0004) [2023-03-08 14:59:51,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 10788.4). Total num frames: 79142912. Throughput: 0: 10811.3. Samples: 79135132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:59:51,893][434108] Avg episode reward: [(0, '4491.894')] [2023-03-08 14:59:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000154576_79142912.pth... [2023-03-08 14:59:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000153928_78811136.pth [2023-03-08 14:59:54,584][434396] Updated weights for policy 0, policy_version 154640 (0.0004) [2023-03-08 14:59:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10802.3). Total num frames: 79200256. Throughput: 0: 10872.6. Samples: 79169332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 14:59:56,893][434108] Avg episode reward: [(0, '4465.112')] [2023-03-08 14:59:58,235][434396] Updated weights for policy 0, policy_version 154720 (0.0004) [2023-03-08 15:00:01,869][434396] Updated weights for policy 0, policy_version 154800 (0.0004) [2023-03-08 15:00:01,892][434108] Fps is (10 sec: 11468.9, 60 sec: 10922.7, 300 sec: 10830.1). Total num frames: 79257600. Throughput: 0: 10999.9. Samples: 79236572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:00:01,893][434108] Avg episode reward: [(0, '4376.613')] [2023-03-08 15:00:05,784][434396] Updated weights for policy 0, policy_version 154880 (0.0005) [2023-03-08 15:00:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 10816.2). Total num frames: 79306752. Throughput: 0: 11065.8. Samples: 79301048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:00:06,893][434108] Avg episode reward: [(0, '4410.559')] [2023-03-08 15:00:06,961][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000154904_79310848.pth... [2023-03-08 15:00:06,963][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000154248_78974976.pth [2023-03-08 15:00:09,579][434396] Updated weights for policy 0, policy_version 154960 (0.0005) [2023-03-08 15:00:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10830.1). Total num frames: 79364096. Throughput: 0: 11086.4. Samples: 79333176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:00:11,893][434108] Avg episode reward: [(0, '4482.223')] [2023-03-08 15:00:13,174][434396] Updated weights for policy 0, policy_version 155040 (0.0005) [2023-03-08 15:00:16,879][434396] Updated weights for policy 0, policy_version 155120 (0.0004) [2023-03-08 15:00:16,892][434108] Fps is (10 sec: 11468.9, 60 sec: 11059.2, 300 sec: 10844.0). Total num frames: 79421440. Throughput: 0: 11120.6. Samples: 79400976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:00:16,893][434108] Avg episode reward: [(0, '4409.632')] [2023-03-08 15:00:20,536][434396] Updated weights for policy 0, policy_version 155200 (0.0004) [2023-03-08 15:00:21,892][434108] Fps is (10 sec: 11059.1, 60 sec: 11059.2, 300 sec: 10844.0). Total num frames: 79474688. Throughput: 0: 11105.6. Samples: 79467580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:00:21,893][434108] Avg episode reward: [(0, '4105.064')] [2023-03-08 15:00:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000155224_79474688.pth... [2023-03-08 15:00:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000154576_79142912.pth [2023-03-08 15:00:24,256][434396] Updated weights for policy 0, policy_version 155280 (0.0004) [2023-03-08 15:00:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 10871.8). Total num frames: 79532032. Throughput: 0: 11072.9. Samples: 79500076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:00:26,893][434108] Avg episode reward: [(0, '4292.611')] [2023-03-08 15:00:28,036][434396] Updated weights for policy 0, policy_version 155360 (0.0004) [2023-03-08 15:00:31,665][434396] Updated weights for policy 0, policy_version 155440 (0.0004) [2023-03-08 15:00:31,892][434108] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 10871.8). Total num frames: 79585280. Throughput: 0: 11054.8. Samples: 79566640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:00:31,893][434108] Avg episode reward: [(0, '4198.029')] [2023-03-08 15:00:35,614][434396] Updated weights for policy 0, policy_version 155520 (0.0005) [2023-03-08 15:00:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 11059.2, 300 sec: 10871.8). Total num frames: 79638528. Throughput: 0: 11006.0. Samples: 79630400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:00:36,893][434108] Avg episode reward: [(0, '4272.940')] [2023-03-08 15:00:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000155544_79638528.pth... [2023-03-08 15:00:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000154904_79310848.pth [2023-03-08 15:00:39,472][434396] Updated weights for policy 0, policy_version 155600 (0.0005) [2023-03-08 15:00:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10990.9, 300 sec: 10871.8). Total num frames: 79691776. Throughput: 0: 10969.2. Samples: 79662944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:00:41,893][434108] Avg episode reward: [(0, '4160.729')] [2023-03-08 15:00:43,423][434396] Updated weights for policy 0, policy_version 155680 (0.0005) [2023-03-08 15:00:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10922.7, 300 sec: 10871.8). Total num frames: 79745024. Throughput: 0: 10845.4. Samples: 79724616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:00:46,893][434108] Avg episode reward: [(0, '4313.664')] [2023-03-08 15:00:47,232][434396] Updated weights for policy 0, policy_version 155760 (0.0005) [2023-03-08 15:00:51,036][434396] Updated weights for policy 0, policy_version 155840 (0.0004) [2023-03-08 15:00:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10885.6). Total num frames: 79798272. Throughput: 0: 10862.8. Samples: 79789872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:00:51,893][434108] Avg episode reward: [(0, '4438.075')] [2023-03-08 15:00:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000155856_79798272.pth... [2023-03-08 15:00:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000155224_79474688.pth [2023-03-08 15:00:54,618][434396] Updated weights for policy 0, policy_version 155920 (0.0003) [2023-03-08 15:00:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10899.5). Total num frames: 79855616. Throughput: 0: 10900.1. Samples: 79823680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:00:56,893][434108] Avg episode reward: [(0, '4505.167')] [2023-03-08 15:00:58,258][434396] Updated weights for policy 0, policy_version 156000 (0.0005) [2023-03-08 15:01:01,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10854.4, 300 sec: 10899.5). Total num frames: 79908864. Throughput: 0: 10909.6. Samples: 79891908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:01:01,893][434108] Avg episode reward: [(0, '4218.165')] [2023-03-08 15:01:01,922][434396] Updated weights for policy 0, policy_version 156080 (0.0004) [2023-03-08 15:01:05,964][434396] Updated weights for policy 0, policy_version 156160 (0.0005) [2023-03-08 15:01:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10922.7, 300 sec: 10913.4). Total num frames: 79962112. Throughput: 0: 10812.8. Samples: 79954156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:01:06,893][434108] Avg episode reward: [(0, '4496.238')] [2023-03-08 15:01:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000156176_79962112.pth... [2023-03-08 15:01:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000155544_79638528.pth [2023-03-08 15:01:09,826][434396] Updated weights for policy 0, policy_version 156240 (0.0005) [2023-03-08 15:01:11,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10854.4, 300 sec: 10913.4). Total num frames: 80015360. Throughput: 0: 10809.8. Samples: 79986516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:01:11,893][434108] Avg episode reward: [(0, '4383.116')] [2023-03-08 15:01:13,762][434396] Updated weights for policy 0, policy_version 156320 (0.0005) [2023-03-08 15:01:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10717.9, 300 sec: 10899.5). Total num frames: 80064512. Throughput: 0: 10701.3. Samples: 80048200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:01:16,893][434108] Avg episode reward: [(0, '4328.044')] [2023-03-08 15:01:17,853][434396] Updated weights for policy 0, policy_version 156400 (0.0005) [2023-03-08 15:01:21,810][434396] Updated weights for policy 0, policy_version 156480 (0.0005) [2023-03-08 15:01:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10885.6). Total num frames: 80117760. Throughput: 0: 10648.4. Samples: 80109576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:01:21,893][434108] Avg episode reward: [(0, '4433.003')] [2023-03-08 15:01:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000156480_80117760.pth... [2023-03-08 15:01:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000155856_79798272.pth [2023-03-08 15:01:25,807][434396] Updated weights for policy 0, policy_version 156560 (0.0005) [2023-03-08 15:01:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10871.8). Total num frames: 80166912. Throughput: 0: 10600.5. Samples: 80139964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:01:26,893][434108] Avg episode reward: [(0, '4532.260')] [2023-03-08 15:01:29,783][434396] Updated weights for policy 0, policy_version 156640 (0.0004) [2023-03-08 15:01:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10857.9). Total num frames: 80220160. Throughput: 0: 10619.5. Samples: 80202492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:01:31,893][434108] Avg episode reward: [(0, '4403.614')] [2023-03-08 15:01:33,714][434396] Updated weights for policy 0, policy_version 156720 (0.0004) [2023-03-08 15:01:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10857.9). Total num frames: 80273408. Throughput: 0: 10557.0. Samples: 80264936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:01:36,893][434108] Avg episode reward: [(0, '4321.800')] [2023-03-08 15:01:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000156784_80273408.pth... [2023-03-08 15:01:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000156176_79962112.pth [2023-03-08 15:01:37,615][434396] Updated weights for policy 0, policy_version 156800 (0.0004) [2023-03-08 15:01:41,633][434396] Updated weights for policy 0, policy_version 156880 (0.0004) [2023-03-08 15:01:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10844.0). Total num frames: 80322560. Throughput: 0: 10503.1. Samples: 80296320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:01:41,893][434108] Avg episode reward: [(0, '4329.552')] [2023-03-08 15:01:45,612][434396] Updated weights for policy 0, policy_version 156960 (0.0003) [2023-03-08 15:01:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10844.0). Total num frames: 80375808. Throughput: 0: 10353.2. Samples: 80357804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:01:46,893][434108] Avg episode reward: [(0, '4292.707')] [2023-03-08 15:01:49,510][434396] Updated weights for policy 0, policy_version 157040 (0.0004) [2023-03-08 15:01:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10816.2). Total num frames: 80424960. Throughput: 0: 10355.1. Samples: 80420136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:01:51,893][434108] Avg episode reward: [(0, '4287.967')] [2023-03-08 15:01:51,914][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000157088_80429056.pth... [2023-03-08 15:01:51,916][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000156480_80117760.pth [2023-03-08 15:01:53,468][434396] Updated weights for policy 0, policy_version 157120 (0.0003) [2023-03-08 15:01:56,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10802.3). Total num frames: 80478208. Throughput: 0: 10328.8. Samples: 80451312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:01:56,893][434108] Avg episode reward: [(0, '4165.166')] [2023-03-08 15:01:57,298][434396] Updated weights for policy 0, policy_version 157200 (0.0003) [2023-03-08 15:02:01,282][434396] Updated weights for policy 0, policy_version 157280 (0.0004) [2023-03-08 15:02:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10802.3). Total num frames: 80531456. Throughput: 0: 10374.3. Samples: 80515044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:02:01,893][434108] Avg episode reward: [(0, '4332.937')] [2023-03-08 15:02:05,239][434396] Updated weights for policy 0, policy_version 157360 (0.0005) [2023-03-08 15:02:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10802.3). Total num frames: 80584704. Throughput: 0: 10377.8. Samples: 80576576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:02:06,893][434108] Avg episode reward: [(0, '4398.723')] [2023-03-08 15:02:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000157392_80584704.pth... [2023-03-08 15:02:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000156784_80273408.pth [2023-03-08 15:02:09,136][434396] Updated weights for policy 0, policy_version 157440 (0.0005) [2023-03-08 15:02:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10788.4). Total num frames: 80633856. Throughput: 0: 10402.6. Samples: 80608080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:02:11,893][434108] Avg episode reward: [(0, '4474.011')] [2023-03-08 15:02:13,090][434396] Updated weights for policy 0, policy_version 157520 (0.0005) [2023-03-08 15:02:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10788.4). Total num frames: 80687104. Throughput: 0: 10395.5. Samples: 80670288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:02:16,893][434108] Avg episode reward: [(0, '4309.677')] [2023-03-08 15:02:17,058][434396] Updated weights for policy 0, policy_version 157600 (0.0005) [2023-03-08 15:02:21,032][434396] Updated weights for policy 0, policy_version 157680 (0.0005) [2023-03-08 15:02:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10774.6). Total num frames: 80740352. Throughput: 0: 10383.1. Samples: 80732176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:02:21,893][434108] Avg episode reward: [(0, '4462.945')] [2023-03-08 15:02:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000157696_80740352.pth... [2023-03-08 15:02:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000157088_80429056.pth [2023-03-08 15:02:24,896][434396] Updated weights for policy 0, policy_version 157760 (0.0005) [2023-03-08 15:02:26,893][434108] Fps is (10 sec: 10649.4, 60 sec: 10444.8, 300 sec: 10760.7). Total num frames: 80793600. Throughput: 0: 10395.3. Samples: 80764112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:02:26,893][434108] Avg episode reward: [(0, '4360.053')] [2023-03-08 15:02:28,828][434396] Updated weights for policy 0, policy_version 157840 (0.0005) [2023-03-08 15:02:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10746.8). Total num frames: 80842752. Throughput: 0: 10421.8. Samples: 80826784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:02:31,893][434108] Avg episode reward: [(0, '4223.060')] [2023-03-08 15:02:32,768][434396] Updated weights for policy 0, policy_version 157920 (0.0005) [2023-03-08 15:02:36,668][434396] Updated weights for policy 0, policy_version 158000 (0.0006) [2023-03-08 15:02:36,892][434108] Fps is (10 sec: 10240.2, 60 sec: 10376.5, 300 sec: 10732.9). Total num frames: 80896000. Throughput: 0: 10415.0. Samples: 80888808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:02:36,893][434108] Avg episode reward: [(0, '4399.372')] [2023-03-08 15:02:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000158000_80896000.pth... [2023-03-08 15:02:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000157392_80584704.pth [2023-03-08 15:02:40,447][434396] Updated weights for policy 0, policy_version 158080 (0.0004) [2023-03-08 15:02:41,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10732.9). Total num frames: 80949248. Throughput: 0: 10461.2. Samples: 80922064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:02:41,893][434108] Avg episode reward: [(0, '4491.121')] [2023-03-08 15:02:44,464][434396] Updated weights for policy 0, policy_version 158160 (0.0005) [2023-03-08 15:02:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10719.0). Total num frames: 81002496. Throughput: 0: 10417.6. Samples: 80983836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:02:46,893][434108] Avg episode reward: [(0, '4341.416')] [2023-03-08 15:02:48,372][434396] Updated weights for policy 0, policy_version 158240 (0.0005) [2023-03-08 15:02:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10691.3). Total num frames: 81051648. Throughput: 0: 10421.6. Samples: 81045548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:02:51,893][434108] Avg episode reward: [(0, '4354.988')] [2023-03-08 15:02:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000158304_81051648.pth... [2023-03-08 15:02:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000157696_80740352.pth [2023-03-08 15:02:52,374][434396] Updated weights for policy 0, policy_version 158320 (0.0005) [2023-03-08 15:02:56,364][434396] Updated weights for policy 0, policy_version 158400 (0.0005) [2023-03-08 15:02:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10691.3). Total num frames: 81104896. Throughput: 0: 10412.6. Samples: 81076648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:02:56,893][434108] Avg episode reward: [(0, '4210.321')] [2023-03-08 15:03:00,250][434396] Updated weights for policy 0, policy_version 158480 (0.0005) [2023-03-08 15:03:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10677.4). Total num frames: 81158144. Throughput: 0: 10421.8. Samples: 81139268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:03:01,893][434108] Avg episode reward: [(0, '4234.640')] [2023-03-08 15:03:04,240][434396] Updated weights for policy 0, policy_version 158560 (0.0005) [2023-03-08 15:03:06,892][434108] Fps is (10 sec: 10239.8, 60 sec: 10376.5, 300 sec: 10663.5). Total num frames: 81207296. Throughput: 0: 10424.7. Samples: 81201288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:03:06,893][434108] Avg episode reward: [(0, '4262.778')] [2023-03-08 15:03:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000158608_81207296.pth... [2023-03-08 15:03:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000158000_80896000.pth [2023-03-08 15:03:08,203][434396] Updated weights for policy 0, policy_version 158640 (0.0005) [2023-03-08 15:03:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10649.6). Total num frames: 81260544. Throughput: 0: 10399.9. Samples: 81232108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:03:11,893][434108] Avg episode reward: [(0, '4466.328')] [2023-03-08 15:03:12,107][434396] Updated weights for policy 0, policy_version 158720 (0.0005) [2023-03-08 15:03:16,044][434396] Updated weights for policy 0, policy_version 158800 (0.0005) [2023-03-08 15:03:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10635.7). Total num frames: 81313792. Throughput: 0: 10403.7. Samples: 81294948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:03:16,893][434108] Avg episode reward: [(0, '4316.456')] [2023-03-08 15:03:19,913][434396] Updated weights for policy 0, policy_version 158880 (0.0005) [2023-03-08 15:03:21,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10635.7). Total num frames: 81367040. Throughput: 0: 10436.7. Samples: 81358460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:03:21,893][434108] Avg episode reward: [(0, '4181.515')] [2023-03-08 15:03:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000158920_81367040.pth... [2023-03-08 15:03:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000158304_81051648.pth [2023-03-08 15:03:23,837][434396] Updated weights for policy 0, policy_version 158960 (0.0005) [2023-03-08 15:03:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10621.8). Total num frames: 81420288. Throughput: 0: 10388.4. Samples: 81389544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:03:26,893][434108] Avg episode reward: [(0, '4192.393')] [2023-03-08 15:03:27,652][434396] Updated weights for policy 0, policy_version 159040 (0.0005) [2023-03-08 15:03:31,634][434396] Updated weights for policy 0, policy_version 159120 (0.0005) [2023-03-08 15:03:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10607.9). Total num frames: 81469440. Throughput: 0: 10427.3. Samples: 81453064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:03:31,893][434108] Avg episode reward: [(0, '4156.606')] [2023-03-08 15:03:35,618][434396] Updated weights for policy 0, policy_version 159200 (0.0005) [2023-03-08 15:03:36,893][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 81522688. Throughput: 0: 10422.5. Samples: 81514560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:03:36,893][434108] Avg episode reward: [(0, '4174.055')] [2023-03-08 15:03:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000159224_81522688.pth... [2023-03-08 15:03:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000158608_81207296.pth [2023-03-08 15:03:39,496][434396] Updated weights for policy 0, policy_version 159280 (0.0005) [2023-03-08 15:03:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 81575936. Throughput: 0: 10433.0. Samples: 81546136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:03:41,893][434108] Avg episode reward: [(0, '4283.854')] [2023-03-08 15:03:43,190][434396] Updated weights for policy 0, policy_version 159360 (0.0004) [2023-03-08 15:03:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10607.9). Total num frames: 81629184. Throughput: 0: 10501.0. Samples: 81611812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:03:46,893][434108] Avg episode reward: [(0, '4244.019')] [2023-03-08 15:03:47,107][434396] Updated weights for policy 0, policy_version 159440 (0.0005) [2023-03-08 15:03:51,030][434396] Updated weights for policy 0, policy_version 159520 (0.0005) [2023-03-08 15:03:51,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10607.9). Total num frames: 81682432. Throughput: 0: 10510.4. Samples: 81674256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:03:51,893][434108] Avg episode reward: [(0, '4127.115')] [2023-03-08 15:03:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000159536_81682432.pth... [2023-03-08 15:03:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000158920_81367040.pth [2023-03-08 15:03:54,921][434396] Updated weights for policy 0, policy_version 159600 (0.0005) [2023-03-08 15:03:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10607.9). Total num frames: 81731584. Throughput: 0: 10527.1. Samples: 81705828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:03:56,893][434108] Avg episode reward: [(0, '4327.773')] [2023-03-08 15:03:58,765][434396] Updated weights for policy 0, policy_version 159680 (0.0005) [2023-03-08 15:04:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10621.8). Total num frames: 81788928. Throughput: 0: 10559.3. Samples: 81770116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:04:01,893][434108] Avg episode reward: [(0, '4280.552')] [2023-03-08 15:04:02,437][434396] Updated weights for policy 0, policy_version 159760 (0.0004) [2023-03-08 15:04:06,085][434396] Updated weights for policy 0, policy_version 159840 (0.0004) [2023-03-08 15:04:06,892][434108] Fps is (10 sec: 11468.7, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 81846272. Throughput: 0: 10645.2. Samples: 81837492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:04:06,893][434108] Avg episode reward: [(0, '4267.639')] [2023-03-08 15:04:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000159856_81846272.pth... [2023-03-08 15:04:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000159224_81522688.pth [2023-03-08 15:04:09,721][434396] Updated weights for policy 0, policy_version 159920 (0.0003) [2023-03-08 15:04:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10649.6). Total num frames: 81899520. Throughput: 0: 10697.1. Samples: 81870912. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:04:11,893][434108] Avg episode reward: [(0, '4031.311')] [2023-03-08 15:04:13,394][434396] Updated weights for policy 0, policy_version 160000 (0.0005) [2023-03-08 15:04:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10649.6). Total num frames: 81952768. Throughput: 0: 10761.1. Samples: 81937312. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:04:16,893][434108] Avg episode reward: [(0, '4116.618')] [2023-03-08 15:04:17,334][434396] Updated weights for policy 0, policy_version 160080 (0.0005) [2023-03-08 15:04:21,292][434396] Updated weights for policy 0, policy_version 160160 (0.0005) [2023-03-08 15:04:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10649.6). Total num frames: 82006016. Throughput: 0: 10769.2. Samples: 81999172. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:04:21,893][434108] Avg episode reward: [(0, '4277.983')] [2023-03-08 15:04:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000160168_82006016.pth... [2023-03-08 15:04:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000159536_81682432.pth [2023-03-08 15:04:25,209][434396] Updated weights for policy 0, policy_version 160240 (0.0005) [2023-03-08 15:04:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 82059264. Throughput: 0: 10765.9. Samples: 82030600. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:04:26,893][434108] Avg episode reward: [(0, '4115.684')] [2023-03-08 15:04:29,254][434396] Updated weights for policy 0, policy_version 160320 (0.0005) [2023-03-08 15:04:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10635.7). Total num frames: 82112512. Throughput: 0: 10673.0. Samples: 82092096. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:04:31,893][434108] Avg episode reward: [(0, '4251.337')] [2023-03-08 15:04:32,933][434396] Updated weights for policy 0, policy_version 160400 (0.0004) [2023-03-08 15:04:36,541][434396] Updated weights for policy 0, policy_version 160480 (0.0004) [2023-03-08 15:04:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10621.8). Total num frames: 82165760. Throughput: 0: 10792.4. Samples: 82159912. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:04:36,893][434108] Avg episode reward: [(0, '4269.949')] [2023-03-08 15:04:36,917][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000160488_82169856.pth... [2023-03-08 15:04:36,918][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000159856_81846272.pth [2023-03-08 15:04:40,252][434396] Updated weights for policy 0, policy_version 160560 (0.0004) [2023-03-08 15:04:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10621.8). Total num frames: 82223104. Throughput: 0: 10835.2. Samples: 82193412. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:04:41,893][434108] Avg episode reward: [(0, '4213.296')] [2023-03-08 15:04:44,211][434396] Updated weights for policy 0, policy_version 160640 (0.0005) [2023-03-08 15:04:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10607.9). Total num frames: 82272256. Throughput: 0: 10796.2. Samples: 82255944. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:04:46,893][434108] Avg episode reward: [(0, '4216.135')] [2023-03-08 15:04:48,112][434396] Updated weights for policy 0, policy_version 160720 (0.0005) [2023-03-08 15:04:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 82325504. Throughput: 0: 10708.3. Samples: 82319368. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:04:51,893][434108] Avg episode reward: [(0, '4301.693')] [2023-03-08 15:04:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000160792_82325504.pth... [2023-03-08 15:04:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000160168_82006016.pth [2023-03-08 15:04:51,993][434396] Updated weights for policy 0, policy_version 160800 (0.0005) [2023-03-08 15:04:55,992][434396] Updated weights for policy 0, policy_version 160880 (0.0005) [2023-03-08 15:04:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10580.2). Total num frames: 82378752. Throughput: 0: 10649.6. Samples: 82350144. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:04:56,893][434108] Avg episode reward: [(0, '4193.363')] [2023-03-08 15:04:59,922][434396] Updated weights for policy 0, policy_version 160960 (0.0005) [2023-03-08 15:05:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 82432000. Throughput: 0: 10556.0. Samples: 82412332. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:05:01,893][434108] Avg episode reward: [(0, '4321.994')] [2023-03-08 15:05:03,787][434396] Updated weights for policy 0, policy_version 161040 (0.0004) [2023-03-08 15:05:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10580.2). Total num frames: 82485248. Throughput: 0: 10605.4. Samples: 82476416. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:05:06,893][434108] Avg episode reward: [(0, '4306.770')] [2023-03-08 15:05:06,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000161104_82485248.pth... [2023-03-08 15:05:06,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000160488_82169856.pth [2023-03-08 15:05:07,540][434396] Updated weights for policy 0, policy_version 161120 (0.0004) [2023-03-08 15:05:11,207][434396] Updated weights for policy 0, policy_version 161200 (0.0004) [2023-03-08 15:05:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 82538496. Throughput: 0: 10651.0. Samples: 82509896. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:05:11,893][434108] Avg episode reward: [(0, '4189.607')] [2023-03-08 15:05:14,812][434396] Updated weights for policy 0, policy_version 161280 (0.0004) [2023-03-08 15:05:16,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 10580.2). Total num frames: 82595840. Throughput: 0: 10786.2. Samples: 82577476. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:05:16,893][434108] Avg episode reward: [(0, '4336.174')] [2023-03-08 15:05:18,544][434396] Updated weights for policy 0, policy_version 161360 (0.0004) [2023-03-08 15:05:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10566.3). Total num frames: 82649088. Throughput: 0: 10734.1. Samples: 82642948. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:05:21,893][434108] Avg episode reward: [(0, '4240.411')] [2023-03-08 15:05:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000161424_82649088.pth... [2023-03-08 15:05:21,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000160792_82325504.pth [2023-03-08 15:05:22,387][434396] Updated weights for policy 0, policy_version 161440 (0.0005) [2023-03-08 15:05:26,321][434396] Updated weights for policy 0, policy_version 161520 (0.0005) [2023-03-08 15:05:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10566.3). Total num frames: 82702336. Throughput: 0: 10673.9. Samples: 82673736. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:05:26,893][434108] Avg episode reward: [(0, '4304.291')] [2023-03-08 15:05:30,093][434396] Updated weights for policy 0, policy_version 161600 (0.0004) [2023-03-08 15:05:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10566.3). Total num frames: 82755584. Throughput: 0: 10718.8. Samples: 82738292. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:05:31,893][434108] Avg episode reward: [(0, '4231.576')] [2023-03-08 15:05:34,103][434396] Updated weights for policy 0, policy_version 161680 (0.0005) [2023-03-08 15:05:36,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10566.3). Total num frames: 82808832. Throughput: 0: 10693.5. Samples: 82800576. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:05:36,893][434108] Avg episode reward: [(0, '4377.146')] [2023-03-08 15:05:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000161736_82808832.pth... [2023-03-08 15:05:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000161104_82485248.pth [2023-03-08 15:05:38,027][434396] Updated weights for policy 0, policy_version 161760 (0.0005) [2023-03-08 15:05:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 82857984. Throughput: 0: 10700.6. Samples: 82831672. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:05:41,893][434108] Avg episode reward: [(0, '4359.498')] [2023-03-08 15:05:41,987][434396] Updated weights for policy 0, policy_version 161840 (0.0005) [2023-03-08 15:05:45,836][434396] Updated weights for policy 0, policy_version 161920 (0.0005) [2023-03-08 15:05:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 82911232. Throughput: 0: 10718.4. Samples: 82894660. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:05:46,893][434108] Avg episode reward: [(0, '4398.570')] [2023-03-08 15:05:49,607][434396] Updated weights for policy 0, policy_version 162000 (0.0005) [2023-03-08 15:05:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10538.5). Total num frames: 82964480. Throughput: 0: 10738.3. Samples: 82959640. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:05:51,893][434108] Avg episode reward: [(0, '4319.723')] [2023-03-08 15:05:51,914][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000162048_82968576.pth... [2023-03-08 15:05:51,915][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000161424_82649088.pth [2023-03-08 15:05:53,391][434396] Updated weights for policy 0, policy_version 162080 (0.0004) [2023-03-08 15:05:56,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10717.9, 300 sec: 10552.4). Total num frames: 83021824. Throughput: 0: 10708.8. Samples: 82991792. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:05:56,893][434108] Avg episode reward: [(0, '4276.581')] [2023-03-08 15:05:57,054][434396] Updated weights for policy 0, policy_version 162160 (0.0004) [2023-03-08 15:06:00,918][434396] Updated weights for policy 0, policy_version 162240 (0.0005) [2023-03-08 15:06:01,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10552.4). Total num frames: 83075072. Throughput: 0: 10677.3. Samples: 83057956. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:06:01,893][434108] Avg episode reward: [(0, '4309.624')] [2023-03-08 15:06:04,871][434396] Updated weights for policy 0, policy_version 162320 (0.0005) [2023-03-08 15:06:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10552.4). Total num frames: 83128320. Throughput: 0: 10603.2. Samples: 83120092. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:06:06,893][434108] Avg episode reward: [(0, '4367.808')] [2023-03-08 15:06:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000162360_83128320.pth... [2023-03-08 15:06:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000161736_82808832.pth [2023-03-08 15:06:08,890][434396] Updated weights for policy 0, policy_version 162400 (0.0005) [2023-03-08 15:06:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 83177472. Throughput: 0: 10585.2. Samples: 83150068. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:06:11,893][434108] Avg episode reward: [(0, '4329.968')] [2023-03-08 15:06:12,855][434396] Updated weights for policy 0, policy_version 162480 (0.0005) [2023-03-08 15:06:16,748][434396] Updated weights for policy 0, policy_version 162560 (0.0005) [2023-03-08 15:06:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 83230720. Throughput: 0: 10533.4. Samples: 83212296. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:06:16,893][434108] Avg episode reward: [(0, '4333.826')] [2023-03-08 15:06:20,672][434396] Updated weights for policy 0, policy_version 162640 (0.0005) [2023-03-08 15:06:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10566.3). Total num frames: 83283968. Throughput: 0: 10559.4. Samples: 83275748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:06:21,893][434108] Avg episode reward: [(0, '4368.136')] [2023-03-08 15:06:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000162664_83283968.pth... [2023-03-08 15:06:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000162048_82968576.pth [2023-03-08 15:06:24,400][434396] Updated weights for policy 0, policy_version 162720 (0.0005) [2023-03-08 15:06:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10566.3). Total num frames: 83337216. Throughput: 0: 10595.6. Samples: 83308472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:06:26,893][434108] Avg episode reward: [(0, '4428.458')] [2023-03-08 15:06:28,084][434396] Updated weights for policy 0, policy_version 162800 (0.0004) [2023-03-08 15:06:31,764][434396] Updated weights for policy 0, policy_version 162880 (0.0004) [2023-03-08 15:06:31,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10580.2). Total num frames: 83394560. Throughput: 0: 10663.3. Samples: 83374508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:06:31,893][434108] Avg episode reward: [(0, '4431.871')] [2023-03-08 15:06:35,608][434396] Updated weights for policy 0, policy_version 162960 (0.0004) [2023-03-08 15:06:36,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10594.1). Total num frames: 83447808. Throughput: 0: 10667.7. Samples: 83439688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:06:36,893][434108] Avg episode reward: [(0, '4386.362')] [2023-03-08 15:06:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000162984_83447808.pth... [2023-03-08 15:06:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000162360_83128320.pth [2023-03-08 15:06:39,299][434396] Updated weights for policy 0, policy_version 163040 (0.0005) [2023-03-08 15:06:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 83501056. Throughput: 0: 10700.6. Samples: 83473316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:06:41,893][434108] Avg episode reward: [(0, '4514.607')] [2023-03-08 15:06:43,068][434396] Updated weights for policy 0, policy_version 163120 (0.0005) [2023-03-08 15:06:46,753][434396] Updated weights for policy 0, policy_version 163200 (0.0004) [2023-03-08 15:06:46,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10786.2, 300 sec: 10621.8). Total num frames: 83558400. Throughput: 0: 10691.9. Samples: 83539092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:06:46,893][434108] Avg episode reward: [(0, '4374.470')] [2023-03-08 15:06:50,549][434396] Updated weights for policy 0, policy_version 163280 (0.0004) [2023-03-08 15:06:51,892][434108] Fps is (10 sec: 11059.0, 60 sec: 10786.1, 300 sec: 10621.8). Total num frames: 83611648. Throughput: 0: 10756.9. Samples: 83604152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:06:51,893][434108] Avg episode reward: [(0, '4420.496')] [2023-03-08 15:06:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000163304_83611648.pth... [2023-03-08 15:06:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000162664_83283968.pth [2023-03-08 15:06:54,251][434396] Updated weights for policy 0, policy_version 163360 (0.0004) [2023-03-08 15:06:56,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10786.1, 300 sec: 10635.7). Total num frames: 83668992. Throughput: 0: 10837.0. Samples: 83637732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:06:56,893][434108] Avg episode reward: [(0, '4413.395')] [2023-03-08 15:06:57,924][434396] Updated weights for policy 0, policy_version 163440 (0.0004) [2023-03-08 15:07:01,677][434396] Updated weights for policy 0, policy_version 163520 (0.0004) [2023-03-08 15:07:01,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10786.1, 300 sec: 10635.7). Total num frames: 83722240. Throughput: 0: 10930.9. Samples: 83704188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:07:01,893][434108] Avg episode reward: [(0, '4373.578')] [2023-03-08 15:07:05,264][434396] Updated weights for policy 0, policy_version 163600 (0.0003) [2023-03-08 15:07:06,893][434108] Fps is (10 sec: 11059.1, 60 sec: 10854.4, 300 sec: 10663.5). Total num frames: 83779584. Throughput: 0: 11015.7. Samples: 83771456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:07:06,893][434108] Avg episode reward: [(0, '4442.740')] [2023-03-08 15:07:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000163632_83779584.pth... [2023-03-08 15:07:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000162984_83447808.pth [2023-03-08 15:07:09,160][434396] Updated weights for policy 0, policy_version 163680 (0.0005) [2023-03-08 15:07:11,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10922.7, 300 sec: 10663.5). Total num frames: 83832832. Throughput: 0: 10993.3. Samples: 83803172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:07:11,893][434108] Avg episode reward: [(0, '4465.084')] [2023-03-08 15:07:12,809][434396] Updated weights for policy 0, policy_version 163760 (0.0004) [2023-03-08 15:07:16,774][434396] Updated weights for policy 0, policy_version 163840 (0.0005) [2023-03-08 15:07:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10663.5). Total num frames: 83886080. Throughput: 0: 10964.8. Samples: 83867924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:07:16,893][434108] Avg episode reward: [(0, '4479.077')] [2023-03-08 15:07:20,709][434396] Updated weights for policy 0, policy_version 163920 (0.0005) [2023-03-08 15:07:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10854.4, 300 sec: 10649.6). Total num frames: 83935232. Throughput: 0: 10909.1. Samples: 83930596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:07:21,893][434108] Avg episode reward: [(0, '4419.939')] [2023-03-08 15:07:21,899][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000163944_83939328.pth... [2023-03-08 15:07:21,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000163304_83611648.pth [2023-03-08 15:07:24,565][434396] Updated weights for policy 0, policy_version 164000 (0.0005) [2023-03-08 15:07:26,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10854.4, 300 sec: 10663.5). Total num frames: 83988480. Throughput: 0: 10852.0. Samples: 83961656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:07:26,893][434108] Avg episode reward: [(0, '4186.625')] [2023-03-08 15:07:28,495][434396] Updated weights for policy 0, policy_version 164080 (0.0005) [2023-03-08 15:07:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10663.5). Total num frames: 84041728. Throughput: 0: 10797.0. Samples: 84024960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:07:31,893][434108] Avg episode reward: [(0, '4332.775')] [2023-03-08 15:07:32,466][434396] Updated weights for policy 0, policy_version 164160 (0.0005) [2023-03-08 15:07:36,317][434396] Updated weights for policy 0, policy_version 164240 (0.0005) [2023-03-08 15:07:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.2, 300 sec: 10663.5). Total num frames: 84094976. Throughput: 0: 10730.9. Samples: 84087040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:07:36,893][434108] Avg episode reward: [(0, '4235.885')] [2023-03-08 15:07:36,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000164248_84094976.pth... [2023-03-08 15:07:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000163632_83779584.pth [2023-03-08 15:07:40,309][434396] Updated weights for policy 0, policy_version 164320 (0.0005) [2023-03-08 15:07:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10663.5). Total num frames: 84148224. Throughput: 0: 10707.1. Samples: 84119552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:07:41,893][434108] Avg episode reward: [(0, '4338.710')] [2023-03-08 15:07:44,272][434396] Updated weights for policy 0, policy_version 164400 (0.0005) [2023-03-08 15:07:46,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10649.6, 300 sec: 10663.5). Total num frames: 84197376. Throughput: 0: 10595.6. Samples: 84180992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:07:46,893][434108] Avg episode reward: [(0, '4302.585')] [2023-03-08 15:07:48,204][434396] Updated weights for policy 0, policy_version 164480 (0.0005) [2023-03-08 15:07:51,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10649.6, 300 sec: 10663.5). Total num frames: 84250624. Throughput: 0: 10467.6. Samples: 84242496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:07:51,893][434108] Avg episode reward: [(0, '4240.990')] [2023-03-08 15:07:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000164552_84250624.pth... [2023-03-08 15:07:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000163944_83939328.pth [2023-03-08 15:07:52,194][434396] Updated weights for policy 0, policy_version 164560 (0.0005) [2023-03-08 15:07:56,204][434396] Updated weights for policy 0, policy_version 164640 (0.0005) [2023-03-08 15:07:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10649.6). Total num frames: 84299776. Throughput: 0: 10450.1. Samples: 84273424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:07:56,893][434108] Avg episode reward: [(0, '4307.378')] [2023-03-08 15:08:00,173][434396] Updated weights for policy 0, policy_version 164720 (0.0004) [2023-03-08 15:08:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10663.5). Total num frames: 84353024. Throughput: 0: 10391.0. Samples: 84335520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:08:01,893][434108] Avg episode reward: [(0, '4342.560')] [2023-03-08 15:08:04,096][434396] Updated weights for policy 0, policy_version 164800 (0.0005) [2023-03-08 15:08:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10663.5). Total num frames: 84406272. Throughput: 0: 10388.7. Samples: 84398088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:08:06,893][434108] Avg episode reward: [(0, '4372.555')] [2023-03-08 15:08:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000164856_84406272.pth... [2023-03-08 15:08:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000164248_84094976.pth [2023-03-08 15:08:07,973][434396] Updated weights for policy 0, policy_version 164880 (0.0005) [2023-03-08 15:08:11,784][434396] Updated weights for policy 0, policy_version 164960 (0.0005) [2023-03-08 15:08:11,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10663.5). Total num frames: 84459520. Throughput: 0: 10415.2. Samples: 84430340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:08:11,893][434108] Avg episode reward: [(0, '4386.279')] [2023-03-08 15:08:15,721][434396] Updated weights for policy 0, policy_version 165040 (0.0005) [2023-03-08 15:08:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10649.6). Total num frames: 84508672. Throughput: 0: 10402.8. Samples: 84493088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:08:16,893][434108] Avg episode reward: [(0, '4412.132')] [2023-03-08 15:08:19,671][434396] Updated weights for policy 0, policy_version 165120 (0.0005) [2023-03-08 15:08:21,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10649.6). Total num frames: 84561920. Throughput: 0: 10402.5. Samples: 84555156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:08:21,893][434108] Avg episode reward: [(0, '4488.764')] [2023-03-08 15:08:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000165160_84561920.pth... [2023-03-08 15:08:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000164552_84250624.pth [2023-03-08 15:08:23,679][434396] Updated weights for policy 0, policy_version 165200 (0.0005) [2023-03-08 15:08:26,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10663.5). Total num frames: 84615168. Throughput: 0: 10371.5. Samples: 84586268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:08:26,893][434108] Avg episode reward: [(0, '4441.857')] [2023-03-08 15:08:27,668][434396] Updated weights for policy 0, policy_version 165280 (0.0005) [2023-03-08 15:08:31,671][434396] Updated weights for policy 0, policy_version 165360 (0.0005) [2023-03-08 15:08:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10649.6). Total num frames: 84664320. Throughput: 0: 10366.4. Samples: 84647480. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 15:08:31,893][434108] Avg episode reward: [(0, '4436.232')] [2023-03-08 15:08:35,666][434396] Updated weights for policy 0, policy_version 165440 (0.0005) [2023-03-08 15:08:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10649.6). Total num frames: 84717568. Throughput: 0: 10374.1. Samples: 84709328. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 15:08:36,893][434108] Avg episode reward: [(0, '4451.000')] [2023-03-08 15:08:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000165464_84717568.pth... [2023-03-08 15:08:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000164856_84406272.pth [2023-03-08 15:08:39,606][434396] Updated weights for policy 0, policy_version 165520 (0.0005) [2023-03-08 15:08:41,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10635.7). Total num frames: 84766720. Throughput: 0: 10378.9. Samples: 84740476. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 15:08:41,893][434108] Avg episode reward: [(0, '4420.639')] [2023-03-08 15:08:43,743][434396] Updated weights for policy 0, policy_version 165600 (0.0005) [2023-03-08 15:08:46,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10621.8). Total num frames: 84815872. Throughput: 0: 10313.3. Samples: 84799620. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 15:08:46,893][434108] Avg episode reward: [(0, '4466.488')] [2023-03-08 15:08:47,656][434396] Updated weights for policy 0, policy_version 165680 (0.0004) [2023-03-08 15:08:51,559][434396] Updated weights for policy 0, policy_version 165760 (0.0004) [2023-03-08 15:08:51,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10635.7). Total num frames: 84869120. Throughput: 0: 10348.2. Samples: 84863756. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 15:08:51,893][434108] Avg episode reward: [(0, '4390.627')] [2023-03-08 15:08:51,956][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000165768_84873216.pth... [2023-03-08 15:08:51,959][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000165160_84561920.pth [2023-03-08 15:08:55,424][434396] Updated weights for policy 0, policy_version 165840 (0.0004) [2023-03-08 15:08:56,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10621.8). Total num frames: 84922368. Throughput: 0: 10307.4. Samples: 84894172. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 15:08:56,893][434108] Avg episode reward: [(0, '4441.428')] [2023-03-08 15:08:59,190][434396] Updated weights for policy 0, policy_version 165920 (0.0004) [2023-03-08 15:09:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10607.9). Total num frames: 84975616. Throughput: 0: 10360.4. Samples: 84959304. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 15:09:01,893][434108] Avg episode reward: [(0, '4475.427')] [2023-03-08 15:09:03,058][434396] Updated weights for policy 0, policy_version 166000 (0.0005) [2023-03-08 15:09:06,833][434396] Updated weights for policy 0, policy_version 166080 (0.0004) [2023-03-08 15:09:06,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10444.8, 300 sec: 10621.8). Total num frames: 85032960. Throughput: 0: 10420.8. Samples: 85024092. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 15:09:06,893][434108] Avg episode reward: [(0, '4408.642')] [2023-03-08 15:09:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000166080_85032960.pth... [2023-03-08 15:09:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000165464_84717568.pth [2023-03-08 15:09:10,519][434396] Updated weights for policy 0, policy_version 166160 (0.0004) [2023-03-08 15:09:11,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10444.8, 300 sec: 10621.8). Total num frames: 85086208. Throughput: 0: 10474.1. Samples: 85057600. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 15:09:11,893][434108] Avg episode reward: [(0, '4406.271')] [2023-03-08 15:09:14,421][434396] Updated weights for policy 0, policy_version 166240 (0.0004) [2023-03-08 15:09:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10621.8). Total num frames: 85139456. Throughput: 0: 10525.1. Samples: 85121108. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 15:09:16,893][434108] Avg episode reward: [(0, '4385.215')] [2023-03-08 15:09:18,369][434396] Updated weights for policy 0, policy_version 166320 (0.0005) [2023-03-08 15:09:21,892][434108] Fps is (10 sec: 10239.8, 60 sec: 10444.8, 300 sec: 10607.9). Total num frames: 85188608. Throughput: 0: 10530.8. Samples: 85183216. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 15:09:21,893][434108] Avg episode reward: [(0, '4527.300')] [2023-03-08 15:09:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000166384_85188608.pth... [2023-03-08 15:09:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000165768_84873216.pth [2023-03-08 15:09:22,348][434396] Updated weights for policy 0, policy_version 166400 (0.0005) [2023-03-08 15:09:26,120][434396] Updated weights for policy 0, policy_version 166480 (0.0004) [2023-03-08 15:09:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10607.9). Total num frames: 85241856. Throughput: 0: 10534.8. Samples: 85214540. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 15:09:26,893][434108] Avg episode reward: [(0, '4521.288')] [2023-03-08 15:09:30,068][434396] Updated weights for policy 0, policy_version 166560 (0.0005) [2023-03-08 15:09:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10607.9). Total num frames: 85295104. Throughput: 0: 10640.8. Samples: 85278456. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 15:09:31,893][434108] Avg episode reward: [(0, '4393.010')] [2023-03-08 15:09:34,054][434396] Updated weights for policy 0, policy_version 166640 (0.0005) [2023-03-08 15:09:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10594.1). Total num frames: 85348352. Throughput: 0: 10586.9. Samples: 85340168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:09:36,893][434108] Avg episode reward: [(0, '4501.460')] [2023-03-08 15:09:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000166696_85348352.pth... [2023-03-08 15:09:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000166080_85032960.pth [2023-03-08 15:09:37,958][434396] Updated weights for policy 0, policy_version 166720 (0.0005) [2023-03-08 15:09:41,697][434396] Updated weights for policy 0, policy_version 166800 (0.0004) [2023-03-08 15:09:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10607.9). Total num frames: 85401600. Throughput: 0: 10625.5. Samples: 85372320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:09:41,893][434108] Avg episode reward: [(0, '4516.850')] [2023-03-08 15:09:45,326][434396] Updated weights for policy 0, policy_version 166880 (0.0004) [2023-03-08 15:09:46,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10717.9, 300 sec: 10621.8). Total num frames: 85458944. Throughput: 0: 10662.0. Samples: 85439092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:09:46,893][434108] Avg episode reward: [(0, '4551.068')] [2023-03-08 15:09:49,156][434396] Updated weights for policy 0, policy_version 166960 (0.0005) [2023-03-08 15:09:51,893][434108] Fps is (10 sec: 11059.1, 60 sec: 10717.8, 300 sec: 10621.8). Total num frames: 85512192. Throughput: 0: 10664.8. Samples: 85504008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:09:51,893][434108] Avg episode reward: [(0, '4479.765')] [2023-03-08 15:09:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000167016_85512192.pth... [2023-03-08 15:09:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000166384_85188608.pth [2023-03-08 15:09:53,085][434396] Updated weights for policy 0, policy_version 167040 (0.0005) [2023-03-08 15:09:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 85561344. Throughput: 0: 10580.6. Samples: 85533728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:09:56,893][434108] Avg episode reward: [(0, '4392.252')] [2023-03-08 15:09:57,012][434396] Updated weights for policy 0, policy_version 167120 (0.0005) [2023-03-08 15:10:01,025][434396] Updated weights for policy 0, policy_version 167200 (0.0005) [2023-03-08 15:10:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 85614592. Throughput: 0: 10553.4. Samples: 85596012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:10:01,893][434108] Avg episode reward: [(0, '4352.131')] [2023-03-08 15:10:04,862][434396] Updated weights for policy 0, policy_version 167280 (0.0005) [2023-03-08 15:10:06,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10607.9). Total num frames: 85667840. Throughput: 0: 10594.0. Samples: 85659948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:10:06,893][434108] Avg episode reward: [(0, '4482.406')] [2023-03-08 15:10:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000167320_85667840.pth... [2023-03-08 15:10:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000166696_85348352.pth [2023-03-08 15:10:08,463][434396] Updated weights for policy 0, policy_version 167360 (0.0004) [2023-03-08 15:10:11,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 85725184. Throughput: 0: 10662.1. Samples: 85694332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:10:11,893][434108] Avg episode reward: [(0, '4462.312')] [2023-03-08 15:10:12,230][434396] Updated weights for policy 0, policy_version 167440 (0.0005) [2023-03-08 15:10:16,140][434396] Updated weights for policy 0, policy_version 167520 (0.0005) [2023-03-08 15:10:16,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 85778432. Throughput: 0: 10665.9. Samples: 85758420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:10:16,893][434108] Avg episode reward: [(0, '4391.276')] [2023-03-08 15:10:19,891][434396] Updated weights for policy 0, policy_version 167600 (0.0004) [2023-03-08 15:10:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10607.9). Total num frames: 85831680. Throughput: 0: 10740.6. Samples: 85823496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:10:21,893][434108] Avg episode reward: [(0, '4271.214')] [2023-03-08 15:10:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000167640_85831680.pth... [2023-03-08 15:10:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000167016_85512192.pth [2023-03-08 15:10:23,823][434396] Updated weights for policy 0, policy_version 167680 (0.0005) [2023-03-08 15:10:26,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10594.1). Total num frames: 85880832. Throughput: 0: 10712.4. Samples: 85854376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:10:26,893][434108] Avg episode reward: [(0, '4276.816')] [2023-03-08 15:10:27,773][434396] Updated weights for policy 0, policy_version 167760 (0.0005) [2023-03-08 15:10:31,804][434396] Updated weights for policy 0, policy_version 167840 (0.0005) [2023-03-08 15:10:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10594.1). Total num frames: 85934080. Throughput: 0: 10596.3. Samples: 85915928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:10:31,893][434108] Avg episode reward: [(0, '4301.402')] [2023-03-08 15:10:35,812][434396] Updated weights for policy 0, policy_version 167920 (0.0005) [2023-03-08 15:10:36,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 85983232. Throughput: 0: 10514.0. Samples: 85977136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:10:36,893][434108] Avg episode reward: [(0, '4389.347')] [2023-03-08 15:10:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000167936_85983232.pth... [2023-03-08 15:10:36,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000167320_85667840.pth [2023-03-08 15:10:39,840][434396] Updated weights for policy 0, policy_version 168000 (0.0005) [2023-03-08 15:10:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 86036480. Throughput: 0: 10535.1. Samples: 86007808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:10:41,893][434108] Avg episode reward: [(0, '4317.668')] [2023-03-08 15:10:43,671][434396] Updated weights for policy 0, policy_version 168080 (0.0005) [2023-03-08 15:10:46,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.0, 300 sec: 10594.1). Total num frames: 86089728. Throughput: 0: 10549.3. Samples: 86070732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:10:46,893][434108] Avg episode reward: [(0, '4423.041')] [2023-03-08 15:10:47,693][434396] Updated weights for policy 0, policy_version 168160 (0.0005) [2023-03-08 15:10:51,696][434396] Updated weights for policy 0, policy_version 168240 (0.0005) [2023-03-08 15:10:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 86138880. Throughput: 0: 10477.1. Samples: 86131416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:10:51,893][434108] Avg episode reward: [(0, '4191.259')] [2023-03-08 15:10:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000168240_86138880.pth... [2023-03-08 15:10:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000167640_85831680.pth [2023-03-08 15:10:55,640][434396] Updated weights for policy 0, policy_version 168320 (0.0005) [2023-03-08 15:10:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10566.3). Total num frames: 86192128. Throughput: 0: 10426.4. Samples: 86163520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:10:56,893][434108] Avg episode reward: [(0, '4377.557')] [2023-03-08 15:10:59,514][434396] Updated weights for policy 0, policy_version 168400 (0.0004) [2023-03-08 15:11:01,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10566.3). Total num frames: 86245376. Throughput: 0: 10400.9. Samples: 86226460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:11:01,893][434108] Avg episode reward: [(0, '4101.112')] [2023-03-08 15:11:03,303][434396] Updated weights for policy 0, policy_version 168480 (0.0004) [2023-03-08 15:11:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10580.2). Total num frames: 86298624. Throughput: 0: 10367.3. Samples: 86290024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:11:06,893][434108] Avg episode reward: [(0, '4074.222')] [2023-03-08 15:11:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000168552_86298624.pth... [2023-03-08 15:11:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000167936_85983232.pth [2023-03-08 15:11:07,262][434396] Updated weights for policy 0, policy_version 168560 (0.0005) [2023-03-08 15:11:11,383][434396] Updated weights for policy 0, policy_version 168640 (0.0005) [2023-03-08 15:11:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10566.3). Total num frames: 86347776. Throughput: 0: 10355.1. Samples: 86320356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:11:11,893][434108] Avg episode reward: [(0, '4042.597')] [2023-03-08 15:11:15,368][434396] Updated weights for policy 0, policy_version 168720 (0.0003) [2023-03-08 15:11:16,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10552.4). Total num frames: 86396928. Throughput: 0: 10326.2. Samples: 86380608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:11:16,893][434108] Avg episode reward: [(0, '4029.601')] [2023-03-08 15:11:19,239][434396] Updated weights for policy 0, policy_version 168800 (0.0003) [2023-03-08 15:11:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10552.4). Total num frames: 86450176. Throughput: 0: 10380.4. Samples: 86444256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:11:21,893][434108] Avg episode reward: [(0, '4312.715')] [2023-03-08 15:11:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000168848_86450176.pth... [2023-03-08 15:11:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000168240_86138880.pth [2023-03-08 15:11:23,193][434396] Updated weights for policy 0, policy_version 168880 (0.0004) [2023-03-08 15:11:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10538.5). Total num frames: 86503424. Throughput: 0: 10378.1. Samples: 86474824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:11:26,893][434108] Avg episode reward: [(0, '4191.932')] [2023-03-08 15:11:27,126][434396] Updated weights for policy 0, policy_version 168960 (0.0005) [2023-03-08 15:11:31,118][434396] Updated weights for policy 0, policy_version 169040 (0.0005) [2023-03-08 15:11:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10524.6). Total num frames: 86552576. Throughput: 0: 10377.8. Samples: 86537732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:11:31,893][434108] Avg episode reward: [(0, '4298.296')] [2023-03-08 15:11:35,166][434396] Updated weights for policy 0, policy_version 169120 (0.0005) [2023-03-08 15:11:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 86605824. Throughput: 0: 10377.9. Samples: 86598424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:11:36,893][434108] Avg episode reward: [(0, '4203.956')] [2023-03-08 15:11:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000169152_86605824.pth... [2023-03-08 15:11:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000168552_86298624.pth [2023-03-08 15:11:38,987][434396] Updated weights for policy 0, policy_version 169200 (0.0005) [2023-03-08 15:11:41,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10510.7). Total num frames: 86659072. Throughput: 0: 10376.5. Samples: 86630464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:11:41,893][434108] Avg episode reward: [(0, '4319.200')] [2023-03-08 15:11:42,774][434396] Updated weights for policy 0, policy_version 169280 (0.0004) [2023-03-08 15:11:46,728][434396] Updated weights for policy 0, policy_version 169360 (0.0005) [2023-03-08 15:11:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10510.8). Total num frames: 86712320. Throughput: 0: 10420.4. Samples: 86695376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:11:46,893][434108] Avg episode reward: [(0, '4381.615')] [2023-03-08 15:11:50,585][434396] Updated weights for policy 0, policy_version 169440 (0.0005) [2023-03-08 15:11:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 86765568. Throughput: 0: 10387.2. Samples: 86757448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:11:51,893][434108] Avg episode reward: [(0, '4430.136')] [2023-03-08 15:11:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000169464_86765568.pth... [2023-03-08 15:11:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000168848_86450176.pth [2023-03-08 15:11:54,500][434396] Updated weights for policy 0, policy_version 169520 (0.0004) [2023-03-08 15:11:56,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 86814720. Throughput: 0: 10428.2. Samples: 86789624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:11:56,893][434108] Avg episode reward: [(0, '4232.587')] [2023-03-08 15:11:58,412][434396] Updated weights for policy 0, policy_version 169600 (0.0004) [2023-03-08 15:12:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 86867968. Throughput: 0: 10478.9. Samples: 86852160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:12:01,893][434108] Avg episode reward: [(0, '4419.240')] [2023-03-08 15:12:02,333][434396] Updated weights for policy 0, policy_version 169680 (0.0004) [2023-03-08 15:12:06,309][434396] Updated weights for policy 0, policy_version 169760 (0.0004) [2023-03-08 15:12:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 86921216. Throughput: 0: 10439.1. Samples: 86914016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:12:06,893][434108] Avg episode reward: [(0, '4406.095')] [2023-03-08 15:12:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000169768_86921216.pth... [2023-03-08 15:12:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000169152_86605824.pth [2023-03-08 15:12:10,035][434396] Updated weights for policy 0, policy_version 169840 (0.0004) [2023-03-08 15:12:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 86978560. Throughput: 0: 10485.0. Samples: 86946648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:12:11,893][434108] Avg episode reward: [(0, '4319.142')] [2023-03-08 15:12:13,624][434396] Updated weights for policy 0, policy_version 169920 (0.0004) [2023-03-08 15:12:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 87031808. Throughput: 0: 10599.1. Samples: 87014692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:12:16,893][434108] Avg episode reward: [(0, '4352.298')] [2023-03-08 15:12:17,432][434396] Updated weights for policy 0, policy_version 170000 (0.0005) [2023-03-08 15:12:21,342][434396] Updated weights for policy 0, policy_version 170080 (0.0005) [2023-03-08 15:12:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 87085056. Throughput: 0: 10638.0. Samples: 87077132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:12:21,893][434108] Avg episode reward: [(0, '4492.549')] [2023-03-08 15:12:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000170088_87085056.pth... [2023-03-08 15:12:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000169464_86765568.pth [2023-03-08 15:12:25,332][434396] Updated weights for policy 0, policy_version 170160 (0.0004) [2023-03-08 15:12:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 87138304. Throughput: 0: 10626.2. Samples: 87108644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:12:26,893][434108] Avg episode reward: [(0, '4439.211')] [2023-03-08 15:12:29,052][434396] Updated weights for policy 0, policy_version 170240 (0.0004) [2023-03-08 15:12:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10496.9). Total num frames: 87191552. Throughput: 0: 10622.3. Samples: 87173380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:12:31,893][434108] Avg episode reward: [(0, '4427.996')] [2023-03-08 15:12:33,000][434396] Updated weights for policy 0, policy_version 170320 (0.0005) [2023-03-08 15:12:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10483.0). Total num frames: 87240704. Throughput: 0: 10613.5. Samples: 87235056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:12:36,893][434108] Avg episode reward: [(0, '4436.177')] [2023-03-08 15:12:36,931][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000170400_87244800.pth... [2023-03-08 15:12:36,932][434396] Updated weights for policy 0, policy_version 170400 (0.0005) [2023-03-08 15:12:36,932][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000169768_86921216.pth [2023-03-08 15:12:40,631][434396] Updated weights for policy 0, policy_version 170480 (0.0004) [2023-03-08 15:12:41,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10510.8). Total num frames: 87298048. Throughput: 0: 10625.2. Samples: 87267760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:12:41,893][434108] Avg episode reward: [(0, '4383.386')] [2023-03-08 15:12:44,391][434396] Updated weights for policy 0, policy_version 170560 (0.0004) [2023-03-08 15:12:46,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10510.8). Total num frames: 87351296. Throughput: 0: 10704.6. Samples: 87333868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:12:46,893][434108] Avg episode reward: [(0, '4439.769')] [2023-03-08 15:12:48,324][434396] Updated weights for policy 0, policy_version 170640 (0.0005) [2023-03-08 15:12:51,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 87404544. Throughput: 0: 10745.7. Samples: 87397572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:12:51,893][434108] Avg episode reward: [(0, '4294.132')] [2023-03-08 15:12:51,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000170712_87404544.pth... [2023-03-08 15:12:51,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000170088_87085056.pth [2023-03-08 15:12:52,032][434396] Updated weights for policy 0, policy_version 170720 (0.0004) [2023-03-08 15:12:55,982][434396] Updated weights for policy 0, policy_version 170800 (0.0005) [2023-03-08 15:12:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10524.6). Total num frames: 87457792. Throughput: 0: 10723.0. Samples: 87429184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:12:56,893][434108] Avg episode reward: [(0, '4458.871')] [2023-03-08 15:12:59,873][434396] Updated weights for policy 0, policy_version 170880 (0.0005) [2023-03-08 15:13:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10524.6). Total num frames: 87511040. Throughput: 0: 10602.1. Samples: 87491788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:13:01,893][434108] Avg episode reward: [(0, '4395.306')] [2023-03-08 15:13:03,810][434396] Updated weights for policy 0, policy_version 170960 (0.0004) [2023-03-08 15:13:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10510.7). Total num frames: 87560192. Throughput: 0: 10609.8. Samples: 87554572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:13:06,893][434108] Avg episode reward: [(0, '4401.128')] [2023-03-08 15:13:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000171016_87560192.pth... [2023-03-08 15:13:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000170400_87244800.pth [2023-03-08 15:13:07,712][434396] Updated weights for policy 0, policy_version 171040 (0.0005) [2023-03-08 15:13:11,620][434396] Updated weights for policy 0, policy_version 171120 (0.0004) [2023-03-08 15:13:11,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 87613440. Throughput: 0: 10617.1. Samples: 87586412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:13:11,893][434108] Avg episode reward: [(0, '4460.271')] [2023-03-08 15:13:15,535][434396] Updated weights for policy 0, policy_version 171200 (0.0005) [2023-03-08 15:13:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 87666688. Throughput: 0: 10569.0. Samples: 87648984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:13:16,893][434108] Avg episode reward: [(0, '4473.756')] [2023-03-08 15:13:19,273][434396] Updated weights for policy 0, policy_version 171280 (0.0004) [2023-03-08 15:13:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10538.5). Total num frames: 87724032. Throughput: 0: 10673.9. Samples: 87715384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:13:21,893][434108] Avg episode reward: [(0, '4439.274')] [2023-03-08 15:13:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000171336_87724032.pth... [2023-03-08 15:13:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000170712_87404544.pth [2023-03-08 15:13:22,989][434396] Updated weights for policy 0, policy_version 171360 (0.0004) [2023-03-08 15:13:26,814][434396] Updated weights for policy 0, policy_version 171440 (0.0005) [2023-03-08 15:13:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 87777280. Throughput: 0: 10656.3. Samples: 87747292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:13:26,893][434108] Avg episode reward: [(0, '4455.684')] [2023-03-08 15:13:30,714][434396] Updated weights for policy 0, policy_version 171520 (0.0005) [2023-03-08 15:13:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 87830528. Throughput: 0: 10594.5. Samples: 87810620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:13:31,893][434108] Avg episode reward: [(0, '4335.562')] [2023-03-08 15:13:34,446][434396] Updated weights for policy 0, policy_version 171600 (0.0004) [2023-03-08 15:13:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10717.8, 300 sec: 10566.3). Total num frames: 87883776. Throughput: 0: 10624.2. Samples: 87875660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:13:36,893][434108] Avg episode reward: [(0, '4445.777')] [2023-03-08 15:13:36,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000171648_87883776.pth... [2023-03-08 15:13:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000171016_87560192.pth [2023-03-08 15:13:38,286][434396] Updated weights for policy 0, policy_version 171680 (0.0005) [2023-03-08 15:13:41,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10580.2). Total num frames: 87937024. Throughput: 0: 10643.7. Samples: 87908152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:13:41,893][434108] Avg episode reward: [(0, '4507.209')] [2023-03-08 15:13:42,267][434396] Updated weights for policy 0, policy_version 171760 (0.0005) [2023-03-08 15:13:46,129][434396] Updated weights for policy 0, policy_version 171840 (0.0005) [2023-03-08 15:13:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10580.2). Total num frames: 87990272. Throughput: 0: 10642.5. Samples: 87970700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:13:46,893][434108] Avg episode reward: [(0, '4483.645')] [2023-03-08 15:13:50,055][434396] Updated weights for policy 0, policy_version 171920 (0.0005) [2023-03-08 15:13:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10566.3). Total num frames: 88039424. Throughput: 0: 10625.4. Samples: 88032716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:13:51,893][434108] Avg episode reward: [(0, '4494.071')] [2023-03-08 15:13:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000171952_88039424.pth... [2023-03-08 15:13:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000171336_87724032.pth [2023-03-08 15:13:53,812][434396] Updated weights for policy 0, policy_version 172000 (0.0005) [2023-03-08 15:13:56,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10580.2). Total num frames: 88096768. Throughput: 0: 10664.5. Samples: 88066312. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:13:56,893][434108] Avg episode reward: [(0, '4431.109')] [2023-03-08 15:13:57,503][434396] Updated weights for policy 0, policy_version 172080 (0.0004) [2023-03-08 15:14:01,220][434396] Updated weights for policy 0, policy_version 172160 (0.0004) [2023-03-08 15:14:01,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 88150016. Throughput: 0: 10759.5. Samples: 88133160. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:14:01,893][434108] Avg episode reward: [(0, '4506.748')] [2023-03-08 15:14:04,898][434396] Updated weights for policy 0, policy_version 172240 (0.0004) [2023-03-08 15:14:06,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10786.1, 300 sec: 10580.2). Total num frames: 88207360. Throughput: 0: 10752.2. Samples: 88199232. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:14:06,893][434108] Avg episode reward: [(0, '4483.060')] [2023-03-08 15:14:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000172280_88207360.pth... [2023-03-08 15:14:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000171648_87883776.pth [2023-03-08 15:14:08,752][434396] Updated weights for policy 0, policy_version 172320 (0.0005) [2023-03-08 15:14:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10580.2). Total num frames: 88260608. Throughput: 0: 10747.6. Samples: 88230936. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:14:11,893][434108] Avg episode reward: [(0, '4535.843')] [2023-03-08 15:14:12,513][434396] Updated weights for policy 0, policy_version 172400 (0.0004) [2023-03-08 15:14:16,471][434396] Updated weights for policy 0, policy_version 172480 (0.0005) [2023-03-08 15:14:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10786.1, 300 sec: 10594.1). Total num frames: 88313856. Throughput: 0: 10768.0. Samples: 88295180. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:14:16,893][434108] Avg episode reward: [(0, '4498.566')] [2023-03-08 15:14:20,299][434396] Updated weights for policy 0, policy_version 172560 (0.0005) [2023-03-08 15:14:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 88367104. Throughput: 0: 10737.6. Samples: 88358852. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:14:21,893][434108] Avg episode reward: [(0, '4528.188')] [2023-03-08 15:14:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000172592_88367104.pth... [2023-03-08 15:14:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000171952_88039424.pth [2023-03-08 15:14:23,985][434396] Updated weights for policy 0, policy_version 172640 (0.0004) [2023-03-08 15:14:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 88420352. Throughput: 0: 10746.7. Samples: 88391752. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:14:26,893][434108] Avg episode reward: [(0, '4524.343')] [2023-03-08 15:14:27,778][434396] Updated weights for policy 0, policy_version 172720 (0.0005) [2023-03-08 15:14:31,679][434396] Updated weights for policy 0, policy_version 172800 (0.0005) [2023-03-08 15:14:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 88473600. Throughput: 0: 10784.9. Samples: 88456020. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:14:31,893][434108] Avg episode reward: [(0, '4545.750')] [2023-03-08 15:14:35,630][434396] Updated weights for policy 0, policy_version 172880 (0.0005) [2023-03-08 15:14:36,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 88526848. Throughput: 0: 10800.1. Samples: 88518720. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:14:36,893][434108] Avg episode reward: [(0, '4521.827')] [2023-03-08 15:14:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000172904_88526848.pth... [2023-03-08 15:14:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000172280_88207360.pth [2023-03-08 15:14:39,559][434396] Updated weights for policy 0, policy_version 172960 (0.0005) [2023-03-08 15:14:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10580.2). Total num frames: 88580096. Throughput: 0: 10762.4. Samples: 88550620. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:14:41,893][434108] Avg episode reward: [(0, '4510.425')] [2023-03-08 15:14:43,339][434396] Updated weights for policy 0, policy_version 173040 (0.0005) [2023-03-08 15:14:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10580.2). Total num frames: 88633344. Throughput: 0: 10721.3. Samples: 88615620. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:14:46,899][434108] Avg episode reward: [(0, '4510.793')] [2023-03-08 15:14:46,928][434396] Updated weights for policy 0, policy_version 173120 (0.0004) [2023-03-08 15:14:50,708][434396] Updated weights for policy 0, policy_version 173200 (0.0005) [2023-03-08 15:14:51,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10607.9). Total num frames: 88690688. Throughput: 0: 10727.4. Samples: 88681964. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:14:51,903][434108] Avg episode reward: [(0, '4513.959')] [2023-03-08 15:14:51,907][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000173224_88690688.pth... [2023-03-08 15:14:51,909][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000172592_88367104.pth [2023-03-08 15:14:54,360][434396] Updated weights for policy 0, policy_version 173280 (0.0004) [2023-03-08 15:14:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10607.9). Total num frames: 88743936. Throughput: 0: 10764.3. Samples: 88715328. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:14:56,893][434108] Avg episode reward: [(0, '4554.908')] [2023-03-08 15:14:58,040][434396] Updated weights for policy 0, policy_version 173360 (0.0005) [2023-03-08 15:15:01,814][434396] Updated weights for policy 0, policy_version 173440 (0.0005) [2023-03-08 15:15:01,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10621.8). Total num frames: 88801280. Throughput: 0: 10836.1. Samples: 88782804. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:15:01,893][434108] Avg episode reward: [(0, '4532.851')] [2023-03-08 15:15:05,674][434396] Updated weights for policy 0, policy_version 173520 (0.0005) [2023-03-08 15:15:06,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10786.1, 300 sec: 10607.9). Total num frames: 88854528. Throughput: 0: 10832.5. Samples: 88846312. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:15:06,893][434108] Avg episode reward: [(0, '4471.049')] [2023-03-08 15:15:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000173544_88854528.pth... [2023-03-08 15:15:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000172904_88526848.pth [2023-03-08 15:15:09,443][434396] Updated weights for policy 0, policy_version 173600 (0.0005) [2023-03-08 15:15:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10607.9). Total num frames: 88907776. Throughput: 0: 10822.4. Samples: 88878760. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:15:11,893][434108] Avg episode reward: [(0, '4471.320')] [2023-03-08 15:15:13,109][434396] Updated weights for policy 0, policy_version 173680 (0.0005) [2023-03-08 15:15:16,805][434396] Updated weights for policy 0, policy_version 173760 (0.0005) [2023-03-08 15:15:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10621.8). Total num frames: 88965120. Throughput: 0: 10864.8. Samples: 88944936. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:15:16,893][434108] Avg episode reward: [(0, '4488.789')] [2023-03-08 15:15:20,560][434396] Updated weights for policy 0, policy_version 173840 (0.0005) [2023-03-08 15:15:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10854.4, 300 sec: 10635.7). Total num frames: 89018368. Throughput: 0: 10924.2. Samples: 89010308. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:15:21,893][434108] Avg episode reward: [(0, '4499.796')] [2023-03-08 15:15:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000173864_89018368.pth... [2023-03-08 15:15:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000173224_88690688.pth [2023-03-08 15:15:24,575][434396] Updated weights for policy 0, policy_version 173920 (0.0005) [2023-03-08 15:15:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10786.1, 300 sec: 10621.8). Total num frames: 89067520. Throughput: 0: 10911.5. Samples: 89041640. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:15:26,893][434108] Avg episode reward: [(0, '4455.862')] [2023-03-08 15:15:28,569][434396] Updated weights for policy 0, policy_version 174000 (0.0005) [2023-03-08 15:15:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10786.1, 300 sec: 10635.7). Total num frames: 89120768. Throughput: 0: 10845.5. Samples: 89103668. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:15:31,893][434108] Avg episode reward: [(0, '4438.245')] [2023-03-08 15:15:32,478][434396] Updated weights for policy 0, policy_version 174080 (0.0005) [2023-03-08 15:15:36,324][434396] Updated weights for policy 0, policy_version 174160 (0.0005) [2023-03-08 15:15:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10635.7). Total num frames: 89174016. Throughput: 0: 10764.3. Samples: 89166360. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:15:36,893][434108] Avg episode reward: [(0, '4440.816')] [2023-03-08 15:15:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000174168_89174016.pth... [2023-03-08 15:15:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000173544_88854528.pth [2023-03-08 15:15:40,237][434396] Updated weights for policy 0, policy_version 174240 (0.0005) [2023-03-08 15:15:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10635.7). Total num frames: 89227264. Throughput: 0: 10739.2. Samples: 89198592. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:15:41,893][434108] Avg episode reward: [(0, '4470.888')] [2023-03-08 15:15:44,111][434396] Updated weights for policy 0, policy_version 174320 (0.0004) [2023-03-08 15:15:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10635.7). Total num frames: 89276416. Throughput: 0: 10620.9. Samples: 89260744. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:15:46,893][434108] Avg episode reward: [(0, '4513.989')] [2023-03-08 15:15:48,028][434396] Updated weights for policy 0, policy_version 174400 (0.0005) [2023-03-08 15:15:51,828][434396] Updated weights for policy 0, policy_version 174480 (0.0005) [2023-03-08 15:15:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10649.6). Total num frames: 89333760. Throughput: 0: 10651.6. Samples: 89325632. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:15:51,893][434108] Avg episode reward: [(0, '4511.984')] [2023-03-08 15:15:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000174480_89333760.pth... [2023-03-08 15:15:51,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000173864_89018368.pth [2023-03-08 15:15:55,848][434396] Updated weights for policy 0, policy_version 174560 (0.0005) [2023-03-08 15:15:56,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 89382912. Throughput: 0: 10616.1. Samples: 89356484. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:15:56,893][434108] Avg episode reward: [(0, '4482.366')] [2023-03-08 15:15:59,889][434396] Updated weights for policy 0, policy_version 174640 (0.0005) [2023-03-08 15:16:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10635.7). Total num frames: 89436160. Throughput: 0: 10487.4. Samples: 89416868. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 15:16:01,893][434108] Avg episode reward: [(0, '4442.165')] [2023-03-08 15:16:03,902][434396] Updated weights for policy 0, policy_version 174720 (0.0005) [2023-03-08 15:16:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 89485312. Throughput: 0: 10398.0. Samples: 89478220. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:16:06,893][434108] Avg episode reward: [(0, '4491.243')] [2023-03-08 15:16:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000174776_89485312.pth... [2023-03-08 15:16:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000174168_89174016.pth [2023-03-08 15:16:07,910][434396] Updated weights for policy 0, policy_version 174800 (0.0004) [2023-03-08 15:16:11,857][434396] Updated weights for policy 0, policy_version 174880 (0.0005) [2023-03-08 15:16:11,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10649.6). Total num frames: 89538560. Throughput: 0: 10389.0. Samples: 89509144. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:16:11,893][434108] Avg episode reward: [(0, '4424.472')] [2023-03-08 15:16:15,877][434396] Updated weights for policy 0, policy_version 174960 (0.0006) [2023-03-08 15:16:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10635.7). Total num frames: 89587712. Throughput: 0: 10392.4. Samples: 89571328. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:16:16,893][434108] Avg episode reward: [(0, '4426.761')] [2023-03-08 15:16:19,590][434396] Updated weights for policy 0, policy_version 175040 (0.0005) [2023-03-08 15:16:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10649.6). Total num frames: 89645056. Throughput: 0: 10441.7. Samples: 89636236. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:16:21,893][434108] Avg episode reward: [(0, '4471.251')] [2023-03-08 15:16:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000175088_89645056.pth... [2023-03-08 15:16:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000174480_89333760.pth [2023-03-08 15:16:23,307][434396] Updated weights for policy 0, policy_version 175120 (0.0005) [2023-03-08 15:16:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10513.1, 300 sec: 10663.5). Total num frames: 89698304. Throughput: 0: 10467.7. Samples: 89669640. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:16:26,893][434108] Avg episode reward: [(0, '4370.556')] [2023-03-08 15:16:27,099][434396] Updated weights for policy 0, policy_version 175200 (0.0004) [2023-03-08 15:16:31,005][434396] Updated weights for policy 0, policy_version 175280 (0.0004) [2023-03-08 15:16:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10663.5). Total num frames: 89751552. Throughput: 0: 10477.8. Samples: 89732244. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:16:31,893][434108] Avg episode reward: [(0, '4481.010')] [2023-03-08 15:16:34,675][434396] Updated weights for policy 0, policy_version 175360 (0.0004) [2023-03-08 15:16:36,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10663.5). Total num frames: 89804800. Throughput: 0: 10512.6. Samples: 89798700. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:16:36,893][434108] Avg episode reward: [(0, '4448.089')] [2023-03-08 15:16:36,924][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000175408_89808896.pth... [2023-03-08 15:16:36,926][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000174776_89485312.pth [2023-03-08 15:16:38,394][434396] Updated weights for policy 0, policy_version 175440 (0.0004) [2023-03-08 15:16:41,893][434108] Fps is (10 sec: 11059.1, 60 sec: 10581.3, 300 sec: 10677.4). Total num frames: 89862144. Throughput: 0: 10577.2. Samples: 89832460. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:16:41,893][434108] Avg episode reward: [(0, '4481.892')] [2023-03-08 15:16:42,232][434396] Updated weights for policy 0, policy_version 175520 (0.0005) [2023-03-08 15:16:45,966][434396] Updated weights for policy 0, policy_version 175600 (0.0004) [2023-03-08 15:16:46,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 89915392. Throughput: 0: 10664.6. Samples: 89896776. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:16:46,893][434108] Avg episode reward: [(0, '4526.674')] [2023-03-08 15:16:49,683][434396] Updated weights for policy 0, policy_version 175680 (0.0004) [2023-03-08 15:16:51,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10691.3). Total num frames: 89968640. Throughput: 0: 10768.1. Samples: 89962784. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:16:51,893][434108] Avg episode reward: [(0, '4471.851')] [2023-03-08 15:16:51,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000175720_89968640.pth... [2023-03-08 15:16:51,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000175088_89645056.pth [2023-03-08 15:16:53,373][434396] Updated weights for policy 0, policy_version 175760 (0.0005) [2023-03-08 15:16:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10705.1). Total num frames: 90025984. Throughput: 0: 10823.5. Samples: 89996200. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:16:56,893][434108] Avg episode reward: [(0, '4514.076')] [2023-03-08 15:16:57,174][434396] Updated weights for policy 0, policy_version 175840 (0.0004) [2023-03-08 15:17:01,243][434396] Updated weights for policy 0, policy_version 175920 (0.0005) [2023-03-08 15:17:01,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10691.3). Total num frames: 90075136. Throughput: 0: 10818.1. Samples: 90058144. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:17:01,893][434108] Avg episode reward: [(0, '4499.053')] [2023-03-08 15:17:04,920][434396] Updated weights for policy 0, policy_version 176000 (0.0005) [2023-03-08 15:17:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10691.3). Total num frames: 90132480. Throughput: 0: 10845.8. Samples: 90124296. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:17:06,893][434108] Avg episode reward: [(0, '4405.755')] [2023-03-08 15:17:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000176040_90132480.pth... [2023-03-08 15:17:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000175408_89808896.pth [2023-03-08 15:17:08,611][434396] Updated weights for policy 0, policy_version 176080 (0.0004) [2023-03-08 15:17:11,892][434108] Fps is (10 sec: 11468.8, 60 sec: 10854.4, 300 sec: 10705.1). Total num frames: 90189824. Throughput: 0: 10833.1. Samples: 90157128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:17:11,893][434108] Avg episode reward: [(0, '4470.552')] [2023-03-08 15:17:12,228][434396] Updated weights for policy 0, policy_version 176160 (0.0004) [2023-03-08 15:17:15,949][434396] Updated weights for policy 0, policy_version 176240 (0.0004) [2023-03-08 15:17:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10705.1). Total num frames: 90243072. Throughput: 0: 10931.2. Samples: 90224148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:17:16,893][434108] Avg episode reward: [(0, '4489.180')] [2023-03-08 15:17:19,541][434396] Updated weights for policy 0, policy_version 176320 (0.0003) [2023-03-08 15:17:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10922.7, 300 sec: 10719.0). Total num frames: 90300416. Throughput: 0: 10964.0. Samples: 90292080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:17:21,893][434108] Avg episode reward: [(0, '4465.877')] [2023-03-08 15:17:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000176368_90300416.pth... [2023-03-08 15:17:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000175720_89968640.pth [2023-03-08 15:17:23,299][434396] Updated weights for policy 0, policy_version 176400 (0.0004) [2023-03-08 15:17:26,893][434108] Fps is (10 sec: 11059.1, 60 sec: 10922.7, 300 sec: 10719.0). Total num frames: 90353664. Throughput: 0: 10940.2. Samples: 90324768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:17:26,893][434108] Avg episode reward: [(0, '4395.281')] [2023-03-08 15:17:27,032][434396] Updated weights for policy 0, policy_version 176480 (0.0004) [2023-03-08 15:17:30,888][434396] Updated weights for policy 0, policy_version 176560 (0.0004) [2023-03-08 15:17:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10922.7, 300 sec: 10732.9). Total num frames: 90406912. Throughput: 0: 10971.5. Samples: 90390492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:17:31,893][434108] Avg episode reward: [(0, '4306.380')] [2023-03-08 15:17:34,828][434396] Updated weights for policy 0, policy_version 176640 (0.0005) [2023-03-08 15:17:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10922.7, 300 sec: 10719.0). Total num frames: 90460160. Throughput: 0: 10872.3. Samples: 90452040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:17:36,893][434108] Avg episode reward: [(0, '4295.983')] [2023-03-08 15:17:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000176680_90460160.pth... [2023-03-08 15:17:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000176040_90132480.pth [2023-03-08 15:17:38,811][434396] Updated weights for policy 0, policy_version 176720 (0.0005) [2023-03-08 15:17:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10786.2, 300 sec: 10705.1). Total num frames: 90509312. Throughput: 0: 10818.1. Samples: 90483016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:17:41,893][434108] Avg episode reward: [(0, '4469.222')] [2023-03-08 15:17:42,809][434396] Updated weights for policy 0, policy_version 176800 (0.0005) [2023-03-08 15:17:46,840][434396] Updated weights for policy 0, policy_version 176880 (0.0005) [2023-03-08 15:17:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10786.1, 300 sec: 10705.1). Total num frames: 90562560. Throughput: 0: 10787.8. Samples: 90543592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:17:46,893][434108] Avg episode reward: [(0, '4351.321')] [2023-03-08 15:17:50,649][434396] Updated weights for policy 0, policy_version 176960 (0.0005) [2023-03-08 15:17:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10705.1). Total num frames: 90615808. Throughput: 0: 10740.4. Samples: 90607616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:17:51,893][434108] Avg episode reward: [(0, '4340.020')] [2023-03-08 15:17:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000176984_90615808.pth... [2023-03-08 15:17:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000176368_90300416.pth [2023-03-08 15:17:54,593][434396] Updated weights for policy 0, policy_version 177040 (0.0004) [2023-03-08 15:17:56,893][434108] Fps is (10 sec: 10239.8, 60 sec: 10649.6, 300 sec: 10691.2). Total num frames: 90664960. Throughput: 0: 10696.4. Samples: 90638468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:17:56,893][434108] Avg episode reward: [(0, '4203.237')] [2023-03-08 15:17:58,568][434396] Updated weights for policy 0, policy_version 177120 (0.0005) [2023-03-08 15:18:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10705.1). Total num frames: 90718208. Throughput: 0: 10604.1. Samples: 90701332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:18:01,893][434108] Avg episode reward: [(0, '4241.943')] [2023-03-08 15:18:02,413][434396] Updated weights for policy 0, policy_version 177200 (0.0004) [2023-03-08 15:18:06,285][434396] Updated weights for policy 0, policy_version 177280 (0.0005) [2023-03-08 15:18:06,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10705.1). Total num frames: 90771456. Throughput: 0: 10479.3. Samples: 90763648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:18:06,893][434108] Avg episode reward: [(0, '3962.482')] [2023-03-08 15:18:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000177288_90771456.pth... [2023-03-08 15:18:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000176680_90460160.pth [2023-03-08 15:18:09,988][434396] Updated weights for policy 0, policy_version 177360 (0.0005) [2023-03-08 15:18:11,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10705.1). Total num frames: 90824704. Throughput: 0: 10516.7. Samples: 90798020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:18:11,893][434108] Avg episode reward: [(0, '4043.759')] [2023-03-08 15:18:13,902][434396] Updated weights for policy 0, policy_version 177440 (0.0005) [2023-03-08 15:18:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10691.3). Total num frames: 90877952. Throughput: 0: 10466.4. Samples: 90861480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:18:16,893][434108] Avg episode reward: [(0, '4309.543')] [2023-03-08 15:18:17,819][434396] Updated weights for policy 0, policy_version 177520 (0.0005) [2023-03-08 15:18:21,790][434396] Updated weights for policy 0, policy_version 177600 (0.0005) [2023-03-08 15:18:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10691.3). Total num frames: 90931200. Throughput: 0: 10467.4. Samples: 90923072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:18:21,893][434108] Avg episode reward: [(0, '4192.300')] [2023-03-08 15:18:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000177600_90931200.pth... [2023-03-08 15:18:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000176984_90615808.pth [2023-03-08 15:18:25,566][434396] Updated weights for policy 0, policy_version 177680 (0.0005) [2023-03-08 15:18:26,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10691.2). Total num frames: 90984448. Throughput: 0: 10501.1. Samples: 90955568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:18:26,893][434108] Avg episode reward: [(0, '4215.960')] [2023-03-08 15:18:29,608][434396] Updated weights for policy 0, policy_version 177760 (0.0005) [2023-03-08 15:18:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10677.4). Total num frames: 91033600. Throughput: 0: 10529.4. Samples: 91017416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:18:31,893][434108] Avg episode reward: [(0, '4153.390')] [2023-03-08 15:18:33,613][434396] Updated weights for policy 0, policy_version 177840 (0.0005) [2023-03-08 15:18:36,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10663.5). Total num frames: 91082752. Throughput: 0: 10434.4. Samples: 91077164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:18:36,893][434108] Avg episode reward: [(0, '4317.704')] [2023-03-08 15:18:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000177896_91082752.pth... [2023-03-08 15:18:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000177288_90771456.pth [2023-03-08 15:18:37,789][434396] Updated weights for policy 0, policy_version 177920 (0.0005) [2023-03-08 15:18:41,797][434396] Updated weights for policy 0, policy_version 178000 (0.0006) [2023-03-08 15:18:41,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10663.5). Total num frames: 91136000. Throughput: 0: 10419.1. Samples: 91107328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:18:41,893][434108] Avg episode reward: [(0, '4233.060')] [2023-03-08 15:18:45,901][434396] Updated weights for policy 0, policy_version 178080 (0.0006) [2023-03-08 15:18:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10663.5). Total num frames: 91185152. Throughput: 0: 10386.2. Samples: 91168712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:18:46,893][434108] Avg episode reward: [(0, '4349.906')] [2023-03-08 15:18:50,102][434396] Updated weights for policy 0, policy_version 178160 (0.0006) [2023-03-08 15:18:51,893][434108] Fps is (10 sec: 9830.3, 60 sec: 10308.3, 300 sec: 10635.7). Total num frames: 91234304. Throughput: 0: 10291.0. Samples: 91226744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:18:51,893][434108] Avg episode reward: [(0, '4405.336')] [2023-03-08 15:18:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000178192_91234304.pth... [2023-03-08 15:18:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000177600_90931200.pth [2023-03-08 15:18:54,370][434396] Updated weights for policy 0, policy_version 178240 (0.0005) [2023-03-08 15:18:56,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10621.8). Total num frames: 91283456. Throughput: 0: 10160.9. Samples: 91255260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:18:56,893][434108] Avg episode reward: [(0, '4295.372')] [2023-03-08 15:18:58,446][434396] Updated weights for policy 0, policy_version 178320 (0.0005) [2023-03-08 15:19:01,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10594.1). Total num frames: 91332608. Throughput: 0: 10105.7. Samples: 91316236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:19:01,893][434108] Avg episode reward: [(0, '4358.542')] [2023-03-08 15:19:02,516][434396] Updated weights for policy 0, policy_version 178400 (0.0005) [2023-03-08 15:19:06,555][434396] Updated weights for policy 0, policy_version 178480 (0.0006) [2023-03-08 15:19:06,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10580.2). Total num frames: 91381760. Throughput: 0: 10068.0. Samples: 91376132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:19:06,893][434108] Avg episode reward: [(0, '4296.637')] [2023-03-08 15:19:06,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000178480_91381760.pth... [2023-03-08 15:19:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000177896_91082752.pth [2023-03-08 15:19:10,672][434396] Updated weights for policy 0, policy_version 178560 (0.0004) [2023-03-08 15:19:11,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10580.2). Total num frames: 91435008. Throughput: 0: 9988.7. Samples: 91405056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:19:11,893][434108] Avg episode reward: [(0, '4451.407')] [2023-03-08 15:19:14,468][434396] Updated weights for policy 0, policy_version 178640 (0.0004) [2023-03-08 15:19:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10580.2). Total num frames: 91488256. Throughput: 0: 10048.4. Samples: 91469592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:19:16,893][434108] Avg episode reward: [(0, '4406.817')] [2023-03-08 15:19:18,659][434396] Updated weights for policy 0, policy_version 178720 (0.0005) [2023-03-08 15:19:21,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10566.3). Total num frames: 91537408. Throughput: 0: 10060.9. Samples: 91529904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:19:21,893][434108] Avg episode reward: [(0, '4502.472')] [2023-03-08 15:19:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000178784_91537408.pth... [2023-03-08 15:19:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000178192_91234304.pth [2023-03-08 15:19:22,440][434396] Updated weights for policy 0, policy_version 178800 (0.0005) [2023-03-08 15:19:26,737][434396] Updated weights for policy 0, policy_version 178880 (0.0005) [2023-03-08 15:19:26,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10552.4). Total num frames: 91586560. Throughput: 0: 10079.8. Samples: 91560920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:19:26,893][434108] Avg episode reward: [(0, '4486.688')] [2023-03-08 15:19:30,935][434396] Updated weights for policy 0, policy_version 178960 (0.0005) [2023-03-08 15:19:31,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10538.5). Total num frames: 91635712. Throughput: 0: 10013.7. Samples: 91619328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:19:31,893][434108] Avg episode reward: [(0, '4514.973')] [2023-03-08 15:19:34,910][434396] Updated weights for policy 0, policy_version 179040 (0.0005) [2023-03-08 15:19:36,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 10524.6). Total num frames: 91684864. Throughput: 0: 10084.4. Samples: 91680540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:19:36,893][434108] Avg episode reward: [(0, '4449.708')] [2023-03-08 15:19:36,923][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000179080_91688960.pth... [2023-03-08 15:19:36,925][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000178480_91381760.pth [2023-03-08 15:19:38,924][434396] Updated weights for policy 0, policy_version 179120 (0.0005) [2023-03-08 15:19:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10524.6). Total num frames: 91738112. Throughput: 0: 10109.9. Samples: 91710208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:19:41,893][434108] Avg episode reward: [(0, '4460.519')] [2023-03-08 15:19:42,836][434396] Updated weights for policy 0, policy_version 179200 (0.0004) [2023-03-08 15:19:46,879][434396] Updated weights for policy 0, policy_version 179280 (0.0005) [2023-03-08 15:19:46,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10510.8). Total num frames: 91791360. Throughput: 0: 10131.5. Samples: 91772152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:19:46,893][434108] Avg episode reward: [(0, '4437.987')] [2023-03-08 15:19:50,890][434396] Updated weights for policy 0, policy_version 179360 (0.0005) [2023-03-08 15:19:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10496.9). Total num frames: 91840512. Throughput: 0: 10165.4. Samples: 91833576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:19:51,893][434108] Avg episode reward: [(0, '4322.675')] [2023-03-08 15:19:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000179376_91840512.pth... [2023-03-08 15:19:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000178784_91537408.pth [2023-03-08 15:19:54,975][434396] Updated weights for policy 0, policy_version 179440 (0.0005) [2023-03-08 15:19:56,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10469.1). Total num frames: 91889664. Throughput: 0: 10209.4. Samples: 91864480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:19:56,893][434108] Avg episode reward: [(0, '4261.904')] [2023-03-08 15:19:59,048][434396] Updated weights for policy 0, policy_version 179520 (0.0005) [2023-03-08 15:20:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10469.1). Total num frames: 91942912. Throughput: 0: 10102.0. Samples: 91924184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:20:01,904][434108] Avg episode reward: [(0, '4289.933')] [2023-03-08 15:20:03,033][434396] Updated weights for policy 0, policy_version 179600 (0.0005) [2023-03-08 15:20:06,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10455.2). Total num frames: 91992064. Throughput: 0: 10121.1. Samples: 91985352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:20:06,893][434108] Avg episode reward: [(0, '4081.293')] [2023-03-08 15:20:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000179672_91992064.pth... [2023-03-08 15:20:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000179080_91688960.pth [2023-03-08 15:20:07,075][434396] Updated weights for policy 0, policy_version 179680 (0.0005) [2023-03-08 15:20:11,117][434396] Updated weights for policy 0, policy_version 179760 (0.0005) [2023-03-08 15:20:11,892][434108] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10427.4). Total num frames: 92041216. Throughput: 0: 10127.1. Samples: 92016640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:20:11,893][434108] Avg episode reward: [(0, '4199.758')] [2023-03-08 15:20:15,207][434396] Updated weights for policy 0, policy_version 179840 (0.0005) [2023-03-08 15:20:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10427.4). Total num frames: 92094464. Throughput: 0: 10144.9. Samples: 92075848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:20:16,893][434108] Avg episode reward: [(0, '4314.712')] [2023-03-08 15:20:18,887][434396] Updated weights for policy 0, policy_version 179920 (0.0004) [2023-03-08 15:20:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10441.3). Total num frames: 92147712. Throughput: 0: 10237.2. Samples: 92141216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:20:21,893][434108] Avg episode reward: [(0, '4191.351')] [2023-03-08 15:20:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000179976_92147712.pth... [2023-03-08 15:20:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000179376_91840512.pth [2023-03-08 15:20:22,908][434396] Updated weights for policy 0, policy_version 180000 (0.0005) [2023-03-08 15:20:26,874][434396] Updated weights for policy 0, policy_version 180080 (0.0004) [2023-03-08 15:20:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10441.3). Total num frames: 92200960. Throughput: 0: 10252.3. Samples: 92171560. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 15:20:26,893][434108] Avg episode reward: [(0, '4329.818')] [2023-03-08 15:20:30,715][434396] Updated weights for policy 0, policy_version 180160 (0.0004) [2023-03-08 15:20:31,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10427.4). Total num frames: 92250112. Throughput: 0: 10275.9. Samples: 92234568. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 15:20:31,893][434108] Avg episode reward: [(0, '4326.116')] [2023-03-08 15:20:34,800][434396] Updated weights for policy 0, policy_version 180240 (0.0005) [2023-03-08 15:20:36,892][434108] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10413.6). Total num frames: 92299264. Throughput: 0: 10259.2. Samples: 92295240. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 15:20:36,893][434108] Avg episode reward: [(0, '4378.201')] [2023-03-08 15:20:36,921][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000180280_92303360.pth... [2023-03-08 15:20:36,923][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000179672_91992064.pth [2023-03-08 15:20:38,925][434396] Updated weights for policy 0, policy_version 180320 (0.0005) [2023-03-08 15:20:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10427.4). Total num frames: 92352512. Throughput: 0: 10224.8. Samples: 92324596. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 15:20:41,893][434108] Avg episode reward: [(0, '4493.168')] [2023-03-08 15:20:42,630][434396] Updated weights for policy 0, policy_version 180400 (0.0004) [2023-03-08 15:20:46,396][434396] Updated weights for policy 0, policy_version 180480 (0.0005) [2023-03-08 15:20:46,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 92409856. Throughput: 0: 10384.4. Samples: 92391480. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 15:20:46,893][434108] Avg episode reward: [(0, '4445.664')] [2023-03-08 15:20:50,379][434396] Updated weights for policy 0, policy_version 180560 (0.0005) [2023-03-08 15:20:51,893][434108] Fps is (10 sec: 10649.4, 60 sec: 10308.2, 300 sec: 10427.4). Total num frames: 92459008. Throughput: 0: 10404.1. Samples: 92453540. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 15:20:51,893][434108] Avg episode reward: [(0, '4385.960')] [2023-03-08 15:20:51,924][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000180592_92463104.pth... [2023-03-08 15:20:51,925][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000179976_92147712.pth [2023-03-08 15:20:54,314][434396] Updated weights for policy 0, policy_version 180640 (0.0005) [2023-03-08 15:20:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 92512256. Throughput: 0: 10394.8. Samples: 92484408. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 15:20:56,893][434108] Avg episode reward: [(0, '4421.389')] [2023-03-08 15:20:58,275][434396] Updated weights for policy 0, policy_version 180720 (0.0005) [2023-03-08 15:21:01,893][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 92561408. Throughput: 0: 10454.9. Samples: 92546320. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 15:21:01,893][434108] Avg episode reward: [(0, '4462.601')] [2023-03-08 15:21:02,324][434396] Updated weights for policy 0, policy_version 180800 (0.0005) [2023-03-08 15:21:06,443][434396] Updated weights for policy 0, policy_version 180880 (0.0005) [2023-03-08 15:21:06,893][434108] Fps is (10 sec: 10239.8, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 92614656. Throughput: 0: 10325.1. Samples: 92605848. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 15:21:06,893][434108] Avg episode reward: [(0, '4590.606')] [2023-03-08 15:21:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000180888_92614656.pth... [2023-03-08 15:21:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000180280_92303360.pth [2023-03-08 15:21:10,421][434396] Updated weights for policy 0, policy_version 180960 (0.0006) [2023-03-08 15:21:11,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 92663808. Throughput: 0: 10343.6. Samples: 92637024. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 15:21:11,893][434108] Avg episode reward: [(0, '4494.129')] [2023-03-08 15:21:14,463][434396] Updated weights for policy 0, policy_version 181040 (0.0005) [2023-03-08 15:21:16,892][434108] Fps is (10 sec: 10240.2, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 92717056. Throughput: 0: 10308.3. Samples: 92698444. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 15:21:16,893][434108] Avg episode reward: [(0, '4471.990')] [2023-03-08 15:21:18,360][434396] Updated weights for policy 0, policy_version 181120 (0.0005) [2023-03-08 15:21:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10399.7). Total num frames: 92766208. Throughput: 0: 10341.8. Samples: 92760620. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 15:21:21,893][434108] Avg episode reward: [(0, '4590.326')] [2023-03-08 15:21:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000181184_92766208.pth... [2023-03-08 15:21:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000180592_92463104.pth [2023-03-08 15:21:22,322][434396] Updated weights for policy 0, policy_version 181200 (0.0005) [2023-03-08 15:21:26,109][434396] Updated weights for policy 0, policy_version 181280 (0.0005) [2023-03-08 15:21:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10399.7). Total num frames: 92819456. Throughput: 0: 10409.2. Samples: 92793008. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 15:21:26,893][434108] Avg episode reward: [(0, '4539.413')] [2023-03-08 15:21:29,964][434396] Updated weights for policy 0, policy_version 181360 (0.0004) [2023-03-08 15:21:31,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10444.8, 300 sec: 10413.6). Total num frames: 92876800. Throughput: 0: 10334.7. Samples: 92856540. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 15:21:31,893][434108] Avg episode reward: [(0, '4574.631')] [2023-03-08 15:21:33,772][434396] Updated weights for policy 0, policy_version 181440 (0.0004) [2023-03-08 15:21:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10385.8). Total num frames: 92925952. Throughput: 0: 10400.2. Samples: 92921548. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 15:21:36,893][434108] Avg episode reward: [(0, '4558.574')] [2023-03-08 15:21:36,923][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000181504_92930048.pth... [2023-03-08 15:21:36,924][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000180888_92614656.pth [2023-03-08 15:21:37,743][434396] Updated weights for policy 0, policy_version 181520 (0.0005) [2023-03-08 15:21:41,473][434396] Updated weights for policy 0, policy_version 181600 (0.0005) [2023-03-08 15:21:41,893][434108] Fps is (10 sec: 10649.3, 60 sec: 10513.0, 300 sec: 10399.7). Total num frames: 92983296. Throughput: 0: 10386.9. Samples: 92951820. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 15:21:41,893][434108] Avg episode reward: [(0, '4531.073')] [2023-03-08 15:21:45,297][434396] Updated weights for policy 0, policy_version 181680 (0.0004) [2023-03-08 15:21:46,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10444.8, 300 sec: 10399.7). Total num frames: 93036544. Throughput: 0: 10457.5. Samples: 93016908. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 15:21:46,893][434108] Avg episode reward: [(0, '4562.695')] [2023-03-08 15:21:49,062][434396] Updated weights for policy 0, policy_version 181760 (0.0003) [2023-03-08 15:21:51,892][434108] Fps is (10 sec: 10649.8, 60 sec: 10513.1, 300 sec: 10385.8). Total num frames: 93089792. Throughput: 0: 10572.6. Samples: 93081612. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 15:21:51,893][434108] Avg episode reward: [(0, '4537.247')] [2023-03-08 15:21:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000181816_93089792.pth... [2023-03-08 15:21:51,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000181184_92766208.pth [2023-03-08 15:21:52,844][434396] Updated weights for policy 0, policy_version 181840 (0.0005) [2023-03-08 15:21:56,551][434396] Updated weights for policy 0, policy_version 181920 (0.0004) [2023-03-08 15:21:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10399.7). Total num frames: 93143040. Throughput: 0: 10611.6. Samples: 93114544. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 15:21:56,893][434108] Avg episode reward: [(0, '4567.530')] [2023-03-08 15:22:00,396][434396] Updated weights for policy 0, policy_version 182000 (0.0005) [2023-03-08 15:22:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10581.4, 300 sec: 10385.8). Total num frames: 93196288. Throughput: 0: 10700.5. Samples: 93179968. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 15:22:01,893][434108] Avg episode reward: [(0, '4584.536')] [2023-03-08 15:22:04,198][434396] Updated weights for policy 0, policy_version 182080 (0.0005) [2023-03-08 15:22:06,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10385.8). Total num frames: 93253632. Throughput: 0: 10772.1. Samples: 93245364. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 15:22:06,893][434108] Avg episode reward: [(0, '4567.114')] [2023-03-08 15:22:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000182136_93253632.pth... [2023-03-08 15:22:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000181504_92930048.pth [2023-03-08 15:22:07,889][434396] Updated weights for policy 0, policy_version 182160 (0.0004) [2023-03-08 15:22:11,629][434396] Updated weights for policy 0, policy_version 182240 (0.0004) [2023-03-08 15:22:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10385.8). Total num frames: 93306880. Throughput: 0: 10782.2. Samples: 93278208. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 15:22:11,893][434108] Avg episode reward: [(0, '4574.871')] [2023-03-08 15:22:15,585][434396] Updated weights for policy 0, policy_version 182320 (0.0004) [2023-03-08 15:22:16,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10371.9). Total num frames: 93360128. Throughput: 0: 10824.8. Samples: 93343656. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 15:22:16,893][434108] Avg episode reward: [(0, '4567.578')] [2023-03-08 15:22:19,696][434396] Updated weights for policy 0, policy_version 182400 (0.0005) [2023-03-08 15:22:21,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10717.9, 300 sec: 10358.0). Total num frames: 93409280. Throughput: 0: 10694.9. Samples: 93402820. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 15:22:21,893][434108] Avg episode reward: [(0, '4587.229')] [2023-03-08 15:22:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000182440_93409280.pth... [2023-03-08 15:22:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000181816_93089792.pth [2023-03-08 15:22:23,732][434396] Updated weights for policy 0, policy_version 182480 (0.0005) [2023-03-08 15:22:26,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10649.6, 300 sec: 10344.1). Total num frames: 93458432. Throughput: 0: 10689.7. Samples: 93432856. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 15:22:26,893][434108] Avg episode reward: [(0, '4599.422')] [2023-03-08 15:22:27,766][434396] Updated weights for policy 0, policy_version 182560 (0.0005) [2023-03-08 15:22:31,702][434396] Updated weights for policy 0, policy_version 182640 (0.0005) [2023-03-08 15:22:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10344.1). Total num frames: 93511680. Throughput: 0: 10602.9. Samples: 93494040. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 15:22:31,893][434108] Avg episode reward: [(0, '4543.790')] [2023-03-08 15:22:35,516][434396] Updated weights for policy 0, policy_version 182720 (0.0005) [2023-03-08 15:22:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10358.0). Total num frames: 93564928. Throughput: 0: 10588.5. Samples: 93558092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:22:36,893][434108] Avg episode reward: [(0, '4565.336')] [2023-03-08 15:22:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000182744_93564928.pth... [2023-03-08 15:22:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000182136_93253632.pth [2023-03-08 15:22:39,141][434396] Updated weights for policy 0, policy_version 182800 (0.0004) [2023-03-08 15:22:41,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10371.9). Total num frames: 93622272. Throughput: 0: 10608.2. Samples: 93591912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:22:41,893][434108] Avg episode reward: [(0, '4545.034')] [2023-03-08 15:22:42,817][434396] Updated weights for policy 0, policy_version 182880 (0.0005) [2023-03-08 15:22:46,720][434396] Updated weights for policy 0, policy_version 182960 (0.0005) [2023-03-08 15:22:46,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10371.9). Total num frames: 93675520. Throughput: 0: 10642.7. Samples: 93658892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:22:46,893][434108] Avg episode reward: [(0, '4569.984')] [2023-03-08 15:22:50,664][434396] Updated weights for policy 0, policy_version 183040 (0.0005) [2023-03-08 15:22:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10385.8). Total num frames: 93728768. Throughput: 0: 10559.5. Samples: 93720540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:22:51,893][434108] Avg episode reward: [(0, '4433.573')] [2023-03-08 15:22:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000183064_93728768.pth... [2023-03-08 15:22:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000182440_93409280.pth [2023-03-08 15:22:54,544][434396] Updated weights for policy 0, policy_version 183120 (0.0004) [2023-03-08 15:22:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10371.9). Total num frames: 93777920. Throughput: 0: 10542.7. Samples: 93752628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:22:56,893][434108] Avg episode reward: [(0, '4541.812')] [2023-03-08 15:22:58,506][434396] Updated weights for policy 0, policy_version 183200 (0.0005) [2023-03-08 15:23:01,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10385.8). Total num frames: 93835264. Throughput: 0: 10469.7. Samples: 93814792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:23:01,893][434108] Avg episode reward: [(0, '4578.575')] [2023-03-08 15:23:02,222][434396] Updated weights for policy 0, policy_version 183280 (0.0004) [2023-03-08 15:23:05,925][434396] Updated weights for policy 0, policy_version 183360 (0.0004) [2023-03-08 15:23:06,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10581.3, 300 sec: 10385.8). Total num frames: 93888512. Throughput: 0: 10628.4. Samples: 93881100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:23:06,893][434108] Avg episode reward: [(0, '4592.825')] [2023-03-08 15:23:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000183376_93888512.pth... [2023-03-08 15:23:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000182744_93564928.pth [2023-03-08 15:23:09,864][434396] Updated weights for policy 0, policy_version 183440 (0.0006) [2023-03-08 15:23:11,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10385.8). Total num frames: 93941760. Throughput: 0: 10672.1. Samples: 93913100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:23:11,893][434108] Avg episode reward: [(0, '4585.022')] [2023-03-08 15:23:13,805][434396] Updated weights for policy 0, policy_version 183520 (0.0005) [2023-03-08 15:23:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10385.8). Total num frames: 93995008. Throughput: 0: 10693.1. Samples: 93975228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:23:16,893][434108] Avg episode reward: [(0, '4554.559')] [2023-03-08 15:23:17,588][434396] Updated weights for policy 0, policy_version 183600 (0.0004) [2023-03-08 15:23:21,329][434396] Updated weights for policy 0, policy_version 183680 (0.0005) [2023-03-08 15:23:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10385.8). Total num frames: 94048256. Throughput: 0: 10713.0. Samples: 94040176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:23:21,893][434108] Avg episode reward: [(0, '4551.292')] [2023-03-08 15:23:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000183688_94048256.pth... [2023-03-08 15:23:21,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000183064_93728768.pth [2023-03-08 15:23:25,286][434396] Updated weights for policy 0, policy_version 183760 (0.0005) [2023-03-08 15:23:26,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10649.6, 300 sec: 10385.8). Total num frames: 94097408. Throughput: 0: 10665.0. Samples: 94071836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:23:26,893][434108] Avg episode reward: [(0, '4482.516')] [2023-03-08 15:23:29,408][434396] Updated weights for policy 0, policy_version 183840 (0.0005) [2023-03-08 15:23:31,892][434108] Fps is (10 sec: 10240.2, 60 sec: 10649.6, 300 sec: 10399.7). Total num frames: 94150656. Throughput: 0: 10523.7. Samples: 94132456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:23:31,893][434108] Avg episode reward: [(0, '4524.984')] [2023-03-08 15:23:33,452][434396] Updated weights for policy 0, policy_version 183920 (0.0005) [2023-03-08 15:23:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10385.8). Total num frames: 94199808. Throughput: 0: 10516.1. Samples: 94193764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:23:36,893][434108] Avg episode reward: [(0, '4554.547')] [2023-03-08 15:23:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000183984_94199808.pth... [2023-03-08 15:23:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000183376_93888512.pth [2023-03-08 15:23:37,390][434396] Updated weights for policy 0, policy_version 184000 (0.0005) [2023-03-08 15:23:41,239][434396] Updated weights for policy 0, policy_version 184080 (0.0005) [2023-03-08 15:23:41,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10399.7). Total num frames: 94253056. Throughput: 0: 10498.4. Samples: 94225056. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:23:41,893][434108] Avg episode reward: [(0, '4551.318')] [2023-03-08 15:23:45,367][434396] Updated weights for policy 0, policy_version 184160 (0.0005) [2023-03-08 15:23:46,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10399.7). Total num frames: 94302208. Throughput: 0: 10476.1. Samples: 94286216. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:23:46,893][434108] Avg episode reward: [(0, '4534.585')] [2023-03-08 15:23:49,403][434396] Updated weights for policy 0, policy_version 184240 (0.0005) [2023-03-08 15:23:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10413.6). Total num frames: 94355456. Throughput: 0: 10359.4. Samples: 94347272. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:23:51,893][434108] Avg episode reward: [(0, '4479.044')] [2023-03-08 15:23:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000184288_94355456.pth... [2023-03-08 15:23:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000183688_94048256.pth [2023-03-08 15:23:53,295][434396] Updated weights for policy 0, policy_version 184320 (0.0005) [2023-03-08 15:23:56,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10427.4). Total num frames: 94408704. Throughput: 0: 10373.1. Samples: 94379888. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:23:56,893][434108] Avg episode reward: [(0, '4554.747')] [2023-03-08 15:23:57,264][434396] Updated weights for policy 0, policy_version 184400 (0.0006) [2023-03-08 15:24:01,198][434396] Updated weights for policy 0, policy_version 184480 (0.0005) [2023-03-08 15:24:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 94457856. Throughput: 0: 10362.4. Samples: 94441536. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:24:01,893][434108] Avg episode reward: [(0, '4552.968')] [2023-03-08 15:24:05,142][434396] Updated weights for policy 0, policy_version 184560 (0.0005) [2023-03-08 15:24:06,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 94511104. Throughput: 0: 10291.8. Samples: 94503308. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:24:06,893][434108] Avg episode reward: [(0, '4541.222')] [2023-03-08 15:24:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000184592_94511104.pth... [2023-03-08 15:24:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000183984_94199808.pth [2023-03-08 15:24:09,117][434396] Updated weights for policy 0, policy_version 184640 (0.0005) [2023-03-08 15:24:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10413.6). Total num frames: 94560256. Throughput: 0: 10294.0. Samples: 94535064. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:24:11,893][434108] Avg episode reward: [(0, '4568.246')] [2023-03-08 15:24:13,321][434396] Updated weights for policy 0, policy_version 184720 (0.0005) [2023-03-08 15:24:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 94613504. Throughput: 0: 10255.0. Samples: 94593932. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:24:16,893][434108] Avg episode reward: [(0, '4556.272')] [2023-03-08 15:24:17,258][434396] Updated weights for policy 0, policy_version 184800 (0.0005) [2023-03-08 15:24:21,283][434396] Updated weights for policy 0, policy_version 184880 (0.0005) [2023-03-08 15:24:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10427.4). Total num frames: 94662656. Throughput: 0: 10266.4. Samples: 94655752. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:24:21,893][434108] Avg episode reward: [(0, '4514.652')] [2023-03-08 15:24:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000184888_94662656.pth... [2023-03-08 15:24:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000184288_94355456.pth [2023-03-08 15:24:25,149][434396] Updated weights for policy 0, policy_version 184960 (0.0004) [2023-03-08 15:24:26,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10441.3). Total num frames: 94715904. Throughput: 0: 10272.0. Samples: 94687296. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:24:26,893][434108] Avg episode reward: [(0, '4532.565')] [2023-03-08 15:24:29,048][434396] Updated weights for policy 0, policy_version 185040 (0.0005) [2023-03-08 15:24:31,893][434108] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10441.3). Total num frames: 94765056. Throughput: 0: 10328.0. Samples: 94750976. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:24:31,893][434108] Avg episode reward: [(0, '4523.722')] [2023-03-08 15:24:33,579][434396] Updated weights for policy 0, policy_version 185120 (0.0005) [2023-03-08 15:24:36,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10427.4). Total num frames: 94814208. Throughput: 0: 10202.3. Samples: 94806376. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:24:36,893][434108] Avg episode reward: [(0, '4568.090')] [2023-03-08 15:24:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000185184_94814208.pth... [2023-03-08 15:24:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000184592_94511104.pth [2023-03-08 15:24:37,442][434396] Updated weights for policy 0, policy_version 185200 (0.0004) [2023-03-08 15:24:41,181][434396] Updated weights for policy 0, policy_version 185280 (0.0004) [2023-03-08 15:24:41,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10427.4). Total num frames: 94867456. Throughput: 0: 10218.8. Samples: 94839736. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:24:41,893][434108] Avg episode reward: [(0, '4571.178')] [2023-03-08 15:24:45,091][434396] Updated weights for policy 0, policy_version 185360 (0.0005) [2023-03-08 15:24:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10441.3). Total num frames: 94920704. Throughput: 0: 10268.7. Samples: 94903628. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:24:46,893][434108] Avg episode reward: [(0, '4525.754')] [2023-03-08 15:24:48,758][434396] Updated weights for policy 0, policy_version 185440 (0.0004) [2023-03-08 15:24:51,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 94978048. Throughput: 0: 10369.2. Samples: 94969920. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:24:51,893][434108] Avg episode reward: [(0, '4528.875')] [2023-03-08 15:24:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000185504_94978048.pth... [2023-03-08 15:24:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000184888_94662656.pth [2023-03-08 15:24:52,505][434396] Updated weights for policy 0, policy_version 185520 (0.0004) [2023-03-08 15:24:56,615][434396] Updated weights for policy 0, policy_version 185600 (0.0005) [2023-03-08 15:24:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10455.2). Total num frames: 95027200. Throughput: 0: 10389.1. Samples: 95002572. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:24:56,893][434108] Avg episode reward: [(0, '4590.620')] [2023-03-08 15:25:00,696][434396] Updated weights for policy 0, policy_version 185680 (0.0005) [2023-03-08 15:25:01,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10455.2). Total num frames: 95076352. Throughput: 0: 10380.3. Samples: 95061048. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:25:01,893][434108] Avg episode reward: [(0, '4594.390')] [2023-03-08 15:25:04,840][434396] Updated weights for policy 0, policy_version 185760 (0.0005) [2023-03-08 15:25:06,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10455.2). Total num frames: 95125504. Throughput: 0: 10334.5. Samples: 95120804. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:25:06,893][434108] Avg episode reward: [(0, '4571.251')] [2023-03-08 15:25:06,954][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000185800_95129600.pth... [2023-03-08 15:25:06,955][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000185184_94814208.pth [2023-03-08 15:25:08,958][434396] Updated weights for policy 0, policy_version 185840 (0.0006) [2023-03-08 15:25:11,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10455.2). Total num frames: 95178752. Throughput: 0: 10290.7. Samples: 95150376. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:25:11,893][434108] Avg episode reward: [(0, '4561.484')] [2023-03-08 15:25:12,734][434396] Updated weights for policy 0, policy_version 185920 (0.0004) [2023-03-08 15:25:16,376][434396] Updated weights for policy 0, policy_version 186000 (0.0005) [2023-03-08 15:25:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10376.5, 300 sec: 10469.1). Total num frames: 95236096. Throughput: 0: 10334.3. Samples: 95216020. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:25:16,893][434108] Avg episode reward: [(0, '4557.414')] [2023-03-08 15:25:20,171][434396] Updated weights for policy 0, policy_version 186080 (0.0005) [2023-03-08 15:25:21,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 95289344. Throughput: 0: 10574.8. Samples: 95282240. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:25:21,893][434108] Avg episode reward: [(0, '4555.584')] [2023-03-08 15:25:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000186112_95289344.pth... [2023-03-08 15:25:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000185504_94978048.pth [2023-03-08 15:25:23,879][434396] Updated weights for policy 0, policy_version 186160 (0.0004) [2023-03-08 15:25:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 95346688. Throughput: 0: 10568.9. Samples: 95315336. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:25:26,893][434108] Avg episode reward: [(0, '4552.617')] [2023-03-08 15:25:27,465][434396] Updated weights for policy 0, policy_version 186240 (0.0003) [2023-03-08 15:25:31,359][434396] Updated weights for policy 0, policy_version 186320 (0.0004) [2023-03-08 15:25:31,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10581.4, 300 sec: 10510.8). Total num frames: 95399936. Throughput: 0: 10644.6. Samples: 95382636. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:25:31,893][434108] Avg episode reward: [(0, '4580.903')] [2023-03-08 15:25:35,351][434396] Updated weights for policy 0, policy_version 186400 (0.0005) [2023-03-08 15:25:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 95449088. Throughput: 0: 10530.9. Samples: 95443812. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:25:36,893][434108] Avg episode reward: [(0, '4598.871')] [2023-03-08 15:25:36,913][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000186432_95453184.pth... [2023-03-08 15:25:36,914][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000185800_95129600.pth [2023-03-08 15:25:39,247][434396] Updated weights for policy 0, policy_version 186480 (0.0005) [2023-03-08 15:25:41,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10483.0). Total num frames: 95502336. Throughput: 0: 10501.7. Samples: 95475148. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:25:41,893][434108] Avg episode reward: [(0, '4534.323')] [2023-03-08 15:25:43,288][434396] Updated weights for policy 0, policy_version 186560 (0.0005) [2023-03-08 15:25:46,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 95551488. Throughput: 0: 10547.5. Samples: 95535684. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:25:46,893][434108] Avg episode reward: [(0, '4508.405')] [2023-03-08 15:25:47,360][434396] Updated weights for policy 0, policy_version 186640 (0.0005) [2023-03-08 15:25:51,301][434396] Updated weights for policy 0, policy_version 186720 (0.0005) [2023-03-08 15:25:51,893][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 95604736. Throughput: 0: 10592.1. Samples: 95597448. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 15:25:51,893][434108] Avg episode reward: [(0, '4519.653')] [2023-03-08 15:25:51,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000186728_95604736.pth... [2023-03-08 15:25:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000186112_95289344.pth [2023-03-08 15:25:55,177][434396] Updated weights for policy 0, policy_version 186800 (0.0005) [2023-03-08 15:25:56,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 95657984. Throughput: 0: 10644.4. Samples: 95629376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:25:56,893][434108] Avg episode reward: [(0, '4490.525')] [2023-03-08 15:25:59,167][434396] Updated weights for policy 0, policy_version 186880 (0.0005) [2023-03-08 15:26:01,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 95707136. Throughput: 0: 10557.8. Samples: 95691120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:26:01,893][434108] Avg episode reward: [(0, '4557.907')] [2023-03-08 15:26:03,129][434396] Updated weights for policy 0, policy_version 186960 (0.0005) [2023-03-08 15:26:06,893][434108] Fps is (10 sec: 10239.8, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 95760384. Throughput: 0: 10470.8. Samples: 95753428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:26:06,893][434108] Avg episode reward: [(0, '4540.613')] [2023-03-08 15:26:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000187032_95760384.pth... [2023-03-08 15:26:06,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000186432_95453184.pth [2023-03-08 15:26:07,102][434396] Updated weights for policy 0, policy_version 187040 (0.0005) [2023-03-08 15:26:11,039][434396] Updated weights for policy 0, policy_version 187120 (0.0005) [2023-03-08 15:26:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 95813632. Throughput: 0: 10436.1. Samples: 95784960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:26:11,893][434108] Avg episode reward: [(0, '4516.203')] [2023-03-08 15:26:15,131][434396] Updated weights for policy 0, policy_version 187200 (0.0005) [2023-03-08 15:26:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 95862784. Throughput: 0: 10285.1. Samples: 95845464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:26:16,893][434108] Avg episode reward: [(0, '4523.555')] [2023-03-08 15:26:19,090][434396] Updated weights for policy 0, policy_version 187280 (0.0005) [2023-03-08 15:26:21,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 95916032. Throughput: 0: 10301.9. Samples: 95907396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:26:21,893][434108] Avg episode reward: [(0, '4530.971')] [2023-03-08 15:26:21,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000187336_95916032.pth... [2023-03-08 15:26:21,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000186728_95604736.pth [2023-03-08 15:26:23,023][434396] Updated weights for policy 0, policy_version 187360 (0.0005) [2023-03-08 15:26:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10469.1). Total num frames: 95965184. Throughput: 0: 10303.9. Samples: 95938820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:26:26,893][434108] Avg episode reward: [(0, '4554.344')] [2023-03-08 15:26:26,920][434396] Updated weights for policy 0, policy_version 187440 (0.0005) [2023-03-08 15:26:30,550][434396] Updated weights for policy 0, policy_version 187520 (0.0004) [2023-03-08 15:26:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10496.9). Total num frames: 96022528. Throughput: 0: 10406.1. Samples: 96003960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:26:31,893][434108] Avg episode reward: [(0, '4527.627')] [2023-03-08 15:26:34,190][434396] Updated weights for policy 0, policy_version 187600 (0.0004) [2023-03-08 15:26:36,892][434108] Fps is (10 sec: 11468.6, 60 sec: 10513.0, 300 sec: 10496.9). Total num frames: 96079872. Throughput: 0: 10533.6. Samples: 96071460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:26:36,893][434108] Avg episode reward: [(0, '4525.777')] [2023-03-08 15:26:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000187656_96079872.pth... [2023-03-08 15:26:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000187032_95760384.pth [2023-03-08 15:26:38,054][434396] Updated weights for policy 0, policy_version 187680 (0.0004) [2023-03-08 15:26:41,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 96129024. Throughput: 0: 10494.7. Samples: 96101640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:26:41,893][434108] Avg episode reward: [(0, '4545.780')] [2023-03-08 15:26:41,984][434396] Updated weights for policy 0, policy_version 187760 (0.0005) [2023-03-08 15:26:45,800][434396] Updated weights for policy 0, policy_version 187840 (0.0005) [2023-03-08 15:26:46,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 96182272. Throughput: 0: 10536.5. Samples: 96165260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:26:46,893][434108] Avg episode reward: [(0, '4529.928')] [2023-03-08 15:26:49,531][434396] Updated weights for policy 0, policy_version 187920 (0.0005) [2023-03-08 15:26:51,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 96235520. Throughput: 0: 10612.8. Samples: 96231004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:26:51,893][434108] Avg episode reward: [(0, '4524.558')] [2023-03-08 15:26:51,914][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000187968_96239616.pth... [2023-03-08 15:26:51,915][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000187336_95916032.pth [2023-03-08 15:26:53,435][434396] Updated weights for policy 0, policy_version 188000 (0.0005) [2023-03-08 15:26:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 96292864. Throughput: 0: 10610.2. Samples: 96262420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:26:56,893][434108] Avg episode reward: [(0, '4553.830')] [2023-03-08 15:26:57,138][434396] Updated weights for policy 0, policy_version 188080 (0.0005) [2023-03-08 15:27:00,837][434396] Updated weights for policy 0, policy_version 188160 (0.0004) [2023-03-08 15:27:01,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10483.0). Total num frames: 96346112. Throughput: 0: 10742.7. Samples: 96328888. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:27:01,893][434108] Avg episode reward: [(0, '4528.089')] [2023-03-08 15:27:04,703][434396] Updated weights for policy 0, policy_version 188240 (0.0005) [2023-03-08 15:27:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10483.0). Total num frames: 96399360. Throughput: 0: 10759.8. Samples: 96391588. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:27:06,893][434108] Avg episode reward: [(0, '4471.317')] [2023-03-08 15:27:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000188280_96399360.pth... [2023-03-08 15:27:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000187656_96079872.pth [2023-03-08 15:27:08,614][434396] Updated weights for policy 0, policy_version 188320 (0.0005) [2023-03-08 15:27:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10483.0). Total num frames: 96452608. Throughput: 0: 10784.7. Samples: 96424132. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:27:11,893][434108] Avg episode reward: [(0, '4504.097')] [2023-03-08 15:27:12,381][434396] Updated weights for policy 0, policy_version 188400 (0.0005) [2023-03-08 15:27:16,144][434396] Updated weights for policy 0, policy_version 188480 (0.0004) [2023-03-08 15:27:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10510.8). Total num frames: 96509952. Throughput: 0: 10785.8. Samples: 96489320. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:27:16,893][434108] Avg episode reward: [(0, '4531.331')] [2023-03-08 15:27:19,981][434396] Updated weights for policy 0, policy_version 188560 (0.0004) [2023-03-08 15:27:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10510.8). Total num frames: 96559104. Throughput: 0: 10708.8. Samples: 96553356. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:27:21,893][434108] Avg episode reward: [(0, '4581.283')] [2023-03-08 15:27:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000188592_96559104.pth... [2023-03-08 15:27:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000187968_96239616.pth [2023-03-08 15:27:23,967][434396] Updated weights for policy 0, policy_version 188640 (0.0005) [2023-03-08 15:27:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10786.1, 300 sec: 10510.8). Total num frames: 96612352. Throughput: 0: 10715.9. Samples: 96583856. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:27:26,893][434108] Avg episode reward: [(0, '4561.307')] [2023-03-08 15:27:27,948][434396] Updated weights for policy 0, policy_version 188720 (0.0005) [2023-03-08 15:27:31,783][434396] Updated weights for policy 0, policy_version 188800 (0.0005) [2023-03-08 15:27:31,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10510.8). Total num frames: 96665600. Throughput: 0: 10696.3. Samples: 96646592. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:27:31,893][434108] Avg episode reward: [(0, '4588.012')] [2023-03-08 15:27:35,716][434396] Updated weights for policy 0, policy_version 188880 (0.0005) [2023-03-08 15:27:36,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10483.0). Total num frames: 96714752. Throughput: 0: 10640.5. Samples: 96709828. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:27:36,893][434108] Avg episode reward: [(0, '4577.946')] [2023-03-08 15:27:36,895][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000188904_96718848.pth... [2023-03-08 15:27:36,897][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000188280_96399360.pth [2023-03-08 15:27:39,559][434396] Updated weights for policy 0, policy_version 188960 (0.0005) [2023-03-08 15:27:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10483.0). Total num frames: 96768000. Throughput: 0: 10664.0. Samples: 96742300. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:27:41,893][434108] Avg episode reward: [(0, '4571.122')] [2023-03-08 15:27:43,436][434396] Updated weights for policy 0, policy_version 189040 (0.0005) [2023-03-08 15:27:46,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 10496.9). Total num frames: 96825344. Throughput: 0: 10600.8. Samples: 96805924. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:27:46,893][434108] Avg episode reward: [(0, '4541.732')] [2023-03-08 15:27:47,128][434396] Updated weights for policy 0, policy_version 189120 (0.0004) [2023-03-08 15:27:50,950][434396] Updated weights for policy 0, policy_version 189200 (0.0004) [2023-03-08 15:27:51,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10510.8). Total num frames: 96878592. Throughput: 0: 10649.3. Samples: 96870804. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:27:51,893][434108] Avg episode reward: [(0, '4395.647')] [2023-03-08 15:27:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000189216_96878592.pth... [2023-03-08 15:27:51,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000188592_96559104.pth [2023-03-08 15:27:54,842][434396] Updated weights for policy 0, policy_version 189280 (0.0005) [2023-03-08 15:27:56,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10496.9). Total num frames: 96931840. Throughput: 0: 10645.2. Samples: 96903168. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:27:56,893][434108] Avg episode reward: [(0, '4464.889')] [2023-03-08 15:27:58,522][434396] Updated weights for policy 0, policy_version 189360 (0.0004) [2023-03-08 15:28:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10496.9). Total num frames: 96985088. Throughput: 0: 10659.9. Samples: 96969016. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 15:28:01,893][434108] Avg episode reward: [(0, '4550.543')] [2023-03-08 15:28:02,335][434396] Updated weights for policy 0, policy_version 189440 (0.0004) [2023-03-08 15:28:06,303][434396] Updated weights for policy 0, policy_version 189520 (0.0004) [2023-03-08 15:28:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10496.9). Total num frames: 97038336. Throughput: 0: 10610.9. Samples: 97030848. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:28:06,893][434108] Avg episode reward: [(0, '4588.765')] [2023-03-08 15:28:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000189528_97038336.pth... [2023-03-08 15:28:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000188904_96718848.pth [2023-03-08 15:28:10,100][434396] Updated weights for policy 0, policy_version 189600 (0.0004) [2023-03-08 15:28:11,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10496.9). Total num frames: 97091584. Throughput: 0: 10647.7. Samples: 97063004. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:28:11,893][434108] Avg episode reward: [(0, '4520.413')] [2023-03-08 15:28:14,010][434396] Updated weights for policy 0, policy_version 189680 (0.0005) [2023-03-08 15:28:16,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 97144832. Throughput: 0: 10676.4. Samples: 97127028. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:28:16,893][434108] Avg episode reward: [(0, '4562.055')] [2023-03-08 15:28:17,970][434396] Updated weights for policy 0, policy_version 189760 (0.0005) [2023-03-08 15:28:21,821][434396] Updated weights for policy 0, policy_version 189840 (0.0004) [2023-03-08 15:28:21,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10510.8). Total num frames: 97198080. Throughput: 0: 10669.4. Samples: 97189952. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:28:21,893][434108] Avg episode reward: [(0, '4593.013')] [2023-03-08 15:28:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000189840_97198080.pth... [2023-03-08 15:28:21,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000189216_96878592.pth [2023-03-08 15:28:25,784][434396] Updated weights for policy 0, policy_version 189920 (0.0005) [2023-03-08 15:28:26,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 97247232. Throughput: 0: 10626.0. Samples: 97220472. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:28:26,893][434108] Avg episode reward: [(0, '4578.536')] [2023-03-08 15:28:29,805][434396] Updated weights for policy 0, policy_version 190000 (0.0005) [2023-03-08 15:28:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 97300480. Throughput: 0: 10588.7. Samples: 97282416. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:28:31,893][434108] Avg episode reward: [(0, '4466.769')] [2023-03-08 15:28:33,842][434396] Updated weights for policy 0, policy_version 190080 (0.0005) [2023-03-08 15:28:36,893][434108] Fps is (10 sec: 10649.4, 60 sec: 10649.6, 300 sec: 10510.7). Total num frames: 97353728. Throughput: 0: 10516.7. Samples: 97344056. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:28:36,893][434108] Avg episode reward: [(0, '4586.859')] [2023-03-08 15:28:36,898][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000190144_97353728.pth... [2023-03-08 15:28:36,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000189528_97038336.pth [2023-03-08 15:28:37,581][434396] Updated weights for policy 0, policy_version 190160 (0.0005) [2023-03-08 15:28:41,593][434396] Updated weights for policy 0, policy_version 190240 (0.0005) [2023-03-08 15:28:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 97402880. Throughput: 0: 10531.5. Samples: 97377084. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:28:41,893][434108] Avg episode reward: [(0, '4562.770')] [2023-03-08 15:28:45,418][434396] Updated weights for policy 0, policy_version 190320 (0.0005) [2023-03-08 15:28:46,892][434108] Fps is (10 sec: 10240.3, 60 sec: 10513.1, 300 sec: 10510.8). Total num frames: 97456128. Throughput: 0: 10462.0. Samples: 97439808. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:28:46,893][434108] Avg episode reward: [(0, '4578.496')] [2023-03-08 15:28:49,317][434396] Updated weights for policy 0, policy_version 190400 (0.0005) [2023-03-08 15:28:51,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10510.7). Total num frames: 97509376. Throughput: 0: 10451.9. Samples: 97501184. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:28:51,893][434108] Avg episode reward: [(0, '4565.181')] [2023-03-08 15:28:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000190448_97509376.pth... [2023-03-08 15:28:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000189840_97198080.pth [2023-03-08 15:28:53,408][434396] Updated weights for policy 0, policy_version 190480 (0.0005) [2023-03-08 15:28:56,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 97562624. Throughput: 0: 10420.5. Samples: 97531928. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:28:56,893][434108] Avg episode reward: [(0, '4563.628')] [2023-03-08 15:28:57,176][434396] Updated weights for policy 0, policy_version 190560 (0.0004) [2023-03-08 15:29:01,020][434396] Updated weights for policy 0, policy_version 190640 (0.0005) [2023-03-08 15:29:01,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 97615872. Throughput: 0: 10429.5. Samples: 97596356. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:29:01,893][434108] Avg episode reward: [(0, '4590.140')] [2023-03-08 15:29:04,809][434396] Updated weights for policy 0, policy_version 190720 (0.0005) [2023-03-08 15:29:06,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 97669120. Throughput: 0: 10467.6. Samples: 97660992. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:29:06,893][434108] Avg episode reward: [(0, '4582.137')] [2023-03-08 15:29:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000190760_97669120.pth... [2023-03-08 15:29:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000190144_97353728.pth [2023-03-08 15:29:08,828][434396] Updated weights for policy 0, policy_version 190800 (0.0005) [2023-03-08 15:29:11,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 97718272. Throughput: 0: 10470.8. Samples: 97691656. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 15:29:11,893][434108] Avg episode reward: [(0, '4557.985')] [2023-03-08 15:29:12,779][434396] Updated weights for policy 0, policy_version 190880 (0.0005) [2023-03-08 15:29:16,762][434396] Updated weights for policy 0, policy_version 190960 (0.0005) [2023-03-08 15:29:16,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10538.5). Total num frames: 97771520. Throughput: 0: 10473.1. Samples: 97753708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:29:16,893][434108] Avg episode reward: [(0, '4553.522')] [2023-03-08 15:29:20,550][434396] Updated weights for policy 0, policy_version 191040 (0.0005) [2023-03-08 15:29:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10538.5). Total num frames: 97824768. Throughput: 0: 10516.6. Samples: 97817300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:29:21,893][434108] Avg episode reward: [(0, '4576.469')] [2023-03-08 15:29:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000191064_97824768.pth... [2023-03-08 15:29:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000190448_97509376.pth [2023-03-08 15:29:24,260][434396] Updated weights for policy 0, policy_version 191120 (0.0004) [2023-03-08 15:29:26,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10566.3). Total num frames: 97882112. Throughput: 0: 10525.3. Samples: 97850724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:29:26,893][434108] Avg episode reward: [(0, '4533.353')] [2023-03-08 15:29:27,929][434396] Updated weights for policy 0, policy_version 191200 (0.0004) [2023-03-08 15:29:31,600][434396] Updated weights for policy 0, policy_version 191280 (0.0004) [2023-03-08 15:29:31,892][434108] Fps is (10 sec: 11059.1, 60 sec: 10581.3, 300 sec: 10580.2). Total num frames: 97935360. Throughput: 0: 10615.1. Samples: 97917488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:29:31,893][434108] Avg episode reward: [(0, '4568.111')] [2023-03-08 15:29:35,676][434396] Updated weights for policy 0, policy_version 191360 (0.0005) [2023-03-08 15:29:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.4, 300 sec: 10580.2). Total num frames: 97988608. Throughput: 0: 10646.8. Samples: 97980292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:29:36,893][434108] Avg episode reward: [(0, '4582.307')] [2023-03-08 15:29:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000191384_97988608.pth... [2023-03-08 15:29:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000190760_97669120.pth [2023-03-08 15:29:39,659][434396] Updated weights for policy 0, policy_version 191440 (0.0004) [2023-03-08 15:29:41,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10566.3). Total num frames: 98037760. Throughput: 0: 10648.1. Samples: 98011092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:29:41,893][434108] Avg episode reward: [(0, '4558.760')] [2023-03-08 15:29:43,704][434396] Updated weights for policy 0, policy_version 191520 (0.0005) [2023-03-08 15:29:46,892][434108] Fps is (10 sec: 9830.4, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 98086912. Throughput: 0: 10548.9. Samples: 98071056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:29:46,893][434108] Avg episode reward: [(0, '4502.480')] [2023-03-08 15:29:47,766][434396] Updated weights for policy 0, policy_version 191600 (0.0004) [2023-03-08 15:29:51,579][434396] Updated weights for policy 0, policy_version 191680 (0.0005) [2023-03-08 15:29:51,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 98140160. Throughput: 0: 10498.8. Samples: 98133440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:29:51,893][434108] Avg episode reward: [(0, '4520.207')] [2023-03-08 15:29:51,929][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000191688_98144256.pth... [2023-03-08 15:29:51,930][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000191064_97824768.pth [2023-03-08 15:29:55,204][434396] Updated weights for policy 0, policy_version 191760 (0.0004) [2023-03-08 15:29:56,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10581.3, 300 sec: 10580.2). Total num frames: 98197504. Throughput: 0: 10581.7. Samples: 98167832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:29:56,893][434108] Avg episode reward: [(0, '4543.599')] [2023-03-08 15:29:58,945][434396] Updated weights for policy 0, policy_version 191840 (0.0004) [2023-03-08 15:30:01,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10581.4, 300 sec: 10594.1). Total num frames: 98250752. Throughput: 0: 10675.5. Samples: 98234104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:30:01,893][434108] Avg episode reward: [(0, '4587.302')] [2023-03-08 15:30:02,652][434396] Updated weights for policy 0, policy_version 191920 (0.0004) [2023-03-08 15:30:06,683][434396] Updated weights for policy 0, policy_version 192000 (0.0005) [2023-03-08 15:30:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 98304000. Throughput: 0: 10651.0. Samples: 98296596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:30:06,893][434108] Avg episode reward: [(0, '4520.219')] [2023-03-08 15:30:06,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000192000_98304000.pth... [2023-03-08 15:30:06,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000191384_97988608.pth [2023-03-08 15:30:10,412][434396] Updated weights for policy 0, policy_version 192080 (0.0004) [2023-03-08 15:30:11,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 98361344. Throughput: 0: 10620.5. Samples: 98328648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:30:11,893][434108] Avg episode reward: [(0, '4494.821')] [2023-03-08 15:30:14,084][434396] Updated weights for policy 0, policy_version 192160 (0.0004) [2023-03-08 15:30:16,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 98414592. Throughput: 0: 10631.9. Samples: 98395924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:30:16,893][434108] Avg episode reward: [(0, '4559.713')] [2023-03-08 15:30:17,824][434396] Updated weights for policy 0, policy_version 192240 (0.0004) [2023-03-08 15:30:21,609][434396] Updated weights for policy 0, policy_version 192320 (0.0004) [2023-03-08 15:30:21,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10580.2). Total num frames: 98467840. Throughput: 0: 10693.8. Samples: 98461512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:30:21,893][434108] Avg episode reward: [(0, '4548.811')] [2023-03-08 15:30:21,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000192320_98467840.pth... [2023-03-08 15:30:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000191688_98144256.pth [2023-03-08 15:30:25,418][434396] Updated weights for policy 0, policy_version 192400 (0.0005) [2023-03-08 15:30:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10580.2). Total num frames: 98521088. Throughput: 0: 10732.7. Samples: 98494064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:30:26,893][434108] Avg episode reward: [(0, '4542.596')] [2023-03-08 15:30:29,358][434396] Updated weights for policy 0, policy_version 192480 (0.0005) [2023-03-08 15:30:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10594.1). Total num frames: 98574336. Throughput: 0: 10794.8. Samples: 98556824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:30:31,893][434108] Avg episode reward: [(0, '4545.274')] [2023-03-08 15:30:33,308][434396] Updated weights for policy 0, policy_version 192560 (0.0005) [2023-03-08 15:30:36,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10594.1). Total num frames: 98627584. Throughput: 0: 10825.2. Samples: 98620576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:30:36,893][434108] Avg episode reward: [(0, '4520.070')] [2023-03-08 15:30:36,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000192632_98627584.pth... [2023-03-08 15:30:36,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000192000_98304000.pth [2023-03-08 15:30:37,016][434396] Updated weights for policy 0, policy_version 192640 (0.0004) [2023-03-08 15:30:40,733][434396] Updated weights for policy 0, policy_version 192720 (0.0005) [2023-03-08 15:30:41,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10621.8). Total num frames: 98684928. Throughput: 0: 10804.0. Samples: 98654012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:30:41,893][434108] Avg episode reward: [(0, '4507.899')] [2023-03-08 15:30:44,350][434396] Updated weights for policy 0, policy_version 192800 (0.0003) [2023-03-08 15:30:46,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10854.4, 300 sec: 10621.8). Total num frames: 98738176. Throughput: 0: 10809.0. Samples: 98720508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:30:46,893][434108] Avg episode reward: [(0, '4553.742')] [2023-03-08 15:30:48,345][434396] Updated weights for policy 0, policy_version 192880 (0.0005) [2023-03-08 15:30:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10854.4, 300 sec: 10621.8). Total num frames: 98791424. Throughput: 0: 10788.0. Samples: 98782056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:30:51,893][434108] Avg episode reward: [(0, '4528.773')] [2023-03-08 15:30:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000192952_98791424.pth... [2023-03-08 15:30:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000192320_98467840.pth [2023-03-08 15:30:52,260][434396] Updated weights for policy 0, policy_version 192960 (0.0005) [2023-03-08 15:30:56,148][434396] Updated weights for policy 0, policy_version 193040 (0.0005) [2023-03-08 15:30:56,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10621.8). Total num frames: 98840576. Throughput: 0: 10782.1. Samples: 98813844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:30:56,893][434108] Avg episode reward: [(0, '4560.227')] [2023-03-08 15:31:00,052][434396] Updated weights for policy 0, policy_version 193120 (0.0005) [2023-03-08 15:31:01,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10621.8). Total num frames: 98893824. Throughput: 0: 10700.4. Samples: 98877440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:31:01,893][434108] Avg episode reward: [(0, '4561.942')] [2023-03-08 15:31:03,961][434396] Updated weights for policy 0, policy_version 193200 (0.0005) [2023-03-08 15:31:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10621.8). Total num frames: 98947072. Throughput: 0: 10609.6. Samples: 98938944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:31:06,893][434108] Avg episode reward: [(0, '4531.158')] [2023-03-08 15:31:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000193256_98947072.pth... [2023-03-08 15:31:06,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000192632_98627584.pth [2023-03-08 15:31:07,912][434396] Updated weights for policy 0, policy_version 193280 (0.0005) [2023-03-08 15:31:11,783][434396] Updated weights for policy 0, policy_version 193360 (0.0005) [2023-03-08 15:31:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 99000320. Throughput: 0: 10610.7. Samples: 98971544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:31:11,893][434108] Avg episode reward: [(0, '4571.819')] [2023-03-08 15:31:15,709][434396] Updated weights for policy 0, policy_version 193440 (0.0005) [2023-03-08 15:31:16,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10621.8). Total num frames: 99049472. Throughput: 0: 10603.7. Samples: 99033992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:31:16,893][434108] Avg episode reward: [(0, '4570.624')] [2023-03-08 15:31:19,727][434396] Updated weights for policy 0, policy_version 193520 (0.0005) [2023-03-08 15:31:21,892][434108] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10635.7). Total num frames: 99102720. Throughput: 0: 10565.5. Samples: 99096024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:31:21,893][434108] Avg episode reward: [(0, '4543.970')] [2023-03-08 15:31:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000193560_99102720.pth... [2023-03-08 15:31:21,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000192952_98791424.pth [2023-03-08 15:31:23,639][434396] Updated weights for policy 0, policy_version 193600 (0.0005) [2023-03-08 15:31:26,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10621.8). Total num frames: 99155968. Throughput: 0: 10517.4. Samples: 99127296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:31:26,893][434108] Avg episode reward: [(0, '4552.128')] [2023-03-08 15:31:27,658][434396] Updated weights for policy 0, policy_version 193680 (0.0004) [2023-03-08 15:31:31,516][434396] Updated weights for policy 0, policy_version 193760 (0.0005) [2023-03-08 15:31:31,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10607.9). Total num frames: 99209216. Throughput: 0: 10405.2. Samples: 99188744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:31:31,893][434108] Avg episode reward: [(0, '4559.312')] [2023-03-08 15:31:35,266][434396] Updated weights for policy 0, policy_version 193840 (0.0005) [2023-03-08 15:31:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10621.8). Total num frames: 99262464. Throughput: 0: 10495.1. Samples: 99254336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:31:36,893][434108] Avg episode reward: [(0, '4448.037')] [2023-03-08 15:31:36,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000193872_99262464.pth... [2023-03-08 15:31:36,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000193256_98947072.pth [2023-03-08 15:31:39,087][434396] Updated weights for policy 0, policy_version 193920 (0.0005) [2023-03-08 15:31:41,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10621.8). Total num frames: 99315712. Throughput: 0: 10504.4. Samples: 99286544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:31:41,893][434108] Avg episode reward: [(0, '4510.561')] [2023-03-08 15:31:43,050][434396] Updated weights for policy 0, policy_version 194000 (0.0006) [2023-03-08 15:31:46,868][434396] Updated weights for policy 0, policy_version 194080 (0.0005) [2023-03-08 15:31:46,892][434108] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10621.8). Total num frames: 99368960. Throughput: 0: 10485.3. Samples: 99349276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:31:46,893][434108] Avg episode reward: [(0, '4532.374')] [2023-03-08 15:31:50,522][434396] Updated weights for policy 0, policy_version 194160 (0.0003) [2023-03-08 15:31:51,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10607.9). Total num frames: 99422208. Throughput: 0: 10579.7. Samples: 99415028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:31:51,893][434108] Avg episode reward: [(0, '4530.746')] [2023-03-08 15:31:51,896][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000194184_99422208.pth... [2023-03-08 15:31:51,898][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000193560_99102720.pth [2023-03-08 15:31:54,214][434396] Updated weights for policy 0, policy_version 194240 (0.0004) [2023-03-08 15:31:56,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10621.8). Total num frames: 99479552. Throughput: 0: 10599.3. Samples: 99448512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:31:56,893][434108] Avg episode reward: [(0, '4582.819')] [2023-03-08 15:31:57,908][434396] Updated weights for policy 0, policy_version 194320 (0.0004) [2023-03-08 15:32:01,711][434396] Updated weights for policy 0, policy_version 194400 (0.0004) [2023-03-08 15:32:01,892][434108] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10621.8). Total num frames: 99532800. Throughput: 0: 10701.5. Samples: 99515560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:32:01,893][434108] Avg episode reward: [(0, '4491.439')] [2023-03-08 15:32:05,481][434396] Updated weights for policy 0, policy_version 194480 (0.0004) [2023-03-08 15:32:06,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10621.8). Total num frames: 99586048. Throughput: 0: 10740.6. Samples: 99579352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:32:06,893][434108] Avg episode reward: [(0, '4538.771')] [2023-03-08 15:32:06,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000194504_99586048.pth... [2023-03-08 15:32:06,900][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000193872_99262464.pth [2023-03-08 15:32:09,417][434396] Updated weights for policy 0, policy_version 194560 (0.0005) [2023-03-08 15:32:11,892][434108] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 99639296. Throughput: 0: 10740.8. Samples: 99610632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:32:11,893][434108] Avg episode reward: [(0, '4552.504')] [2023-03-08 15:32:13,068][434396] Updated weights for policy 0, policy_version 194640 (0.0004) [2023-03-08 15:32:16,852][434396] Updated weights for policy 0, policy_version 194720 (0.0004) [2023-03-08 15:32:16,892][434108] Fps is (10 sec: 11059.3, 60 sec: 10786.1, 300 sec: 10635.7). Total num frames: 99696640. Throughput: 0: 10833.1. Samples: 99676232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:32:16,893][434108] Avg episode reward: [(0, '4536.957')] [2023-03-08 15:32:20,687][434396] Updated weights for policy 0, policy_version 194800 (0.0004) [2023-03-08 15:32:21,893][434108] Fps is (10 sec: 11059.1, 60 sec: 10786.1, 300 sec: 10635.7). Total num frames: 99749888. Throughput: 0: 10823.7. Samples: 99741404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:32:21,893][434108] Avg episode reward: [(0, '4564.677')] [2023-03-08 15:32:21,897][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000194824_99749888.pth... [2023-03-08 15:32:21,899][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000194184_99422208.pth [2023-03-08 15:32:24,821][434396] Updated weights for policy 0, policy_version 194880 (0.0005) [2023-03-08 15:32:26,892][434108] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10621.8). Total num frames: 99799040. Throughput: 0: 10753.2. Samples: 99770440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:32:26,893][434108] Avg episode reward: [(0, '4581.394')] [2023-03-08 15:32:28,622][434396] Updated weights for policy 0, policy_version 194960 (0.0005) [2023-03-08 15:32:31,892][434108] Fps is (10 sec: 10240.1, 60 sec: 10717.9, 300 sec: 10635.7). Total num frames: 99852288. Throughput: 0: 10797.4. Samples: 99835160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:32:31,893][434108] Avg episode reward: [(0, '4505.115')] [2023-03-08 15:32:32,354][434396] Updated weights for policy 0, policy_version 195040 (0.0005) [2023-03-08 15:32:36,173][434396] Updated weights for policy 0, policy_version 195120 (0.0005) [2023-03-08 15:32:36,892][434108] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10635.7). Total num frames: 99905536. Throughput: 0: 10777.2. Samples: 99900004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:32:36,893][434108] Avg episode reward: [(0, '4526.354')] [2023-03-08 15:32:36,941][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000195136_99909632.pth... [2023-03-08 15:32:36,943][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000194504_99586048.pth [2023-03-08 15:32:40,220][434396] Updated weights for policy 0, policy_version 195200 (0.0006) [2023-03-08 15:32:41,893][434108] Fps is (10 sec: 10649.5, 60 sec: 10717.8, 300 sec: 10621.8). Total num frames: 99958784. Throughput: 0: 10703.8. Samples: 99930184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 15:32:41,893][434108] Avg episode reward: [(0, '4560.530')] [2023-03-08 15:32:44,282][434396] Updated weights for policy 0, policy_version 195280 (0.0005) [2023-03-08 15:32:46,326][434347] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 [2023-03-08 15:32:46,751][434347] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 [2023-03-08 15:32:46,752][434347] Stopping Batcher_0... [2023-03-08 15:32:46,752][434397] Stopping RolloutWorker_w6... [2023-03-08 15:32:46,752][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000195328_100007936.pth... [2023-03-08 15:32:46,752][434400] Stopping RolloutWorker_w4... [2023-03-08 15:32:46,752][434407] Stopping RolloutWorker_w3... [2023-03-08 15:32:46,752][434397] Loop rollout_proc6_evt_loop terminating... [2023-03-08 15:32:46,752][434401] Stopping RolloutWorker_w7... [2023-03-08 15:32:46,752][434400] Loop rollout_proc4_evt_loop terminating... [2023-03-08 15:32:46,752][434398] Stopping RolloutWorker_w0... [2023-03-08 15:32:46,752][434407] Loop rollout_proc3_evt_loop terminating... [2023-03-08 15:32:46,752][434399] Stopping RolloutWorker_w1... [2023-03-08 15:32:46,752][434401] Loop rollout_proc7_evt_loop terminating... [2023-03-08 15:32:46,752][434398] Loop rollout_proc0_evt_loop terminating... [2023-03-08 15:32:46,752][434402] Stopping RolloutWorker_w2... [2023-03-08 15:32:46,752][434108] Component Batcher_0 stopped! [2023-03-08 15:32:46,752][434395] Stopping RolloutWorker_w5... [2023-03-08 15:32:46,752][434399] Loop rollout_proc1_evt_loop terminating... [2023-03-08 15:32:46,752][434402] Loop rollout_proc2_evt_loop terminating... [2023-03-08 15:32:46,752][434395] Loop rollout_proc5_evt_loop terminating... [2023-03-08 15:32:46,753][434108] Component RolloutWorker_w6 stopped! [2023-03-08 15:32:46,753][434108] Component RolloutWorker_w4 stopped! [2023-03-08 15:32:46,753][434108] Component RolloutWorker_w3 stopped! [2023-03-08 15:32:46,752][434347] Loop batcher_evt_loop terminating... [2023-03-08 15:32:46,754][434347] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000194824_99749888.pth [2023-03-08 15:32:46,753][434108] Component RolloutWorker_w7 stopped! [2023-03-08 15:32:46,754][434108] Component RolloutWorker_w1 stopped! [2023-03-08 15:32:46,754][434347] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/hammer-v2/checkpoint_p0/checkpoint_000195328_100007936.pth... [2023-03-08 15:32:46,754][434108] Component RolloutWorker_w0 stopped! [2023-03-08 15:32:46,754][434108] Component RolloutWorker_w2 stopped! [2023-03-08 15:32:46,755][434108] Component RolloutWorker_w5 stopped! [2023-03-08 15:32:46,755][434347] Stopping LearnerWorker_p0... [2023-03-08 15:32:46,756][434347] Loop learner_proc0_evt_loop terminating... [2023-03-08 15:32:46,756][434108] Component LearnerWorker_p0 stopped! [2023-03-08 15:32:46,774][434396] Weights refcount: 2 0 [2023-03-08 15:32:46,775][434396] Stopping InferenceWorker_p0-w0... [2023-03-08 15:32:46,776][434396] Loop inference_proc0-0_evt_loop terminating... [2023-03-08 15:32:46,776][434108] Component InferenceWorker_p0-w0 stopped! [2023-03-08 15:32:46,776][434108] Waiting for process learner_proc0 to stop... [2023-03-08 15:32:47,213][434108] Waiting for process inference_proc0-0 to join... [2023-03-08 15:32:47,224][434108] Waiting for process rollout_proc0 to join... [2023-03-08 15:32:47,225][434108] Waiting for process rollout_proc1 to join... [2023-03-08 15:32:47,225][434108] Waiting for process rollout_proc2 to join... [2023-03-08 15:32:47,225][434108] Waiting for process rollout_proc3 to join... [2023-03-08 15:32:47,226][434108] Waiting for process rollout_proc4 to join... [2023-03-08 15:32:47,226][434108] Waiting for process rollout_proc5 to join... [2023-03-08 15:32:47,226][434108] Waiting for process rollout_proc6 to join... [2023-03-08 15:32:47,226][434108] Waiting for process rollout_proc7 to join... [2023-03-08 15:32:47,226][434108] Batcher 0 profile tree view: batching: 17.3625, releasing_batches: 15.0626 [2023-03-08 15:32:47,227][434108] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0000 wait_policy_total: 3605.5728 update_model: 103.0742 weight_update: 0.0005 one_step: 0.0010 handle_policy_step: 5248.7113 deserialize: 221.2058, stack: 54.7389, obs_to_device_normalize: 944.9625, forward: 2592.1559, send_messages: 380.3960 prepare_outputs: 594.4452 to_cpu: 92.3113 [2023-03-08 15:32:47,227][434108] Learner 0 profile tree view: misc: 0.1098, prepare_batch: 91.0656 train: 1156.8320 epoch_init: 0.3957, minibatch_init: 12.0851, losses_postprocess: 11.8738, kl_divergence: 4.2629, after_optimizer: 5.0654 calculate_losses: 475.8242 losses_init: 0.4049, forward_head: 235.4963, bptt_initial: 1.2554, bptt: 1.3001, tail: 112.2067, advantages_returns: 8.6971, losses: 102.8164 update: 631.5315 clip: 55.6767 [2023-03-08 15:32:47,227][434108] RolloutWorker_w0 profile tree view: wait_for_trajectories: 2.9384, enqueue_policy_requests: 130.5141, env_step: 6754.5461, overhead: 316.6412, complete_rollouts: 3.2102 save_policy_outputs: 329.5702 split_output_tensors: 161.2125 [2023-03-08 15:32:47,227][434108] RolloutWorker_w7 profile tree view: wait_for_trajectories: 2.9241, enqueue_policy_requests: 131.4633, env_step: 6743.2863, overhead: 313.7152, complete_rollouts: 3.1992 save_policy_outputs: 325.1334 split_output_tensors: 159.6218 [2023-03-08 15:32:47,228][434108] Loop Runner_EvtLoop terminating... [2023-03-08 15:32:47,228][434108] Runner profile tree view: main_loop: 9607.4751 [2023-03-08 15:32:47,228][434108] Collected {0: 100007936}, FPS: 10409.4