[2023-03-08 18:52:37,245][547459] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/config.json... [2023-03-08 18:52:37,263][547459] Rollout worker 0 uses device cpu [2023-03-08 18:52:37,264][547459] Rollout worker 1 uses device cpu [2023-03-08 18:52:37,264][547459] Rollout worker 2 uses device cpu [2023-03-08 18:52:37,264][547459] Rollout worker 3 uses device cpu [2023-03-08 18:52:37,264][547459] Rollout worker 4 uses device cpu [2023-03-08 18:52:37,264][547459] Rollout worker 5 uses device cpu [2023-03-08 18:52:37,265][547459] Rollout worker 6 uses device cpu [2023-03-08 18:52:37,265][547459] Rollout worker 7 uses device cpu [2023-03-08 18:52:37,265][547459] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 [2023-03-08 18:52:37,282][547459] InferenceWorker_p0-w0: min num requests: 2 [2023-03-08 18:52:37,301][547459] Starting all processes... [2023-03-08 18:52:37,302][547459] Starting process learner_proc0 [2023-03-08 18:52:37,351][547459] Starting all processes... [2023-03-08 18:52:37,389][547459] Starting process inference_proc0-0 [2023-03-08 18:52:37,399][547459] Starting process rollout_proc0 [2023-03-08 18:52:37,400][547459] Starting process rollout_proc1 [2023-03-08 18:52:37,400][547459] Starting process rollout_proc2 [2023-03-08 18:52:37,400][547459] Starting process rollout_proc3 [2023-03-08 18:52:37,401][547459] Starting process rollout_proc4 [2023-03-08 18:52:37,403][547459] Starting process rollout_proc5 [2023-03-08 18:52:37,403][547459] Starting process rollout_proc6 [2023-03-08 18:52:37,403][547459] Starting process rollout_proc7 [2023-03-08 18:52:39,146][547802] Worker 2 uses CPU cores [8, 9, 10, 11] [2023-03-08 18:52:39,255][547801] Worker 1 uses CPU cores [4, 5, 6, 7] [2023-03-08 18:52:39,319][547931] Worker 5 uses CPU cores [20, 21, 22, 23] [2023-03-08 18:52:39,558][547835] Worker 4 uses CPU cores [16, 17, 18, 19] [2023-03-08 18:52:39,624][547899] Worker 7 uses CPU cores [28, 29, 30, 31] [2023-03-08 18:52:39,723][547834] Worker 3 uses CPU cores [12, 13, 14, 15] [2023-03-08 18:52:39,854][547867] Worker 6 uses CPU cores [24, 25, 26, 27] [2023-03-08 18:52:39,954][547800] Worker 0 uses CPU cores [0, 1, 2, 3] [2023-03-08 18:52:40,024][547754] Starting seed is not provided [2023-03-08 18:52:40,024][547754] Initializing actor-critic model on device cpu [2023-03-08 18:52:40,025][547754] RunningMeanStd input shape: (39,) [2023-03-08 18:52:40,025][547754] RunningMeanStd input shape: (1,) [2023-03-08 18:52:40,086][547754] Created Actor Critic model with architecture: [2023-03-08 18:52:40,087][547754] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): MlpEncoder( (mlp_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=Tanh) (2): RecursiveScriptModule(original_name=Linear) (3): RecursiveScriptModule(original_name=Tanh) ) ) ) ) (core): ModelCoreIdentity() (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=64, out_features=1, bias=True) (action_parameterization): ActionParameterizationContinuousNonAdaptiveStddev( (distribution_linear): Linear(in_features=64, out_features=4, bias=True) ) ) [2023-03-08 18:52:40,393][547754] Using optimizer [2023-03-08 18:52:40,394][547754] No checkpoints found [2023-03-08 18:52:40,394][547754] Did not load from checkpoint, starting from scratch! [2023-03-08 18:52:40,394][547754] Initialized policy 0 weights for model version 0 [2023-03-08 18:52:40,395][547754] LearnerWorker_p0 finished initialization! [2023-03-08 18:52:40,397][547799] RunningMeanStd input shape: (39,) [2023-03-08 18:52:40,397][547799] RunningMeanStd input shape: (1,) [2023-03-08 18:52:40,483][547459] Inference worker 0-0 is ready! [2023-03-08 18:52:40,484][547459] All inference workers are ready! Signal rollout workers to start! [2023-03-08 18:52:44,547][547801] Decorrelating experience for 0 frames... [2023-03-08 18:52:44,565][547801] Decorrelating experience for 64 frames... [2023-03-08 18:52:44,600][547459] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-08 18:52:44,615][547801] Decorrelating experience for 128 frames... [2023-03-08 18:52:44,685][547801] Decorrelating experience for 192 frames... [2023-03-08 18:52:44,725][547867] Decorrelating experience for 0 frames... [2023-03-08 18:52:44,737][547867] Decorrelating experience for 64 frames... [2023-03-08 18:52:44,771][547867] Decorrelating experience for 128 frames... [2023-03-08 18:52:44,777][547931] Decorrelating experience for 0 frames... [2023-03-08 18:52:44,790][547931] Decorrelating experience for 64 frames... [2023-03-08 18:52:44,824][547867] Decorrelating experience for 192 frames... [2023-03-08 18:52:44,831][547931] Decorrelating experience for 128 frames... [2023-03-08 18:52:44,840][547835] Decorrelating experience for 0 frames... [2023-03-08 18:52:44,852][547835] Decorrelating experience for 64 frames... [2023-03-08 18:52:44,874][547802] Decorrelating experience for 0 frames... [2023-03-08 18:52:44,886][547802] Decorrelating experience for 64 frames... [2023-03-08 18:52:44,887][547835] Decorrelating experience for 128 frames... [2023-03-08 18:52:44,887][547834] Decorrelating experience for 0 frames... [2023-03-08 18:52:44,899][547834] Decorrelating experience for 64 frames... [2023-03-08 18:52:44,910][547931] Decorrelating experience for 192 frames... [2023-03-08 18:52:44,920][547802] Decorrelating experience for 128 frames... [2023-03-08 18:52:44,931][547834] Decorrelating experience for 128 frames... [2023-03-08 18:52:44,942][547835] Decorrelating experience for 192 frames... [2023-03-08 18:52:44,973][547802] Decorrelating experience for 192 frames... [2023-03-08 18:52:44,983][547834] Decorrelating experience for 192 frames... [2023-03-08 18:52:45,053][547800] Decorrelating experience for 0 frames... [2023-03-08 18:52:45,061][547899] Decorrelating experience for 0 frames... [2023-03-08 18:52:45,072][547800] Decorrelating experience for 64 frames... [2023-03-08 18:52:45,073][547899] Decorrelating experience for 64 frames... [2023-03-08 18:52:45,107][547899] Decorrelating experience for 128 frames... [2023-03-08 18:52:45,123][547800] Decorrelating experience for 128 frames... [2023-03-08 18:52:45,159][547899] Decorrelating experience for 192 frames... [2023-03-08 18:52:45,195][547800] Decorrelating experience for 192 frames... [2023-03-08 18:52:48,592][547801] Decorrelating experience for 256 frames... [2023-03-08 18:52:48,709][547801] Decorrelating experience for 320 frames... [2023-03-08 18:52:48,779][547867] Decorrelating experience for 256 frames... [2023-03-08 18:52:48,877][547867] Decorrelating experience for 320 frames... [2023-03-08 18:52:48,886][547801] Decorrelating experience for 384 frames... [2023-03-08 18:52:48,948][547835] Decorrelating experience for 256 frames... [2023-03-08 18:52:49,001][547867] Decorrelating experience for 384 frames... [2023-03-08 18:52:49,020][547834] Decorrelating experience for 256 frames... [2023-03-08 18:52:49,088][547801] Decorrelating experience for 448 frames... [2023-03-08 18:52:49,093][547835] Decorrelating experience for 320 frames... [2023-03-08 18:52:49,114][547834] Decorrelating experience for 320 frames... [2023-03-08 18:52:49,171][547867] Decorrelating experience for 448 frames... [2023-03-08 18:52:49,219][547802] Decorrelating experience for 256 frames... [2023-03-08 18:52:49,224][547834] Decorrelating experience for 384 frames... [2023-03-08 18:52:49,252][547835] Decorrelating experience for 384 frames... [2023-03-08 18:52:49,296][547931] Decorrelating experience for 256 frames... [2023-03-08 18:52:49,359][547834] Decorrelating experience for 448 frames... [2023-03-08 18:52:49,370][547802] Decorrelating experience for 320 frames... [2023-03-08 18:52:49,415][547931] Decorrelating experience for 320 frames... [2023-03-08 18:52:49,455][547835] Decorrelating experience for 448 frames... [2023-03-08 18:52:49,548][547802] Decorrelating experience for 384 frames... [2023-03-08 18:52:49,552][547899] Decorrelating experience for 256 frames... [2023-03-08 18:52:49,555][547931] Decorrelating experience for 384 frames... [2023-03-08 18:52:49,600][547459] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 103.2. Samples: 516. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-08 18:52:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000000000_0.pth... [2023-03-08 18:52:49,690][547899] Decorrelating experience for 320 frames... [2023-03-08 18:52:49,726][547802] Decorrelating experience for 448 frames... [2023-03-08 18:52:49,742][547931] Decorrelating experience for 448 frames... [2023-03-08 18:52:49,801][547899] Decorrelating experience for 384 frames... [2023-03-08 18:52:49,935][547899] Decorrelating experience for 448 frames... [2023-03-08 18:52:50,168][547800] Decorrelating experience for 256 frames... [2023-03-08 18:52:50,271][547800] Decorrelating experience for 320 frames... [2023-03-08 18:52:50,389][547800] Decorrelating experience for 384 frames... [2023-03-08 18:52:50,528][547800] Decorrelating experience for 448 frames... [2023-03-08 18:52:54,600][547459] Fps is (10 sec: 2867.2, 60 sec: 2867.2, 300 sec: 2867.2). Total num frames: 28672. Throughput: 0: 1235.2. Samples: 12352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:52:54,600][547459] Avg episode reward: [(0, '261.244')] [2023-03-08 18:52:55,480][547799] Updated weights for policy 0, policy_version 80 (0.0006) [2023-03-08 18:52:57,276][547459] Heartbeat connected on Batcher_0 [2023-03-08 18:52:57,285][547459] Heartbeat connected on RolloutWorker_w0 [2023-03-08 18:52:57,287][547459] Heartbeat connected on RolloutWorker_w1 [2023-03-08 18:52:57,290][547459] Heartbeat connected on RolloutWorker_w2 [2023-03-08 18:52:57,292][547459] Heartbeat connected on RolloutWorker_w3 [2023-03-08 18:52:57,294][547459] Heartbeat connected on RolloutWorker_w4 [2023-03-08 18:52:57,296][547459] Heartbeat connected on RolloutWorker_w5 [2023-03-08 18:52:57,299][547459] Heartbeat connected on RolloutWorker_w6 [2023-03-08 18:52:57,301][547459] Heartbeat connected on RolloutWorker_w7 [2023-03-08 18:52:57,316][547459] Heartbeat connected on LearnerWorker_p0 [2023-03-08 18:52:57,319][547459] Heartbeat connected on InferenceWorker_p0-w0 [2023-03-08 18:52:59,600][547459] Fps is (10 sec: 7372.9, 60 sec: 4915.2, 300 sec: 4915.2). Total num frames: 73728. Throughput: 0: 4534.1. Samples: 68012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:52:59,600][547459] Avg episode reward: [(0, '1434.027')] [2023-03-08 18:53:00,170][547799] Updated weights for policy 0, policy_version 160 (0.0006) [2023-03-08 18:53:04,486][547799] Updated weights for policy 0, policy_version 240 (0.0005) [2023-03-08 18:53:04,600][547459] Fps is (10 sec: 9420.8, 60 sec: 6144.0, 300 sec: 6144.0). Total num frames: 122880. Throughput: 0: 6147.2. Samples: 122944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:53:04,600][547459] Avg episode reward: [(0, '2076.935')] [2023-03-08 18:53:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000000240_122880.pth... [2023-03-08 18:53:04,607][547754] Saving new best policy, reward=2076.935! [2023-03-08 18:53:08,981][547799] Updated weights for policy 0, policy_version 320 (0.0005) [2023-03-08 18:53:09,600][547459] Fps is (10 sec: 9420.7, 60 sec: 6717.4, 300 sec: 6717.4). Total num frames: 167936. Throughput: 0: 6046.9. Samples: 151172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:53:09,600][547459] Avg episode reward: [(0, '1595.824')] [2023-03-08 18:53:13,319][547799] Updated weights for policy 0, policy_version 400 (0.0005) [2023-03-08 18:53:14,600][547459] Fps is (10 sec: 9420.8, 60 sec: 7236.3, 300 sec: 7236.3). Total num frames: 217088. Throughput: 0: 6905.3. Samples: 207160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:53:14,600][547459] Avg episode reward: [(0, '954.010')] [2023-03-08 18:53:17,589][547799] Updated weights for policy 0, policy_version 480 (0.0005) [2023-03-08 18:53:19,600][547459] Fps is (10 sec: 9830.4, 60 sec: 7606.9, 300 sec: 7606.9). Total num frames: 266240. Throughput: 0: 7595.3. Samples: 265836. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 18:53:19,600][547459] Avg episode reward: [(0, '535.218')] [2023-03-08 18:53:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000000520_266240.pth... [2023-03-08 18:53:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000000000_0.pth [2023-03-08 18:53:21,628][547799] Updated weights for policy 0, policy_version 560 (0.0005) [2023-03-08 18:53:24,600][547459] Fps is (10 sec: 9830.4, 60 sec: 7884.8, 300 sec: 7884.8). Total num frames: 315392. Throughput: 0: 7378.5. Samples: 295140. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 18:53:24,601][547459] Avg episode reward: [(0, '270.089')] [2023-03-08 18:53:25,862][547799] Updated weights for policy 0, policy_version 640 (0.0005) [2023-03-08 18:53:29,599][547459] Fps is (10 sec: 9420.9, 60 sec: 8010.0, 300 sec: 8010.0). Total num frames: 360448. Throughput: 0: 7827.9. Samples: 352256. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 18:53:29,600][547459] Avg episode reward: [(0, '312.053')] [2023-03-08 18:53:30,248][547799] Updated weights for policy 0, policy_version 720 (0.0005) [2023-03-08 18:53:34,600][547459] Fps is (10 sec: 9011.2, 60 sec: 8110.1, 300 sec: 8110.1). Total num frames: 405504. Throughput: 0: 9076.9. Samples: 408976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:53:34,600][547459] Avg episode reward: [(0, '285.386')] [2023-03-08 18:53:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000000800_409600.pth... [2023-03-08 18:53:34,604][547799] Updated weights for policy 0, policy_version 800 (0.0005) [2023-03-08 18:53:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000000240_122880.pth [2023-03-08 18:53:39,129][547799] Updated weights for policy 0, policy_version 880 (0.0006) [2023-03-08 18:53:39,600][547459] Fps is (10 sec: 9420.7, 60 sec: 8266.5, 300 sec: 8266.5). Total num frames: 454656. Throughput: 0: 9393.2. Samples: 435048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:53:39,600][547459] Avg episode reward: [(0, '344.351')] [2023-03-08 18:53:43,503][547799] Updated weights for policy 0, policy_version 960 (0.0005) [2023-03-08 18:53:44,600][547459] Fps is (10 sec: 9420.9, 60 sec: 8328.5, 300 sec: 8328.5). Total num frames: 499712. Throughput: 0: 9412.7. Samples: 491584. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 18:53:44,600][547459] Avg episode reward: [(0, '359.715')] [2023-03-08 18:53:47,991][547799] Updated weights for policy 0, policy_version 1040 (0.0005) [2023-03-08 18:53:49,600][547459] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 8381.0). Total num frames: 544768. Throughput: 0: 9393.0. Samples: 545628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:53:49,600][547459] Avg episode reward: [(0, '518.267')] [2023-03-08 18:53:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000001064_544768.pth... [2023-03-08 18:53:49,609][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000000520_266240.pth [2023-03-08 18:53:52,531][547799] Updated weights for policy 0, policy_version 1120 (0.0006) [2023-03-08 18:53:54,600][547459] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 8426.1). Total num frames: 589824. Throughput: 0: 9383.9. Samples: 573448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:53:54,600][547459] Avg episode reward: [(0, '378.835')] [2023-03-08 18:53:56,957][547799] Updated weights for policy 0, policy_version 1200 (0.0005) [2023-03-08 18:53:59,600][547459] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 8465.1). Total num frames: 634880. Throughput: 0: 9359.0. Samples: 628316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:53:59,600][547459] Avg episode reward: [(0, '622.633')] [2023-03-08 18:54:01,464][547799] Updated weights for policy 0, policy_version 1280 (0.0005) [2023-03-08 18:54:04,600][547459] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 8499.2). Total num frames: 679936. Throughput: 0: 9285.6. Samples: 683688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:54:04,600][547459] Avg episode reward: [(0, '434.078')] [2023-03-08 18:54:04,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000001336_684032.pth... [2023-03-08 18:54:04,616][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000000800_409600.pth [2023-03-08 18:54:05,949][547799] Updated weights for policy 0, policy_version 1360 (0.0006) [2023-03-08 18:54:09,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 8577.5). Total num frames: 729088. Throughput: 0: 9234.1. Samples: 710676. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 18:54:09,600][547459] Avg episode reward: [(0, '797.914')] [2023-03-08 18:54:10,349][547799] Updated weights for policy 0, policy_version 1440 (0.0005) [2023-03-08 18:54:14,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 8601.6). Total num frames: 774144. Throughput: 0: 9195.4. Samples: 766052. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 18:54:14,601][547459] Avg episode reward: [(0, '387.218')] [2023-03-08 18:54:14,846][547799] Updated weights for policy 0, policy_version 1520 (0.0005) [2023-03-08 18:54:19,321][547799] Updated weights for policy 0, policy_version 1600 (0.0005) [2023-03-08 18:54:19,600][547459] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 8623.2). Total num frames: 819200. Throughput: 0: 9166.6. Samples: 821472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:54:19,600][547459] Avg episode reward: [(0, '450.252')] [2023-03-08 18:54:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000001600_819200.pth... [2023-03-08 18:54:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000001064_544768.pth [2023-03-08 18:54:23,685][547799] Updated weights for policy 0, policy_version 1680 (0.0005) [2023-03-08 18:54:24,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 8683.5). Total num frames: 868352. Throughput: 0: 9225.1. Samples: 850176. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 18:54:24,600][547459] Avg episode reward: [(0, '850.606')] [2023-03-08 18:54:28,043][547799] Updated weights for policy 0, policy_version 1760 (0.0005) [2023-03-08 18:54:29,600][547459] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 8699.1). Total num frames: 913408. Throughput: 0: 9193.5. Samples: 905292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:54:29,600][547459] Avg episode reward: [(0, '881.582')] [2023-03-08 18:54:32,562][547799] Updated weights for policy 0, policy_version 1840 (0.0005) [2023-03-08 18:54:34,600][547459] Fps is (10 sec: 9011.1, 60 sec: 9216.0, 300 sec: 8713.3). Total num frames: 958464. Throughput: 0: 9227.6. Samples: 960868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:54:34,600][547459] Avg episode reward: [(0, '745.635')] [2023-03-08 18:54:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000001872_958464.pth... [2023-03-08 18:54:34,609][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000001336_684032.pth [2023-03-08 18:54:36,999][547799] Updated weights for policy 0, policy_version 1920 (0.0006) [2023-03-08 18:54:39,600][547459] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 8726.3). Total num frames: 1003520. Throughput: 0: 9197.6. Samples: 987340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:54:39,600][547459] Avg episode reward: [(0, '929.590')] [2023-03-08 18:54:41,528][547799] Updated weights for policy 0, policy_version 2000 (0.0006) [2023-03-08 18:54:44,600][547459] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 8738.1). Total num frames: 1048576. Throughput: 0: 9198.4. Samples: 1042244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:54:44,600][547459] Avg episode reward: [(0, '787.868')] [2023-03-08 18:54:46,228][547799] Updated weights for policy 0, policy_version 2080 (0.0005) [2023-03-08 18:54:49,600][547459] Fps is (10 sec: 8191.9, 60 sec: 9011.2, 300 sec: 8683.5). Total num frames: 1085440. Throughput: 0: 8978.7. Samples: 1087732. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 18:54:49,600][547459] Avg episode reward: [(0, '581.489')] [2023-03-08 18:54:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000002120_1085440.pth... [2023-03-08 18:54:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000001600_819200.pth [2023-03-08 18:54:52,265][547799] Updated weights for policy 0, policy_version 2160 (0.0005) [2023-03-08 18:54:54,600][547459] Fps is (10 sec: 7372.8, 60 sec: 8874.7, 300 sec: 8633.1). Total num frames: 1122304. Throughput: 0: 8830.7. Samples: 1108056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:54:54,600][547459] Avg episode reward: [(0, '624.490')] [2023-03-08 18:54:57,589][547799] Updated weights for policy 0, policy_version 2240 (0.0006) [2023-03-08 18:54:59,600][547459] Fps is (10 sec: 7372.8, 60 sec: 8738.1, 300 sec: 8586.4). Total num frames: 1159168. Throughput: 0: 8616.7. Samples: 1153804. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 18:54:59,600][547459] Avg episode reward: [(0, '631.071')] [2023-03-08 18:55:02,628][547799] Updated weights for policy 0, policy_version 2320 (0.0005) [2023-03-08 18:55:04,600][547459] Fps is (10 sec: 8192.0, 60 sec: 8738.1, 300 sec: 8601.6). Total num frames: 1204224. Throughput: 0: 8507.0. Samples: 1204288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:55:04,600][547459] Avg episode reward: [(0, '247.587')] [2023-03-08 18:55:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000002352_1204224.pth... [2023-03-08 18:55:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000001872_958464.pth [2023-03-08 18:55:07,273][547799] Updated weights for policy 0, policy_version 2400 (0.0005) [2023-03-08 18:55:09,600][547459] Fps is (10 sec: 8601.5, 60 sec: 8601.6, 300 sec: 8587.5). Total num frames: 1245184. Throughput: 0: 8471.3. Samples: 1231384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:55:09,600][547459] Avg episode reward: [(0, '442.190')] [2023-03-08 18:55:11,831][547799] Updated weights for policy 0, policy_version 2480 (0.0005) [2023-03-08 18:55:14,599][547459] Fps is (10 sec: 9011.3, 60 sec: 8669.9, 300 sec: 8628.9). Total num frames: 1294336. Throughput: 0: 8462.9. Samples: 1286120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:55:14,600][547459] Avg episode reward: [(0, '555.209')] [2023-03-08 18:55:16,267][547799] Updated weights for policy 0, policy_version 2560 (0.0006) [2023-03-08 18:55:19,600][547459] Fps is (10 sec: 9420.9, 60 sec: 8669.9, 300 sec: 8641.2). Total num frames: 1339392. Throughput: 0: 8434.0. Samples: 1340396. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 18:55:19,600][547459] Avg episode reward: [(0, '522.217')] [2023-03-08 18:55:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000002616_1339392.pth... [2023-03-08 18:55:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000002120_1085440.pth [2023-03-08 18:55:20,811][547799] Updated weights for policy 0, policy_version 2640 (0.0005) [2023-03-08 18:55:24,600][547459] Fps is (10 sec: 9011.1, 60 sec: 8601.6, 300 sec: 8652.8). Total num frames: 1384448. Throughput: 0: 8459.8. Samples: 1368032. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 18:55:24,600][547459] Avg episode reward: [(0, '457.328')] [2023-03-08 18:55:25,391][547799] Updated weights for policy 0, policy_version 2720 (0.0006) [2023-03-08 18:55:29,600][547459] Fps is (10 sec: 9011.2, 60 sec: 8601.6, 300 sec: 8663.7). Total num frames: 1429504. Throughput: 0: 8403.8. Samples: 1420416. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 18:55:29,600][547459] Avg episode reward: [(0, '320.624')] [2023-03-08 18:55:29,985][547799] Updated weights for policy 0, policy_version 2800 (0.0005) [2023-03-08 18:55:34,341][547799] Updated weights for policy 0, policy_version 2880 (0.0005) [2023-03-08 18:55:34,600][547459] Fps is (10 sec: 9011.2, 60 sec: 8601.6, 300 sec: 8673.9). Total num frames: 1474560. Throughput: 0: 8639.1. Samples: 1476492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:55:34,600][547459] Avg episode reward: [(0, '521.245')] [2023-03-08 18:55:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000002880_1474560.pth... [2023-03-08 18:55:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000002352_1204224.pth [2023-03-08 18:55:38,554][547799] Updated weights for policy 0, policy_version 2960 (0.0004) [2023-03-08 18:55:39,600][547459] Fps is (10 sec: 9420.8, 60 sec: 8669.9, 300 sec: 8706.9). Total num frames: 1523712. Throughput: 0: 8841.3. Samples: 1505916. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 18:55:39,600][547459] Avg episode reward: [(0, '796.340')] [2023-03-08 18:55:42,841][547799] Updated weights for policy 0, policy_version 3040 (0.0005) [2023-03-08 18:55:44,600][547459] Fps is (10 sec: 9830.4, 60 sec: 8738.1, 300 sec: 8738.1). Total num frames: 1572864. Throughput: 0: 9098.9. Samples: 1563252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:55:44,600][547459] Avg episode reward: [(0, '471.785')] [2023-03-08 18:55:47,078][547799] Updated weights for policy 0, policy_version 3120 (0.0006) [2023-03-08 18:55:49,600][547459] Fps is (10 sec: 9420.8, 60 sec: 8874.7, 300 sec: 8745.5). Total num frames: 1617920. Throughput: 0: 9259.2. Samples: 1620952. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 18:55:49,600][547459] Avg episode reward: [(0, '948.799')] [2023-03-08 18:55:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000003160_1617920.pth... [2023-03-08 18:55:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000002616_1339392.pth [2023-03-08 18:55:51,502][547799] Updated weights for policy 0, policy_version 3200 (0.0005) [2023-03-08 18:55:54,600][547459] Fps is (10 sec: 9011.1, 60 sec: 9011.2, 300 sec: 8752.5). Total num frames: 1662976. Throughput: 0: 9255.0. Samples: 1647856. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 18:55:54,600][547459] Avg episode reward: [(0, '928.671')] [2023-03-08 18:55:55,851][547799] Updated weights for policy 0, policy_version 3280 (0.0005) [2023-03-08 18:55:59,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 8780.1). Total num frames: 1712128. Throughput: 0: 9308.2. Samples: 1704988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:55:59,600][547459] Avg episode reward: [(0, '1224.967')] [2023-03-08 18:56:00,070][547799] Updated weights for policy 0, policy_version 3360 (0.0006) [2023-03-08 18:56:04,578][547799] Updated weights for policy 0, policy_version 3440 (0.0005) [2023-03-08 18:56:04,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9284.3, 300 sec: 8806.4). Total num frames: 1761280. Throughput: 0: 9351.3. Samples: 1761204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:56:04,600][547459] Avg episode reward: [(0, '960.208')] [2023-03-08 18:56:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000003440_1761280.pth... [2023-03-08 18:56:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000002880_1474560.pth [2023-03-08 18:56:08,927][547799] Updated weights for policy 0, policy_version 3520 (0.0005) [2023-03-08 18:56:09,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9352.6, 300 sec: 8811.4). Total num frames: 1806336. Throughput: 0: 9371.8. Samples: 1789764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:56:09,600][547459] Avg episode reward: [(0, '1117.901')] [2023-03-08 18:56:13,121][547799] Updated weights for policy 0, policy_version 3600 (0.0005) [2023-03-08 18:56:14,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 8835.7). Total num frames: 1855488. Throughput: 0: 9485.6. Samples: 1847268. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 18:56:14,600][547459] Avg episode reward: [(0, '1017.365')] [2023-03-08 18:56:17,513][547799] Updated weights for policy 0, policy_version 3680 (0.0005) [2023-03-08 18:56:19,600][547459] Fps is (10 sec: 9420.6, 60 sec: 9352.5, 300 sec: 8839.7). Total num frames: 1900544. Throughput: 0: 9511.6. Samples: 1904516. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 18:56:19,600][547459] Avg episode reward: [(0, '924.203')] [2023-03-08 18:56:19,620][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000003720_1904640.pth... [2023-03-08 18:56:19,622][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000003160_1617920.pth [2023-03-08 18:56:21,658][547799] Updated weights for policy 0, policy_version 3760 (0.0005) [2023-03-08 18:56:24,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 8862.3). Total num frames: 1949696. Throughput: 0: 9499.1. Samples: 1933376. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 18:56:24,600][547459] Avg episode reward: [(0, '1464.468')] [2023-03-08 18:56:26,000][547799] Updated weights for policy 0, policy_version 3840 (0.0006) [2023-03-08 18:56:29,600][547459] Fps is (10 sec: 9830.5, 60 sec: 9489.1, 300 sec: 8883.8). Total num frames: 1998848. Throughput: 0: 9487.4. Samples: 1990184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:56:29,600][547459] Avg episode reward: [(0, '1749.195')] [2023-03-08 18:56:30,340][547799] Updated weights for policy 0, policy_version 3920 (0.0005) [2023-03-08 18:56:34,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 8886.5). Total num frames: 2043904. Throughput: 0: 9471.9. Samples: 2047188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:56:34,600][547459] Avg episode reward: [(0, '2043.126')] [2023-03-08 18:56:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000003992_2043904.pth... [2023-03-08 18:56:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000003440_1761280.pth [2023-03-08 18:56:34,690][547799] Updated weights for policy 0, policy_version 4000 (0.0005) [2023-03-08 18:56:39,164][547799] Updated weights for policy 0, policy_version 4080 (0.0005) [2023-03-08 18:56:39,600][547459] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 8889.2). Total num frames: 2088960. Throughput: 0: 9483.8. Samples: 2074628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:56:39,600][547459] Avg episode reward: [(0, '2754.646')] [2023-03-08 18:56:39,627][547754] Saving new best policy, reward=2754.646! [2023-03-08 18:56:43,740][547799] Updated weights for policy 0, policy_version 4160 (0.0006) [2023-03-08 18:56:44,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 8908.8). Total num frames: 2138112. Throughput: 0: 9410.8. Samples: 2128476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:56:44,600][547459] Avg episode reward: [(0, '2640.618')] [2023-03-08 18:56:48,132][547799] Updated weights for policy 0, policy_version 4240 (0.0005) [2023-03-08 18:56:49,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 8910.9). Total num frames: 2183168. Throughput: 0: 9382.5. Samples: 2183416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:56:49,600][547459] Avg episode reward: [(0, '2906.450')] [2023-03-08 18:56:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000004264_2183168.pth... [2023-03-08 18:56:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000003720_1904640.pth [2023-03-08 18:56:49,607][547754] Saving new best policy, reward=2906.450! [2023-03-08 18:56:52,389][547799] Updated weights for policy 0, policy_version 4320 (0.0005) [2023-03-08 18:56:54,600][547459] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 8912.9). Total num frames: 2228224. Throughput: 0: 9409.2. Samples: 2213176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:56:54,600][547459] Avg episode reward: [(0, '3011.194')] [2023-03-08 18:56:54,640][547754] Saving new best policy, reward=3011.194! [2023-03-08 18:56:56,847][547799] Updated weights for policy 0, policy_version 4400 (0.0005) [2023-03-08 18:56:59,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 8930.9). Total num frames: 2277376. Throughput: 0: 9375.4. Samples: 2269160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:56:59,600][547459] Avg episode reward: [(0, '2757.832')] [2023-03-08 18:57:01,269][547799] Updated weights for policy 0, policy_version 4480 (0.0005) [2023-03-08 18:57:04,600][547459] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 8932.4). Total num frames: 2322432. Throughput: 0: 9314.5. Samples: 2323668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:57:04,600][547459] Avg episode reward: [(0, '2171.961')] [2023-03-08 18:57:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000004536_2322432.pth... [2023-03-08 18:57:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000003992_2043904.pth [2023-03-08 18:57:05,654][547799] Updated weights for policy 0, policy_version 4560 (0.0006) [2023-03-08 18:57:09,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 8949.4). Total num frames: 2371584. Throughput: 0: 9307.0. Samples: 2352192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:57:09,600][547459] Avg episode reward: [(0, '2426.697')] [2023-03-08 18:57:09,888][547799] Updated weights for policy 0, policy_version 4640 (0.0005) [2023-03-08 18:57:14,229][547799] Updated weights for policy 0, policy_version 4720 (0.0006) [2023-03-08 18:57:14,600][547459] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 8950.5). Total num frames: 2416640. Throughput: 0: 9328.5. Samples: 2409964. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 18:57:14,600][547459] Avg episode reward: [(0, '2980.463')] [2023-03-08 18:57:18,439][547799] Updated weights for policy 0, policy_version 4800 (0.0005) [2023-03-08 18:57:19,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 8966.5). Total num frames: 2465792. Throughput: 0: 9361.9. Samples: 2468472. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 18:57:19,600][547459] Avg episode reward: [(0, '3646.199')] [2023-03-08 18:57:19,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000004816_2465792.pth... [2023-03-08 18:57:19,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000004264_2183168.pth [2023-03-08 18:57:19,608][547754] Saving new best policy, reward=3646.199! [2023-03-08 18:57:22,729][547799] Updated weights for policy 0, policy_version 4880 (0.0005) [2023-03-08 18:57:24,600][547459] Fps is (10 sec: 9830.3, 60 sec: 9420.8, 300 sec: 8981.9). Total num frames: 2514944. Throughput: 0: 9369.4. Samples: 2496252. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 18:57:24,600][547459] Avg episode reward: [(0, '3865.772')] [2023-03-08 18:57:24,601][547754] Saving new best policy, reward=3865.772! [2023-03-08 18:57:27,073][547799] Updated weights for policy 0, policy_version 4960 (0.0005) [2023-03-08 18:57:29,599][547459] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 8982.5). Total num frames: 2560000. Throughput: 0: 9453.9. Samples: 2553900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:57:29,600][547459] Avg episode reward: [(0, '3971.576')] [2023-03-08 18:57:29,607][547754] Saving new best policy, reward=3971.576! [2023-03-08 18:57:31,202][547799] Updated weights for policy 0, policy_version 5040 (0.0005) [2023-03-08 18:57:34,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9011.2). Total num frames: 2613248. Throughput: 0: 9553.4. Samples: 2613320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:57:34,600][547459] Avg episode reward: [(0, '3805.758')] [2023-03-08 18:57:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000005104_2613248.pth... [2023-03-08 18:57:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000004536_2322432.pth [2023-03-08 18:57:35,302][547799] Updated weights for policy 0, policy_version 5120 (0.0005) [2023-03-08 18:57:39,231][547799] Updated weights for policy 0, policy_version 5200 (0.0006) [2023-03-08 18:57:39,600][547459] Fps is (10 sec: 10239.9, 60 sec: 9557.3, 300 sec: 9025.1). Total num frames: 2662400. Throughput: 0: 9596.6. Samples: 2645024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:57:39,600][547459] Avg episode reward: [(0, '4236.115')] [2023-03-08 18:57:39,601][547754] Saving new best policy, reward=4236.115! [2023-03-08 18:57:43,531][547799] Updated weights for policy 0, policy_version 5280 (0.0005) [2023-03-08 18:57:44,599][547459] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9191.7). Total num frames: 2711552. Throughput: 0: 9649.1. Samples: 2703368. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 18:57:44,600][547459] Avg episode reward: [(0, '4131.863')] [2023-03-08 18:57:47,643][547799] Updated weights for policy 0, policy_version 5360 (0.0005) [2023-03-08 18:57:49,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9261.1). Total num frames: 2760704. Throughput: 0: 9763.9. Samples: 2763044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:57:49,600][547459] Avg episode reward: [(0, '3824.778')] [2023-03-08 18:57:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000005392_2760704.pth... [2023-03-08 18:57:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000004816_2465792.pth [2023-03-08 18:57:51,744][547799] Updated weights for policy 0, policy_version 5440 (0.0005) [2023-03-08 18:57:54,600][547459] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9288.9). Total num frames: 2813952. Throughput: 0: 9808.4. Samples: 2793572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:57:54,600][547459] Avg episode reward: [(0, '4103.490')] [2023-03-08 18:57:55,708][547799] Updated weights for policy 0, policy_version 5520 (0.0005) [2023-03-08 18:57:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9288.9). Total num frames: 2863104. Throughput: 0: 9888.1. Samples: 2854928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:57:59,601][547459] Avg episode reward: [(0, '4228.775')] [2023-03-08 18:57:59,732][547799] Updated weights for policy 0, policy_version 5600 (0.0005) [2023-03-08 18:58:03,724][547799] Updated weights for policy 0, policy_version 5680 (0.0005) [2023-03-08 18:58:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9316.7). Total num frames: 2916352. Throughput: 0: 9953.1. Samples: 2916360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:58:04,600][547459] Avg episode reward: [(0, '4339.105')] [2023-03-08 18:58:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000005696_2916352.pth... [2023-03-08 18:58:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000005104_2613248.pth [2023-03-08 18:58:04,606][547754] Saving new best policy, reward=4339.105! [2023-03-08 18:58:07,734][547799] Updated weights for policy 0, policy_version 5760 (0.0005) [2023-03-08 18:58:09,600][547459] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9316.7). Total num frames: 2965504. Throughput: 0: 10019.7. Samples: 2947136. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 18:58:09,600][547459] Avg episode reward: [(0, '3894.647')] [2023-03-08 18:58:11,897][547799] Updated weights for policy 0, policy_version 5840 (0.0005) [2023-03-08 18:58:14,600][547459] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9316.7). Total num frames: 3014656. Throughput: 0: 10058.6. Samples: 3006536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:58:14,600][547459] Avg episode reward: [(0, '3970.705')] [2023-03-08 18:58:15,871][547799] Updated weights for policy 0, policy_version 5920 (0.0005) [2023-03-08 18:58:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9330.5). Total num frames: 3067904. Throughput: 0: 10107.7. Samples: 3068168. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 18:58:19,600][547459] Avg episode reward: [(0, '4404.631')] [2023-03-08 18:58:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000005992_3067904.pth... [2023-03-08 18:58:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000005392_2760704.pth [2023-03-08 18:58:19,607][547754] Saving new best policy, reward=4404.631! [2023-03-08 18:58:19,861][547799] Updated weights for policy 0, policy_version 6000 (0.0005) [2023-03-08 18:58:23,835][547799] Updated weights for policy 0, policy_version 6080 (0.0005) [2023-03-08 18:58:24,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9344.4). Total num frames: 3117056. Throughput: 0: 10115.4. Samples: 3100216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:58:24,600][547459] Avg episode reward: [(0, '4311.415')] [2023-03-08 18:58:27,758][547799] Updated weights for policy 0, policy_version 6160 (0.0005) [2023-03-08 18:58:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9372.2). Total num frames: 3170304. Throughput: 0: 10194.3. Samples: 3162112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:58:29,600][547459] Avg episode reward: [(0, '4501.810')] [2023-03-08 18:58:29,601][547754] Saving new best policy, reward=4501.810! [2023-03-08 18:58:31,896][547799] Updated weights for policy 0, policy_version 6240 (0.0005) [2023-03-08 18:58:34,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9372.2). Total num frames: 3219456. Throughput: 0: 10204.2. Samples: 3222236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:58:34,600][547459] Avg episode reward: [(0, '4285.102')] [2023-03-08 18:58:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000006288_3219456.pth... [2023-03-08 18:58:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000005696_2916352.pth [2023-03-08 18:58:35,985][547799] Updated weights for policy 0, policy_version 6320 (0.0005) [2023-03-08 18:58:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9400.0). Total num frames: 3272704. Throughput: 0: 10190.2. Samples: 3252132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:58:39,611][547459] Avg episode reward: [(0, '4622.149')] [2023-03-08 18:58:39,612][547754] Saving new best policy, reward=4622.149! [2023-03-08 18:58:39,972][547799] Updated weights for policy 0, policy_version 6400 (0.0005) [2023-03-08 18:58:44,035][547799] Updated weights for policy 0, policy_version 6480 (0.0005) [2023-03-08 18:58:44,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 9413.9). Total num frames: 3321856. Throughput: 0: 10182.5. Samples: 3313140. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 18:58:44,600][547459] Avg episode reward: [(0, '4539.388')] [2023-03-08 18:58:48,013][547799] Updated weights for policy 0, policy_version 6560 (0.0005) [2023-03-08 18:58:49,600][547459] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 9427.7). Total num frames: 3371008. Throughput: 0: 10162.1. Samples: 3373656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:58:49,600][547459] Avg episode reward: [(0, '4388.542')] [2023-03-08 18:58:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000006584_3371008.pth... [2023-03-08 18:58:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000005992_3067904.pth [2023-03-08 18:58:52,151][547799] Updated weights for policy 0, policy_version 6640 (0.0005) [2023-03-08 18:58:54,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9441.6). Total num frames: 3420160. Throughput: 0: 10146.2. Samples: 3403716. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 18:58:54,600][547459] Avg episode reward: [(0, '4458.458')] [2023-03-08 18:58:56,302][547799] Updated weights for policy 0, policy_version 6720 (0.0005) [2023-03-08 18:58:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9469.4). Total num frames: 3473408. Throughput: 0: 10157.4. Samples: 3463620. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 18:58:59,600][547459] Avg episode reward: [(0, '4564.423')] [2023-03-08 18:59:00,285][547799] Updated weights for policy 0, policy_version 6800 (0.0005) [2023-03-08 18:59:04,181][547799] Updated weights for policy 0, policy_version 6880 (0.0006) [2023-03-08 18:59:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9469.4). Total num frames: 3522560. Throughput: 0: 10182.5. Samples: 3526380. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 18:59:04,600][547459] Avg episode reward: [(0, '4572.536')] [2023-03-08 18:59:04,616][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000006888_3526656.pth... [2023-03-08 18:59:04,618][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000006288_3219456.pth [2023-03-08 18:59:08,385][547799] Updated weights for policy 0, policy_version 6960 (0.0005) [2023-03-08 18:59:09,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9483.3). Total num frames: 3571712. Throughput: 0: 10113.6. Samples: 3555328. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 18:59:09,600][547459] Avg episode reward: [(0, '4504.239')] [2023-03-08 18:59:12,534][547799] Updated weights for policy 0, policy_version 7040 (0.0006) [2023-03-08 18:59:14,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 9511.1). Total num frames: 3624960. Throughput: 0: 10067.4. Samples: 3615144. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 18:59:14,600][547459] Avg episode reward: [(0, '4541.388')] [2023-03-08 18:59:16,494][547799] Updated weights for policy 0, policy_version 7120 (0.0005) [2023-03-08 18:59:19,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9511.0). Total num frames: 3674112. Throughput: 0: 10093.8. Samples: 3676456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:59:19,600][547459] Avg episode reward: [(0, '4664.236')] [2023-03-08 18:59:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000007176_3674112.pth... [2023-03-08 18:59:19,609][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000006584_3371008.pth [2023-03-08 18:59:19,609][547754] Saving new best policy, reward=4664.236! [2023-03-08 18:59:20,483][547799] Updated weights for policy 0, policy_version 7200 (0.0005) [2023-03-08 18:59:24,411][547799] Updated weights for policy 0, policy_version 7280 (0.0005) [2023-03-08 18:59:24,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 9538.8). Total num frames: 3727360. Throughput: 0: 10109.1. Samples: 3707040. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 18:59:24,600][547459] Avg episode reward: [(0, '4679.509')] [2023-03-08 18:59:24,601][547754] Saving new best policy, reward=4679.509! [2023-03-08 18:59:28,652][547799] Updated weights for policy 0, policy_version 7360 (0.0006) [2023-03-08 18:59:29,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 9552.7). Total num frames: 3776512. Throughput: 0: 10111.8. Samples: 3768172. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 18:59:29,611][547459] Avg episode reward: [(0, '4682.638')] [2023-03-08 18:59:29,612][547754] Saving new best policy, reward=4682.638! [2023-03-08 18:59:32,726][547799] Updated weights for policy 0, policy_version 7440 (0.0005) [2023-03-08 18:59:34,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9566.6). Total num frames: 3825664. Throughput: 0: 10097.0. Samples: 3828020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:59:34,600][547459] Avg episode reward: [(0, '4726.927')] [2023-03-08 18:59:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000007472_3825664.pth... [2023-03-08 18:59:34,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000006888_3526656.pth [2023-03-08 18:59:34,608][547754] Saving new best policy, reward=4726.927! [2023-03-08 18:59:36,659][547799] Updated weights for policy 0, policy_version 7520 (0.0005) [2023-03-08 18:59:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9594.4). Total num frames: 3878912. Throughput: 0: 10121.4. Samples: 3859180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:59:39,600][547459] Avg episode reward: [(0, '4669.486')] [2023-03-08 18:59:40,720][547799] Updated weights for policy 0, policy_version 7600 (0.0005) [2023-03-08 18:59:44,576][547799] Updated weights for policy 0, policy_version 7680 (0.0005) [2023-03-08 18:59:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 9649.9). Total num frames: 3932160. Throughput: 0: 10164.3. Samples: 3921012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:59:44,601][547459] Avg episode reward: [(0, '4594.833')] [2023-03-08 18:59:48,658][547799] Updated weights for policy 0, policy_version 7760 (0.0005) [2023-03-08 18:59:49,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 9691.6). Total num frames: 3981312. Throughput: 0: 10128.9. Samples: 3982180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:59:49,600][547459] Avg episode reward: [(0, '4589.920')] [2023-03-08 18:59:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000007776_3981312.pth... [2023-03-08 18:59:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000007176_3674112.pth [2023-03-08 18:59:52,735][547799] Updated weights for policy 0, policy_version 7840 (0.0006) [2023-03-08 18:59:54,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9733.2). Total num frames: 4030464. Throughput: 0: 10153.5. Samples: 4012236. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 18:59:54,600][547459] Avg episode reward: [(0, '4526.762')] [2023-03-08 18:59:56,758][547799] Updated weights for policy 0, policy_version 7920 (0.0005) [2023-03-08 18:59:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9761.0). Total num frames: 4083712. Throughput: 0: 10175.2. Samples: 4073028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 18:59:59,600][547459] Avg episode reward: [(0, '4536.161')] [2023-03-08 19:00:00,706][547799] Updated weights for policy 0, policy_version 8000 (0.0005) [2023-03-08 19:00:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9788.8). Total num frames: 4132864. Throughput: 0: 10143.8. Samples: 4132928. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:00:04,600][547459] Avg episode reward: [(0, '3916.192')] [2023-03-08 19:00:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000008072_4132864.pth... [2023-03-08 19:00:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000007472_3825664.pth [2023-03-08 19:00:04,959][547799] Updated weights for policy 0, policy_version 8080 (0.0005) [2023-03-08 19:00:08,938][547799] Updated weights for policy 0, policy_version 8160 (0.0006) [2023-03-08 19:00:09,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9788.7). Total num frames: 4182016. Throughput: 0: 10138.8. Samples: 4163288. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:00:09,600][547459] Avg episode reward: [(0, '4440.342')] [2023-03-08 19:00:12,996][547799] Updated weights for policy 0, policy_version 8240 (0.0005) [2023-03-08 19:00:14,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9802.6). Total num frames: 4231168. Throughput: 0: 10130.8. Samples: 4224056. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:00:14,600][547459] Avg episode reward: [(0, '4404.825')] [2023-03-08 19:00:17,134][547799] Updated weights for policy 0, policy_version 8320 (0.0005) [2023-03-08 19:00:19,600][547459] Fps is (10 sec: 9830.2, 60 sec: 10103.4, 300 sec: 9816.5). Total num frames: 4280320. Throughput: 0: 10075.6. Samples: 4281424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:00:19,601][547459] Avg episode reward: [(0, '4590.181')] [2023-03-08 19:00:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000008360_4280320.pth... [2023-03-08 19:00:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000007776_3981312.pth [2023-03-08 19:00:21,554][547799] Updated weights for policy 0, policy_version 8400 (0.0005) [2023-03-08 19:00:24,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9816.5). Total num frames: 4325376. Throughput: 0: 9997.2. Samples: 4309056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:00:24,600][547459] Avg episode reward: [(0, '4383.500')] [2023-03-08 19:00:26,266][547799] Updated weights for policy 0, policy_version 8480 (0.0005) [2023-03-08 19:00:29,600][547459] Fps is (10 sec: 9011.4, 60 sec: 9898.7, 300 sec: 9816.5). Total num frames: 4370432. Throughput: 0: 9817.1. Samples: 4362780. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:00:29,600][547459] Avg episode reward: [(0, '4572.493')] [2023-03-08 19:00:30,605][547799] Updated weights for policy 0, policy_version 8560 (0.0005) [2023-03-08 19:00:34,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9816.5). Total num frames: 4419584. Throughput: 0: 9719.7. Samples: 4419568. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:00:34,600][547459] Avg episode reward: [(0, '4332.541')] [2023-03-08 19:00:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000008632_4419584.pth... [2023-03-08 19:00:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000008072_4132864.pth [2023-03-08 19:00:35,062][547799] Updated weights for policy 0, policy_version 8640 (0.0005) [2023-03-08 19:00:39,244][547799] Updated weights for policy 0, policy_version 8720 (0.0005) [2023-03-08 19:00:39,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9802.6). Total num frames: 4464640. Throughput: 0: 9675.7. Samples: 4447644. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:00:39,600][547459] Avg episode reward: [(0, '4707.431')] [2023-03-08 19:00:43,414][547799] Updated weights for policy 0, policy_version 8800 (0.0005) [2023-03-08 19:00:44,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9816.5). Total num frames: 4513792. Throughput: 0: 9632.6. Samples: 4506496. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:00:44,600][547459] Avg episode reward: [(0, '4597.176')] [2023-03-08 19:00:47,461][547799] Updated weights for policy 0, policy_version 8880 (0.0005) [2023-03-08 19:00:49,600][547459] Fps is (10 sec: 10240.1, 60 sec: 9762.1, 300 sec: 9844.3). Total num frames: 4567040. Throughput: 0: 9647.1. Samples: 4567048. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:00:49,600][547459] Avg episode reward: [(0, '4714.623')] [2023-03-08 19:00:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000008920_4567040.pth... [2023-03-08 19:00:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000008360_4280320.pth [2023-03-08 19:00:51,725][547799] Updated weights for policy 0, policy_version 8960 (0.0005) [2023-03-08 19:00:54,600][547459] Fps is (10 sec: 10240.1, 60 sec: 9762.1, 300 sec: 9844.3). Total num frames: 4616192. Throughput: 0: 9609.8. Samples: 4595728. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:00:54,600][547459] Avg episode reward: [(0, '4569.862')] [2023-03-08 19:00:55,805][547799] Updated weights for policy 0, policy_version 9040 (0.0006) [2023-03-08 19:00:59,600][547459] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9844.3). Total num frames: 4665344. Throughput: 0: 9624.2. Samples: 4657144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:00:59,600][547459] Avg episode reward: [(0, '4666.426')] [2023-03-08 19:00:59,769][547799] Updated weights for policy 0, policy_version 9120 (0.0005) [2023-03-08 19:01:03,865][547799] Updated weights for policy 0, policy_version 9200 (0.0005) [2023-03-08 19:01:04,600][547459] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9858.2). Total num frames: 4714496. Throughput: 0: 9664.0. Samples: 4716304. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:01:04,600][547459] Avg episode reward: [(0, '4672.775')] [2023-03-08 19:01:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000009208_4714496.pth... [2023-03-08 19:01:04,610][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000008632_4419584.pth [2023-03-08 19:01:07,941][547799] Updated weights for policy 0, policy_version 9280 (0.0005) [2023-03-08 19:01:09,600][547459] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9872.1). Total num frames: 4767744. Throughput: 0: 9732.8. Samples: 4747032. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:01:09,600][547459] Avg episode reward: [(0, '4662.919')] [2023-03-08 19:01:12,122][547799] Updated weights for policy 0, policy_version 9360 (0.0006) [2023-03-08 19:01:14,600][547459] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9885.9). Total num frames: 4816896. Throughput: 0: 9852.0. Samples: 4806120. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:01:14,600][547459] Avg episode reward: [(0, '4569.501')] [2023-03-08 19:01:16,201][547799] Updated weights for policy 0, policy_version 9440 (0.0006) [2023-03-08 19:01:19,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9762.2, 300 sec: 9885.9). Total num frames: 4866048. Throughput: 0: 9923.4. Samples: 4866120. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:01:19,601][547459] Avg episode reward: [(0, '4604.230')] [2023-03-08 19:01:19,607][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000009504_4866048.pth... [2023-03-08 19:01:19,611][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000008920_4567040.pth [2023-03-08 19:01:20,310][547799] Updated weights for policy 0, policy_version 9520 (0.0006) [2023-03-08 19:01:24,530][547799] Updated weights for policy 0, policy_version 9600 (0.0005) [2023-03-08 19:01:24,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9885.9). Total num frames: 4915200. Throughput: 0: 9964.1. Samples: 4896028. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:01:24,601][547459] Avg episode reward: [(0, '4591.289')] [2023-03-08 19:01:28,579][547799] Updated weights for policy 0, policy_version 9680 (0.0005) [2023-03-08 19:01:29,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9899.8). Total num frames: 4964352. Throughput: 0: 9991.4. Samples: 4956108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:01:29,600][547459] Avg episode reward: [(0, '4497.625')] [2023-03-08 19:01:32,696][547799] Updated weights for policy 0, policy_version 9760 (0.0005) [2023-03-08 19:01:34,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9913.7). Total num frames: 5013504. Throughput: 0: 9970.0. Samples: 5015696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:01:34,600][547459] Avg episode reward: [(0, '4582.475')] [2023-03-08 19:01:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000009792_5013504.pth... [2023-03-08 19:01:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000009208_4714496.pth [2023-03-08 19:01:36,777][547799] Updated weights for policy 0, policy_version 9840 (0.0006) [2023-03-08 19:01:39,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 5062656. Throughput: 0: 10005.9. Samples: 5045992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:01:39,600][547459] Avg episode reward: [(0, '4386.452')] [2023-03-08 19:01:40,823][547799] Updated weights for policy 0, policy_version 9920 (0.0005) [2023-03-08 19:01:44,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9941.5). Total num frames: 5115904. Throughput: 0: 9982.6. Samples: 5106364. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:01:44,601][547459] Avg episode reward: [(0, '4618.552')] [2023-03-08 19:01:44,825][547799] Updated weights for policy 0, policy_version 10000 (0.0005) [2023-03-08 19:01:48,891][547799] Updated weights for policy 0, policy_version 10080 (0.0006) [2023-03-08 19:01:49,600][547459] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9955.4). Total num frames: 5165056. Throughput: 0: 10032.4. Samples: 5167764. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:01:49,600][547459] Avg episode reward: [(0, '4740.549')] [2023-03-08 19:01:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000010088_5165056.pth... [2023-03-08 19:01:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000009504_4866048.pth [2023-03-08 19:01:49,607][547754] Saving new best policy, reward=4740.549! [2023-03-08 19:01:52,840][547799] Updated weights for policy 0, policy_version 10160 (0.0005) [2023-03-08 19:01:54,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9969.2). Total num frames: 5218304. Throughput: 0: 10047.2. Samples: 5199156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:01:54,600][547459] Avg episode reward: [(0, '4494.350')] [2023-03-08 19:01:56,948][547799] Updated weights for policy 0, policy_version 10240 (0.0005) [2023-03-08 19:01:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9983.1). Total num frames: 5267456. Throughput: 0: 10069.9. Samples: 5259264. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:01:59,600][547459] Avg episode reward: [(0, '4567.071')] [2023-03-08 19:02:00,985][547799] Updated weights for policy 0, policy_version 10320 (0.0005) [2023-03-08 19:02:04,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9997.0). Total num frames: 5320704. Throughput: 0: 10098.0. Samples: 5320532. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:02:04,600][547459] Avg episode reward: [(0, '4614.189')] [2023-03-08 19:02:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000010392_5320704.pth... [2023-03-08 19:02:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000009792_5013504.pth [2023-03-08 19:02:04,939][547799] Updated weights for policy 0, policy_version 10400 (0.0005) [2023-03-08 19:02:09,019][547799] Updated weights for policy 0, policy_version 10480 (0.0005) [2023-03-08 19:02:09,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10010.9). Total num frames: 5369856. Throughput: 0: 10107.6. Samples: 5350872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:02:09,601][547459] Avg episode reward: [(0, '4638.129')] [2023-03-08 19:02:12,913][547799] Updated weights for policy 0, policy_version 10560 (0.0005) [2023-03-08 19:02:14,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10024.8). Total num frames: 5423104. Throughput: 0: 10168.3. Samples: 5413680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:02:14,600][547459] Avg episode reward: [(0, '4746.312')] [2023-03-08 19:02:14,601][547754] Saving new best policy, reward=4746.312! [2023-03-08 19:02:16,793][547799] Updated weights for policy 0, policy_version 10640 (0.0005) [2023-03-08 19:02:19,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10038.7). Total num frames: 5476352. Throughput: 0: 10237.0. Samples: 5476360. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:02:19,600][547459] Avg episode reward: [(0, '4809.288')] [2023-03-08 19:02:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000010696_5476352.pth... [2023-03-08 19:02:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000010088_5165056.pth [2023-03-08 19:02:19,607][547754] Saving new best policy, reward=4809.288! [2023-03-08 19:02:20,747][547799] Updated weights for policy 0, policy_version 10720 (0.0005) [2023-03-08 19:02:24,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 5525504. Throughput: 0: 10259.3. Samples: 5507660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:02:24,600][547459] Avg episode reward: [(0, '4761.227')] [2023-03-08 19:02:24,694][547799] Updated weights for policy 0, policy_version 10800 (0.0005) [2023-03-08 19:02:28,703][547799] Updated weights for policy 0, policy_version 10880 (0.0005) [2023-03-08 19:02:29,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10052.6). Total num frames: 5578752. Throughput: 0: 10282.5. Samples: 5569076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:02:29,600][547459] Avg episode reward: [(0, '4804.966')] [2023-03-08 19:02:32,771][547799] Updated weights for policy 0, policy_version 10960 (0.0005) [2023-03-08 19:02:34,600][547459] Fps is (10 sec: 10239.8, 60 sec: 10240.0, 300 sec: 10052.6). Total num frames: 5627904. Throughput: 0: 10261.3. Samples: 5629524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:02:34,601][547459] Avg episode reward: [(0, '4791.585')] [2023-03-08 19:02:34,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000010992_5627904.pth... [2023-03-08 19:02:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000010392_5320704.pth [2023-03-08 19:02:36,809][547799] Updated weights for policy 0, policy_version 11040 (0.0005) [2023-03-08 19:02:39,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10052.6). Total num frames: 5677056. Throughput: 0: 10240.6. Samples: 5659984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:02:39,600][547459] Avg episode reward: [(0, '4639.145')] [2023-03-08 19:02:40,577][547754] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000003 [2023-03-08 19:02:40,985][547799] Updated weights for policy 0, policy_version 11120 (0.0005) [2023-03-08 19:02:44,600][547459] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 5726208. Throughput: 0: 10195.9. Samples: 5718080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:02:44,600][547459] Avg episode reward: [(0, '4629.400')] [2023-03-08 19:02:45,183][547799] Updated weights for policy 0, policy_version 11200 (0.0005) [2023-03-08 19:02:49,209][547799] Updated weights for policy 0, policy_version 11280 (0.0005) [2023-03-08 19:02:49,600][547459] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10038.7). Total num frames: 5775360. Throughput: 0: 10193.7. Samples: 5779248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:02:49,601][547459] Avg episode reward: [(0, '4262.592')] [2023-03-08 19:02:49,627][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000011288_5779456.pth... [2023-03-08 19:02:49,629][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000010696_5476352.pth [2023-03-08 19:02:53,222][547799] Updated weights for policy 0, policy_version 11360 (0.0006) [2023-03-08 19:02:54,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 5828608. Throughput: 0: 10191.5. Samples: 5809488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:02:54,601][547459] Avg episode reward: [(0, '4018.885')] [2023-03-08 19:02:57,304][547799] Updated weights for policy 0, policy_version 11440 (0.0005) [2023-03-08 19:02:59,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10038.7). Total num frames: 5877760. Throughput: 0: 10130.8. Samples: 5869568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:02:59,601][547459] Avg episode reward: [(0, '4551.222')] [2023-03-08 19:03:01,428][547799] Updated weights for policy 0, policy_version 11520 (0.0005) [2023-03-08 19:03:04,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 5926912. Throughput: 0: 10094.6. Samples: 5930616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:03:04,600][547459] Avg episode reward: [(0, '4690.905')] [2023-03-08 19:03:04,643][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000011584_5931008.pth... [2023-03-08 19:03:04,645][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000010992_5627904.pth [2023-03-08 19:03:05,482][547799] Updated weights for policy 0, policy_version 11600 (0.0005) [2023-03-08 19:03:09,495][547799] Updated weights for policy 0, policy_version 11680 (0.0005) [2023-03-08 19:03:09,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 5980160. Throughput: 0: 10049.8. Samples: 5959904. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:03:09,601][547459] Avg episode reward: [(0, '4735.694')] [2023-03-08 19:03:13,503][547799] Updated weights for policy 0, policy_version 11760 (0.0005) [2023-03-08 19:03:14,600][547459] Fps is (10 sec: 10239.8, 60 sec: 10103.4, 300 sec: 10038.7). Total num frames: 6029312. Throughput: 0: 10046.8. Samples: 6021184. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:03:14,601][547459] Avg episode reward: [(0, '4566.954')] [2023-03-08 19:03:17,736][547799] Updated weights for policy 0, policy_version 11840 (0.0005) [2023-03-08 19:03:19,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 6078464. Throughput: 0: 10025.3. Samples: 6080660. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:03:19,600][547459] Avg episode reward: [(0, '4353.091')] [2023-03-08 19:03:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000011872_6078464.pth... [2023-03-08 19:03:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000011288_5779456.pth [2023-03-08 19:03:21,861][547799] Updated weights for policy 0, policy_version 11920 (0.0005) [2023-03-08 19:03:24,600][547459] Fps is (10 sec: 10240.3, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 6131712. Throughput: 0: 10012.6. Samples: 6110552. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:03:24,600][547459] Avg episode reward: [(0, '4461.268')] [2023-03-08 19:03:25,739][547799] Updated weights for policy 0, policy_version 12000 (0.0005) [2023-03-08 19:03:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 6180864. Throughput: 0: 10102.0. Samples: 6172672. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:03:29,601][547459] Avg episode reward: [(0, '4604.388')] [2023-03-08 19:03:29,836][547799] Updated weights for policy 0, policy_version 12080 (0.0005) [2023-03-08 19:03:33,835][547799] Updated weights for policy 0, policy_version 12160 (0.0005) [2023-03-08 19:03:34,600][547459] Fps is (10 sec: 10239.8, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 6234112. Throughput: 0: 10101.0. Samples: 6233796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:03:34,601][547459] Avg episode reward: [(0, '4256.156')] [2023-03-08 19:03:34,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000012176_6234112.pth... [2023-03-08 19:03:34,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000011584_5931008.pth [2023-03-08 19:03:37,896][547799] Updated weights for policy 0, policy_version 12240 (0.0005) [2023-03-08 19:03:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 6283264. Throughput: 0: 10074.7. Samples: 6262848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:03:39,600][547459] Avg episode reward: [(0, '4405.508')] [2023-03-08 19:03:42,006][547799] Updated weights for policy 0, policy_version 12320 (0.0005) [2023-03-08 19:03:44,600][547459] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 6332416. Throughput: 0: 10101.6. Samples: 6324140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:03:44,600][547459] Avg episode reward: [(0, '4274.637')] [2023-03-08 19:03:45,993][547799] Updated weights for policy 0, policy_version 12400 (0.0005) [2023-03-08 19:03:49,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 6385664. Throughput: 0: 10113.6. Samples: 6385728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:03:49,601][547459] Avg episode reward: [(0, '4204.718')] [2023-03-08 19:03:49,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000012472_6385664.pth... [2023-03-08 19:03:49,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000011872_6078464.pth [2023-03-08 19:03:49,920][547799] Updated weights for policy 0, policy_version 12480 (0.0005) [2023-03-08 19:03:53,874][547799] Updated weights for policy 0, policy_version 12560 (0.0005) [2023-03-08 19:03:54,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 6434816. Throughput: 0: 10160.9. Samples: 6417144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:03:54,601][547459] Avg episode reward: [(0, '4503.406')] [2023-03-08 19:03:57,905][547799] Updated weights for policy 0, policy_version 12640 (0.0005) [2023-03-08 19:03:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 6488064. Throughput: 0: 10157.9. Samples: 6478288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:03:59,600][547459] Avg episode reward: [(0, '4606.042')] [2023-03-08 19:04:01,902][547799] Updated weights for policy 0, policy_version 12720 (0.0005) [2023-03-08 19:04:04,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 6537216. Throughput: 0: 10185.9. Samples: 6539024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:04:04,600][547459] Avg episode reward: [(0, '4335.761')] [2023-03-08 19:04:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000012768_6537216.pth... [2023-03-08 19:04:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000012176_6234112.pth [2023-03-08 19:04:05,936][547799] Updated weights for policy 0, policy_version 12800 (0.0005) [2023-03-08 19:04:09,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 6590464. Throughput: 0: 10220.3. Samples: 6570468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:04:09,601][547459] Avg episode reward: [(0, '4347.606')] [2023-03-08 19:04:09,721][547799] Updated weights for policy 0, policy_version 12880 (0.0004) [2023-03-08 19:04:13,639][547799] Updated weights for policy 0, policy_version 12960 (0.0005) [2023-03-08 19:04:14,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10240.1, 300 sec: 10066.4). Total num frames: 6643712. Throughput: 0: 10260.0. Samples: 6634372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:04:14,600][547459] Avg episode reward: [(0, '4409.917')] [2023-03-08 19:04:17,518][547799] Updated weights for policy 0, policy_version 13040 (0.0005) [2023-03-08 19:04:19,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10066.4). Total num frames: 6696960. Throughput: 0: 10294.2. Samples: 6697032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:04:19,600][547459] Avg episode reward: [(0, '4603.847')] [2023-03-08 19:04:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000013080_6696960.pth... [2023-03-08 19:04:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000012472_6385664.pth [2023-03-08 19:04:21,390][547799] Updated weights for policy 0, policy_version 13120 (0.0005) [2023-03-08 19:04:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10066.4). Total num frames: 6746112. Throughput: 0: 10375.1. Samples: 6729728. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:04:24,600][547459] Avg episode reward: [(0, '4570.868')] [2023-03-08 19:04:25,445][547799] Updated weights for policy 0, policy_version 13200 (0.0005) [2023-03-08 19:04:29,434][547799] Updated weights for policy 0, policy_version 13280 (0.0005) [2023-03-08 19:04:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10080.3). Total num frames: 6799360. Throughput: 0: 10362.0. Samples: 6790432. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:04:29,600][547459] Avg episode reward: [(0, '4746.330')] [2023-03-08 19:04:33,159][547799] Updated weights for policy 0, policy_version 13360 (0.0005) [2023-03-08 19:04:34,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10080.3). Total num frames: 6852608. Throughput: 0: 10438.9. Samples: 6855480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:04:34,600][547459] Avg episode reward: [(0, '4719.384')] [2023-03-08 19:04:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000013384_6852608.pth... [2023-03-08 19:04:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000012768_6537216.pth [2023-03-08 19:04:37,040][547799] Updated weights for policy 0, policy_version 13440 (0.0005) [2023-03-08 19:04:39,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10080.3). Total num frames: 6905856. Throughput: 0: 10413.4. Samples: 6885748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:04:39,600][547459] Avg episode reward: [(0, '4786.765')] [2023-03-08 19:04:41,071][547799] Updated weights for policy 0, policy_version 13520 (0.0005) [2023-03-08 19:04:44,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10080.3). Total num frames: 6955008. Throughput: 0: 10418.5. Samples: 6947120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:04:44,600][547459] Avg episode reward: [(0, '4636.464')] [2023-03-08 19:04:45,018][547799] Updated weights for policy 0, policy_version 13600 (0.0005) [2023-03-08 19:04:48,938][547799] Updated weights for policy 0, policy_version 13680 (0.0005) [2023-03-08 19:04:49,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10094.2). Total num frames: 7008256. Throughput: 0: 10485.3. Samples: 7010864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:04:49,600][547459] Avg episode reward: [(0, '4491.362')] [2023-03-08 19:04:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000013688_7008256.pth... [2023-03-08 19:04:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000013080_6696960.pth [2023-03-08 19:04:52,833][547799] Updated weights for policy 0, policy_version 13760 (0.0005) [2023-03-08 19:04:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10094.2). Total num frames: 7061504. Throughput: 0: 10470.1. Samples: 7041620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:04:54,600][547459] Avg episode reward: [(0, '4384.989')] [2023-03-08 19:04:56,695][547799] Updated weights for policy 0, policy_version 13840 (0.0005) [2023-03-08 19:04:59,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10108.1). Total num frames: 7114752. Throughput: 0: 10466.0. Samples: 7105344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:04:59,600][547459] Avg episode reward: [(0, '3973.163')] [2023-03-08 19:05:00,723][547799] Updated weights for policy 0, policy_version 13920 (0.0005) [2023-03-08 19:05:04,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10108.1). Total num frames: 7163904. Throughput: 0: 10426.1. Samples: 7166208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:05:04,600][547459] Avg episode reward: [(0, '4366.421')] [2023-03-08 19:05:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000013992_7163904.pth... [2023-03-08 19:05:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000013384_6852608.pth [2023-03-08 19:05:04,712][547799] Updated weights for policy 0, policy_version 14000 (0.0005) [2023-03-08 19:05:08,735][547799] Updated weights for policy 0, policy_version 14080 (0.0005) [2023-03-08 19:05:09,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10122.0). Total num frames: 7217152. Throughput: 0: 10378.0. Samples: 7196736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:05:09,600][547459] Avg episode reward: [(0, '4272.331')] [2023-03-08 19:05:12,738][547799] Updated weights for policy 0, policy_version 14160 (0.0005) [2023-03-08 19:05:14,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10122.0). Total num frames: 7266304. Throughput: 0: 10394.3. Samples: 7258176. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:05:14,600][547459] Avg episode reward: [(0, '4280.610')] [2023-03-08 19:05:16,707][547799] Updated weights for policy 0, policy_version 14240 (0.0005) [2023-03-08 19:05:19,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10149.7). Total num frames: 7319552. Throughput: 0: 10314.1. Samples: 7319616. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:05:19,601][547459] Avg episode reward: [(0, '4590.912')] [2023-03-08 19:05:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000014296_7319552.pth... [2023-03-08 19:05:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000013688_7008256.pth [2023-03-08 19:05:20,636][547799] Updated weights for policy 0, policy_version 14320 (0.0005) [2023-03-08 19:05:24,567][547799] Updated weights for policy 0, policy_version 14400 (0.0005) [2023-03-08 19:05:24,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10177.5). Total num frames: 7372800. Throughput: 0: 10368.4. Samples: 7352328. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:05:24,601][547459] Avg episode reward: [(0, '4786.075')] [2023-03-08 19:05:28,444][547799] Updated weights for policy 0, policy_version 14480 (0.0005) [2023-03-08 19:05:29,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10177.5). Total num frames: 7421952. Throughput: 0: 10383.2. Samples: 7414364. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:05:29,600][547459] Avg episode reward: [(0, '4668.914')] [2023-03-08 19:05:32,334][547799] Updated weights for policy 0, policy_version 14560 (0.0005) [2023-03-08 19:05:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10205.3). Total num frames: 7475200. Throughput: 0: 10394.8. Samples: 7478632. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:05:34,600][547459] Avg episode reward: [(0, '4558.365')] [2023-03-08 19:05:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000014600_7475200.pth... [2023-03-08 19:05:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000013992_7163904.pth [2023-03-08 19:05:36,257][547799] Updated weights for policy 0, policy_version 14640 (0.0005) [2023-03-08 19:05:39,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10219.2). Total num frames: 7528448. Throughput: 0: 10393.3. Samples: 7509320. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:05:39,600][547459] Avg episode reward: [(0, '4633.514')] [2023-03-08 19:05:40,009][547799] Updated weights for policy 0, policy_version 14720 (0.0005) [2023-03-08 19:05:43,999][547799] Updated weights for policy 0, policy_version 14800 (0.0005) [2023-03-08 19:05:44,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10219.2). Total num frames: 7581696. Throughput: 0: 10391.3. Samples: 7572952. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:05:44,600][547459] Avg episode reward: [(0, '4792.397')] [2023-03-08 19:05:47,912][547799] Updated weights for policy 0, policy_version 14880 (0.0005) [2023-03-08 19:05:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10233.1). Total num frames: 7634944. Throughput: 0: 10432.2. Samples: 7635656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:05:49,601][547459] Avg episode reward: [(0, '4665.864')] [2023-03-08 19:05:49,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000014912_7634944.pth... [2023-03-08 19:05:49,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000014296_7319552.pth [2023-03-08 19:05:51,673][547799] Updated weights for policy 0, policy_version 14960 (0.0005) [2023-03-08 19:05:54,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10246.9). Total num frames: 7688192. Throughput: 0: 10487.8. Samples: 7668688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:05:54,600][547459] Avg episode reward: [(0, '4709.524')] [2023-03-08 19:05:55,622][547799] Updated weights for policy 0, policy_version 15040 (0.0005) [2023-03-08 19:05:59,552][547799] Updated weights for policy 0, policy_version 15120 (0.0005) [2023-03-08 19:05:59,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10260.8). Total num frames: 7741440. Throughput: 0: 10497.2. Samples: 7730548. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:05:59,600][547459] Avg episode reward: [(0, '4415.797')] [2023-03-08 19:06:03,397][547799] Updated weights for policy 0, policy_version 15200 (0.0005) [2023-03-08 19:06:04,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10260.8). Total num frames: 7794688. Throughput: 0: 10557.2. Samples: 7794688. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:06:04,600][547459] Avg episode reward: [(0, '4800.404')] [2023-03-08 19:06:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000015224_7794688.pth... [2023-03-08 19:06:04,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000014600_7475200.pth [2023-03-08 19:06:07,247][547799] Updated weights for policy 0, policy_version 15280 (0.0005) [2023-03-08 19:06:09,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10274.7). Total num frames: 7847936. Throughput: 0: 10532.6. Samples: 7826296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:06:09,601][547459] Avg episode reward: [(0, '4764.975')] [2023-03-08 19:06:11,104][547799] Updated weights for policy 0, policy_version 15360 (0.0005) [2023-03-08 19:06:14,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10288.6). Total num frames: 7901184. Throughput: 0: 10557.1. Samples: 7889432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:06:14,600][547459] Avg episode reward: [(0, '4809.781')] [2023-03-08 19:06:14,600][547754] Saving new best policy, reward=4809.781! [2023-03-08 19:06:14,941][547799] Updated weights for policy 0, policy_version 15440 (0.0005) [2023-03-08 19:06:18,905][547799] Updated weights for policy 0, policy_version 15520 (0.0005) [2023-03-08 19:06:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10288.6). Total num frames: 7950336. Throughput: 0: 10546.6. Samples: 7953228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:06:19,601][547459] Avg episode reward: [(0, '4675.470')] [2023-03-08 19:06:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000015528_7950336.pth... [2023-03-08 19:06:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000014912_7634944.pth [2023-03-08 19:06:22,732][547799] Updated weights for policy 0, policy_version 15600 (0.0006) [2023-03-08 19:06:24,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10302.5). Total num frames: 8003584. Throughput: 0: 10569.8. Samples: 7984960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:06:24,600][547459] Avg episode reward: [(0, '4777.931')] [2023-03-08 19:06:26,518][547799] Updated weights for policy 0, policy_version 15680 (0.0005) [2023-03-08 19:06:29,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10316.4). Total num frames: 8056832. Throughput: 0: 10572.4. Samples: 8048712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:06:29,600][547459] Avg episode reward: [(0, '4762.336')] [2023-03-08 19:06:30,483][547799] Updated weights for policy 0, policy_version 15760 (0.0005) [2023-03-08 19:06:34,298][547799] Updated weights for policy 0, policy_version 15840 (0.0005) [2023-03-08 19:06:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10330.3). Total num frames: 8110080. Throughput: 0: 10592.2. Samples: 8112304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:06:34,600][547459] Avg episode reward: [(0, '4700.757')] [2023-03-08 19:06:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000015840_8110080.pth... [2023-03-08 19:06:34,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000015224_7794688.pth [2023-03-08 19:06:38,207][547799] Updated weights for policy 0, policy_version 15920 (0.0005) [2023-03-08 19:06:39,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.4, 300 sec: 10330.3). Total num frames: 8163328. Throughput: 0: 10551.4. Samples: 8143500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:06:39,600][547459] Avg episode reward: [(0, '4685.219')] [2023-03-08 19:06:42,088][547799] Updated weights for policy 0, policy_version 16000 (0.0005) [2023-03-08 19:06:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10344.1). Total num frames: 8216576. Throughput: 0: 10608.6. Samples: 8207936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:06:44,600][547459] Avg episode reward: [(0, '4639.953')] [2023-03-08 19:06:45,900][547799] Updated weights for policy 0, policy_version 16080 (0.0005) [2023-03-08 19:06:49,600][547459] Fps is (10 sec: 10649.4, 60 sec: 10581.3, 300 sec: 10344.1). Total num frames: 8269824. Throughput: 0: 10609.8. Samples: 8272132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:06:49,601][547459] Avg episode reward: [(0, '4831.621')] [2023-03-08 19:06:49,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000016152_8269824.pth... [2023-03-08 19:06:49,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000015528_7950336.pth [2023-03-08 19:06:49,608][547754] Saving new best policy, reward=4831.621! [2023-03-08 19:06:49,739][547799] Updated weights for policy 0, policy_version 16160 (0.0005) [2023-03-08 19:06:53,468][547799] Updated weights for policy 0, policy_version 16240 (0.0006) [2023-03-08 19:06:54,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.4, 300 sec: 10358.0). Total num frames: 8323072. Throughput: 0: 10638.4. Samples: 8305020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:06:54,600][547459] Avg episode reward: [(0, '4787.457')] [2023-03-08 19:06:57,280][547799] Updated weights for policy 0, policy_version 16320 (0.0005) [2023-03-08 19:06:59,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10371.9). Total num frames: 8380416. Throughput: 0: 10671.3. Samples: 8369644. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:06:59,600][547459] Avg episode reward: [(0, '4726.207')] [2023-03-08 19:07:01,046][547799] Updated weights for policy 0, policy_version 16400 (0.0005) [2023-03-08 19:07:04,600][547459] Fps is (10 sec: 11059.0, 60 sec: 10649.6, 300 sec: 10385.8). Total num frames: 8433664. Throughput: 0: 10711.7. Samples: 8435256. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:07:04,600][547459] Avg episode reward: [(0, '4775.872')] [2023-03-08 19:07:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000016472_8433664.pth... [2023-03-08 19:07:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000015840_8110080.pth [2023-03-08 19:07:04,778][547799] Updated weights for policy 0, policy_version 16480 (0.0005) [2023-03-08 19:07:08,592][547799] Updated weights for policy 0, policy_version 16560 (0.0006) [2023-03-08 19:07:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10385.8). Total num frames: 8486912. Throughput: 0: 10711.6. Samples: 8466984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:07:09,600][547459] Avg episode reward: [(0, '4779.916')] [2023-03-08 19:07:12,383][547799] Updated weights for policy 0, policy_version 16640 (0.0005) [2023-03-08 19:07:14,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10385.8). Total num frames: 8540160. Throughput: 0: 10739.2. Samples: 8531976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:07:14,600][547459] Avg episode reward: [(0, '4784.392')] [2023-03-08 19:07:16,336][547799] Updated weights for policy 0, policy_version 16720 (0.0005) [2023-03-08 19:07:19,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10399.7). Total num frames: 8593408. Throughput: 0: 10696.9. Samples: 8593664. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:07:19,601][547459] Avg episode reward: [(0, '4739.512')] [2023-03-08 19:07:19,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000016784_8593408.pth... [2023-03-08 19:07:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000016152_8269824.pth [2023-03-08 19:07:20,257][547799] Updated weights for policy 0, policy_version 16800 (0.0005) [2023-03-08 19:07:24,009][547799] Updated weights for policy 0, policy_version 16880 (0.0005) [2023-03-08 19:07:24,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10399.7). Total num frames: 8646656. Throughput: 0: 10727.5. Samples: 8626240. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:07:24,600][547459] Avg episode reward: [(0, '4821.079')] [2023-03-08 19:07:27,804][547799] Updated weights for policy 0, policy_version 16960 (0.0005) [2023-03-08 19:07:29,599][547459] Fps is (10 sec: 10649.8, 60 sec: 10717.9, 300 sec: 10413.6). Total num frames: 8699904. Throughput: 0: 10750.6. Samples: 8691712. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:07:29,600][547459] Avg episode reward: [(0, '4467.820')] [2023-03-08 19:07:31,632][547799] Updated weights for policy 0, policy_version 17040 (0.0005) [2023-03-08 19:07:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10427.4). Total num frames: 8753152. Throughput: 0: 10757.2. Samples: 8756204. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:07:34,600][547459] Avg episode reward: [(0, '4775.774')] [2023-03-08 19:07:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000017096_8753152.pth... [2023-03-08 19:07:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000016472_8433664.pth [2023-03-08 19:07:35,504][547799] Updated weights for policy 0, policy_version 17120 (0.0005) [2023-03-08 19:07:39,255][547799] Updated weights for policy 0, policy_version 17200 (0.0005) [2023-03-08 19:07:39,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.8, 300 sec: 10441.3). Total num frames: 8806400. Throughput: 0: 10702.5. Samples: 8786636. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:07:39,600][547459] Avg episode reward: [(0, '4729.908')] [2023-03-08 19:07:43,152][547799] Updated weights for policy 0, policy_version 17280 (0.0005) [2023-03-08 19:07:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10455.2). Total num frames: 8859648. Throughput: 0: 10707.2. Samples: 8851468. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:07:44,600][547459] Avg episode reward: [(0, '4588.999')] [2023-03-08 19:07:46,950][547799] Updated weights for policy 0, policy_version 17360 (0.0005) [2023-03-08 19:07:49,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10455.2). Total num frames: 8912896. Throughput: 0: 10677.2. Samples: 8915728. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:07:49,600][547459] Avg episode reward: [(0, '4169.137')] [2023-03-08 19:07:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000017408_8912896.pth... [2023-03-08 19:07:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000016784_8593408.pth [2023-03-08 19:07:50,850][547799] Updated weights for policy 0, policy_version 17440 (0.0005) [2023-03-08 19:07:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.8, 300 sec: 10469.1). Total num frames: 8966144. Throughput: 0: 10660.1. Samples: 8946688. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:07:54,600][547459] Avg episode reward: [(0, '4203.754')] [2023-03-08 19:07:54,680][547799] Updated weights for policy 0, policy_version 17520 (0.0005) [2023-03-08 19:07:58,664][547799] Updated weights for policy 0, policy_version 17600 (0.0005) [2023-03-08 19:07:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10483.0). Total num frames: 9019392. Throughput: 0: 10626.6. Samples: 9010172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:07:59,600][547459] Avg episode reward: [(0, '4687.204')] [2023-03-08 19:08:02,560][547799] Updated weights for policy 0, policy_version 17680 (0.0005) [2023-03-08 19:08:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10483.0). Total num frames: 9072640. Throughput: 0: 10650.5. Samples: 9072936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:08:04,600][547459] Avg episode reward: [(0, '4666.264')] [2023-03-08 19:08:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000017720_9072640.pth... [2023-03-08 19:08:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000017096_8753152.pth [2023-03-08 19:08:06,328][547799] Updated weights for policy 0, policy_version 17760 (0.0005) [2023-03-08 19:08:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10496.9). Total num frames: 9125888. Throughput: 0: 10657.1. Samples: 9105808. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:08:09,600][547459] Avg episode reward: [(0, '4780.094')] [2023-03-08 19:08:10,107][547799] Updated weights for policy 0, policy_version 17840 (0.0005) [2023-03-08 19:08:13,930][547799] Updated weights for policy 0, policy_version 17920 (0.0005) [2023-03-08 19:08:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10510.8). Total num frames: 9179136. Throughput: 0: 10649.6. Samples: 9170944. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:08:14,600][547459] Avg episode reward: [(0, '4728.622')] [2023-03-08 19:08:17,665][547799] Updated weights for policy 0, policy_version 18000 (0.0005) [2023-03-08 19:08:19,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 10524.6). Total num frames: 9236480. Throughput: 0: 10673.0. Samples: 9236488. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:08:19,601][547459] Avg episode reward: [(0, '4751.865')] [2023-03-08 19:08:19,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000018040_9236480.pth... [2023-03-08 19:08:19,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000017408_8912896.pth [2023-03-08 19:08:21,382][547799] Updated weights for policy 0, policy_version 18080 (0.0005) [2023-03-08 19:08:24,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10538.5). Total num frames: 9289728. Throughput: 0: 10728.8. Samples: 9269432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:08:24,600][547459] Avg episode reward: [(0, '4784.538')] [2023-03-08 19:08:25,166][547799] Updated weights for policy 0, policy_version 18160 (0.0005) [2023-03-08 19:08:29,037][547799] Updated weights for policy 0, policy_version 18240 (0.0005) [2023-03-08 19:08:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.8, 300 sec: 10538.5). Total num frames: 9342976. Throughput: 0: 10724.7. Samples: 9334080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:08:29,600][547459] Avg episode reward: [(0, '4781.039')] [2023-03-08 19:08:32,852][547799] Updated weights for policy 0, policy_version 18320 (0.0005) [2023-03-08 19:08:34,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10552.4). Total num frames: 9396224. Throughput: 0: 10730.3. Samples: 9398592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:08:34,600][547459] Avg episode reward: [(0, '4644.515')] [2023-03-08 19:08:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000018352_9396224.pth... [2023-03-08 19:08:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000017720_9072640.pth [2023-03-08 19:08:36,660][547799] Updated weights for policy 0, policy_version 18400 (0.0005) [2023-03-08 19:08:39,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10566.3). Total num frames: 9449472. Throughput: 0: 10719.5. Samples: 9429064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:08:39,600][547459] Avg episode reward: [(0, '4770.239')] [2023-03-08 19:08:40,575][547799] Updated weights for policy 0, policy_version 18480 (0.0005) [2023-03-08 19:08:44,266][547799] Updated weights for policy 0, policy_version 18560 (0.0005) [2023-03-08 19:08:44,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10566.3). Total num frames: 9502720. Throughput: 0: 10763.7. Samples: 9494540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:08:44,600][547459] Avg episode reward: [(0, '4451.802')] [2023-03-08 19:08:48,092][547799] Updated weights for policy 0, policy_version 18640 (0.0006) [2023-03-08 19:08:49,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10580.2). Total num frames: 9555968. Throughput: 0: 10806.2. Samples: 9559212. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:08:49,600][547459] Avg episode reward: [(0, '4745.486')] [2023-03-08 19:08:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000018664_9555968.pth... [2023-03-08 19:08:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000018040_9236480.pth [2023-03-08 19:08:51,957][547799] Updated weights for policy 0, policy_version 18720 (0.0005) [2023-03-08 19:08:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10580.2). Total num frames: 9609216. Throughput: 0: 10769.2. Samples: 9590424. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:08:54,600][547459] Avg episode reward: [(0, '4720.743')] [2023-03-08 19:08:55,738][547799] Updated weights for policy 0, policy_version 18800 (0.0005) [2023-03-08 19:08:59,556][547799] Updated weights for policy 0, policy_version 18880 (0.0005) [2023-03-08 19:08:59,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10786.1, 300 sec: 10607.9). Total num frames: 9666560. Throughput: 0: 10765.3. Samples: 9655384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:08:59,600][547459] Avg episode reward: [(0, '4767.199')] [2023-03-08 19:09:03,398][547799] Updated weights for policy 0, policy_version 18960 (0.0005) [2023-03-08 19:09:04,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10607.9). Total num frames: 9719808. Throughput: 0: 10741.9. Samples: 9719872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:09:04,600][547459] Avg episode reward: [(0, '4712.021')] [2023-03-08 19:09:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000018984_9719808.pth... [2023-03-08 19:09:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000018352_9396224.pth [2023-03-08 19:09:07,188][547799] Updated weights for policy 0, policy_version 19040 (0.0005) [2023-03-08 19:09:09,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10786.1, 300 sec: 10607.9). Total num frames: 9773056. Throughput: 0: 10736.7. Samples: 9752584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:09:09,600][547459] Avg episode reward: [(0, '4624.377')] [2023-03-08 19:09:10,960][547799] Updated weights for policy 0, policy_version 19120 (0.0004) [2023-03-08 19:09:14,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10786.1, 300 sec: 10607.9). Total num frames: 9826304. Throughput: 0: 10702.6. Samples: 9815696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:09:14,600][547459] Avg episode reward: [(0, '4799.247')] [2023-03-08 19:09:14,885][547799] Updated weights for policy 0, policy_version 19200 (0.0005) [2023-03-08 19:09:18,788][547799] Updated weights for policy 0, policy_version 19280 (0.0005) [2023-03-08 19:09:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10621.8). Total num frames: 9879552. Throughput: 0: 10687.8. Samples: 9879544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:09:19,600][547459] Avg episode reward: [(0, '4718.652')] [2023-03-08 19:09:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000019296_9879552.pth... [2023-03-08 19:09:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000018664_9555968.pth [2023-03-08 19:09:22,683][547799] Updated weights for policy 0, policy_version 19360 (0.0005) [2023-03-08 19:09:24,600][547459] Fps is (10 sec: 10649.4, 60 sec: 10717.8, 300 sec: 10621.8). Total num frames: 9932800. Throughput: 0: 10714.4. Samples: 9911212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:09:24,601][547459] Avg episode reward: [(0, '4691.991')] [2023-03-08 19:09:26,591][547799] Updated weights for policy 0, policy_version 19440 (0.0005) [2023-03-08 19:09:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 9981952. Throughput: 0: 10648.3. Samples: 9973716. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:09:29,601][547459] Avg episode reward: [(0, '4335.280')] [2023-03-08 19:09:30,527][547799] Updated weights for policy 0, policy_version 19520 (0.0005) [2023-03-08 19:09:34,561][547799] Updated weights for policy 0, policy_version 19600 (0.0006) [2023-03-08 19:09:34,599][547459] Fps is (10 sec: 10240.2, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 10035200. Throughput: 0: 10577.5. Samples: 10035200. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:09:34,600][547459] Avg episode reward: [(0, '2964.386')] [2023-03-08 19:09:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000019600_10035200.pth... [2023-03-08 19:09:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000018984_9719808.pth [2023-03-08 19:09:38,726][547799] Updated weights for policy 0, policy_version 19680 (0.0005) [2023-03-08 19:09:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10607.9). Total num frames: 10084352. Throughput: 0: 10522.7. Samples: 10063944. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:09:39,600][547459] Avg episode reward: [(0, '2454.747')] [2023-03-08 19:09:42,704][547799] Updated weights for policy 0, policy_version 19760 (0.0005) [2023-03-08 19:09:44,600][547459] Fps is (10 sec: 9830.3, 60 sec: 10513.1, 300 sec: 10594.1). Total num frames: 10133504. Throughput: 0: 10446.8. Samples: 10125488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:09:44,600][547459] Avg episode reward: [(0, '2279.733')] [2023-03-08 19:09:46,709][547799] Updated weights for policy 0, policy_version 19840 (0.0005) [2023-03-08 19:09:49,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10594.1). Total num frames: 10186752. Throughput: 0: 10375.1. Samples: 10186752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:09:49,600][547459] Avg episode reward: [(0, '2397.069')] [2023-03-08 19:09:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000019896_10186752.pth... [2023-03-08 19:09:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000019296_9879552.pth [2023-03-08 19:09:50,826][547799] Updated weights for policy 0, policy_version 19920 (0.0005) [2023-03-08 19:09:54,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10580.2). Total num frames: 10235904. Throughput: 0: 10298.7. Samples: 10216024. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:09:54,600][547459] Avg episode reward: [(0, '1984.669')] [2023-03-08 19:09:54,871][547799] Updated weights for policy 0, policy_version 20000 (0.0005) [2023-03-08 19:09:58,982][547799] Updated weights for policy 0, policy_version 20080 (0.0005) [2023-03-08 19:09:59,599][547459] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10580.2). Total num frames: 10285056. Throughput: 0: 10248.2. Samples: 10276864. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:09:59,600][547459] Avg episode reward: [(0, '2743.966')] [2023-03-08 19:10:02,961][547799] Updated weights for policy 0, policy_version 20160 (0.0006) [2023-03-08 19:10:04,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10566.3). Total num frames: 10334208. Throughput: 0: 10189.9. Samples: 10338088. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:10:04,600][547459] Avg episode reward: [(0, '2536.223')] [2023-03-08 19:10:04,625][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000020192_10338304.pth... [2023-03-08 19:10:04,627][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000019600_10035200.pth [2023-03-08 19:10:07,082][547799] Updated weights for policy 0, policy_version 20240 (0.0005) [2023-03-08 19:10:09,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10580.2). Total num frames: 10387456. Throughput: 0: 10137.4. Samples: 10367396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:10:09,600][547459] Avg episode reward: [(0, '2157.947')] [2023-03-08 19:10:11,005][547799] Updated weights for policy 0, policy_version 20320 (0.0005) [2023-03-08 19:10:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10580.2). Total num frames: 10440704. Throughput: 0: 10148.0. Samples: 10430376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:10:14,601][547459] Avg episode reward: [(0, '2515.808')] [2023-03-08 19:10:14,864][547799] Updated weights for policy 0, policy_version 20400 (0.0005) [2023-03-08 19:10:18,742][547799] Updated weights for policy 0, policy_version 20480 (0.0005) [2023-03-08 19:10:19,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10580.2). Total num frames: 10493952. Throughput: 0: 10195.9. Samples: 10494016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:10:19,600][547459] Avg episode reward: [(0, '2875.712')] [2023-03-08 19:10:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000020496_10493952.pth... [2023-03-08 19:10:19,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000019896_10186752.pth [2023-03-08 19:10:22,666][547799] Updated weights for policy 0, policy_version 20560 (0.0005) [2023-03-08 19:10:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10594.1). Total num frames: 10547200. Throughput: 0: 10276.2. Samples: 10526372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:10:24,600][547459] Avg episode reward: [(0, '2897.582')] [2023-03-08 19:10:26,528][547799] Updated weights for policy 0, policy_version 20640 (0.0005) [2023-03-08 19:10:29,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10594.1). Total num frames: 10600448. Throughput: 0: 10297.3. Samples: 10588868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:10:29,601][547459] Avg episode reward: [(0, '2878.933')] [2023-03-08 19:10:30,347][547799] Updated weights for policy 0, policy_version 20720 (0.0005) [2023-03-08 19:10:34,311][547799] Updated weights for policy 0, policy_version 20800 (0.0006) [2023-03-08 19:10:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10580.2). Total num frames: 10649600. Throughput: 0: 10338.7. Samples: 10651996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:10:34,600][547459] Avg episode reward: [(0, '3066.600')] [2023-03-08 19:10:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000020800_10649600.pth... [2023-03-08 19:10:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000020192_10338304.pth [2023-03-08 19:10:38,284][547799] Updated weights for policy 0, policy_version 20880 (0.0005) [2023-03-08 19:10:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10580.2). Total num frames: 10702848. Throughput: 0: 10370.3. Samples: 10682688. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:10:39,601][547459] Avg episode reward: [(0, '3047.926')] [2023-03-08 19:10:42,104][547799] Updated weights for policy 0, policy_version 20960 (0.0005) [2023-03-08 19:10:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10580.2). Total num frames: 10756096. Throughput: 0: 10467.5. Samples: 10747904. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:10:44,600][547459] Avg episode reward: [(0, '3569.100')] [2023-03-08 19:10:45,839][547799] Updated weights for policy 0, policy_version 21040 (0.0005) [2023-03-08 19:10:49,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10580.2). Total num frames: 10809344. Throughput: 0: 10518.8. Samples: 10811432. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:10:49,600][547459] Avg episode reward: [(0, '2981.906')] [2023-03-08 19:10:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000021112_10809344.pth... [2023-03-08 19:10:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000020496_10493952.pth [2023-03-08 19:10:49,740][547799] Updated weights for policy 0, policy_version 21120 (0.0006) [2023-03-08 19:10:53,494][547799] Updated weights for policy 0, policy_version 21200 (0.0005) [2023-03-08 19:10:54,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10580.2). Total num frames: 10862592. Throughput: 0: 10586.2. Samples: 10843776. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:10:54,600][547459] Avg episode reward: [(0, '2917.725')] [2023-03-08 19:10:57,414][547799] Updated weights for policy 0, policy_version 21280 (0.0005) [2023-03-08 19:10:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10580.2). Total num frames: 10915840. Throughput: 0: 10605.8. Samples: 10907636. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:10:59,600][547459] Avg episode reward: [(0, '2581.113')] [2023-03-08 19:11:01,297][547799] Updated weights for policy 0, policy_version 21360 (0.0005) [2023-03-08 19:11:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10580.2). Total num frames: 10969088. Throughput: 0: 10558.6. Samples: 10969152. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:11:04,600][547459] Avg episode reward: [(0, '2756.149')] [2023-03-08 19:11:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000021424_10969088.pth... [2023-03-08 19:11:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000020800_10649600.pth [2023-03-08 19:11:05,315][547799] Updated weights for policy 0, policy_version 21440 (0.0005) [2023-03-08 19:11:09,198][547799] Updated weights for policy 0, policy_version 21520 (0.0005) [2023-03-08 19:11:09,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10580.2). Total num frames: 11022336. Throughput: 0: 10559.6. Samples: 11001552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:11:09,600][547459] Avg episode reward: [(0, '2366.282')] [2023-03-08 19:11:13,020][547799] Updated weights for policy 0, policy_version 21600 (0.0005) [2023-03-08 19:11:14,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 11075584. Throughput: 0: 10570.7. Samples: 11064548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:11:14,600][547459] Avg episode reward: [(0, '3107.842')] [2023-03-08 19:11:16,841][547799] Updated weights for policy 0, policy_version 21680 (0.0005) [2023-03-08 19:11:19,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 11128832. Throughput: 0: 10598.0. Samples: 11128904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:11:19,600][547459] Avg episode reward: [(0, '3239.763')] [2023-03-08 19:11:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000021736_11128832.pth... [2023-03-08 19:11:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000021112_10809344.pth [2023-03-08 19:11:20,677][547799] Updated weights for policy 0, policy_version 21760 (0.0005) [2023-03-08 19:11:24,503][547799] Updated weights for policy 0, policy_version 21840 (0.0005) [2023-03-08 19:11:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 11182080. Throughput: 0: 10641.8. Samples: 11161568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:11:24,600][547459] Avg episode reward: [(0, '2811.116')] [2023-03-08 19:11:28,507][547799] Updated weights for policy 0, policy_version 21920 (0.0005) [2023-03-08 19:11:29,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10580.2). Total num frames: 11231232. Throughput: 0: 10560.0. Samples: 11223104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:11:29,600][547459] Avg episode reward: [(0, '3029.905')] [2023-03-08 19:11:32,246][547799] Updated weights for policy 0, policy_version 22000 (0.0005) [2023-03-08 19:11:34,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10581.4, 300 sec: 10580.2). Total num frames: 11284480. Throughput: 0: 10590.1. Samples: 11287988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:11:34,600][547459] Avg episode reward: [(0, '3080.842')] [2023-03-08 19:11:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000022048_11288576.pth... [2023-03-08 19:11:34,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000021424_10969088.pth [2023-03-08 19:11:36,135][547799] Updated weights for policy 0, policy_version 22080 (0.0005) [2023-03-08 19:11:39,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10580.2). Total num frames: 11337728. Throughput: 0: 10602.4. Samples: 11320884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:11:39,600][547459] Avg episode reward: [(0, '3443.272')] [2023-03-08 19:11:40,072][547799] Updated weights for policy 0, policy_version 22160 (0.0005) [2023-03-08 19:11:43,930][547799] Updated weights for policy 0, policy_version 22240 (0.0005) [2023-03-08 19:11:44,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10580.2). Total num frames: 11390976. Throughput: 0: 10559.0. Samples: 11382792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:11:44,600][547459] Avg episode reward: [(0, '3103.902')] [2023-03-08 19:11:47,801][547799] Updated weights for policy 0, policy_version 22320 (0.0005) [2023-03-08 19:11:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10580.2). Total num frames: 11444224. Throughput: 0: 10587.4. Samples: 11445584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:11:49,601][547459] Avg episode reward: [(0, '3359.089')] [2023-03-08 19:11:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000022352_11444224.pth... [2023-03-08 19:11:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000021736_11128832.pth [2023-03-08 19:11:51,764][547799] Updated weights for policy 0, policy_version 22400 (0.0006) [2023-03-08 19:11:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10566.3). Total num frames: 11497472. Throughput: 0: 10565.3. Samples: 11476992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:11:54,600][547459] Avg episode reward: [(0, '3489.976')] [2023-03-08 19:11:55,606][547799] Updated weights for policy 0, policy_version 22480 (0.0005) [2023-03-08 19:11:59,562][547799] Updated weights for policy 0, policy_version 22560 (0.0004) [2023-03-08 19:11:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10566.3). Total num frames: 11550720. Throughput: 0: 10562.3. Samples: 11539852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:11:59,601][547459] Avg episode reward: [(0, '3757.253')] [2023-03-08 19:12:03,513][547799] Updated weights for policy 0, policy_version 22640 (0.0005) [2023-03-08 19:12:04,599][547459] Fps is (10 sec: 10240.2, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 11599872. Throughput: 0: 10542.9. Samples: 11603336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:12:04,600][547459] Avg episode reward: [(0, '3940.260')] [2023-03-08 19:12:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000022656_11599872.pth... [2023-03-08 19:12:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000022048_11288576.pth [2023-03-08 19:12:07,339][547799] Updated weights for policy 0, policy_version 22720 (0.0005) [2023-03-08 19:12:09,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 11653120. Throughput: 0: 10526.0. Samples: 11635240. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:12:09,600][547459] Avg episode reward: [(0, '3419.885')] [2023-03-08 19:12:11,123][547799] Updated weights for policy 0, policy_version 22800 (0.0005) [2023-03-08 19:12:14,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10581.3, 300 sec: 10566.3). Total num frames: 11710464. Throughput: 0: 10587.8. Samples: 11699556. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:12:14,600][547459] Avg episode reward: [(0, '3760.480')] [2023-03-08 19:12:14,946][547799] Updated weights for policy 0, policy_version 22880 (0.0005) [2023-03-08 19:12:18,915][547799] Updated weights for policy 0, policy_version 22960 (0.0005) [2023-03-08 19:12:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.0, 300 sec: 10552.4). Total num frames: 11759616. Throughput: 0: 10545.1. Samples: 11762520. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:12:19,600][547459] Avg episode reward: [(0, '3346.888')] [2023-03-08 19:12:19,660][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000022976_11763712.pth... [2023-03-08 19:12:19,662][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000022352_11444224.pth [2023-03-08 19:12:22,872][547799] Updated weights for policy 0, policy_version 23040 (0.0005) [2023-03-08 19:12:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 11812864. Throughput: 0: 10486.8. Samples: 11792792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:12:24,601][547459] Avg episode reward: [(0, '3604.705')] [2023-03-08 19:12:26,869][547799] Updated weights for policy 0, policy_version 23120 (0.0005) [2023-03-08 19:12:29,600][547459] Fps is (10 sec: 10240.2, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 11862016. Throughput: 0: 10486.9. Samples: 11854704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:12:29,600][547459] Avg episode reward: [(0, '3575.881')] [2023-03-08 19:12:30,796][547799] Updated weights for policy 0, policy_version 23200 (0.0005) [2023-03-08 19:12:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 11915264. Throughput: 0: 10460.5. Samples: 11916304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:12:34,600][547459] Avg episode reward: [(0, '3592.288')] [2023-03-08 19:12:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000023272_11915264.pth... [2023-03-08 19:12:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000022656_11599872.pth [2023-03-08 19:12:34,765][547799] Updated weights for policy 0, policy_version 23280 (0.0005) [2023-03-08 19:12:38,695][547799] Updated weights for policy 0, policy_version 23360 (0.0005) [2023-03-08 19:12:39,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 11968512. Throughput: 0: 10467.6. Samples: 11948032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:12:39,601][547459] Avg episode reward: [(0, '3464.618')] [2023-03-08 19:12:42,646][547799] Updated weights for policy 0, policy_version 23440 (0.0005) [2023-03-08 19:12:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 12021760. Throughput: 0: 10451.4. Samples: 12010164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:12:44,600][547459] Avg episode reward: [(0, '3842.995')] [2023-03-08 19:12:46,580][547799] Updated weights for policy 0, policy_version 23520 (0.0005) [2023-03-08 19:12:49,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 12070912. Throughput: 0: 10449.3. Samples: 12073556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:12:49,600][547459] Avg episode reward: [(0, '3691.154')] [2023-03-08 19:12:49,663][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000023584_12075008.pth... [2023-03-08 19:12:49,665][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000022976_11763712.pth [2023-03-08 19:12:50,519][547799] Updated weights for policy 0, policy_version 23600 (0.0005) [2023-03-08 19:12:54,334][547799] Updated weights for policy 0, policy_version 23680 (0.0005) [2023-03-08 19:12:54,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 12124160. Throughput: 0: 10435.0. Samples: 12104816. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:12:54,600][547459] Avg episode reward: [(0, '4039.290')] [2023-03-08 19:12:58,200][547799] Updated weights for policy 0, policy_version 23760 (0.0005) [2023-03-08 19:12:59,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 12177408. Throughput: 0: 10437.5. Samples: 12169244. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:12:59,600][547459] Avg episode reward: [(0, '4324.906')] [2023-03-08 19:13:02,009][547799] Updated weights for policy 0, policy_version 23840 (0.0005) [2023-03-08 19:13:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.0, 300 sec: 10524.6). Total num frames: 12230656. Throughput: 0: 10459.5. Samples: 12233196. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:13:04,600][547459] Avg episode reward: [(0, '3893.073')] [2023-03-08 19:13:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000023888_12230656.pth... [2023-03-08 19:13:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000023272_11915264.pth [2023-03-08 19:13:05,845][547799] Updated weights for policy 0, policy_version 23920 (0.0006) [2023-03-08 19:13:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 12283904. Throughput: 0: 10509.6. Samples: 12265724. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:13:09,600][547459] Avg episode reward: [(0, '4531.813')] [2023-03-08 19:13:09,663][547799] Updated weights for policy 0, policy_version 24000 (0.0005) [2023-03-08 19:13:13,530][547799] Updated weights for policy 0, policy_version 24080 (0.0005) [2023-03-08 19:13:14,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 12337152. Throughput: 0: 10539.4. Samples: 12328976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:13:14,600][547459] Avg episode reward: [(0, '4274.510')] [2023-03-08 19:13:17,444][547799] Updated weights for policy 0, policy_version 24160 (0.0005) [2023-03-08 19:13:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10510.7). Total num frames: 12390400. Throughput: 0: 10565.0. Samples: 12391732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:13:19,601][547459] Avg episode reward: [(0, '4197.488')] [2023-03-08 19:13:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000024200_12390400.pth... [2023-03-08 19:13:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000023584_12075008.pth [2023-03-08 19:13:21,343][547799] Updated weights for policy 0, policy_version 24240 (0.0005) [2023-03-08 19:13:24,600][547459] Fps is (10 sec: 10648.7, 60 sec: 10512.9, 300 sec: 10510.7). Total num frames: 12443648. Throughput: 0: 10560.6. Samples: 12423268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:13:24,601][547459] Avg episode reward: [(0, '4369.947')] [2023-03-08 19:13:25,191][547799] Updated weights for policy 0, policy_version 24320 (0.0005) [2023-03-08 19:13:28,975][547799] Updated weights for policy 0, policy_version 24400 (0.0005) [2023-03-08 19:13:29,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 12496896. Throughput: 0: 10631.1. Samples: 12488564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:13:29,600][547459] Avg episode reward: [(0, '4126.043')] [2023-03-08 19:13:32,861][547799] Updated weights for policy 0, policy_version 24480 (0.0005) [2023-03-08 19:13:34,600][547459] Fps is (10 sec: 10650.5, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 12550144. Throughput: 0: 10632.8. Samples: 12552032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:13:34,600][547459] Avg episode reward: [(0, '4248.829')] [2023-03-08 19:13:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000024512_12550144.pth... [2023-03-08 19:13:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000023888_12230656.pth [2023-03-08 19:13:36,675][547799] Updated weights for policy 0, policy_version 24560 (0.0005) [2023-03-08 19:13:39,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 12603392. Throughput: 0: 10625.8. Samples: 12582976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:13:39,600][547459] Avg episode reward: [(0, '3867.775')] [2023-03-08 19:13:40,656][547799] Updated weights for policy 0, policy_version 24640 (0.0005) [2023-03-08 19:13:44,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 12652544. Throughput: 0: 10571.2. Samples: 12644948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:13:44,600][547459] Avg episode reward: [(0, '4319.259')] [2023-03-08 19:13:44,622][547799] Updated weights for policy 0, policy_version 24720 (0.0005) [2023-03-08 19:13:48,692][547799] Updated weights for policy 0, policy_version 24800 (0.0005) [2023-03-08 19:13:49,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 12705792. Throughput: 0: 10502.1. Samples: 12705792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:13:49,600][547459] Avg episode reward: [(0, '4170.531')] [2023-03-08 19:13:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000024816_12705792.pth... [2023-03-08 19:13:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000024200_12390400.pth [2023-03-08 19:13:52,858][547799] Updated weights for policy 0, policy_version 24880 (0.0005) [2023-03-08 19:13:54,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 12754944. Throughput: 0: 10427.5. Samples: 12734960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:13:54,600][547459] Avg episode reward: [(0, '4103.687')] [2023-03-08 19:13:56,807][547799] Updated weights for policy 0, policy_version 24960 (0.0006) [2023-03-08 19:13:59,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 12808192. Throughput: 0: 10449.6. Samples: 12799208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:13:59,600][547459] Avg episode reward: [(0, '4262.563')] [2023-03-08 19:14:00,627][547799] Updated weights for policy 0, policy_version 25040 (0.0005) [2023-03-08 19:14:04,514][547799] Updated weights for policy 0, policy_version 25120 (0.0005) [2023-03-08 19:14:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 12861440. Throughput: 0: 10439.4. Samples: 12861504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:14:04,601][547459] Avg episode reward: [(0, '4558.343')] [2023-03-08 19:14:04,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000025120_12861440.pth... [2023-03-08 19:14:04,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000024512_12550144.pth [2023-03-08 19:14:08,457][547799] Updated weights for policy 0, policy_version 25200 (0.0005) [2023-03-08 19:14:09,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 12910592. Throughput: 0: 10442.0. Samples: 12893152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:14:09,600][547459] Avg episode reward: [(0, '4344.258')] [2023-03-08 19:14:12,416][547799] Updated weights for policy 0, policy_version 25280 (0.0005) [2023-03-08 19:14:14,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 12963840. Throughput: 0: 10360.5. Samples: 12954788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:14:14,600][547459] Avg episode reward: [(0, '4255.015')] [2023-03-08 19:14:16,391][547799] Updated weights for policy 0, policy_version 25360 (0.0005) [2023-03-08 19:14:19,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 13017088. Throughput: 0: 10343.8. Samples: 13017504. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:14:19,600][547459] Avg episode reward: [(0, '3826.697')] [2023-03-08 19:14:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000025424_13017088.pth... [2023-03-08 19:14:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000024816_12705792.pth [2023-03-08 19:14:20,210][547799] Updated weights for policy 0, policy_version 25440 (0.0005) [2023-03-08 19:14:24,169][547799] Updated weights for policy 0, policy_version 25520 (0.0005) [2023-03-08 19:14:24,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.9, 300 sec: 10469.1). Total num frames: 13070336. Throughput: 0: 10374.9. Samples: 13049848. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:14:24,600][547459] Avg episode reward: [(0, '4087.234')] [2023-03-08 19:14:27,929][547799] Updated weights for policy 0, policy_version 25600 (0.0005) [2023-03-08 19:14:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 13123584. Throughput: 0: 10401.0. Samples: 13112992. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:14:29,600][547459] Avg episode reward: [(0, '4034.349')] [2023-03-08 19:14:31,700][547799] Updated weights for policy 0, policy_version 25680 (0.0005) [2023-03-08 19:14:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 13176832. Throughput: 0: 10505.3. Samples: 13178528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:14:34,600][547459] Avg episode reward: [(0, '3766.067')] [2023-03-08 19:14:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000025736_13176832.pth... [2023-03-08 19:14:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000025120_12861440.pth [2023-03-08 19:14:35,491][547799] Updated weights for policy 0, policy_version 25760 (0.0005) [2023-03-08 19:14:39,368][547799] Updated weights for policy 0, policy_version 25840 (0.0005) [2023-03-08 19:14:39,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 13230080. Throughput: 0: 10560.0. Samples: 13210160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:14:39,600][547459] Avg episode reward: [(0, '4314.938')] [2023-03-08 19:14:43,076][547799] Updated weights for policy 0, policy_version 25920 (0.0005) [2023-03-08 19:14:44,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 13283328. Throughput: 0: 10579.6. Samples: 13275288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:14:44,600][547459] Avg episode reward: [(0, '4254.527')] [2023-03-08 19:14:46,909][547799] Updated weights for policy 0, policy_version 26000 (0.0005) [2023-03-08 19:14:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10510.8). Total num frames: 13336576. Throughput: 0: 10615.5. Samples: 13339200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:14:49,600][547459] Avg episode reward: [(0, '4118.778')] [2023-03-08 19:14:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000026048_13336576.pth... [2023-03-08 19:14:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000025424_13017088.pth [2023-03-08 19:14:50,860][547799] Updated weights for policy 0, policy_version 26080 (0.0005) [2023-03-08 19:14:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 13389824. Throughput: 0: 10604.1. Samples: 13370336. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:14:54,600][547459] Avg episode reward: [(0, '4259.117')] [2023-03-08 19:14:54,676][547799] Updated weights for policy 0, policy_version 26160 (0.0005) [2023-03-08 19:14:58,543][547799] Updated weights for policy 0, policy_version 26240 (0.0005) [2023-03-08 19:14:59,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10538.5). Total num frames: 13443072. Throughput: 0: 10668.9. Samples: 13434888. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:14:59,600][547459] Avg episode reward: [(0, '4330.122')] [2023-03-08 19:15:02,372][547799] Updated weights for policy 0, policy_version 26320 (0.0005) [2023-03-08 19:15:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10538.5). Total num frames: 13496320. Throughput: 0: 10697.2. Samples: 13498880. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:15:04,600][547459] Avg episode reward: [(0, '4648.153')] [2023-03-08 19:15:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000026360_13496320.pth... [2023-03-08 19:15:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000025736_13176832.pth [2023-03-08 19:15:06,279][547799] Updated weights for policy 0, policy_version 26400 (0.0005) [2023-03-08 19:15:09,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10538.5). Total num frames: 13549568. Throughput: 0: 10670.6. Samples: 13530024. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:15:09,600][547459] Avg episode reward: [(0, '4704.469')] [2023-03-08 19:15:10,085][547799] Updated weights for policy 0, policy_version 26480 (0.0005) [2023-03-08 19:15:14,044][547799] Updated weights for policy 0, policy_version 26560 (0.0005) [2023-03-08 19:15:14,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10538.5). Total num frames: 13602816. Throughput: 0: 10688.1. Samples: 13593956. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:15:14,600][547459] Avg episode reward: [(0, '4119.890')] [2023-03-08 19:15:18,049][547799] Updated weights for policy 0, policy_version 26640 (0.0005) [2023-03-08 19:15:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 13651968. Throughput: 0: 10610.8. Samples: 13656012. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:15:19,600][547459] Avg episode reward: [(0, '4281.810')] [2023-03-08 19:15:19,613][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000026672_13656064.pth... [2023-03-08 19:15:19,614][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000026048_13336576.pth [2023-03-08 19:15:22,070][547799] Updated weights for policy 0, policy_version 26720 (0.0005) [2023-03-08 19:15:24,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 13705216. Throughput: 0: 10547.5. Samples: 13684800. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:15:24,601][547459] Avg episode reward: [(0, '4473.604')] [2023-03-08 19:15:26,125][547799] Updated weights for policy 0, policy_version 26800 (0.0005) [2023-03-08 19:15:29,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10538.5). Total num frames: 13758464. Throughput: 0: 10500.9. Samples: 13747828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:15:29,600][547459] Avg episode reward: [(0, '4698.088')] [2023-03-08 19:15:30,009][547799] Updated weights for policy 0, policy_version 26880 (0.0005) [2023-03-08 19:15:33,980][547799] Updated weights for policy 0, policy_version 26960 (0.0005) [2023-03-08 19:15:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 13807616. Throughput: 0: 10452.7. Samples: 13809572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:15:34,600][547459] Avg episode reward: [(0, '4639.617')] [2023-03-08 19:15:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000026968_13807616.pth... [2023-03-08 19:15:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000026360_13496320.pth [2023-03-08 19:15:37,772][547799] Updated weights for policy 0, policy_version 27040 (0.0005) [2023-03-08 19:15:39,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 13860864. Throughput: 0: 10488.0. Samples: 13842296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:15:39,600][547459] Avg episode reward: [(0, '4663.395')] [2023-03-08 19:15:41,716][547799] Updated weights for policy 0, policy_version 27120 (0.0005) [2023-03-08 19:15:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 13914112. Throughput: 0: 10421.8. Samples: 13903868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:15:44,601][547459] Avg episode reward: [(0, '4446.786')] [2023-03-08 19:15:45,774][547799] Updated weights for policy 0, policy_version 27200 (0.0005) [2023-03-08 19:15:49,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 13963264. Throughput: 0: 10356.7. Samples: 13964932. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:15:49,600][547459] Avg episode reward: [(0, '4719.907')] [2023-03-08 19:15:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000027272_13963264.pth... [2023-03-08 19:15:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000026672_13656064.pth [2023-03-08 19:15:49,734][547799] Updated weights for policy 0, policy_version 27280 (0.0005) [2023-03-08 19:15:53,865][547799] Updated weights for policy 0, policy_version 27360 (0.0005) [2023-03-08 19:15:54,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10496.9). Total num frames: 14012416. Throughput: 0: 10345.8. Samples: 13995584. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:15:54,600][547459] Avg episode reward: [(0, '4675.672')] [2023-03-08 19:15:57,811][547799] Updated weights for policy 0, policy_version 27440 (0.0005) [2023-03-08 19:15:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10496.9). Total num frames: 14065664. Throughput: 0: 10296.4. Samples: 14057296. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:15:59,601][547459] Avg episode reward: [(0, '4288.426')] [2023-03-08 19:16:01,805][547799] Updated weights for policy 0, policy_version 27520 (0.0005) [2023-03-08 19:16:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10483.0). Total num frames: 14114816. Throughput: 0: 10267.6. Samples: 14118052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:16:04,600][547459] Avg episode reward: [(0, '4175.226')] [2023-03-08 19:16:04,656][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000027576_14118912.pth... [2023-03-08 19:16:04,657][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000026968_13807616.pth [2023-03-08 19:16:05,853][547799] Updated weights for policy 0, policy_version 27600 (0.0005) [2023-03-08 19:16:09,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10483.0). Total num frames: 14168064. Throughput: 0: 10285.5. Samples: 14147648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:16:09,601][547459] Avg episode reward: [(0, '4482.014')] [2023-03-08 19:16:09,971][547799] Updated weights for policy 0, policy_version 27680 (0.0005) [2023-03-08 19:16:13,941][547799] Updated weights for policy 0, policy_version 27760 (0.0005) [2023-03-08 19:16:14,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10469.1). Total num frames: 14217216. Throughput: 0: 10244.2. Samples: 14208816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:16:14,600][547459] Avg episode reward: [(0, '4622.418')] [2023-03-08 19:16:17,856][547799] Updated weights for policy 0, policy_version 27840 (0.0005) [2023-03-08 19:16:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10469.1). Total num frames: 14270464. Throughput: 0: 10244.4. Samples: 14270568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:16:19,601][547459] Avg episode reward: [(0, '4633.607')] [2023-03-08 19:16:19,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000027872_14270464.pth... [2023-03-08 19:16:19,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000027272_13963264.pth [2023-03-08 19:16:21,855][547799] Updated weights for policy 0, policy_version 27920 (0.0005) [2023-03-08 19:16:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10469.1). Total num frames: 14319616. Throughput: 0: 10219.7. Samples: 14302184. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:16:24,601][547459] Avg episode reward: [(0, '4819.284')] [2023-03-08 19:16:25,952][547799] Updated weights for policy 0, policy_version 28000 (0.0005) [2023-03-08 19:16:29,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10469.1). Total num frames: 14372864. Throughput: 0: 10194.2. Samples: 14362608. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:16:29,600][547459] Avg episode reward: [(0, '4395.766')] [2023-03-08 19:16:29,965][547799] Updated weights for policy 0, policy_version 28080 (0.0005) [2023-03-08 19:16:34,032][547799] Updated weights for policy 0, policy_version 28160 (0.0006) [2023-03-08 19:16:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10455.2). Total num frames: 14422016. Throughput: 0: 10184.8. Samples: 14423248. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:16:34,600][547459] Avg episode reward: [(0, '4714.794')] [2023-03-08 19:16:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000028168_14422016.pth... [2023-03-08 19:16:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000027576_14118912.pth [2023-03-08 19:16:38,008][547799] Updated weights for policy 0, policy_version 28240 (0.0005) [2023-03-08 19:16:39,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10455.2). Total num frames: 14475264. Throughput: 0: 10199.3. Samples: 14454552. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:16:39,600][547459] Avg episode reward: [(0, '4716.885')] [2023-03-08 19:16:41,908][547799] Updated weights for policy 0, policy_version 28320 (0.0005) [2023-03-08 19:16:44,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10441.3). Total num frames: 14524416. Throughput: 0: 10200.0. Samples: 14516296. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:16:44,600][547459] Avg episode reward: [(0, '4623.920')] [2023-03-08 19:16:45,854][547799] Updated weights for policy 0, policy_version 28400 (0.0005) [2023-03-08 19:16:49,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10441.3). Total num frames: 14577664. Throughput: 0: 10215.0. Samples: 14577728. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:16:49,601][547459] Avg episode reward: [(0, '4201.425')] [2023-03-08 19:16:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000028472_14577664.pth... [2023-03-08 19:16:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000027872_14270464.pth [2023-03-08 19:16:49,893][547799] Updated weights for policy 0, policy_version 28480 (0.0005) [2023-03-08 19:16:53,809][547799] Updated weights for policy 0, policy_version 28560 (0.0005) [2023-03-08 19:16:54,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10427.4). Total num frames: 14626816. Throughput: 0: 10252.4. Samples: 14609008. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:16:54,600][547459] Avg episode reward: [(0, '3943.324')] [2023-03-08 19:16:57,869][547799] Updated weights for policy 0, policy_version 28640 (0.0005) [2023-03-08 19:16:59,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10441.3). Total num frames: 14680064. Throughput: 0: 10282.6. Samples: 14671532. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:16:59,600][547459] Avg episode reward: [(0, '4303.112')] [2023-03-08 19:17:01,733][547799] Updated weights for policy 0, policy_version 28720 (0.0005) [2023-03-08 19:17:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10427.4). Total num frames: 14729216. Throughput: 0: 10282.4. Samples: 14733276. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:17:04,600][547459] Avg episode reward: [(0, '4126.996')] [2023-03-08 19:17:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000028776_14733312.pth... [2023-03-08 19:17:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000028168_14422016.pth [2023-03-08 19:17:05,822][547799] Updated weights for policy 0, policy_version 28800 (0.0005) [2023-03-08 19:17:09,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10413.6). Total num frames: 14782464. Throughput: 0: 10228.4. Samples: 14762464. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:17:09,600][547459] Avg episode reward: [(0, '4205.624')] [2023-03-08 19:17:09,933][547799] Updated weights for policy 0, policy_version 28880 (0.0006) [2023-03-08 19:17:13,988][547799] Updated weights for policy 0, policy_version 28960 (0.0005) [2023-03-08 19:17:14,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10413.6). Total num frames: 14831616. Throughput: 0: 10221.4. Samples: 14822568. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:17:14,600][547459] Avg episode reward: [(0, '4651.777')] [2023-03-08 19:17:17,928][547799] Updated weights for policy 0, policy_version 29040 (0.0005) [2023-03-08 19:17:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10413.6). Total num frames: 14884864. Throughput: 0: 10264.8. Samples: 14885164. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:17:19,600][547459] Avg episode reward: [(0, '4418.728')] [2023-03-08 19:17:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000029072_14884864.pth... [2023-03-08 19:17:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000028472_14577664.pth [2023-03-08 19:17:21,754][547799] Updated weights for policy 0, policy_version 29120 (0.0005) [2023-03-08 19:17:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 14938112. Throughput: 0: 10289.0. Samples: 14917556. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:17:24,600][547459] Avg episode reward: [(0, '4754.469')] [2023-03-08 19:17:25,736][547799] Updated weights for policy 0, policy_version 29200 (0.0005) [2023-03-08 19:17:29,560][547799] Updated weights for policy 0, policy_version 29280 (0.0005) [2023-03-08 19:17:29,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 14991360. Throughput: 0: 10318.8. Samples: 14980640. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:17:29,600][547459] Avg episode reward: [(0, '4671.288')] [2023-03-08 19:17:33,485][547799] Updated weights for policy 0, policy_version 29360 (0.0005) [2023-03-08 19:17:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10413.6). Total num frames: 15040512. Throughput: 0: 10350.6. Samples: 15043504. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:17:34,600][547459] Avg episode reward: [(0, '3874.433')] [2023-03-08 19:17:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000029376_15040512.pth... [2023-03-08 19:17:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000028776_14733312.pth [2023-03-08 19:17:37,376][547799] Updated weights for policy 0, policy_version 29440 (0.0005) [2023-03-08 19:17:39,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10413.6). Total num frames: 15093760. Throughput: 0: 10350.2. Samples: 15074768. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:17:39,600][547459] Avg episode reward: [(0, '4520.788')] [2023-03-08 19:17:41,300][547799] Updated weights for policy 0, policy_version 29520 (0.0005) [2023-03-08 19:17:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 15147008. Throughput: 0: 10358.6. Samples: 15137668. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:17:44,600][547459] Avg episode reward: [(0, '4449.184')] [2023-03-08 19:17:45,267][547799] Updated weights for policy 0, policy_version 29600 (0.0006) [2023-03-08 19:17:49,259][547799] Updated weights for policy 0, policy_version 29680 (0.0005) [2023-03-08 19:17:49,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10413.6). Total num frames: 15196160. Throughput: 0: 10347.4. Samples: 15198908. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:17:49,600][547459] Avg episode reward: [(0, '4731.138')] [2023-03-08 19:17:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000029680_15196160.pth... [2023-03-08 19:17:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000029072_14884864.pth [2023-03-08 19:17:53,296][547799] Updated weights for policy 0, policy_version 29760 (0.0005) [2023-03-08 19:17:54,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 15249408. Throughput: 0: 10367.3. Samples: 15228992. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:17:54,600][547459] Avg episode reward: [(0, '4664.251')] [2023-03-08 19:17:57,217][547799] Updated weights for policy 0, policy_version 29840 (0.0005) [2023-03-08 19:17:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10399.7). Total num frames: 15298560. Throughput: 0: 10409.4. Samples: 15290988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:17:59,600][547459] Avg episode reward: [(0, '4695.513')] [2023-03-08 19:18:01,295][547799] Updated weights for policy 0, policy_version 29920 (0.0005) [2023-03-08 19:18:04,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 15351808. Throughput: 0: 10397.3. Samples: 15353044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:18:04,600][547459] Avg episode reward: [(0, '4795.435')] [2023-03-08 19:18:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000029984_15351808.pth... [2023-03-08 19:18:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000029376_15040512.pth [2023-03-08 19:18:05,164][547799] Updated weights for policy 0, policy_version 30000 (0.0005) [2023-03-08 19:18:09,221][547799] Updated weights for policy 0, policy_version 30080 (0.0005) [2023-03-08 19:18:09,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 15405056. Throughput: 0: 10378.2. Samples: 15384576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:18:09,600][547459] Avg episode reward: [(0, '4488.178')] [2023-03-08 19:18:13,239][547799] Updated weights for policy 0, policy_version 30160 (0.0005) [2023-03-08 19:18:14,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10385.8). Total num frames: 15454208. Throughput: 0: 10329.3. Samples: 15445460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:18:14,600][547459] Avg episode reward: [(0, '4154.777')] [2023-03-08 19:18:17,317][547799] Updated weights for policy 0, policy_version 30240 (0.0005) [2023-03-08 19:18:19,600][547459] Fps is (10 sec: 9830.3, 60 sec: 10308.3, 300 sec: 10371.9). Total num frames: 15503360. Throughput: 0: 10253.8. Samples: 15504924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:18:19,600][547459] Avg episode reward: [(0, '4428.660')] [2023-03-08 19:18:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000030280_15503360.pth... [2023-03-08 19:18:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000029680_15196160.pth [2023-03-08 19:18:21,410][547799] Updated weights for policy 0, policy_version 30320 (0.0006) [2023-03-08 19:18:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10371.9). Total num frames: 15556608. Throughput: 0: 10239.8. Samples: 15535560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:18:24,600][547459] Avg episode reward: [(0, '4186.345')] [2023-03-08 19:18:25,324][547799] Updated weights for policy 0, policy_version 30400 (0.0006) [2023-03-08 19:18:29,384][547799] Updated weights for policy 0, policy_version 30480 (0.0005) [2023-03-08 19:18:29,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10358.0). Total num frames: 15605760. Throughput: 0: 10220.3. Samples: 15597580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:18:29,600][547459] Avg episode reward: [(0, '4605.396')] [2023-03-08 19:18:33,371][547799] Updated weights for policy 0, policy_version 30560 (0.0005) [2023-03-08 19:18:34,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10358.0). Total num frames: 15659008. Throughput: 0: 10224.6. Samples: 15659016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:18:34,600][547459] Avg episode reward: [(0, '4476.640')] [2023-03-08 19:18:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000030584_15659008.pth... [2023-03-08 19:18:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000029984_15351808.pth [2023-03-08 19:18:37,312][547799] Updated weights for policy 0, policy_version 30640 (0.0006) [2023-03-08 19:18:39,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10358.0). Total num frames: 15708160. Throughput: 0: 10254.1. Samples: 15690424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:18:39,600][547459] Avg episode reward: [(0, '4069.590')] [2023-03-08 19:18:41,255][547799] Updated weights for policy 0, policy_version 30720 (0.0005) [2023-03-08 19:18:44,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10358.0). Total num frames: 15761408. Throughput: 0: 10243.5. Samples: 15751948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:18:44,600][547459] Avg episode reward: [(0, '3408.819')] [2023-03-08 19:18:45,319][547799] Updated weights for policy 0, policy_version 30800 (0.0005) [2023-03-08 19:18:49,334][547799] Updated weights for policy 0, policy_version 30880 (0.0005) [2023-03-08 19:18:49,600][547459] Fps is (10 sec: 10239.8, 60 sec: 10240.0, 300 sec: 10358.0). Total num frames: 15810560. Throughput: 0: 10215.7. Samples: 15812752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:18:49,600][547459] Avg episode reward: [(0, '4388.774')] [2023-03-08 19:18:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000030880_15810560.pth... [2023-03-08 19:18:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000030280_15503360.pth [2023-03-08 19:18:53,219][547799] Updated weights for policy 0, policy_version 30960 (0.0005) [2023-03-08 19:18:54,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10358.0). Total num frames: 15863808. Throughput: 0: 10196.1. Samples: 15843400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:18:54,600][547459] Avg episode reward: [(0, '4585.261')] [2023-03-08 19:18:57,001][547799] Updated weights for policy 0, policy_version 31040 (0.0005) [2023-03-08 19:18:59,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10358.0). Total num frames: 15917056. Throughput: 0: 10300.0. Samples: 15908960. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:18:59,600][547459] Avg episode reward: [(0, '4569.486')] [2023-03-08 19:19:00,869][547799] Updated weights for policy 0, policy_version 31120 (0.0005) [2023-03-08 19:19:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10371.9). Total num frames: 15970304. Throughput: 0: 10389.2. Samples: 15972436. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:19:04,600][547459] Avg episode reward: [(0, '4784.801')] [2023-03-08 19:19:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000031192_15970304.pth... [2023-03-08 19:19:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000030584_15659008.pth [2023-03-08 19:19:04,712][547799] Updated weights for policy 0, policy_version 31200 (0.0005) [2023-03-08 19:19:08,575][547799] Updated weights for policy 0, policy_version 31280 (0.0005) [2023-03-08 19:19:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10371.9). Total num frames: 16023552. Throughput: 0: 10390.8. Samples: 16003144. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:19:09,601][547459] Avg episode reward: [(0, '4456.364')] [2023-03-08 19:19:12,450][547799] Updated weights for policy 0, policy_version 31360 (0.0005) [2023-03-08 19:19:14,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10371.9). Total num frames: 16076800. Throughput: 0: 10463.0. Samples: 16068416. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:19:14,601][547459] Avg episode reward: [(0, '4754.961')] [2023-03-08 19:19:16,305][547799] Updated weights for policy 0, policy_version 31440 (0.0005) [2023-03-08 19:19:19,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10371.9). Total num frames: 16130048. Throughput: 0: 10483.5. Samples: 16130776. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:19:19,611][547459] Avg episode reward: [(0, '4749.643')] [2023-03-08 19:19:19,616][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000031504_16130048.pth... [2023-03-08 19:19:19,618][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000030880_15810560.pth [2023-03-08 19:19:20,214][547799] Updated weights for policy 0, policy_version 31520 (0.0006) [2023-03-08 19:19:24,024][547799] Updated weights for policy 0, policy_version 31600 (0.0005) [2023-03-08 19:19:24,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10371.9). Total num frames: 16183296. Throughput: 0: 10500.0. Samples: 16162924. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:19:24,610][547459] Avg episode reward: [(0, '4523.933')] [2023-03-08 19:19:27,914][547799] Updated weights for policy 0, policy_version 31680 (0.0006) [2023-03-08 19:19:29,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10371.9). Total num frames: 16236544. Throughput: 0: 10563.5. Samples: 16227304. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:19:29,611][547459] Avg episode reward: [(0, '4546.234')] [2023-03-08 19:19:31,754][547799] Updated weights for policy 0, policy_version 31760 (0.0005) [2023-03-08 19:19:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10358.0). Total num frames: 16285696. Throughput: 0: 10598.7. Samples: 16289692. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:19:34,611][547459] Avg episode reward: [(0, '4647.293')] [2023-03-08 19:19:34,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000031816_16289792.pth... [2023-03-08 19:19:34,615][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000031192_15970304.pth [2023-03-08 19:19:35,818][547799] Updated weights for policy 0, policy_version 31840 (0.0005) [2023-03-08 19:19:39,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.0, 300 sec: 10358.0). Total num frames: 16338944. Throughput: 0: 10578.5. Samples: 16319432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:19:39,600][547459] Avg episode reward: [(0, '4152.458')] [2023-03-08 19:19:39,738][547799] Updated weights for policy 0, policy_version 31920 (0.0005) [2023-03-08 19:19:43,385][547799] Updated weights for policy 0, policy_version 32000 (0.0005) [2023-03-08 19:19:44,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10371.9). Total num frames: 16396288. Throughput: 0: 10587.7. Samples: 16385408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:19:44,600][547459] Avg episode reward: [(0, '4668.405')] [2023-03-08 19:19:47,245][547799] Updated weights for policy 0, policy_version 32080 (0.0005) [2023-03-08 19:19:49,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.4, 300 sec: 10358.0). Total num frames: 16445440. Throughput: 0: 10598.9. Samples: 16449384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:19:49,600][547459] Avg episode reward: [(0, '4738.934')] [2023-03-08 19:19:49,613][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000032128_16449536.pth... [2023-03-08 19:19:49,614][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000031504_16130048.pth [2023-03-08 19:19:51,119][547799] Updated weights for policy 0, policy_version 32160 (0.0005) [2023-03-08 19:19:54,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10371.9). Total num frames: 16502784. Throughput: 0: 10607.5. Samples: 16480480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:19:54,600][547459] Avg episode reward: [(0, '4202.767')] [2023-03-08 19:19:55,033][547799] Updated weights for policy 0, policy_version 32240 (0.0005) [2023-03-08 19:19:58,887][547799] Updated weights for policy 0, policy_version 32320 (0.0005) [2023-03-08 19:19:59,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10358.0). Total num frames: 16551936. Throughput: 0: 10565.9. Samples: 16543880. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:19:59,600][547459] Avg episode reward: [(0, '4548.437')] [2023-03-08 19:20:02,619][547799] Updated weights for policy 0, policy_version 32400 (0.0005) [2023-03-08 19:20:04,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10371.9). Total num frames: 16609280. Throughput: 0: 10639.6. Samples: 16609560. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:20:04,600][547459] Avg episode reward: [(0, '4682.941')] [2023-03-08 19:20:04,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000032440_16609280.pth... [2023-03-08 19:20:04,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000031816_16289792.pth [2023-03-08 19:20:06,437][547799] Updated weights for policy 0, policy_version 32480 (0.0005) [2023-03-08 19:20:09,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10371.9). Total num frames: 16662528. Throughput: 0: 10647.4. Samples: 16642060. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:20:09,600][547459] Avg episode reward: [(0, '4642.101')] [2023-03-08 19:20:10,336][547799] Updated weights for policy 0, policy_version 32560 (0.0005) [2023-03-08 19:20:14,207][547799] Updated weights for policy 0, policy_version 32640 (0.0005) [2023-03-08 19:20:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10385.8). Total num frames: 16715776. Throughput: 0: 10592.0. Samples: 16703944. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:20:14,600][547459] Avg episode reward: [(0, '4533.452')] [2023-03-08 19:20:18,019][547799] Updated weights for policy 0, policy_version 32720 (0.0005) [2023-03-08 19:20:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10371.9). Total num frames: 16764928. Throughput: 0: 10648.0. Samples: 16768852. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:20:19,600][547459] Avg episode reward: [(0, '4637.877')] [2023-03-08 19:20:19,607][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000032752_16769024.pth... [2023-03-08 19:20:19,609][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000032128_16449536.pth [2023-03-08 19:20:21,916][547799] Updated weights for policy 0, policy_version 32800 (0.0005) [2023-03-08 19:20:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10385.8). Total num frames: 16822272. Throughput: 0: 10689.8. Samples: 16800472. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:20:24,601][547459] Avg episode reward: [(0, '4276.504')] [2023-03-08 19:20:25,668][547799] Updated weights for policy 0, policy_version 32880 (0.0005) [2023-03-08 19:20:29,410][547799] Updated weights for policy 0, policy_version 32960 (0.0005) [2023-03-08 19:20:29,600][547459] Fps is (10 sec: 11059.0, 60 sec: 10649.6, 300 sec: 10399.7). Total num frames: 16875520. Throughput: 0: 10705.6. Samples: 16867160. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:20:29,601][547459] Avg episode reward: [(0, '4679.350')] [2023-03-08 19:20:33,110][547799] Updated weights for policy 0, policy_version 33040 (0.0005) [2023-03-08 19:20:34,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10399.7). Total num frames: 16928768. Throughput: 0: 10718.2. Samples: 16931704. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:20:34,600][547459] Avg episode reward: [(0, '4672.613')] [2023-03-08 19:20:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000033064_16928768.pth... [2023-03-08 19:20:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000032440_16609280.pth [2023-03-08 19:20:36,910][547799] Updated weights for policy 0, policy_version 33120 (0.0005) [2023-03-08 19:20:39,600][547459] Fps is (10 sec: 11059.4, 60 sec: 10786.1, 300 sec: 10413.6). Total num frames: 16986112. Throughput: 0: 10749.7. Samples: 16964216. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:20:39,600][547459] Avg episode reward: [(0, '4629.820')] [2023-03-08 19:20:40,731][547799] Updated weights for policy 0, policy_version 33200 (0.0005) [2023-03-08 19:20:44,458][547799] Updated weights for policy 0, policy_version 33280 (0.0005) [2023-03-08 19:20:44,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 10427.4). Total num frames: 17039360. Throughput: 0: 10797.9. Samples: 17029784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:20:44,600][547459] Avg episode reward: [(0, '4660.604')] [2023-03-08 19:20:48,219][547799] Updated weights for policy 0, policy_version 33360 (0.0005) [2023-03-08 19:20:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10786.1, 300 sec: 10441.3). Total num frames: 17092608. Throughput: 0: 10787.3. Samples: 17094988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:20:49,600][547459] Avg episode reward: [(0, '4586.345')] [2023-03-08 19:20:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000033384_17092608.pth... [2023-03-08 19:20:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000032752_16769024.pth [2023-03-08 19:20:52,137][547799] Updated weights for policy 0, policy_version 33440 (0.0005) [2023-03-08 19:20:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10441.3). Total num frames: 17145856. Throughput: 0: 10741.8. Samples: 17125440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:20:54,601][547459] Avg episode reward: [(0, '4772.740')] [2023-03-08 19:20:56,041][547799] Updated weights for policy 0, policy_version 33520 (0.0005) [2023-03-08 19:20:59,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10786.1, 300 sec: 10455.2). Total num frames: 17199104. Throughput: 0: 10754.6. Samples: 17187900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:20:59,600][547459] Avg episode reward: [(0, '4544.584')] [2023-03-08 19:20:59,901][547799] Updated weights for policy 0, policy_version 33600 (0.0005) [2023-03-08 19:21:03,921][547799] Updated weights for policy 0, policy_version 33680 (0.0005) [2023-03-08 19:21:04,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10441.3). Total num frames: 17248256. Throughput: 0: 10703.0. Samples: 17250488. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:21:04,600][547459] Avg episode reward: [(0, '4719.137')] [2023-03-08 19:21:04,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000033688_17248256.pth... [2023-03-08 19:21:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000033064_16928768.pth [2023-03-08 19:21:07,821][547799] Updated weights for policy 0, policy_version 33760 (0.0005) [2023-03-08 19:21:09,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10455.2). Total num frames: 17301504. Throughput: 0: 10693.7. Samples: 17281688. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:21:09,600][547459] Avg episode reward: [(0, '4703.571')] [2023-03-08 19:21:11,624][547799] Updated weights for policy 0, policy_version 33840 (0.0005) [2023-03-08 19:21:14,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10455.2). Total num frames: 17354752. Throughput: 0: 10653.7. Samples: 17346576. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:21:14,600][547459] Avg episode reward: [(0, '4783.403')] [2023-03-08 19:21:15,382][547799] Updated weights for policy 0, policy_version 33920 (0.0005) [2023-03-08 19:21:19,170][547799] Updated weights for policy 0, policy_version 34000 (0.0005) [2023-03-08 19:21:19,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10786.1, 300 sec: 10483.0). Total num frames: 17412096. Throughput: 0: 10675.8. Samples: 17412116. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:21:19,601][547459] Avg episode reward: [(0, '4725.775')] [2023-03-08 19:21:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000034008_17412096.pth... [2023-03-08 19:21:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000033384_17092608.pth [2023-03-08 19:21:22,958][547799] Updated weights for policy 0, policy_version 34080 (0.0005) [2023-03-08 19:21:24,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10483.0). Total num frames: 17465344. Throughput: 0: 10681.1. Samples: 17444864. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:21:24,600][547459] Avg episode reward: [(0, '4517.949')] [2023-03-08 19:21:26,804][547799] Updated weights for policy 0, policy_version 34160 (0.0005) [2023-03-08 19:21:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10496.9). Total num frames: 17518592. Throughput: 0: 10658.2. Samples: 17509404. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:21:29,600][547459] Avg episode reward: [(0, '4663.879')] [2023-03-08 19:21:30,488][547799] Updated weights for policy 0, policy_version 34240 (0.0005) [2023-03-08 19:21:34,305][547799] Updated weights for policy 0, policy_version 34320 (0.0005) [2023-03-08 19:21:34,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10496.9). Total num frames: 17571840. Throughput: 0: 10656.8. Samples: 17574544. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:21:34,600][547459] Avg episode reward: [(0, '4684.374')] [2023-03-08 19:21:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000034320_17571840.pth... [2023-03-08 19:21:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000033688_17248256.pth [2023-03-08 19:21:38,264][547799] Updated weights for policy 0, policy_version 34400 (0.0005) [2023-03-08 19:21:39,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10510.8). Total num frames: 17625088. Throughput: 0: 10656.6. Samples: 17604988. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:21:39,601][547459] Avg episode reward: [(0, '4419.401')] [2023-03-08 19:21:42,000][547799] Updated weights for policy 0, policy_version 34480 (0.0005) [2023-03-08 19:21:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10510.8). Total num frames: 17678336. Throughput: 0: 10716.7. Samples: 17670152. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:21:44,600][547459] Avg episode reward: [(0, '4766.934')] [2023-03-08 19:21:45,796][547799] Updated weights for policy 0, policy_version 34560 (0.0005) [2023-03-08 19:21:49,600][547459] Fps is (10 sec: 10649.4, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 17731584. Throughput: 0: 10778.1. Samples: 17735504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:21:49,600][547459] Avg episode reward: [(0, '4632.742')] [2023-03-08 19:21:49,609][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000034640_17735680.pth... [2023-03-08 19:21:49,609][547799] Updated weights for policy 0, policy_version 34640 (0.0005) [2023-03-08 19:21:49,610][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000034008_17412096.pth [2023-03-08 19:21:53,385][547799] Updated weights for policy 0, policy_version 34720 (0.0005) [2023-03-08 19:21:54,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 10538.5). Total num frames: 17788928. Throughput: 0: 10800.2. Samples: 17767696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:21:54,601][547459] Avg episode reward: [(0, '4784.279')] [2023-03-08 19:21:57,181][547799] Updated weights for policy 0, policy_version 34800 (0.0005) [2023-03-08 19:21:59,600][547459] Fps is (10 sec: 11059.4, 60 sec: 10717.9, 300 sec: 10552.4). Total num frames: 17842176. Throughput: 0: 10788.2. Samples: 17832044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:21:59,601][547459] Avg episode reward: [(0, '4438.478')] [2023-03-08 19:22:01,046][547799] Updated weights for policy 0, policy_version 34880 (0.0005) [2023-03-08 19:22:04,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10552.4). Total num frames: 17895424. Throughput: 0: 10761.3. Samples: 17896376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:22:04,601][547459] Avg episode reward: [(0, '4817.440')] [2023-03-08 19:22:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000034952_17895424.pth... [2023-03-08 19:22:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000034320_17571840.pth [2023-03-08 19:22:04,798][547799] Updated weights for policy 0, policy_version 34960 (0.0005) [2023-03-08 19:22:08,508][547799] Updated weights for policy 0, policy_version 35040 (0.0005) [2023-03-08 19:22:09,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10786.1, 300 sec: 10566.3). Total num frames: 17948672. Throughput: 0: 10779.6. Samples: 17929944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:22:09,600][547459] Avg episode reward: [(0, '4667.526')] [2023-03-08 19:22:12,372][547799] Updated weights for policy 0, policy_version 35120 (0.0005) [2023-03-08 19:22:14,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10786.1, 300 sec: 10566.3). Total num frames: 18001920. Throughput: 0: 10762.8. Samples: 17993728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:22:14,600][547459] Avg episode reward: [(0, '4779.848')] [2023-03-08 19:22:16,203][547799] Updated weights for policy 0, policy_version 35200 (0.0005) [2023-03-08 19:22:19,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10580.2). Total num frames: 18059264. Throughput: 0: 10771.6. Samples: 18059264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:22:19,600][547459] Avg episode reward: [(0, '4681.254')] [2023-03-08 19:22:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000035272_18059264.pth... [2023-03-08 19:22:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000034640_17735680.pth [2023-03-08 19:22:19,940][547799] Updated weights for policy 0, policy_version 35280 (0.0005) [2023-03-08 19:22:24,007][547799] Updated weights for policy 0, policy_version 35360 (0.0005) [2023-03-08 19:22:24,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10566.3). Total num frames: 18108416. Throughput: 0: 10784.3. Samples: 18090284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:22:24,600][547459] Avg episode reward: [(0, '4089.970')] [2023-03-08 19:22:27,987][547799] Updated weights for policy 0, policy_version 35440 (0.0005) [2023-03-08 19:22:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10580.2). Total num frames: 18161664. Throughput: 0: 10691.3. Samples: 18151260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:22:29,601][547459] Avg episode reward: [(0, '3965.055')] [2023-03-08 19:22:31,816][547799] Updated weights for policy 0, policy_version 35520 (0.0005) [2023-03-08 19:22:34,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.8, 300 sec: 10580.2). Total num frames: 18214912. Throughput: 0: 10657.1. Samples: 18215072. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:22:34,600][547459] Avg episode reward: [(0, '4639.156')] [2023-03-08 19:22:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000035576_18214912.pth... [2023-03-08 19:22:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000034952_17895424.pth [2023-03-08 19:22:35,673][547799] Updated weights for policy 0, policy_version 35600 (0.0005) [2023-03-08 19:22:39,581][547799] Updated weights for policy 0, policy_version 35680 (0.0005) [2023-03-08 19:22:39,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.8, 300 sec: 10580.2). Total num frames: 18268160. Throughput: 0: 10665.3. Samples: 18247636. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:22:39,600][547459] Avg episode reward: [(0, '4725.447')] [2023-03-08 19:22:43,425][547799] Updated weights for policy 0, policy_version 35760 (0.0005) [2023-03-08 19:22:44,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 18321408. Throughput: 0: 10623.0. Samples: 18310080. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:22:44,600][547459] Avg episode reward: [(0, '4770.896')] [2023-03-08 19:22:47,346][547799] Updated weights for policy 0, policy_version 35840 (0.0005) [2023-03-08 19:22:49,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10580.2). Total num frames: 18370560. Throughput: 0: 10619.5. Samples: 18374252. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:22:49,600][547459] Avg episode reward: [(0, '4380.121')] [2023-03-08 19:22:49,636][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000035888_18374656.pth... [2023-03-08 19:22:49,638][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000035272_18059264.pth [2023-03-08 19:22:51,175][547799] Updated weights for policy 0, policy_version 35920 (0.0005) [2023-03-08 19:22:54,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 18427904. Throughput: 0: 10573.5. Samples: 18405752. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:22:54,601][547459] Avg episode reward: [(0, '4703.684')] [2023-03-08 19:22:54,935][547799] Updated weights for policy 0, policy_version 36000 (0.0006) [2023-03-08 19:22:58,747][547799] Updated weights for policy 0, policy_version 36080 (0.0005) [2023-03-08 19:22:59,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 18481152. Throughput: 0: 10605.0. Samples: 18470952. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:22:59,600][547459] Avg episode reward: [(0, '4543.756')] [2023-03-08 19:23:02,583][547799] Updated weights for policy 0, policy_version 36160 (0.0005) [2023-03-08 19:23:04,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 18534400. Throughput: 0: 10558.6. Samples: 18534400. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:23:04,600][547459] Avg episode reward: [(0, '4800.784')] [2023-03-08 19:23:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000036200_18534400.pth... [2023-03-08 19:23:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000035576_18214912.pth [2023-03-08 19:23:06,472][547799] Updated weights for policy 0, policy_version 36240 (0.0005) [2023-03-08 19:23:09,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10607.9). Total num frames: 18583552. Throughput: 0: 10585.0. Samples: 18566608. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:23:09,600][547459] Avg episode reward: [(0, '4512.245')] [2023-03-08 19:23:10,413][547799] Updated weights for policy 0, policy_version 36320 (0.0005) [2023-03-08 19:23:14,197][547799] Updated weights for policy 0, policy_version 36400 (0.0005) [2023-03-08 19:23:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 18640896. Throughput: 0: 10628.6. Samples: 18629548. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:23:14,600][547459] Avg episode reward: [(0, '4207.682')] [2023-03-08 19:23:18,227][547799] Updated weights for policy 0, policy_version 36480 (0.0005) [2023-03-08 19:23:19,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10621.8). Total num frames: 18690048. Throughput: 0: 10586.1. Samples: 18691448. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:23:19,600][547459] Avg episode reward: [(0, '4352.202')] [2023-03-08 19:23:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000036504_18690048.pth... [2023-03-08 19:23:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000035888_18374656.pth [2023-03-08 19:23:22,126][547799] Updated weights for policy 0, policy_version 36560 (0.0005) [2023-03-08 19:23:24,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10635.7). Total num frames: 18743296. Throughput: 0: 10559.6. Samples: 18722816. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:23:24,600][547459] Avg episode reward: [(0, '3588.600')] [2023-03-08 19:23:25,950][547799] Updated weights for policy 0, policy_version 36640 (0.0005) [2023-03-08 19:23:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10635.7). Total num frames: 18796544. Throughput: 0: 10620.8. Samples: 18788016. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:23:29,600][547459] Avg episode reward: [(0, '3420.416')] [2023-03-08 19:23:29,780][547799] Updated weights for policy 0, policy_version 36720 (0.0005) [2023-03-08 19:23:33,633][547799] Updated weights for policy 0, policy_version 36800 (0.0005) [2023-03-08 19:23:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10649.6). Total num frames: 18849792. Throughput: 0: 10596.9. Samples: 18851112. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:23:34,600][547459] Avg episode reward: [(0, '3883.434')] [2023-03-08 19:23:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000036816_18849792.pth... [2023-03-08 19:23:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000036200_18534400.pth [2023-03-08 19:23:37,446][547799] Updated weights for policy 0, policy_version 36880 (0.0004) [2023-03-08 19:23:39,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10649.6). Total num frames: 18903040. Throughput: 0: 10612.6. Samples: 18883320. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:23:39,600][547459] Avg episode reward: [(0, '4267.223')] [2023-03-08 19:23:41,297][547799] Updated weights for policy 0, policy_version 36960 (0.0005) [2023-03-08 19:23:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10663.5). Total num frames: 18956288. Throughput: 0: 10604.6. Samples: 18948160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:23:44,600][547459] Avg episode reward: [(0, '4398.733')] [2023-03-08 19:23:45,044][547799] Updated weights for policy 0, policy_version 37040 (0.0005) [2023-03-08 19:23:48,863][547799] Updated weights for policy 0, policy_version 37120 (0.0005) [2023-03-08 19:23:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10663.5). Total num frames: 19009536. Throughput: 0: 10641.5. Samples: 19013268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:23:49,600][547459] Avg episode reward: [(0, '4671.870')] [2023-03-08 19:23:49,657][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000037136_19013632.pth... [2023-03-08 19:23:49,660][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000036504_18690048.pth [2023-03-08 19:23:52,699][547799] Updated weights for policy 0, policy_version 37200 (0.0006) [2023-03-08 19:23:54,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 19066880. Throughput: 0: 10621.9. Samples: 19044592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:23:54,600][547459] Avg episode reward: [(0, '4741.895')] [2023-03-08 19:23:56,514][547799] Updated weights for policy 0, policy_version 37280 (0.0005) [2023-03-08 19:23:59,600][547459] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 19120128. Throughput: 0: 10664.2. Samples: 19109436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:23:59,600][547459] Avg episode reward: [(0, '4733.374')] [2023-03-08 19:24:00,289][547799] Updated weights for policy 0, policy_version 37360 (0.0006) [2023-03-08 19:24:04,158][547799] Updated weights for policy 0, policy_version 37440 (0.0005) [2023-03-08 19:24:04,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 19173376. Throughput: 0: 10709.5. Samples: 19173376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:24:04,601][547459] Avg episode reward: [(0, '4223.011')] [2023-03-08 19:24:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000037448_19173376.pth... [2023-03-08 19:24:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000036816_18849792.pth [2023-03-08 19:24:08,027][547799] Updated weights for policy 0, policy_version 37520 (0.0005) [2023-03-08 19:24:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10677.4). Total num frames: 19226624. Throughput: 0: 10727.0. Samples: 19205532. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:24:09,600][547459] Avg episode reward: [(0, '4673.016')] [2023-03-08 19:24:11,795][547799] Updated weights for policy 0, policy_version 37600 (0.0005) [2023-03-08 19:24:14,599][547459] Fps is (10 sec: 10649.8, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 19279872. Throughput: 0: 10704.2. Samples: 19269704. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:24:14,600][547459] Avg episode reward: [(0, '4685.531')] [2023-03-08 19:24:15,785][547799] Updated weights for policy 0, policy_version 37680 (0.0005) [2023-03-08 19:24:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10663.5). Total num frames: 19329024. Throughput: 0: 10703.2. Samples: 19332756. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:24:19,600][547459] Avg episode reward: [(0, '3797.589')] [2023-03-08 19:24:19,608][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000037760_19333120.pth... [2023-03-08 19:24:19,609][547799] Updated weights for policy 0, policy_version 37760 (0.0005) [2023-03-08 19:24:19,610][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000037136_19013632.pth [2023-03-08 19:24:23,425][547799] Updated weights for policy 0, policy_version 37840 (0.0005) [2023-03-08 19:24:24,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10677.4). Total num frames: 19386368. Throughput: 0: 10694.4. Samples: 19364568. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:24:24,600][547459] Avg episode reward: [(0, '4153.194')] [2023-03-08 19:24:27,295][547799] Updated weights for policy 0, policy_version 37920 (0.0005) [2023-03-08 19:24:29,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 19435520. Throughput: 0: 10649.6. Samples: 19427392. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:24:29,600][547459] Avg episode reward: [(0, '4511.030')] [2023-03-08 19:24:31,145][547799] Updated weights for policy 0, policy_version 38000 (0.0005) [2023-03-08 19:24:34,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10691.3). Total num frames: 19492864. Throughput: 0: 10657.9. Samples: 19492872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:24:34,600][547459] Avg episode reward: [(0, '4658.058')] [2023-03-08 19:24:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000038072_19492864.pth... [2023-03-08 19:24:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000037448_19173376.pth [2023-03-08 19:24:34,902][547799] Updated weights for policy 0, policy_version 38080 (0.0005) [2023-03-08 19:24:38,648][547799] Updated weights for policy 0, policy_version 38160 (0.0005) [2023-03-08 19:24:39,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10677.4). Total num frames: 19546112. Throughput: 0: 10690.0. Samples: 19525644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:24:39,600][547459] Avg episode reward: [(0, '4777.106')] [2023-03-08 19:24:42,450][547799] Updated weights for policy 0, policy_version 38240 (0.0005) [2023-03-08 19:24:44,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10691.3). Total num frames: 19599360. Throughput: 0: 10700.8. Samples: 19590972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:24:44,600][547459] Avg episode reward: [(0, '4483.694')] [2023-03-08 19:24:46,204][547799] Updated weights for policy 0, policy_version 38320 (0.0005) [2023-03-08 19:24:49,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10677.4). Total num frames: 19652608. Throughput: 0: 10733.0. Samples: 19656360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:24:49,600][547459] Avg episode reward: [(0, '4539.476')] [2023-03-08 19:24:49,639][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000038392_19656704.pth... [2023-03-08 19:24:49,641][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000037760_19333120.pth [2023-03-08 19:24:49,988][547799] Updated weights for policy 0, policy_version 38400 (0.0005) [2023-03-08 19:24:53,816][547799] Updated weights for policy 0, policy_version 38480 (0.0005) [2023-03-08 19:24:54,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 10705.1). Total num frames: 19709952. Throughput: 0: 10726.6. Samples: 19688228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:24:54,600][547459] Avg episode reward: [(0, '4568.227')] [2023-03-08 19:24:57,627][547799] Updated weights for policy 0, policy_version 38560 (0.0006) [2023-03-08 19:24:59,600][547459] Fps is (10 sec: 11059.3, 60 sec: 10717.9, 300 sec: 10691.3). Total num frames: 19763200. Throughput: 0: 10713.9. Samples: 19751832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:24:59,600][547459] Avg episode reward: [(0, '4731.920')] [2023-03-08 19:25:01,464][547799] Updated weights for policy 0, policy_version 38640 (0.0005) [2023-03-08 19:25:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 19812352. Throughput: 0: 10748.4. Samples: 19816436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:25:04,600][547459] Avg episode reward: [(0, '4765.304')] [2023-03-08 19:25:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000038704_19816448.pth... [2023-03-08 19:25:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000038072_19492864.pth [2023-03-08 19:25:05,374][547799] Updated weights for policy 0, policy_version 38720 (0.0005) [2023-03-08 19:25:09,153][547799] Updated weights for policy 0, policy_version 38800 (0.0005) [2023-03-08 19:25:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10691.3). Total num frames: 19869696. Throughput: 0: 10757.7. Samples: 19848664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:25:09,600][547459] Avg episode reward: [(0, '4636.936')] [2023-03-08 19:25:12,947][547799] Updated weights for policy 0, policy_version 38880 (0.0005) [2023-03-08 19:25:14,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10717.8, 300 sec: 10705.1). Total num frames: 19922944. Throughput: 0: 10788.9. Samples: 19912892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:25:14,600][547459] Avg episode reward: [(0, '4816.766')] [2023-03-08 19:25:16,759][547799] Updated weights for policy 0, policy_version 38960 (0.0005) [2023-03-08 19:25:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10786.1, 300 sec: 10691.2). Total num frames: 19976192. Throughput: 0: 10790.6. Samples: 19978448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:25:19,601][547459] Avg episode reward: [(0, '4746.133')] [2023-03-08 19:25:19,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000039016_19976192.pth... [2023-03-08 19:25:19,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000038392_19656704.pth [2023-03-08 19:25:20,508][547799] Updated weights for policy 0, policy_version 39040 (0.0005) [2023-03-08 19:25:24,353][547799] Updated weights for policy 0, policy_version 39120 (0.0006) [2023-03-08 19:25:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10691.3). Total num frames: 20029440. Throughput: 0: 10762.9. Samples: 20009976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:25:24,600][547459] Avg episode reward: [(0, '4439.683')] [2023-03-08 19:25:28,145][547799] Updated weights for policy 0, policy_version 39200 (0.0005) [2023-03-08 19:25:29,600][547459] Fps is (10 sec: 10649.8, 60 sec: 10786.1, 300 sec: 10691.3). Total num frames: 20082688. Throughput: 0: 10745.3. Samples: 20074512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:25:29,600][547459] Avg episode reward: [(0, '3906.884')] [2023-03-08 19:25:31,988][547799] Updated weights for policy 0, policy_version 39280 (0.0005) [2023-03-08 19:25:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10677.4). Total num frames: 20135936. Throughput: 0: 10697.1. Samples: 20137728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:25:34,600][547459] Avg episode reward: [(0, '4310.338')] [2023-03-08 19:25:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000039328_20135936.pth... [2023-03-08 19:25:34,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000038704_19816448.pth [2023-03-08 19:25:35,890][547799] Updated weights for policy 0, policy_version 39360 (0.0005) [2023-03-08 19:25:39,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10677.4). Total num frames: 20189184. Throughput: 0: 10691.7. Samples: 20169352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:25:39,601][547459] Avg episode reward: [(0, '4436.601')] [2023-03-08 19:25:39,699][547799] Updated weights for policy 0, policy_version 39440 (0.0005) [2023-03-08 19:25:43,449][547799] Updated weights for policy 0, policy_version 39520 (0.0005) [2023-03-08 19:25:44,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10677.4). Total num frames: 20242432. Throughput: 0: 10730.7. Samples: 20234712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:25:44,600][547459] Avg episode reward: [(0, '4549.543')] [2023-03-08 19:25:47,283][547799] Updated weights for policy 0, policy_version 39600 (0.0005) [2023-03-08 19:25:49,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10786.2, 300 sec: 10691.3). Total num frames: 20299776. Throughput: 0: 10737.2. Samples: 20299608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:25:49,600][547459] Avg episode reward: [(0, '4692.892')] [2023-03-08 19:25:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000039648_20299776.pth... [2023-03-08 19:25:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000039016_19976192.pth [2023-03-08 19:25:51,091][547799] Updated weights for policy 0, policy_version 39680 (0.0005) [2023-03-08 19:25:54,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10691.3). Total num frames: 20353024. Throughput: 0: 10743.7. Samples: 20332132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:25:54,600][547459] Avg episode reward: [(0, '4731.214')] [2023-03-08 19:25:54,884][547799] Updated weights for policy 0, policy_version 39760 (0.0006) [2023-03-08 19:25:58,748][547799] Updated weights for policy 0, policy_version 39840 (0.0005) [2023-03-08 19:25:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10705.1). Total num frames: 20406272. Throughput: 0: 10736.1. Samples: 20396016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:25:59,600][547459] Avg episode reward: [(0, '4652.872')] [2023-03-08 19:26:02,613][547799] Updated weights for policy 0, policy_version 39920 (0.0005) [2023-03-08 19:26:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10691.3). Total num frames: 20455424. Throughput: 0: 10690.5. Samples: 20459520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:26:04,600][547459] Avg episode reward: [(0, '3851.449')] [2023-03-08 19:26:04,606][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000039960_20459520.pth... [2023-03-08 19:26:04,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000039328_20135936.pth [2023-03-08 19:26:06,535][547799] Updated weights for policy 0, policy_version 40000 (0.0005) [2023-03-08 19:26:09,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10691.3). Total num frames: 20508672. Throughput: 0: 10703.7. Samples: 20491644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:26:09,600][547459] Avg episode reward: [(0, '4007.402')] [2023-03-08 19:26:10,459][547799] Updated weights for policy 0, policy_version 40080 (0.0005) [2023-03-08 19:26:14,227][547799] Updated weights for policy 0, policy_version 40160 (0.0005) [2023-03-08 19:26:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 20561920. Throughput: 0: 10656.4. Samples: 20554052. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:26:14,600][547459] Avg episode reward: [(0, '4345.131')] [2023-03-08 19:26:18,050][547799] Updated weights for policy 0, policy_version 40240 (0.0006) [2023-03-08 19:26:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 20615168. Throughput: 0: 10686.6. Samples: 20618624. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:26:19,600][547459] Avg episode reward: [(0, '4538.491')] [2023-03-08 19:26:19,648][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000040272_20619264.pth... [2023-03-08 19:26:19,651][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000039648_20299776.pth [2023-03-08 19:26:21,986][547799] Updated weights for policy 0, policy_version 40320 (0.0005) [2023-03-08 19:26:24,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 20668416. Throughput: 0: 10666.8. Samples: 20649356. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:26:24,600][547459] Avg episode reward: [(0, '4384.808')] [2023-03-08 19:26:25,957][547799] Updated weights for policy 0, policy_version 40400 (0.0005) [2023-03-08 19:26:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 20721664. Throughput: 0: 10624.2. Samples: 20712800. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:26:29,600][547459] Avg episode reward: [(0, '4479.328')] [2023-03-08 19:26:29,822][547799] Updated weights for policy 0, policy_version 40480 (0.0005) [2023-03-08 19:26:33,727][547799] Updated weights for policy 0, policy_version 40560 (0.0005) [2023-03-08 19:26:34,600][547459] Fps is (10 sec: 10649.4, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 20774912. Throughput: 0: 10562.3. Samples: 20774912. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:26:34,600][547459] Avg episode reward: [(0, '4492.518')] [2023-03-08 19:26:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000040576_20774912.pth... [2023-03-08 19:26:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000039960_20459520.pth [2023-03-08 19:26:37,696][547799] Updated weights for policy 0, policy_version 40640 (0.0005) [2023-03-08 19:26:39,600][547459] Fps is (10 sec: 10649.4, 60 sec: 10649.6, 300 sec: 10677.4). Total num frames: 20828160. Throughput: 0: 10524.0. Samples: 20805712. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:26:39,600][547459] Avg episode reward: [(0, '4613.552')] [2023-03-08 19:26:41,572][547799] Updated weights for policy 0, policy_version 40720 (0.0005) [2023-03-08 19:26:44,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10663.5). Total num frames: 20877312. Throughput: 0: 10515.1. Samples: 20869196. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:26:44,600][547459] Avg episode reward: [(0, '4105.902')] [2023-03-08 19:26:45,398][547799] Updated weights for policy 0, policy_version 40800 (0.0005) [2023-03-08 19:26:49,211][547799] Updated weights for policy 0, policy_version 40880 (0.0005) [2023-03-08 19:26:49,600][547459] Fps is (10 sec: 10649.8, 60 sec: 10581.3, 300 sec: 10663.5). Total num frames: 20934656. Throughput: 0: 10543.9. Samples: 20933996. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:26:49,600][547459] Avg episode reward: [(0, '4080.762')] [2023-03-08 19:26:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000040888_20934656.pth... [2023-03-08 19:26:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000040272_20619264.pth [2023-03-08 19:26:53,066][547799] Updated weights for policy 0, policy_version 40960 (0.0005) [2023-03-08 19:26:54,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10649.6). Total num frames: 20983808. Throughput: 0: 10535.1. Samples: 20965724. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:26:54,600][547459] Avg episode reward: [(0, '3608.246')] [2023-03-08 19:26:56,917][547799] Updated weights for policy 0, policy_version 41040 (0.0005) [2023-03-08 19:26:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10649.6). Total num frames: 21037056. Throughput: 0: 10561.5. Samples: 21029320. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:26:59,600][547459] Avg episode reward: [(0, '3299.950')] [2023-03-08 19:27:00,747][547799] Updated weights for policy 0, policy_version 41120 (0.0005) [2023-03-08 19:27:04,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10649.6). Total num frames: 21090304. Throughput: 0: 10555.4. Samples: 21093616. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:27:04,600][547459] Avg episode reward: [(0, '4022.747')] [2023-03-08 19:27:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000041192_21090304.pth... [2023-03-08 19:27:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000040576_20774912.pth [2023-03-08 19:27:04,663][547799] Updated weights for policy 0, policy_version 41200 (0.0005) [2023-03-08 19:27:08,668][547799] Updated weights for policy 0, policy_version 41280 (0.0005) [2023-03-08 19:27:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10649.6). Total num frames: 21143552. Throughput: 0: 10528.6. Samples: 21123144. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:27:09,600][547459] Avg episode reward: [(0, '3872.842')] [2023-03-08 19:27:12,642][547799] Updated weights for policy 0, policy_version 41360 (0.0005) [2023-03-08 19:27:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10635.7). Total num frames: 21196800. Throughput: 0: 10510.7. Samples: 21185780. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:27:14,600][547459] Avg episode reward: [(0, '4321.788')] [2023-03-08 19:27:16,561][547799] Updated weights for policy 0, policy_version 41440 (0.0005) [2023-03-08 19:27:19,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 21245952. Throughput: 0: 10515.0. Samples: 21248084. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:27:19,600][547459] Avg episode reward: [(0, '3990.015')] [2023-03-08 19:27:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000041496_21245952.pth... [2023-03-08 19:27:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000040888_20934656.pth [2023-03-08 19:27:20,500][547799] Updated weights for policy 0, policy_version 41520 (0.0005) [2023-03-08 19:27:24,362][547799] Updated weights for policy 0, policy_version 41600 (0.0006) [2023-03-08 19:27:24,603][547459] Fps is (10 sec: 10236.0, 60 sec: 10512.4, 300 sec: 10635.6). Total num frames: 21299200. Throughput: 0: 10517.0. Samples: 21279016. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:27:24,604][547459] Avg episode reward: [(0, '4427.290')] [2023-03-08 19:27:28,212][547799] Updated weights for policy 0, policy_version 41680 (0.0005) [2023-03-08 19:27:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 21352448. Throughput: 0: 10554.9. Samples: 21344168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:27:29,600][547459] Avg episode reward: [(0, '4531.409')] [2023-03-08 19:27:32,002][547799] Updated weights for policy 0, policy_version 41760 (0.0005) [2023-03-08 19:27:34,600][547459] Fps is (10 sec: 10653.7, 60 sec: 10513.1, 300 sec: 10635.7). Total num frames: 21405696. Throughput: 0: 10566.8. Samples: 21409504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:27:34,600][547459] Avg episode reward: [(0, '4529.985')] [2023-03-08 19:27:34,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000041816_21409792.pth... [2023-03-08 19:27:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000041192_21090304.pth [2023-03-08 19:27:35,797][547799] Updated weights for policy 0, policy_version 41840 (0.0005) [2023-03-08 19:27:39,583][547799] Updated weights for policy 0, policy_version 41920 (0.0005) [2023-03-08 19:27:39,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10581.4, 300 sec: 10649.6). Total num frames: 21463040. Throughput: 0: 10547.1. Samples: 21440344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:27:39,601][547459] Avg episode reward: [(0, '4698.497')] [2023-03-08 19:27:43,348][547799] Updated weights for policy 0, policy_version 42000 (0.0005) [2023-03-08 19:27:44,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10663.5). Total num frames: 21516288. Throughput: 0: 10605.0. Samples: 21506544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:27:44,600][547459] Avg episode reward: [(0, '4485.115')] [2023-03-08 19:27:47,164][547799] Updated weights for policy 0, policy_version 42080 (0.0005) [2023-03-08 19:27:49,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10649.6). Total num frames: 21569536. Throughput: 0: 10576.2. Samples: 21569544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:27:49,600][547459] Avg episode reward: [(0, '4636.679')] [2023-03-08 19:27:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000042128_21569536.pth... [2023-03-08 19:27:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000041496_21245952.pth [2023-03-08 19:27:51,113][547799] Updated weights for policy 0, policy_version 42160 (0.0005) [2023-03-08 19:27:54,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10581.4, 300 sec: 10635.7). Total num frames: 21618688. Throughput: 0: 10619.8. Samples: 21601032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:27:54,600][547459] Avg episode reward: [(0, '4624.485')] [2023-03-08 19:27:55,049][547799] Updated weights for policy 0, policy_version 42240 (0.0006) [2023-03-08 19:27:58,885][547799] Updated weights for policy 0, policy_version 42320 (0.0005) [2023-03-08 19:27:59,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10635.7). Total num frames: 21671936. Throughput: 0: 10621.6. Samples: 21663752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:27:59,600][547459] Avg episode reward: [(0, '4464.422')] [2023-03-08 19:28:02,747][547799] Updated weights for policy 0, policy_version 42400 (0.0004) [2023-03-08 19:28:04,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10663.5). Total num frames: 21729280. Throughput: 0: 10684.3. Samples: 21728876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:28:04,600][547459] Avg episode reward: [(0, '4811.636')] [2023-03-08 19:28:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000042440_21729280.pth... [2023-03-08 19:28:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000041816_21409792.pth [2023-03-08 19:28:06,482][547799] Updated weights for policy 0, policy_version 42480 (0.0005) [2023-03-08 19:28:09,599][547459] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10649.6). Total num frames: 21782528. Throughput: 0: 10726.5. Samples: 21761668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:28:09,600][547459] Avg episode reward: [(0, '4730.603')] [2023-03-08 19:28:10,258][547799] Updated weights for policy 0, policy_version 42560 (0.0005) [2023-03-08 19:28:14,005][547799] Updated weights for policy 0, policy_version 42640 (0.0005) [2023-03-08 19:28:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10663.5). Total num frames: 21835776. Throughput: 0: 10727.8. Samples: 21826920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:28:14,600][547459] Avg episode reward: [(0, '4674.629')] [2023-03-08 19:28:17,873][547799] Updated weights for policy 0, policy_version 42720 (0.0005) [2023-03-08 19:28:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10663.5). Total num frames: 21889024. Throughput: 0: 10681.3. Samples: 21890164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:28:19,600][547459] Avg episode reward: [(0, '4815.302')] [2023-03-08 19:28:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000042752_21889024.pth... [2023-03-08 19:28:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000042128_21569536.pth [2023-03-08 19:28:21,756][547799] Updated weights for policy 0, policy_version 42800 (0.0005) [2023-03-08 19:28:24,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10718.6, 300 sec: 10663.5). Total num frames: 21942272. Throughput: 0: 10705.6. Samples: 21922096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:28:24,600][547459] Avg episode reward: [(0, '4586.520')] [2023-03-08 19:28:25,529][547799] Updated weights for policy 0, policy_version 42880 (0.0005) [2023-03-08 19:28:29,494][547799] Updated weights for policy 0, policy_version 42960 (0.0005) [2023-03-08 19:28:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10663.5). Total num frames: 21995520. Throughput: 0: 10669.2. Samples: 21986656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:28:29,600][547459] Avg episode reward: [(0, '4284.666')] [2023-03-08 19:28:33,244][547799] Updated weights for policy 0, policy_version 43040 (0.0005) [2023-03-08 19:28:34,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10663.5). Total num frames: 22048768. Throughput: 0: 10673.5. Samples: 22049852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:28:34,600][547459] Avg episode reward: [(0, '4479.846')] [2023-03-08 19:28:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000043064_22048768.pth... [2023-03-08 19:28:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000042440_21729280.pth [2023-03-08 19:28:37,023][547799] Updated weights for policy 0, policy_version 43120 (0.0005) [2023-03-08 19:28:39,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10663.5). Total num frames: 22102016. Throughput: 0: 10699.3. Samples: 22082500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:28:39,601][547459] Avg episode reward: [(0, '4648.307')] [2023-03-08 19:28:40,922][547799] Updated weights for policy 0, policy_version 43200 (0.0005) [2023-03-08 19:28:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10663.5). Total num frames: 22155264. Throughput: 0: 10741.7. Samples: 22147128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:28:44,600][547459] Avg episode reward: [(0, '4354.176')] [2023-03-08 19:28:44,628][547799] Updated weights for policy 0, policy_version 43280 (0.0005) [2023-03-08 19:28:48,598][547799] Updated weights for policy 0, policy_version 43360 (0.0005) [2023-03-08 19:28:49,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10649.6). Total num frames: 22208512. Throughput: 0: 10696.0. Samples: 22210196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:28:49,600][547459] Avg episode reward: [(0, '4211.444')] [2023-03-08 19:28:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000043376_22208512.pth... [2023-03-08 19:28:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000042752_21889024.pth [2023-03-08 19:28:52,506][547799] Updated weights for policy 0, policy_version 43440 (0.0005) [2023-03-08 19:28:54,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10649.6). Total num frames: 22261760. Throughput: 0: 10659.5. Samples: 22241344. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:28:54,601][547459] Avg episode reward: [(0, '3762.895')] [2023-03-08 19:28:56,223][547799] Updated weights for policy 0, policy_version 43520 (0.0005) [2023-03-08 19:28:59,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10649.6). Total num frames: 22315008. Throughput: 0: 10666.0. Samples: 22306888. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:28:59,600][547459] Avg episode reward: [(0, '4161.447')] [2023-03-08 19:29:00,033][547799] Updated weights for policy 0, policy_version 43600 (0.0005) [2023-03-08 19:29:03,843][547799] Updated weights for policy 0, policy_version 43680 (0.0005) [2023-03-08 19:29:04,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10649.6). Total num frames: 22368256. Throughput: 0: 10696.7. Samples: 22371516. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:29:04,600][547459] Avg episode reward: [(0, '4072.624')] [2023-03-08 19:29:04,646][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000043696_22372352.pth... [2023-03-08 19:29:04,648][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000043064_22048768.pth [2023-03-08 19:29:07,841][547799] Updated weights for policy 0, policy_version 43760 (0.0005) [2023-03-08 19:29:09,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10649.6). Total num frames: 22421504. Throughput: 0: 10656.4. Samples: 22401632. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:29:09,600][547459] Avg episode reward: [(0, '3028.817')] [2023-03-08 19:29:11,706][547799] Updated weights for policy 0, policy_version 43840 (0.0005) [2023-03-08 19:29:14,600][547459] Fps is (10 sec: 10649.4, 60 sec: 10649.6, 300 sec: 10663.5). Total num frames: 22474752. Throughput: 0: 10652.3. Samples: 22466008. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:29:14,601][547459] Avg episode reward: [(0, '3084.660')] [2023-03-08 19:29:15,550][547799] Updated weights for policy 0, policy_version 43920 (0.0005) [2023-03-08 19:29:19,496][547799] Updated weights for policy 0, policy_version 44000 (0.0005) [2023-03-08 19:29:19,599][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10649.6). Total num frames: 22528000. Throughput: 0: 10627.1. Samples: 22528072. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:29:19,600][547459] Avg episode reward: [(0, '2870.513')] [2023-03-08 19:29:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000044000_22528000.pth... [2023-03-08 19:29:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000043376_22208512.pth [2023-03-08 19:29:23,480][547799] Updated weights for policy 0, policy_version 44080 (0.0005) [2023-03-08 19:29:24,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10649.6). Total num frames: 22577152. Throughput: 0: 10592.8. Samples: 22559176. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:29:24,600][547459] Avg episode reward: [(0, '3887.455')] [2023-03-08 19:29:27,293][547799] Updated weights for policy 0, policy_version 44160 (0.0005) [2023-03-08 19:29:29,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10649.6). Total num frames: 22634496. Throughput: 0: 10570.2. Samples: 22622788. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:29:29,600][547459] Avg episode reward: [(0, '3757.842')] [2023-03-08 19:29:31,063][547799] Updated weights for policy 0, policy_version 44240 (0.0005) [2023-03-08 19:29:34,599][547459] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10649.6). Total num frames: 22687744. Throughput: 0: 10613.9. Samples: 22687820. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:29:34,600][547459] Avg episode reward: [(0, '3802.454')] [2023-03-08 19:29:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000044312_22687744.pth... [2023-03-08 19:29:34,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000043696_22372352.pth [2023-03-08 19:29:34,879][547799] Updated weights for policy 0, policy_version 44320 (0.0005) [2023-03-08 19:29:38,719][547799] Updated weights for policy 0, policy_version 44400 (0.0005) [2023-03-08 19:29:39,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10649.6). Total num frames: 22740992. Throughput: 0: 10648.2. Samples: 22720512. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:29:39,600][547459] Avg episode reward: [(0, '4225.060')] [2023-03-08 19:29:42,678][547799] Updated weights for policy 0, policy_version 44480 (0.0005) [2023-03-08 19:29:44,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10635.7). Total num frames: 22790144. Throughput: 0: 10558.7. Samples: 22782032. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 19:29:44,600][547459] Avg episode reward: [(0, '4037.045')] [2023-03-08 19:29:46,703][547799] Updated weights for policy 0, policy_version 44560 (0.0005) [2023-03-08 19:29:49,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10621.8). Total num frames: 22843392. Throughput: 0: 10495.7. Samples: 22843824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:29:49,600][547459] Avg episode reward: [(0, '4132.354')] [2023-03-08 19:29:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000044616_22843392.pth... [2023-03-08 19:29:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000044000_22528000.pth [2023-03-08 19:29:50,674][547799] Updated weights for policy 0, policy_version 44640 (0.0006) [2023-03-08 19:29:54,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10607.9). Total num frames: 22892544. Throughput: 0: 10490.1. Samples: 22873688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:29:54,611][547459] Avg episode reward: [(0, '3954.468')] [2023-03-08 19:29:54,828][547799] Updated weights for policy 0, policy_version 44720 (0.0005) [2023-03-08 19:29:58,817][547799] Updated weights for policy 0, policy_version 44800 (0.0006) [2023-03-08 19:29:59,599][547459] Fps is (10 sec: 9830.4, 60 sec: 10444.8, 300 sec: 10608.0). Total num frames: 22941696. Throughput: 0: 10407.5. Samples: 22934344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:29:59,610][547459] Avg episode reward: [(0, '4047.693')] [2023-03-08 19:30:02,812][547799] Updated weights for policy 0, policy_version 44880 (0.0006) [2023-03-08 19:30:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 22994944. Throughput: 0: 10411.6. Samples: 22996596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:30:04,612][547459] Avg episode reward: [(0, '4157.675')] [2023-03-08 19:30:04,616][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000044912_22994944.pth... [2023-03-08 19:30:04,619][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000044312_22687744.pth [2023-03-08 19:30:06,730][547799] Updated weights for policy 0, policy_version 44960 (0.0005) [2023-03-08 19:30:09,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 23048192. Throughput: 0: 10413.3. Samples: 23027776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:30:09,600][547459] Avg episode reward: [(0, '4318.737')] [2023-03-08 19:30:10,634][547799] Updated weights for policy 0, policy_version 45040 (0.0005) [2023-03-08 19:30:14,357][547799] Updated weights for policy 0, policy_version 45120 (0.0005) [2023-03-08 19:30:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 23101440. Throughput: 0: 10449.1. Samples: 23093000. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:30:14,600][547459] Avg episode reward: [(0, '4242.449')] [2023-03-08 19:30:18,209][547799] Updated weights for policy 0, policy_version 45200 (0.0005) [2023-03-08 19:30:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 23154688. Throughput: 0: 10407.0. Samples: 23156136. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:30:19,600][547459] Avg episode reward: [(0, '4506.251')] [2023-03-08 19:30:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000045224_23154688.pth... [2023-03-08 19:30:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000044616_22843392.pth [2023-03-08 19:30:22,067][547799] Updated weights for policy 0, policy_version 45280 (0.0005) [2023-03-08 19:30:24,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10594.1). Total num frames: 23207936. Throughput: 0: 10406.3. Samples: 23188796. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:30:24,600][547459] Avg episode reward: [(0, '4803.708')] [2023-03-08 19:30:25,874][547799] Updated weights for policy 0, policy_version 45360 (0.0005) [2023-03-08 19:30:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 23261184. Throughput: 0: 10467.2. Samples: 23253056. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:30:29,600][547459] Avg episode reward: [(0, '4450.120')] [2023-03-08 19:30:29,644][547799] Updated weights for policy 0, policy_version 45440 (0.0005) [2023-03-08 19:30:33,372][547799] Updated weights for policy 0, policy_version 45520 (0.0005) [2023-03-08 19:30:34,600][547459] Fps is (10 sec: 11059.0, 60 sec: 10513.1, 300 sec: 10607.9). Total num frames: 23318528. Throughput: 0: 10549.0. Samples: 23318528. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:30:34,601][547459] Avg episode reward: [(0, '4375.229')] [2023-03-08 19:30:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000045544_23318528.pth... [2023-03-08 19:30:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000044912_22994944.pth [2023-03-08 19:30:37,238][547799] Updated weights for policy 0, policy_version 45600 (0.0005) [2023-03-08 19:30:39,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 23367680. Throughput: 0: 10601.9. Samples: 23350776. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:30:39,600][547459] Avg episode reward: [(0, '4368.195')] [2023-03-08 19:30:41,168][547799] Updated weights for policy 0, policy_version 45680 (0.0006) [2023-03-08 19:30:44,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10580.2). Total num frames: 23420928. Throughput: 0: 10631.1. Samples: 23412744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:30:44,600][547459] Avg episode reward: [(0, '4264.202')] [2023-03-08 19:30:45,012][547799] Updated weights for policy 0, policy_version 45760 (0.0005) [2023-03-08 19:30:48,871][547799] Updated weights for policy 0, policy_version 45840 (0.0005) [2023-03-08 19:30:49,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10580.2). Total num frames: 23474176. Throughput: 0: 10668.9. Samples: 23476696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:30:49,601][547459] Avg episode reward: [(0, '4045.181')] [2023-03-08 19:30:49,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000045848_23474176.pth... [2023-03-08 19:30:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000045224_23154688.pth [2023-03-08 19:30:52,777][547799] Updated weights for policy 0, policy_version 45920 (0.0006) [2023-03-08 19:30:54,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10580.2). Total num frames: 23527424. Throughput: 0: 10665.2. Samples: 23507708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:30:54,600][547459] Avg episode reward: [(0, '4462.989')] [2023-03-08 19:30:56,606][547799] Updated weights for policy 0, policy_version 46000 (0.0005) [2023-03-08 19:30:59,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10608.0). Total num frames: 23584768. Throughput: 0: 10665.5. Samples: 23572948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:30:59,600][547459] Avg episode reward: [(0, '4405.402')] [2023-03-08 19:31:00,294][547799] Updated weights for policy 0, policy_version 46080 (0.0005) [2023-03-08 19:31:04,180][547799] Updated weights for policy 0, policy_version 46160 (0.0005) [2023-03-08 19:31:04,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 10607.9). Total num frames: 23638016. Throughput: 0: 10708.4. Samples: 23638016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:31:04,600][547459] Avg episode reward: [(0, '4403.962')] [2023-03-08 19:31:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000046168_23638016.pth... [2023-03-08 19:31:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000045544_23318528.pth [2023-03-08 19:31:08,072][547799] Updated weights for policy 0, policy_version 46240 (0.0005) [2023-03-08 19:31:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10607.9). Total num frames: 23691264. Throughput: 0: 10691.1. Samples: 23669896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:31:09,600][547459] Avg episode reward: [(0, '4459.624')] [2023-03-08 19:31:11,971][547799] Updated weights for policy 0, policy_version 46320 (0.0005) [2023-03-08 19:31:14,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10594.1). Total num frames: 23740416. Throughput: 0: 10654.7. Samples: 23732516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:31:14,600][547459] Avg episode reward: [(0, '4606.191')] [2023-03-08 19:31:15,749][547799] Updated weights for policy 0, policy_version 46400 (0.0005) [2023-03-08 19:31:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10594.1). Total num frames: 23793664. Throughput: 0: 10621.2. Samples: 23796484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:31:19,600][547459] Avg episode reward: [(0, '4680.850')] [2023-03-08 19:31:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000046472_23793664.pth... [2023-03-08 19:31:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000045848_23474176.pth [2023-03-08 19:31:19,678][547799] Updated weights for policy 0, policy_version 46480 (0.0005) [2023-03-08 19:31:23,595][547799] Updated weights for policy 0, policy_version 46560 (0.0005) [2023-03-08 19:31:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10594.1). Total num frames: 23846912. Throughput: 0: 10578.8. Samples: 23826824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:31:24,600][547459] Avg episode reward: [(0, '4512.701')] [2023-03-08 19:31:27,375][547799] Updated weights for policy 0, policy_version 46640 (0.0005) [2023-03-08 19:31:29,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10594.1). Total num frames: 23900160. Throughput: 0: 10650.8. Samples: 23892032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:31:29,600][547459] Avg episode reward: [(0, '4619.155')] [2023-03-08 19:31:31,273][547799] Updated weights for policy 0, policy_version 46720 (0.0006) [2023-03-08 19:31:34,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 23953408. Throughput: 0: 10636.9. Samples: 23955356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:31:34,600][547459] Avg episode reward: [(0, '3912.094')] [2023-03-08 19:31:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000046784_23953408.pth... [2023-03-08 19:31:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000046168_23638016.pth [2023-03-08 19:31:35,128][547799] Updated weights for policy 0, policy_version 46800 (0.0005) [2023-03-08 19:31:39,070][547799] Updated weights for policy 0, policy_version 46880 (0.0005) [2023-03-08 19:31:39,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 24006656. Throughput: 0: 10657.1. Samples: 23987280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:31:39,600][547459] Avg episode reward: [(0, '4690.802')] [2023-03-08 19:31:42,898][547799] Updated weights for policy 0, policy_version 46960 (0.0005) [2023-03-08 19:31:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10594.1). Total num frames: 24059904. Throughput: 0: 10624.5. Samples: 24051052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:31:44,600][547459] Avg episode reward: [(0, '4471.403')] [2023-03-08 19:31:46,699][547799] Updated weights for policy 0, policy_version 47040 (0.0005) [2023-03-08 19:31:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 24113152. Throughput: 0: 10560.0. Samples: 24113216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:31:49,600][547459] Avg episode reward: [(0, '4487.777')] [2023-03-08 19:31:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000047096_24113152.pth... [2023-03-08 19:31:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000046472_23793664.pth [2023-03-08 19:31:50,717][547799] Updated weights for policy 0, policy_version 47120 (0.0005) [2023-03-08 19:31:54,572][547799] Updated weights for policy 0, policy_version 47200 (0.0005) [2023-03-08 19:31:54,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10608.0). Total num frames: 24166400. Throughput: 0: 10566.9. Samples: 24145404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:31:54,600][547459] Avg episode reward: [(0, '4385.421')] [2023-03-08 19:31:58,507][547799] Updated weights for policy 0, policy_version 47280 (0.0005) [2023-03-08 19:31:59,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10594.1). Total num frames: 24215552. Throughput: 0: 10553.5. Samples: 24207424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:31:59,600][547459] Avg episode reward: [(0, '4004.608')] [2023-03-08 19:32:02,375][547799] Updated weights for policy 0, policy_version 47360 (0.0005) [2023-03-08 19:32:04,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10594.1). Total num frames: 24268800. Throughput: 0: 10541.8. Samples: 24270864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:32:04,600][547459] Avg episode reward: [(0, '3972.464')] [2023-03-08 19:32:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000047400_24268800.pth... [2023-03-08 19:32:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000046784_23953408.pth [2023-03-08 19:32:06,353][547799] Updated weights for policy 0, policy_version 47440 (0.0005) [2023-03-08 19:32:09,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10594.1). Total num frames: 24322048. Throughput: 0: 10550.1. Samples: 24301576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:32:09,600][547459] Avg episode reward: [(0, '3958.172')] [2023-03-08 19:32:10,209][547799] Updated weights for policy 0, policy_version 47520 (0.0005) [2023-03-08 19:32:14,281][547799] Updated weights for policy 0, policy_version 47600 (0.0006) [2023-03-08 19:32:14,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10594.1). Total num frames: 24371200. Throughput: 0: 10467.7. Samples: 24363080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:32:14,600][547459] Avg episode reward: [(0, '3970.175')] [2023-03-08 19:32:18,021][547799] Updated weights for policy 0, policy_version 47680 (0.0006) [2023-03-08 19:32:19,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10594.2). Total num frames: 24424448. Throughput: 0: 10496.9. Samples: 24427716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:32:19,600][547459] Avg episode reward: [(0, '4144.620')] [2023-03-08 19:32:19,638][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000047712_24428544.pth... [2023-03-08 19:32:19,639][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000047096_24113152.pth [2023-03-08 19:32:21,978][547799] Updated weights for policy 0, policy_version 47760 (0.0005) [2023-03-08 19:32:24,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10594.1). Total num frames: 24477696. Throughput: 0: 10485.0. Samples: 24459104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:32:24,600][547459] Avg episode reward: [(0, '4235.970')] [2023-03-08 19:32:25,978][547799] Updated weights for policy 0, policy_version 47840 (0.0005) [2023-03-08 19:32:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10594.1). Total num frames: 24530944. Throughput: 0: 10465.1. Samples: 24521980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:32:29,600][547459] Avg episode reward: [(0, '4186.495')] [2023-03-08 19:32:29,872][547799] Updated weights for policy 0, policy_version 47920 (0.0005) [2023-03-08 19:32:33,805][547799] Updated weights for policy 0, policy_version 48000 (0.0006) [2023-03-08 19:32:34,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10580.2). Total num frames: 24584192. Throughput: 0: 10458.0. Samples: 24583824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:32:34,600][547459] Avg episode reward: [(0, '4391.729')] [2023-03-08 19:32:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000048016_24584192.pth... [2023-03-08 19:32:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000047400_24268800.pth [2023-03-08 19:32:37,804][547799] Updated weights for policy 0, policy_version 48080 (0.0006) [2023-03-08 19:32:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 24633344. Throughput: 0: 10415.7. Samples: 24614112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:32:39,601][547459] Avg episode reward: [(0, '4519.164')] [2023-03-08 19:32:41,863][547799] Updated weights for policy 0, policy_version 48160 (0.0005) [2023-03-08 19:32:44,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 24686592. Throughput: 0: 10384.2. Samples: 24674712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:32:44,600][547459] Avg episode reward: [(0, '4124.384')] [2023-03-08 19:32:45,794][547799] Updated weights for policy 0, policy_version 48240 (0.0005) [2023-03-08 19:32:49,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10566.3). Total num frames: 24735744. Throughput: 0: 10383.0. Samples: 24738100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:32:49,600][547459] Avg episode reward: [(0, '3974.036')] [2023-03-08 19:32:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000048312_24735744.pth... [2023-03-08 19:32:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000047712_24428544.pth [2023-03-08 19:32:49,693][547799] Updated weights for policy 0, policy_version 48320 (0.0005) [2023-03-08 19:32:53,605][547799] Updated weights for policy 0, policy_version 48400 (0.0005) [2023-03-08 19:32:54,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10566.3). Total num frames: 24788992. Throughput: 0: 10394.7. Samples: 24769336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:32:54,600][547459] Avg episode reward: [(0, '4431.367')] [2023-03-08 19:32:57,645][547799] Updated weights for policy 0, policy_version 48480 (0.0006) [2023-03-08 19:32:59,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10552.4). Total num frames: 24842240. Throughput: 0: 10390.4. Samples: 24830648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:32:59,601][547459] Avg episode reward: [(0, '4263.096')] [2023-03-08 19:33:01,584][547799] Updated weights for policy 0, policy_version 48560 (0.0005) [2023-03-08 19:33:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10538.5). Total num frames: 24891392. Throughput: 0: 10363.2. Samples: 24894060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:33:04,600][547459] Avg episode reward: [(0, '4312.797')] [2023-03-08 19:33:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000048616_24891392.pth... [2023-03-08 19:33:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000048016_24584192.pth [2023-03-08 19:33:05,431][547799] Updated weights for policy 0, policy_version 48640 (0.0006) [2023-03-08 19:33:09,371][547799] Updated weights for policy 0, policy_version 48720 (0.0005) [2023-03-08 19:33:09,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10538.5). Total num frames: 24944640. Throughput: 0: 10347.6. Samples: 24924748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:33:09,600][547459] Avg episode reward: [(0, '4200.736')] [2023-03-08 19:33:13,245][547799] Updated weights for policy 0, policy_version 48800 (0.0005) [2023-03-08 19:33:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10538.5). Total num frames: 24997888. Throughput: 0: 10370.7. Samples: 24988660. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:33:14,600][547459] Avg episode reward: [(0, '4453.266')] [2023-03-08 19:33:17,245][547799] Updated weights for policy 0, policy_version 48880 (0.0005) [2023-03-08 19:33:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10538.5). Total num frames: 25051136. Throughput: 0: 10385.0. Samples: 25051148. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:33:19,600][547459] Avg episode reward: [(0, '4415.313')] [2023-03-08 19:33:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000048928_25051136.pth... [2023-03-08 19:33:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000048312_24735744.pth [2023-03-08 19:33:21,109][547799] Updated weights for policy 0, policy_version 48960 (0.0005) [2023-03-08 19:33:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10538.5). Total num frames: 25104384. Throughput: 0: 10435.0. Samples: 25083688. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:33:24,600][547459] Avg episode reward: [(0, '4472.920')] [2023-03-08 19:33:25,001][547799] Updated weights for policy 0, policy_version 49040 (0.0005) [2023-03-08 19:33:28,814][547799] Updated weights for policy 0, policy_version 49120 (0.0005) [2023-03-08 19:33:29,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10538.5). Total num frames: 25157632. Throughput: 0: 10470.4. Samples: 25145880. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:33:29,600][547459] Avg episode reward: [(0, '4518.393')] [2023-03-08 19:33:32,630][547799] Updated weights for policy 0, policy_version 49200 (0.0005) [2023-03-08 19:33:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10538.5). Total num frames: 25210880. Throughput: 0: 10506.2. Samples: 25210880. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:33:34,600][547459] Avg episode reward: [(0, '4574.830')] [2023-03-08 19:33:34,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000049240_25210880.pth... [2023-03-08 19:33:34,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000048616_24891392.pth [2023-03-08 19:33:36,494][547799] Updated weights for policy 0, policy_version 49280 (0.0005) [2023-03-08 19:33:39,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 25260032. Throughput: 0: 10524.0. Samples: 25242916. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:33:39,600][547459] Avg episode reward: [(0, '4636.076')] [2023-03-08 19:33:40,457][547799] Updated weights for policy 0, policy_version 49360 (0.0004) [2023-03-08 19:33:44,358][547799] Updated weights for policy 0, policy_version 49440 (0.0005) [2023-03-08 19:33:44,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 25313280. Throughput: 0: 10532.4. Samples: 25304604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:33:44,600][547459] Avg episode reward: [(0, '3889.808')] [2023-03-08 19:33:48,332][547799] Updated weights for policy 0, policy_version 49520 (0.0006) [2023-03-08 19:33:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 25366528. Throughput: 0: 10498.9. Samples: 25366512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:33:49,600][547459] Avg episode reward: [(0, '3664.987')] [2023-03-08 19:33:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000049544_25366528.pth... [2023-03-08 19:33:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000048928_25051136.pth [2023-03-08 19:33:52,274][547799] Updated weights for policy 0, policy_version 49600 (0.0005) [2023-03-08 19:33:54,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 25415680. Throughput: 0: 10530.2. Samples: 25398608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:33:54,600][547459] Avg episode reward: [(0, '4353.399')] [2023-03-08 19:33:56,142][547799] Updated weights for policy 0, policy_version 49680 (0.0005) [2023-03-08 19:33:59,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 25468928. Throughput: 0: 10511.0. Samples: 25461656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:33:59,600][547459] Avg episode reward: [(0, '4620.433')] [2023-03-08 19:34:00,030][547799] Updated weights for policy 0, policy_version 49760 (0.0005) [2023-03-08 19:34:03,915][547799] Updated weights for policy 0, policy_version 49840 (0.0005) [2023-03-08 19:34:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10510.7). Total num frames: 25522176. Throughput: 0: 10529.3. Samples: 25524968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:34:04,600][547459] Avg episode reward: [(0, '3860.726')] [2023-03-08 19:34:04,630][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000049856_25526272.pth... [2023-03-08 19:34:04,632][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000049240_25210880.pth [2023-03-08 19:34:07,721][547799] Updated weights for policy 0, policy_version 49920 (0.0006) [2023-03-08 19:34:09,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10510.8). Total num frames: 25575424. Throughput: 0: 10523.5. Samples: 25557244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:34:09,600][547459] Avg episode reward: [(0, '4367.515')] [2023-03-08 19:34:11,566][547799] Updated weights for policy 0, policy_version 50000 (0.0005) [2023-03-08 19:34:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10510.8). Total num frames: 25628672. Throughput: 0: 10538.8. Samples: 25620124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:34:14,600][547459] Avg episode reward: [(0, '4185.196')] [2023-03-08 19:34:15,561][547799] Updated weights for policy 0, policy_version 50080 (0.0005) [2023-03-08 19:34:19,560][547799] Updated weights for policy 0, policy_version 50160 (0.0005) [2023-03-08 19:34:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 25681920. Throughput: 0: 10467.4. Samples: 25681912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:34:19,601][547459] Avg episode reward: [(0, '4182.867')] [2023-03-08 19:34:19,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000050160_25681920.pth... [2023-03-08 19:34:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000049544_25366528.pth [2023-03-08 19:34:23,788][547799] Updated weights for policy 0, policy_version 50240 (0.0005) [2023-03-08 19:34:24,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 25731072. Throughput: 0: 10394.2. Samples: 25710656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:34:24,600][547459] Avg episode reward: [(0, '4043.640')] [2023-03-08 19:34:27,816][547799] Updated weights for policy 0, policy_version 50320 (0.0005) [2023-03-08 19:34:29,599][547459] Fps is (10 sec: 9830.5, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 25780224. Throughput: 0: 10382.9. Samples: 25771832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:34:29,600][547459] Avg episode reward: [(0, '4337.424')] [2023-03-08 19:34:31,667][547799] Updated weights for policy 0, policy_version 50400 (0.0005) [2023-03-08 19:34:34,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 25833472. Throughput: 0: 10403.6. Samples: 25834672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:34:34,600][547459] Avg episode reward: [(0, '4272.878')] [2023-03-08 19:34:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000050456_25833472.pth... [2023-03-08 19:34:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000049856_25526272.pth [2023-03-08 19:34:35,529][547799] Updated weights for policy 0, policy_version 50480 (0.0005) [2023-03-08 19:34:39,507][547799] Updated weights for policy 0, policy_version 50560 (0.0006) [2023-03-08 19:34:39,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 25886720. Throughput: 0: 10391.8. Samples: 25866240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:34:39,600][547459] Avg episode reward: [(0, '4708.644')] [2023-03-08 19:34:43,222][547799] Updated weights for policy 0, policy_version 50640 (0.0005) [2023-03-08 19:34:44,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 25939968. Throughput: 0: 10438.8. Samples: 25931404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:34:44,601][547459] Avg episode reward: [(0, '4555.671')] [2023-03-08 19:34:47,146][547799] Updated weights for policy 0, policy_version 50720 (0.0005) [2023-03-08 19:34:49,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 25993216. Throughput: 0: 10412.7. Samples: 25993540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:34:49,600][547459] Avg episode reward: [(0, '4521.579')] [2023-03-08 19:34:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000050768_25993216.pth... [2023-03-08 19:34:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000050160_25681920.pth [2023-03-08 19:34:50,993][547799] Updated weights for policy 0, policy_version 50800 (0.0005) [2023-03-08 19:34:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 26046464. Throughput: 0: 10427.6. Samples: 26026488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:34:54,600][547459] Avg episode reward: [(0, '4664.465')] [2023-03-08 19:34:54,823][547799] Updated weights for policy 0, policy_version 50880 (0.0005) [2023-03-08 19:34:58,786][547799] Updated weights for policy 0, policy_version 50960 (0.0005) [2023-03-08 19:34:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 26099712. Throughput: 0: 10417.8. Samples: 26088924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:34:59,600][547459] Avg episode reward: [(0, '4507.886')] [2023-03-08 19:35:02,639][547799] Updated weights for policy 0, policy_version 51040 (0.0005) [2023-03-08 19:35:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 26152960. Throughput: 0: 10467.9. Samples: 26152968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:35:04,601][547459] Avg episode reward: [(0, '4410.696')] [2023-03-08 19:35:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000051080_26152960.pth... [2023-03-08 19:35:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000050456_25833472.pth [2023-03-08 19:35:06,535][547799] Updated weights for policy 0, policy_version 51120 (0.0005) [2023-03-08 19:35:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 26206208. Throughput: 0: 10528.8. Samples: 26184452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:35:09,600][547459] Avg episode reward: [(0, '4519.403')] [2023-03-08 19:35:10,294][547799] Updated weights for policy 0, policy_version 51200 (0.0004) [2023-03-08 19:35:14,228][547799] Updated weights for policy 0, policy_version 51280 (0.0006) [2023-03-08 19:35:14,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 26255360. Throughput: 0: 10570.2. Samples: 26247492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:35:14,600][547459] Avg episode reward: [(0, '4624.094')] [2023-03-08 19:35:18,182][547799] Updated weights for policy 0, policy_version 51360 (0.0005) [2023-03-08 19:35:19,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 26308608. Throughput: 0: 10568.1. Samples: 26310236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:35:19,600][547459] Avg episode reward: [(0, '4541.654')] [2023-03-08 19:35:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000051384_26308608.pth... [2023-03-08 19:35:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000050768_25993216.pth [2023-03-08 19:35:22,128][547799] Updated weights for policy 0, policy_version 51440 (0.0005) [2023-03-08 19:35:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.0, 300 sec: 10510.8). Total num frames: 26361856. Throughput: 0: 10558.9. Samples: 26341392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:35:24,601][547459] Avg episode reward: [(0, '4155.858')] [2023-03-08 19:35:25,952][547799] Updated weights for policy 0, policy_version 51520 (0.0005) [2023-03-08 19:35:29,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 26415104. Throughput: 0: 10505.3. Samples: 26404140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:35:29,600][547459] Avg episode reward: [(0, '4429.827')] [2023-03-08 19:35:29,905][547799] Updated weights for policy 0, policy_version 51600 (0.0005) [2023-03-08 19:35:33,857][547799] Updated weights for policy 0, policy_version 51680 (0.0005) [2023-03-08 19:35:34,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 26464256. Throughput: 0: 10530.0. Samples: 26467392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:35:34,600][547459] Avg episode reward: [(0, '4446.114')] [2023-03-08 19:35:34,613][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000051696_26468352.pth... [2023-03-08 19:35:34,615][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000051080_26152960.pth [2023-03-08 19:35:37,639][547799] Updated weights for policy 0, policy_version 51760 (0.0005) [2023-03-08 19:35:39,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 26517504. Throughput: 0: 10526.3. Samples: 26500172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:35:39,600][547459] Avg episode reward: [(0, '4588.985')] [2023-03-08 19:35:41,701][547799] Updated weights for policy 0, policy_version 51840 (0.0005) [2023-03-08 19:35:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 26570752. Throughput: 0: 10523.3. Samples: 26562472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:35:44,600][547459] Avg episode reward: [(0, '4309.085')] [2023-03-08 19:35:45,553][547799] Updated weights for policy 0, policy_version 51920 (0.0005) [2023-03-08 19:35:49,391][547799] Updated weights for policy 0, policy_version 52000 (0.0005) [2023-03-08 19:35:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 26624000. Throughput: 0: 10499.4. Samples: 26625440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:35:49,600][547459] Avg episode reward: [(0, '4546.951')] [2023-03-08 19:35:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000052000_26624000.pth... [2023-03-08 19:35:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000051384_26308608.pth [2023-03-08 19:35:53,317][547799] Updated weights for policy 0, policy_version 52080 (0.0005) [2023-03-08 19:35:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 26677248. Throughput: 0: 10496.2. Samples: 26656780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:35:54,600][547459] Avg episode reward: [(0, '4235.596')] [2023-03-08 19:35:57,286][547799] Updated weights for policy 0, policy_version 52160 (0.0006) [2023-03-08 19:35:59,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 26726400. Throughput: 0: 10464.4. Samples: 26718388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:35:59,600][547459] Avg episode reward: [(0, '4301.548')] [2023-03-08 19:36:01,155][547799] Updated weights for policy 0, policy_version 52240 (0.0005) [2023-03-08 19:36:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 26779648. Throughput: 0: 10501.2. Samples: 26782792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:36:04,600][547459] Avg episode reward: [(0, '4171.446')] [2023-03-08 19:36:04,613][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000052312_26783744.pth... [2023-03-08 19:36:04,615][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000051696_26468352.pth [2023-03-08 19:36:05,060][547799] Updated weights for policy 0, policy_version 52320 (0.0005) [2023-03-08 19:36:08,892][547799] Updated weights for policy 0, policy_version 52400 (0.0005) [2023-03-08 19:36:09,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 26832896. Throughput: 0: 10506.1. Samples: 26814168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:36:09,600][547459] Avg episode reward: [(0, '4403.815')] [2023-03-08 19:36:12,719][547799] Updated weights for policy 0, policy_version 52480 (0.0005) [2023-03-08 19:36:14,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 26886144. Throughput: 0: 10530.6. Samples: 26878016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:36:14,600][547459] Avg episode reward: [(0, '4457.829')] [2023-03-08 19:36:16,546][547799] Updated weights for policy 0, policy_version 52560 (0.0005) [2023-03-08 19:36:19,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 26939392. Throughput: 0: 10560.0. Samples: 26942592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:36:19,600][547459] Avg episode reward: [(0, '3824.016')] [2023-03-08 19:36:19,623][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000052624_26943488.pth... [2023-03-08 19:36:19,625][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000052000_26624000.pth [2023-03-08 19:36:20,429][547799] Updated weights for policy 0, policy_version 52640 (0.0005) [2023-03-08 19:36:24,310][547799] Updated weights for policy 0, policy_version 52720 (0.0005) [2023-03-08 19:36:24,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 26992640. Throughput: 0: 10546.9. Samples: 26974784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:36:24,600][547459] Avg episode reward: [(0, '3945.460')] [2023-03-08 19:36:28,092][547799] Updated weights for policy 0, policy_version 52800 (0.0005) [2023-03-08 19:36:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 27045888. Throughput: 0: 10570.0. Samples: 27038120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:36:29,600][547459] Avg episode reward: [(0, '4417.842')] [2023-03-08 19:36:31,970][547799] Updated weights for policy 0, policy_version 52880 (0.0005) [2023-03-08 19:36:34,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10483.0). Total num frames: 27099136. Throughput: 0: 10609.1. Samples: 27102848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:36:34,600][547459] Avg episode reward: [(0, '4189.954')] [2023-03-08 19:36:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000052928_27099136.pth... [2023-03-08 19:36:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000052312_26783744.pth [2023-03-08 19:36:35,714][547799] Updated weights for policy 0, policy_version 52960 (0.0005) [2023-03-08 19:36:39,597][547799] Updated weights for policy 0, policy_version 53040 (0.0005) [2023-03-08 19:36:39,599][547459] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10496.9). Total num frames: 27156480. Throughput: 0: 10628.3. Samples: 27135052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:36:39,600][547459] Avg episode reward: [(0, '4212.472')] [2023-03-08 19:36:43,466][547799] Updated weights for policy 0, policy_version 53120 (0.0005) [2023-03-08 19:36:44,599][547459] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10496.9). Total num frames: 27209728. Throughput: 0: 10654.0. Samples: 27197816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:36:44,600][547459] Avg episode reward: [(0, '4192.770')] [2023-03-08 19:36:47,200][547799] Updated weights for policy 0, policy_version 53200 (0.0005) [2023-03-08 19:36:49,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10496.9). Total num frames: 27262976. Throughput: 0: 10670.8. Samples: 27262976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:36:49,600][547459] Avg episode reward: [(0, '4531.575')] [2023-03-08 19:36:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000053248_27262976.pth... [2023-03-08 19:36:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000052624_26943488.pth [2023-03-08 19:36:51,120][547799] Updated weights for policy 0, policy_version 53280 (0.0006) [2023-03-08 19:36:54,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10510.8). Total num frames: 27316224. Throughput: 0: 10682.8. Samples: 27294896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:36:54,600][547459] Avg episode reward: [(0, '4290.026')] [2023-03-08 19:36:54,980][547799] Updated weights for policy 0, policy_version 53360 (0.0005) [2023-03-08 19:36:58,887][547799] Updated weights for policy 0, policy_version 53440 (0.0005) [2023-03-08 19:36:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10510.8). Total num frames: 27369472. Throughput: 0: 10649.6. Samples: 27357248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:36:59,600][547459] Avg episode reward: [(0, '4604.799')] [2023-03-08 19:37:02,511][547799] Updated weights for policy 0, policy_version 53520 (0.0005) [2023-03-08 19:37:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10510.7). Total num frames: 27422720. Throughput: 0: 10682.9. Samples: 27423324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:37:04,601][547459] Avg episode reward: [(0, '4539.604')] [2023-03-08 19:37:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000053560_27422720.pth... [2023-03-08 19:37:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000052928_27099136.pth [2023-03-08 19:37:06,315][547799] Updated weights for policy 0, policy_version 53600 (0.0005) [2023-03-08 19:37:09,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10524.6). Total num frames: 27475968. Throughput: 0: 10695.5. Samples: 27456080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:37:09,600][547459] Avg episode reward: [(0, '4683.153')] [2023-03-08 19:37:10,177][547799] Updated weights for policy 0, policy_version 53680 (0.0005) [2023-03-08 19:37:14,068][547799] Updated weights for policy 0, policy_version 53760 (0.0006) [2023-03-08 19:37:14,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10524.6). Total num frames: 27529216. Throughput: 0: 10690.1. Samples: 27519176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:37:14,600][547459] Avg episode reward: [(0, '4707.958')] [2023-03-08 19:37:17,908][547799] Updated weights for policy 0, policy_version 53840 (0.0005) [2023-03-08 19:37:19,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10524.6). Total num frames: 27582464. Throughput: 0: 10671.4. Samples: 27583060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:37:19,600][547459] Avg episode reward: [(0, '4391.337')] [2023-03-08 19:37:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000053872_27582464.pth... [2023-03-08 19:37:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000053248_27262976.pth [2023-03-08 19:37:21,750][547799] Updated weights for policy 0, policy_version 53920 (0.0005) [2023-03-08 19:37:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10524.6). Total num frames: 27635712. Throughput: 0: 10674.4. Samples: 27615400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:37:24,600][547459] Avg episode reward: [(0, '4143.486')] [2023-03-08 19:37:25,608][547799] Updated weights for policy 0, policy_version 54000 (0.0005) [2023-03-08 19:37:29,398][547799] Updated weights for policy 0, policy_version 54080 (0.0005) [2023-03-08 19:37:29,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10524.6). Total num frames: 27688960. Throughput: 0: 10723.7. Samples: 27680384. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:37:29,600][547459] Avg episode reward: [(0, '4433.176')] [2023-03-08 19:37:33,264][547799] Updated weights for policy 0, policy_version 54160 (0.0005) [2023-03-08 19:37:34,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10538.5). Total num frames: 27742208. Throughput: 0: 10673.7. Samples: 27743296. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:37:34,600][547459] Avg episode reward: [(0, '4574.912')] [2023-03-08 19:37:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000054184_27742208.pth... [2023-03-08 19:37:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000053560_27422720.pth [2023-03-08 19:37:37,139][547799] Updated weights for policy 0, policy_version 54240 (0.0005) [2023-03-08 19:37:39,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10538.5). Total num frames: 27795456. Throughput: 0: 10668.6. Samples: 27774984. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:37:39,600][547459] Avg episode reward: [(0, '4666.185')] [2023-03-08 19:37:41,058][547799] Updated weights for policy 0, policy_version 54320 (0.0005) [2023-03-08 19:37:44,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 27848704. Throughput: 0: 10680.4. Samples: 27837868. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:37:44,600][547459] Avg episode reward: [(0, '4757.306')] [2023-03-08 19:37:44,946][547799] Updated weights for policy 0, policy_version 54400 (0.0005) [2023-03-08 19:37:48,681][547799] Updated weights for policy 0, policy_version 54480 (0.0005) [2023-03-08 19:37:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 27901952. Throughput: 0: 10671.6. Samples: 27903544. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:37:49,601][547459] Avg episode reward: [(0, '4675.011')] [2023-03-08 19:37:49,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000054496_27901952.pth... [2023-03-08 19:37:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000053872_27582464.pth [2023-03-08 19:37:52,526][547799] Updated weights for policy 0, policy_version 54560 (0.0006) [2023-03-08 19:37:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 27955200. Throughput: 0: 10637.9. Samples: 27934784. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:37:54,600][547459] Avg episode reward: [(0, '4728.049')] [2023-03-08 19:37:56,363][547799] Updated weights for policy 0, policy_version 54640 (0.0005) [2023-03-08 19:37:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 28008448. Throughput: 0: 10668.5. Samples: 27999260. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:37:59,600][547459] Avg episode reward: [(0, '4372.761')] [2023-03-08 19:38:00,284][547799] Updated weights for policy 0, policy_version 54720 (0.0005) [2023-03-08 19:38:04,126][547799] Updated weights for policy 0, policy_version 54800 (0.0005) [2023-03-08 19:38:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 28061696. Throughput: 0: 10638.1. Samples: 28061776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:38:04,600][547459] Avg episode reward: [(0, '4654.207')] [2023-03-08 19:38:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000054808_28061696.pth... [2023-03-08 19:38:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000054184_27742208.pth [2023-03-08 19:38:07,939][547799] Updated weights for policy 0, policy_version 54880 (0.0005) [2023-03-08 19:38:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 28114944. Throughput: 0: 10646.1. Samples: 28094476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:38:09,600][547459] Avg episode reward: [(0, '4616.782')] [2023-03-08 19:38:11,862][547799] Updated weights for policy 0, policy_version 54960 (0.0006) [2023-03-08 19:38:14,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 28168192. Throughput: 0: 10590.4. Samples: 28156952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:38:14,600][547459] Avg episode reward: [(0, '4577.673')] [2023-03-08 19:38:15,705][547799] Updated weights for policy 0, policy_version 55040 (0.0005) [2023-03-08 19:38:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 28217344. Throughput: 0: 10597.8. Samples: 28220196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:38:19,600][547459] Avg episode reward: [(0, '4393.113')] [2023-03-08 19:38:19,636][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000055120_28221440.pth... [2023-03-08 19:38:19,636][547799] Updated weights for policy 0, policy_version 55120 (0.0005) [2023-03-08 19:38:19,638][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000054496_27901952.pth [2023-03-08 19:38:23,671][547799] Updated weights for policy 0, policy_version 55200 (0.0005) [2023-03-08 19:38:24,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 28270592. Throughput: 0: 10572.9. Samples: 28250764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:38:24,601][547459] Avg episode reward: [(0, '4212.411')] [2023-03-08 19:38:27,396][547799] Updated weights for policy 0, policy_version 55280 (0.0006) [2023-03-08 19:38:29,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 28323840. Throughput: 0: 10618.9. Samples: 28315720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:38:29,600][547459] Avg episode reward: [(0, '4357.809')] [2023-03-08 19:38:31,312][547799] Updated weights for policy 0, policy_version 55360 (0.0005) [2023-03-08 19:38:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10566.3). Total num frames: 28377088. Throughput: 0: 10542.2. Samples: 28377944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:38:34,600][547459] Avg episode reward: [(0, '4324.899')] [2023-03-08 19:38:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000055424_28377088.pth... [2023-03-08 19:38:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000054808_28061696.pth [2023-03-08 19:38:35,201][547799] Updated weights for policy 0, policy_version 55440 (0.0005) [2023-03-08 19:38:39,082][547799] Updated weights for policy 0, policy_version 55520 (0.0005) [2023-03-08 19:38:39,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10566.3). Total num frames: 28430336. Throughput: 0: 10557.2. Samples: 28409860. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:38:39,600][547459] Avg episode reward: [(0, '4686.786')] [2023-03-08 19:38:42,912][547799] Updated weights for policy 0, policy_version 55600 (0.0005) [2023-03-08 19:38:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10566.3). Total num frames: 28483584. Throughput: 0: 10546.1. Samples: 28473836. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:38:44,600][547459] Avg episode reward: [(0, '4168.805')] [2023-03-08 19:38:46,768][547799] Updated weights for policy 0, policy_version 55680 (0.0005) [2023-03-08 19:38:49,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10580.2). Total num frames: 28536832. Throughput: 0: 10556.8. Samples: 28536832. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:38:49,600][547459] Avg episode reward: [(0, '4322.116')] [2023-03-08 19:38:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000055736_28536832.pth... [2023-03-08 19:38:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000055120_28221440.pth [2023-03-08 19:38:50,731][547799] Updated weights for policy 0, policy_version 55760 (0.0005) [2023-03-08 19:38:54,586][547799] Updated weights for policy 0, policy_version 55840 (0.0005) [2023-03-08 19:38:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10580.2). Total num frames: 28590080. Throughput: 0: 10513.5. Samples: 28567584. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:38:54,601][547459] Avg episode reward: [(0, '4432.137')] [2023-03-08 19:38:58,475][547799] Updated weights for policy 0, policy_version 55920 (0.0005) [2023-03-08 19:38:59,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10566.3). Total num frames: 28639232. Throughput: 0: 10539.0. Samples: 28631208. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:38:59,600][547459] Avg episode reward: [(0, '4573.241')] [2023-03-08 19:39:02,314][547799] Updated weights for policy 0, policy_version 56000 (0.0005) [2023-03-08 19:39:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10566.3). Total num frames: 28692480. Throughput: 0: 10552.7. Samples: 28695068. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:39:04,600][547459] Avg episode reward: [(0, '4501.940')] [2023-03-08 19:39:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000056040_28692480.pth... [2023-03-08 19:39:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000055424_28377088.pth [2023-03-08 19:39:06,313][547799] Updated weights for policy 0, policy_version 56080 (0.0006) [2023-03-08 19:39:09,599][547459] Fps is (10 sec: 10649.8, 60 sec: 10513.1, 300 sec: 10566.3). Total num frames: 28745728. Throughput: 0: 10549.3. Samples: 28725480. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:39:09,600][547459] Avg episode reward: [(0, '4239.160')] [2023-03-08 19:39:10,237][547799] Updated weights for policy 0, policy_version 56160 (0.0005) [2023-03-08 19:39:14,017][547799] Updated weights for policy 0, policy_version 56240 (0.0005) [2023-03-08 19:39:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10566.3). Total num frames: 28798976. Throughput: 0: 10551.1. Samples: 28790520. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:39:14,600][547459] Avg episode reward: [(0, '4688.550')] [2023-03-08 19:39:17,906][547799] Updated weights for policy 0, policy_version 56320 (0.0005) [2023-03-08 19:39:19,599][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10580.2). Total num frames: 28852224. Throughput: 0: 10542.0. Samples: 28852332. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:39:19,600][547459] Avg episode reward: [(0, '4737.523')] [2023-03-08 19:39:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000056352_28852224.pth... [2023-03-08 19:39:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000055736_28536832.pth [2023-03-08 19:39:21,793][547799] Updated weights for policy 0, policy_version 56400 (0.0005) [2023-03-08 19:39:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 28905472. Throughput: 0: 10557.1. Samples: 28884928. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:39:24,600][547459] Avg episode reward: [(0, '4676.940')] [2023-03-08 19:39:25,561][547799] Updated weights for policy 0, policy_version 56480 (0.0005) [2023-03-08 19:39:29,245][547799] Updated weights for policy 0, policy_version 56560 (0.0005) [2023-03-08 19:39:29,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 28958720. Throughput: 0: 10594.6. Samples: 28950592. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:39:29,600][547459] Avg episode reward: [(0, '4728.081')] [2023-03-08 19:39:33,082][547799] Updated weights for policy 0, policy_version 56640 (0.0005) [2023-03-08 19:39:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 29011968. Throughput: 0: 10646.6. Samples: 29015932. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:39:34,600][547459] Avg episode reward: [(0, '4827.165')] [2023-03-08 19:39:34,630][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000056672_29016064.pth... [2023-03-08 19:39:34,631][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000056040_28692480.pth [2023-03-08 19:39:36,914][547799] Updated weights for policy 0, policy_version 56720 (0.0005) [2023-03-08 19:39:39,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 29065216. Throughput: 0: 10664.1. Samples: 29047468. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:39:39,600][547459] Avg episode reward: [(0, '4573.468')] [2023-03-08 19:39:40,790][547799] Updated weights for policy 0, policy_version 56800 (0.0006) [2023-03-08 19:39:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 29118464. Throughput: 0: 10647.3. Samples: 29110336. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:39:44,600][547459] Avg episode reward: [(0, '4531.334')] [2023-03-08 19:39:44,735][547799] Updated weights for policy 0, policy_version 56880 (0.0006) [2023-03-08 19:39:48,606][547799] Updated weights for policy 0, policy_version 56960 (0.0004) [2023-03-08 19:39:49,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 29171712. Throughput: 0: 10640.6. Samples: 29173896. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:39:49,600][547459] Avg episode reward: [(0, '4459.649')] [2023-03-08 19:39:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000056976_29171712.pth... [2023-03-08 19:39:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000056352_28852224.pth [2023-03-08 19:39:52,272][547799] Updated weights for policy 0, policy_version 57040 (0.0005) [2023-03-08 19:39:54,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 29229056. Throughput: 0: 10720.6. Samples: 29207908. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:39:54,600][547459] Avg episode reward: [(0, '4679.842')] [2023-03-08 19:39:56,122][547799] Updated weights for policy 0, policy_version 57120 (0.0005) [2023-03-08 19:39:59,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 10607.9). Total num frames: 29282304. Throughput: 0: 10673.7. Samples: 29270836. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:39:59,600][547459] Avg episode reward: [(0, '4641.608')] [2023-03-08 19:39:59,931][547799] Updated weights for policy 0, policy_version 57200 (0.0005) [2023-03-08 19:40:03,677][547799] Updated weights for policy 0, policy_version 57280 (0.0005) [2023-03-08 19:40:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10607.9). Total num frames: 29335552. Throughput: 0: 10757.6. Samples: 29336424. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:40:04,600][547459] Avg episode reward: [(0, '4739.969')] [2023-03-08 19:40:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000057296_29335552.pth... [2023-03-08 19:40:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000056672_29016064.pth [2023-03-08 19:40:07,466][547799] Updated weights for policy 0, policy_version 57360 (0.0005) [2023-03-08 19:40:09,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10621.8). Total num frames: 29388800. Throughput: 0: 10747.6. Samples: 29368568. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:40:09,600][547459] Avg episode reward: [(0, '4779.778')] [2023-03-08 19:40:11,372][547799] Updated weights for policy 0, policy_version 57440 (0.0006) [2023-03-08 19:40:14,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10621.8). Total num frames: 29442048. Throughput: 0: 10732.4. Samples: 29433548. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:40:14,600][547459] Avg episode reward: [(0, '4728.008')] [2023-03-08 19:40:15,059][547799] Updated weights for policy 0, policy_version 57520 (0.0005) [2023-03-08 19:40:18,812][547799] Updated weights for policy 0, policy_version 57600 (0.0005) [2023-03-08 19:40:19,600][547459] Fps is (10 sec: 11059.0, 60 sec: 10786.1, 300 sec: 10635.7). Total num frames: 29499392. Throughput: 0: 10742.9. Samples: 29499364. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:40:19,600][547459] Avg episode reward: [(0, '4719.999')] [2023-03-08 19:40:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000057616_29499392.pth... [2023-03-08 19:40:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000056976_29171712.pth [2023-03-08 19:40:22,643][547799] Updated weights for policy 0, policy_version 57680 (0.0005) [2023-03-08 19:40:24,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10786.1, 300 sec: 10635.7). Total num frames: 29552640. Throughput: 0: 10758.2. Samples: 29531588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:40:24,600][547459] Avg episode reward: [(0, '4824.794')] [2023-03-08 19:40:26,426][547799] Updated weights for policy 0, policy_version 57760 (0.0005) [2023-03-08 19:40:29,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10786.1, 300 sec: 10649.6). Total num frames: 29605888. Throughput: 0: 10796.6. Samples: 29596184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:40:29,600][547459] Avg episode reward: [(0, '4821.773')] [2023-03-08 19:40:30,178][547799] Updated weights for policy 0, policy_version 57840 (0.0005) [2023-03-08 19:40:34,061][547799] Updated weights for policy 0, policy_version 57920 (0.0005) [2023-03-08 19:40:34,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10786.1, 300 sec: 10649.6). Total num frames: 29659136. Throughput: 0: 10790.9. Samples: 29659488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:40:34,600][547459] Avg episode reward: [(0, '4690.151')] [2023-03-08 19:40:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000057928_29659136.pth... [2023-03-08 19:40:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000057296_29335552.pth [2023-03-08 19:40:38,132][547799] Updated weights for policy 0, policy_version 58000 (0.0005) [2023-03-08 19:40:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10717.9, 300 sec: 10635.7). Total num frames: 29708288. Throughput: 0: 10709.4. Samples: 29689832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:40:39,600][547459] Avg episode reward: [(0, '4309.125')] [2023-03-08 19:40:41,992][547799] Updated weights for policy 0, policy_version 58080 (0.0006) [2023-03-08 19:40:44,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10717.9, 300 sec: 10635.7). Total num frames: 29761536. Throughput: 0: 10720.5. Samples: 29753256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:40:44,600][547459] Avg episode reward: [(0, '4713.893')] [2023-03-08 19:40:45,830][547799] Updated weights for policy 0, policy_version 58160 (0.0006) [2023-03-08 19:40:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.8, 300 sec: 10635.7). Total num frames: 29814784. Throughput: 0: 10663.2. Samples: 29816268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:40:49,600][547459] Avg episode reward: [(0, '4611.115')] [2023-03-08 19:40:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000058232_29814784.pth... [2023-03-08 19:40:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000057616_29499392.pth [2023-03-08 19:40:49,731][547799] Updated weights for policy 0, policy_version 58240 (0.0005) [2023-03-08 19:40:53,568][547799] Updated weights for policy 0, policy_version 58320 (0.0005) [2023-03-08 19:40:54,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10649.6). Total num frames: 29868032. Throughput: 0: 10657.1. Samples: 29848136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:40:54,600][547459] Avg episode reward: [(0, '4769.737')] [2023-03-08 19:40:57,310][547799] Updated weights for policy 0, policy_version 58400 (0.0005) [2023-03-08 19:40:59,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10663.5). Total num frames: 29925376. Throughput: 0: 10668.5. Samples: 29913632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:40:59,601][547459] Avg episode reward: [(0, '4732.108')] [2023-03-08 19:41:01,135][547799] Updated weights for policy 0, policy_version 58480 (0.0005) [2023-03-08 19:41:04,600][547459] Fps is (10 sec: 11058.9, 60 sec: 10717.8, 300 sec: 10663.5). Total num frames: 29978624. Throughput: 0: 10651.6. Samples: 29978688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:41:04,602][547459] Avg episode reward: [(0, '4534.817')] [2023-03-08 19:41:04,607][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000058552_29978624.pth... [2023-03-08 19:41:04,610][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000057928_29659136.pth [2023-03-08 19:41:04,940][547799] Updated weights for policy 0, policy_version 58560 (0.0005) [2023-03-08 19:41:08,823][547799] Updated weights for policy 0, policy_version 58640 (0.0005) [2023-03-08 19:41:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.8, 300 sec: 10663.5). Total num frames: 30031872. Throughput: 0: 10661.1. Samples: 30011336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:41:09,600][547459] Avg episode reward: [(0, '4554.130')] [2023-03-08 19:41:12,725][547799] Updated weights for policy 0, policy_version 58720 (0.0005) [2023-03-08 19:41:14,599][547459] Fps is (10 sec: 10240.3, 60 sec: 10649.6, 300 sec: 10649.6). Total num frames: 30081024. Throughput: 0: 10596.0. Samples: 30073004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:41:14,600][547459] Avg episode reward: [(0, '4705.424')] [2023-03-08 19:41:16,598][547799] Updated weights for policy 0, policy_version 58800 (0.0005) [2023-03-08 19:41:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10649.6). Total num frames: 30134272. Throughput: 0: 10616.9. Samples: 30137248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:41:19,600][547459] Avg episode reward: [(0, '4578.452')] [2023-03-08 19:41:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000058856_30134272.pth... [2023-03-08 19:41:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000058232_29814784.pth [2023-03-08 19:41:20,469][547799] Updated weights for policy 0, policy_version 58880 (0.0005) [2023-03-08 19:41:24,232][547799] Updated weights for policy 0, policy_version 58960 (0.0005) [2023-03-08 19:41:24,599][547459] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10663.5). Total num frames: 30191616. Throughput: 0: 10648.9. Samples: 30169032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:41:24,600][547459] Avg episode reward: [(0, '4730.146')] [2023-03-08 19:41:28,060][547799] Updated weights for policy 0, policy_version 59040 (0.0005) [2023-03-08 19:41:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10649.6). Total num frames: 30240768. Throughput: 0: 10672.4. Samples: 30233516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:41:29,600][547459] Avg episode reward: [(0, '4812.314')] [2023-03-08 19:41:31,936][547799] Updated weights for policy 0, policy_version 59120 (0.0005) [2023-03-08 19:41:34,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10635.7). Total num frames: 30294016. Throughput: 0: 10705.8. Samples: 30298028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:41:34,600][547459] Avg episode reward: [(0, '4811.693')] [2023-03-08 19:41:34,607][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000059176_30298112.pth... [2023-03-08 19:41:34,609][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000058552_29978624.pth [2023-03-08 19:41:35,787][547799] Updated weights for policy 0, policy_version 59200 (0.0005) [2023-03-08 19:41:39,520][547799] Updated weights for policy 0, policy_version 59280 (0.0005) [2023-03-08 19:41:39,600][547459] Fps is (10 sec: 11059.3, 60 sec: 10717.9, 300 sec: 10649.6). Total num frames: 30351360. Throughput: 0: 10692.5. Samples: 30329300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:41:39,600][547459] Avg episode reward: [(0, '4812.966')] [2023-03-08 19:41:43,527][547799] Updated weights for policy 0, policy_version 59360 (0.0005) [2023-03-08 19:41:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 30400512. Throughput: 0: 10638.9. Samples: 30392384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:41:44,600][547459] Avg episode reward: [(0, '4730.897')] [2023-03-08 19:41:47,474][547799] Updated weights for policy 0, policy_version 59440 (0.0005) [2023-03-08 19:41:49,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 30453760. Throughput: 0: 10569.3. Samples: 30454304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:41:49,600][547459] Avg episode reward: [(0, '4637.600')] [2023-03-08 19:41:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000059480_30453760.pth... [2023-03-08 19:41:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000058856_30134272.pth [2023-03-08 19:41:51,363][547799] Updated weights for policy 0, policy_version 59520 (0.0005) [2023-03-08 19:41:54,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10635.7). Total num frames: 30507008. Throughput: 0: 10557.4. Samples: 30486416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:41:54,600][547459] Avg episode reward: [(0, '4447.665')] [2023-03-08 19:41:55,353][547799] Updated weights for policy 0, policy_version 59600 (0.0005) [2023-03-08 19:41:59,273][547799] Updated weights for policy 0, policy_version 59680 (0.0005) [2023-03-08 19:41:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10621.8). Total num frames: 30556160. Throughput: 0: 10556.2. Samples: 30548032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:41:59,600][547459] Avg episode reward: [(0, '4545.838')] [2023-03-08 19:42:03,237][547799] Updated weights for policy 0, policy_version 59760 (0.0005) [2023-03-08 19:42:04,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10621.8). Total num frames: 30609408. Throughput: 0: 10494.1. Samples: 30609480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:42:04,600][547459] Avg episode reward: [(0, '4633.164')] [2023-03-08 19:42:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000059784_30609408.pth... [2023-03-08 19:42:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000059176_30298112.pth [2023-03-08 19:42:07,411][547799] Updated weights for policy 0, policy_version 59840 (0.0005) [2023-03-08 19:42:09,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10607.9). Total num frames: 30658560. Throughput: 0: 10445.5. Samples: 30639080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:42:09,600][547459] Avg episode reward: [(0, '3496.166')] [2023-03-08 19:42:11,332][547799] Updated weights for policy 0, policy_version 59920 (0.0005) [2023-03-08 19:42:14,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10607.9). Total num frames: 30711808. Throughput: 0: 10412.0. Samples: 30702056. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:42:14,600][547459] Avg episode reward: [(0, '4638.687')] [2023-03-08 19:42:15,231][547799] Updated weights for policy 0, policy_version 60000 (0.0005) [2023-03-08 19:42:19,145][547799] Updated weights for policy 0, policy_version 60080 (0.0005) [2023-03-08 19:42:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10607.9). Total num frames: 30765056. Throughput: 0: 10377.9. Samples: 30765032. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:42:19,600][547459] Avg episode reward: [(0, '4435.861')] [2023-03-08 19:42:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000060088_30765056.pth... [2023-03-08 19:42:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000059480_30453760.pth [2023-03-08 19:42:23,176][547799] Updated weights for policy 0, policy_version 60160 (0.0005) [2023-03-08 19:42:24,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10594.1). Total num frames: 30814208. Throughput: 0: 10350.6. Samples: 30795076. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:42:24,600][547459] Avg episode reward: [(0, '4299.632')] [2023-03-08 19:42:27,175][547799] Updated weights for policy 0, policy_version 60240 (0.0005) [2023-03-08 19:42:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 30867456. Throughput: 0: 10346.8. Samples: 30857992. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:42:29,601][547459] Avg episode reward: [(0, '4584.194')] [2023-03-08 19:42:30,998][547799] Updated weights for policy 0, policy_version 60320 (0.0005) [2023-03-08 19:42:34,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 30920704. Throughput: 0: 10373.2. Samples: 30921096. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:42:34,611][547459] Avg episode reward: [(0, '4756.644')] [2023-03-08 19:42:34,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000060392_30920704.pth... [2023-03-08 19:42:34,617][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000059784_30609408.pth [2023-03-08 19:42:34,849][547799] Updated weights for policy 0, policy_version 60400 (0.0005) [2023-03-08 19:42:38,750][547799] Updated weights for policy 0, policy_version 60480 (0.0005) [2023-03-08 19:42:39,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10594.1). Total num frames: 30973952. Throughput: 0: 10374.3. Samples: 30953260. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:42:39,611][547459] Avg episode reward: [(0, '4808.088')] [2023-03-08 19:42:42,493][547799] Updated weights for policy 0, policy_version 60560 (0.0005) [2023-03-08 19:42:44,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10594.1). Total num frames: 31027200. Throughput: 0: 10451.3. Samples: 31018340. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:42:44,611][547459] Avg episode reward: [(0, '4564.632')] [2023-03-08 19:42:46,513][547799] Updated weights for policy 0, policy_version 60640 (0.0005) [2023-03-08 19:42:49,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10580.2). Total num frames: 31076352. Throughput: 0: 10415.0. Samples: 31078156. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:42:49,611][547459] Avg episode reward: [(0, '4183.240')] [2023-03-08 19:42:49,615][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000060696_31076352.pth... [2023-03-08 19:42:49,617][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000060088_30765056.pth [2023-03-08 19:42:50,541][547799] Updated weights for policy 0, policy_version 60720 (0.0005) [2023-03-08 19:42:54,433][547799] Updated weights for policy 0, policy_version 60800 (0.0005) [2023-03-08 19:42:54,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10580.2). Total num frames: 31129600. Throughput: 0: 10446.0. Samples: 31109152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:42:54,611][547459] Avg episode reward: [(0, '4503.801')] [2023-03-08 19:42:58,434][547799] Updated weights for policy 0, policy_version 60880 (0.0005) [2023-03-08 19:42:59,599][547459] Fps is (10 sec: 10240.2, 60 sec: 10376.5, 300 sec: 10566.3). Total num frames: 31178752. Throughput: 0: 10427.2. Samples: 31171280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:42:59,611][547459] Avg episode reward: [(0, '4459.366')] [2023-03-08 19:43:02,315][547799] Updated weights for policy 0, policy_version 60960 (0.0005) [2023-03-08 19:43:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10566.3). Total num frames: 31232000. Throughput: 0: 10460.9. Samples: 31235772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:43:04,611][547459] Avg episode reward: [(0, '4610.431')] [2023-03-08 19:43:04,645][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000061008_31236096.pth... [2023-03-08 19:43:04,647][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000060392_30920704.pth [2023-03-08 19:43:06,267][547799] Updated weights for policy 0, policy_version 61040 (0.0005) [2023-03-08 19:43:09,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 31285248. Throughput: 0: 10460.1. Samples: 31265780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:43:09,600][547459] Avg episode reward: [(0, '4709.861')] [2023-03-08 19:43:10,075][547799] Updated weights for policy 0, policy_version 61120 (0.0005) [2023-03-08 19:43:13,843][547799] Updated weights for policy 0, policy_version 61200 (0.0005) [2023-03-08 19:43:14,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10513.1, 300 sec: 10594.1). Total num frames: 31342592. Throughput: 0: 10498.8. Samples: 31330436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:43:14,600][547459] Avg episode reward: [(0, '4815.912')] [2023-03-08 19:43:17,681][547799] Updated weights for policy 0, policy_version 61280 (0.0005) [2023-03-08 19:43:19,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10580.2). Total num frames: 31391744. Throughput: 0: 10537.5. Samples: 31395284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:43:19,600][547459] Avg episode reward: [(0, '4817.227')] [2023-03-08 19:43:19,642][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000061320_31395840.pth... [2023-03-08 19:43:19,644][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000060696_31076352.pth [2023-03-08 19:43:21,570][547799] Updated weights for policy 0, policy_version 61360 (0.0005) [2023-03-08 19:43:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10580.2). Total num frames: 31444992. Throughput: 0: 10497.4. Samples: 31425644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:43:24,600][547459] Avg episode reward: [(0, '4496.633')] [2023-03-08 19:43:25,601][547799] Updated weights for policy 0, policy_version 61440 (0.0005) [2023-03-08 19:43:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 31494144. Throughput: 0: 10391.4. Samples: 31485952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:43:29,600][547459] Avg episode reward: [(0, '4079.993')] [2023-03-08 19:43:29,765][547799] Updated weights for policy 0, policy_version 61520 (0.0005) [2023-03-08 19:43:33,738][547799] Updated weights for policy 0, policy_version 61600 (0.0005) [2023-03-08 19:43:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 31547392. Throughput: 0: 10427.7. Samples: 31547400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:43:34,600][547459] Avg episode reward: [(0, '4633.731')] [2023-03-08 19:43:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000061616_31547392.pth... [2023-03-08 19:43:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000061008_31236096.pth [2023-03-08 19:43:37,661][547799] Updated weights for policy 0, policy_version 61680 (0.0005) [2023-03-08 19:43:39,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10552.4). Total num frames: 31596544. Throughput: 0: 10446.5. Samples: 31579244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:43:39,600][547459] Avg episode reward: [(0, '4733.668')] [2023-03-08 19:43:41,616][547799] Updated weights for policy 0, policy_version 61760 (0.0005) [2023-03-08 19:43:44,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10552.4). Total num frames: 31649792. Throughput: 0: 10459.3. Samples: 31641948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:43:44,600][547459] Avg episode reward: [(0, '4739.974')] [2023-03-08 19:43:45,449][547799] Updated weights for policy 0, policy_version 61840 (0.0005) [2023-03-08 19:43:49,277][547799] Updated weights for policy 0, policy_version 61920 (0.0005) [2023-03-08 19:43:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10552.4). Total num frames: 31703040. Throughput: 0: 10456.1. Samples: 31706296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:43:49,600][547459] Avg episode reward: [(0, '4695.594')] [2023-03-08 19:43:49,652][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000061928_31707136.pth... [2023-03-08 19:43:49,654][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000061320_31395840.pth [2023-03-08 19:43:53,315][547799] Updated weights for policy 0, policy_version 62000 (0.0006) [2023-03-08 19:43:54,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 31756288. Throughput: 0: 10450.2. Samples: 31736040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:43:54,600][547459] Avg episode reward: [(0, '4643.421')] [2023-03-08 19:43:57,261][547799] Updated weights for policy 0, policy_version 62080 (0.0005) [2023-03-08 19:43:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10552.4). Total num frames: 31805440. Throughput: 0: 10386.6. Samples: 31797832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:43:59,600][547459] Avg episode reward: [(0, '4437.430')] [2023-03-08 19:44:01,226][547799] Updated weights for policy 0, policy_version 62160 (0.0005) [2023-03-08 19:44:04,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10552.4). Total num frames: 31858688. Throughput: 0: 10314.3. Samples: 31859428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:44:04,600][547459] Avg episode reward: [(0, '4279.237')] [2023-03-08 19:44:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000062224_31858688.pth... [2023-03-08 19:44:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000061616_31547392.pth [2023-03-08 19:44:05,209][547799] Updated weights for policy 0, policy_version 62240 (0.0005) [2023-03-08 19:44:09,113][547799] Updated weights for policy 0, policy_version 62320 (0.0005) [2023-03-08 19:44:09,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10552.4). Total num frames: 31911936. Throughput: 0: 10352.8. Samples: 31891520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:44:09,600][547459] Avg episode reward: [(0, '4370.195')] [2023-03-08 19:44:12,933][547799] Updated weights for policy 0, policy_version 62400 (0.0006) [2023-03-08 19:44:14,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10538.5). Total num frames: 31961088. Throughput: 0: 10423.8. Samples: 31955024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:44:14,600][547459] Avg episode reward: [(0, '4589.994')] [2023-03-08 19:44:16,966][547799] Updated weights for policy 0, policy_version 62480 (0.0005) [2023-03-08 19:44:19,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10538.5). Total num frames: 32014336. Throughput: 0: 10408.5. Samples: 32015784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:44:19,600][547459] Avg episode reward: [(0, '3633.063')] [2023-03-08 19:44:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000062528_32014336.pth... [2023-03-08 19:44:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000061928_31707136.pth [2023-03-08 19:44:20,981][547799] Updated weights for policy 0, policy_version 62560 (0.0005) [2023-03-08 19:44:24,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10524.6). Total num frames: 32063488. Throughput: 0: 10378.1. Samples: 32046260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:44:24,600][547459] Avg episode reward: [(0, '3691.493')] [2023-03-08 19:44:25,084][547799] Updated weights for policy 0, policy_version 62640 (0.0005) [2023-03-08 19:44:29,262][547799] Updated weights for policy 0, policy_version 62720 (0.0006) [2023-03-08 19:44:29,600][547459] Fps is (10 sec: 9830.5, 60 sec: 10308.3, 300 sec: 10510.8). Total num frames: 32112640. Throughput: 0: 10301.5. Samples: 32105516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:44:29,600][547459] Avg episode reward: [(0, '3179.328')] [2023-03-08 19:44:33,249][547799] Updated weights for policy 0, policy_version 62800 (0.0005) [2023-03-08 19:44:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10510.7). Total num frames: 32165888. Throughput: 0: 10229.4. Samples: 32166620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:44:34,600][547459] Avg episode reward: [(0, '3692.147')] [2023-03-08 19:44:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000062824_32165888.pth... [2023-03-08 19:44:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000062224_31858688.pth [2023-03-08 19:44:37,209][547799] Updated weights for policy 0, policy_version 62880 (0.0005) [2023-03-08 19:44:39,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10496.9). Total num frames: 32215040. Throughput: 0: 10276.6. Samples: 32198488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:44:39,600][547459] Avg episode reward: [(0, '4211.154')] [2023-03-08 19:44:41,421][547799] Updated weights for policy 0, policy_version 62960 (0.0005) [2023-03-08 19:44:44,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10483.0). Total num frames: 32264192. Throughput: 0: 10181.7. Samples: 32256008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:44:44,600][547459] Avg episode reward: [(0, '3123.859')] [2023-03-08 19:44:45,745][547799] Updated weights for policy 0, policy_version 63040 (0.0005) [2023-03-08 19:44:49,600][547459] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10455.2). Total num frames: 32313344. Throughput: 0: 10119.7. Samples: 32314812. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:44:49,600][547459] Avg episode reward: [(0, '3565.087')] [2023-03-08 19:44:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000063112_32313344.pth... [2023-03-08 19:44:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000062528_32014336.pth [2023-03-08 19:44:49,838][547799] Updated weights for policy 0, policy_version 63120 (0.0006) [2023-03-08 19:44:54,056][547799] Updated weights for policy 0, policy_version 63200 (0.0005) [2023-03-08 19:44:54,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10441.3). Total num frames: 32362496. Throughput: 0: 10066.4. Samples: 32344508. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:44:54,601][547459] Avg episode reward: [(0, '3620.699')] [2023-03-08 19:44:58,198][547799] Updated weights for policy 0, policy_version 63280 (0.0005) [2023-03-08 19:44:59,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10427.4). Total num frames: 32411648. Throughput: 0: 9958.0. Samples: 32403136. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:44:59,600][547459] Avg episode reward: [(0, '3680.574')] [2023-03-08 19:45:02,394][547799] Updated weights for policy 0, policy_version 63360 (0.0005) [2023-03-08 19:45:04,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10413.6). Total num frames: 32460800. Throughput: 0: 9890.7. Samples: 32460864. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:45:04,600][547459] Avg episode reward: [(0, '3381.239')] [2023-03-08 19:45:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000063400_32460800.pth... [2023-03-08 19:45:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000062824_32165888.pth [2023-03-08 19:45:06,586][547799] Updated weights for policy 0, policy_version 63440 (0.0005) [2023-03-08 19:45:09,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10399.7). Total num frames: 32509952. Throughput: 0: 9881.8. Samples: 32490940. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:45:09,600][547459] Avg episode reward: [(0, '3614.365')] [2023-03-08 19:45:10,774][547799] Updated weights for policy 0, policy_version 63520 (0.0005) [2023-03-08 19:45:14,599][547459] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10371.9). Total num frames: 32559104. Throughput: 0: 9856.8. Samples: 32549072. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:45:14,600][547459] Avg episode reward: [(0, '3746.362')] [2023-03-08 19:45:14,891][547799] Updated weights for policy 0, policy_version 63600 (0.0006) [2023-03-08 19:45:19,064][547799] Updated weights for policy 0, policy_version 63680 (0.0005) [2023-03-08 19:45:19,599][547459] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10358.0). Total num frames: 32608256. Throughput: 0: 9815.6. Samples: 32608320. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:45:19,600][547459] Avg episode reward: [(0, '3636.327')] [2023-03-08 19:45:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000063688_32608256.pth... [2023-03-08 19:45:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000063112_32313344.pth [2023-03-08 19:45:23,158][547799] Updated weights for policy 0, policy_version 63760 (0.0005) [2023-03-08 19:45:24,600][547459] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10344.1). Total num frames: 32657408. Throughput: 0: 9781.2. Samples: 32638644. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:45:24,600][547459] Avg episode reward: [(0, '3753.950')] [2023-03-08 19:45:27,383][547799] Updated weights for policy 0, policy_version 63840 (0.0005) [2023-03-08 19:45:29,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10330.3). Total num frames: 32706560. Throughput: 0: 9806.0. Samples: 32697280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:45:29,600][547459] Avg episode reward: [(0, '3357.893')] [2023-03-08 19:45:31,613][547799] Updated weights for policy 0, policy_version 63920 (0.0005) [2023-03-08 19:45:34,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10330.2). Total num frames: 32755712. Throughput: 0: 9799.2. Samples: 32755776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:45:34,600][547459] Avg episode reward: [(0, '2848.177')] [2023-03-08 19:45:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000063976_32755712.pth... [2023-03-08 19:45:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000063400_32460800.pth [2023-03-08 19:45:35,699][547799] Updated weights for policy 0, policy_version 64000 (0.0006) [2023-03-08 19:45:39,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10316.4). Total num frames: 32804864. Throughput: 0: 9790.4. Samples: 32785076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:45:39,600][547459] Avg episode reward: [(0, '3658.818')] [2023-03-08 19:45:39,980][547799] Updated weights for policy 0, policy_version 64080 (0.0005) [2023-03-08 19:45:44,244][547799] Updated weights for policy 0, policy_version 64160 (0.0005) [2023-03-08 19:45:44,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10288.6). Total num frames: 32849920. Throughput: 0: 9771.2. Samples: 32842840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:45:44,600][547459] Avg episode reward: [(0, '3560.987')] [2023-03-08 19:45:48,409][547799] Updated weights for policy 0, policy_version 64240 (0.0005) [2023-03-08 19:45:49,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10274.7). Total num frames: 32899072. Throughput: 0: 9811.0. Samples: 32902360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:45:49,600][547459] Avg episode reward: [(0, '4194.512')] [2023-03-08 19:45:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000064256_32899072.pth... [2023-03-08 19:45:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000063688_32608256.pth [2023-03-08 19:45:52,472][547799] Updated weights for policy 0, policy_version 64320 (0.0005) [2023-03-08 19:45:54,600][547459] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 10260.8). Total num frames: 32952320. Throughput: 0: 9801.7. Samples: 32932016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:45:54,600][547459] Avg episode reward: [(0, '3887.877')] [2023-03-08 19:45:56,621][547799] Updated weights for policy 0, policy_version 64400 (0.0005) [2023-03-08 19:45:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 10246.9). Total num frames: 33001472. Throughput: 0: 9856.0. Samples: 32992592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:45:59,600][547459] Avg episode reward: [(0, '3482.810')] [2023-03-08 19:46:00,708][547799] Updated weights for policy 0, policy_version 64480 (0.0005) [2023-03-08 19:46:04,599][547459] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 10233.1). Total num frames: 33050624. Throughput: 0: 9882.4. Samples: 33053028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:46:04,600][547459] Avg episode reward: [(0, '3830.761')] [2023-03-08 19:46:04,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000064552_33050624.pth... [2023-03-08 19:46:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000063976_32755712.pth [2023-03-08 19:46:04,730][547799] Updated weights for policy 0, policy_version 64560 (0.0005) [2023-03-08 19:46:09,014][547799] Updated weights for policy 0, policy_version 64640 (0.0005) [2023-03-08 19:46:09,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10233.1). Total num frames: 33099776. Throughput: 0: 9851.5. Samples: 33081960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:46:09,600][547459] Avg episode reward: [(0, '3650.472')] [2023-03-08 19:46:13,158][547799] Updated weights for policy 0, policy_version 64720 (0.0005) [2023-03-08 19:46:14,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10219.2). Total num frames: 33148928. Throughput: 0: 9853.2. Samples: 33140672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:46:14,600][547459] Avg episode reward: [(0, '3117.634')] [2023-03-08 19:46:17,371][547799] Updated weights for policy 0, policy_version 64800 (0.0006) [2023-03-08 19:46:19,599][547459] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 10191.4). Total num frames: 33198080. Throughput: 0: 9834.8. Samples: 33198340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:46:19,600][547459] Avg episode reward: [(0, '3341.481')] [2023-03-08 19:46:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000064840_33198080.pth... [2023-03-08 19:46:19,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000064256_32899072.pth [2023-03-08 19:46:21,509][547799] Updated weights for policy 0, policy_version 64880 (0.0005) [2023-03-08 19:46:24,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10191.4). Total num frames: 33247232. Throughput: 0: 9865.0. Samples: 33229000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:46:24,600][547459] Avg episode reward: [(0, '3172.034')] [2023-03-08 19:46:25,571][547799] Updated weights for policy 0, policy_version 64960 (0.0005) [2023-03-08 19:46:29,600][547459] Fps is (10 sec: 9830.2, 60 sec: 9830.4, 300 sec: 10177.5). Total num frames: 33296384. Throughput: 0: 9919.5. Samples: 33289220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:46:29,601][547459] Avg episode reward: [(0, '3545.168')] [2023-03-08 19:46:29,763][547799] Updated weights for policy 0, policy_version 65040 (0.0006) [2023-03-08 19:46:34,045][547799] Updated weights for policy 0, policy_version 65120 (0.0005) [2023-03-08 19:46:34,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10149.7). Total num frames: 33345536. Throughput: 0: 9852.3. Samples: 33345712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:46:34,601][547459] Avg episode reward: [(0, '3099.969')] [2023-03-08 19:46:34,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000065128_33345536.pth... [2023-03-08 19:46:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000064552_33050624.pth [2023-03-08 19:46:38,166][547799] Updated weights for policy 0, policy_version 65200 (0.0005) [2023-03-08 19:46:39,600][547459] Fps is (10 sec: 9830.6, 60 sec: 9830.4, 300 sec: 10149.7). Total num frames: 33394688. Throughput: 0: 9871.4. Samples: 33376228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:46:39,600][547459] Avg episode reward: [(0, '3130.638')] [2023-03-08 19:46:42,226][547799] Updated weights for policy 0, policy_version 65280 (0.0005) [2023-03-08 19:46:44,600][547459] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10135.9). Total num frames: 33443840. Throughput: 0: 9847.3. Samples: 33435720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:46:44,600][547459] Avg episode reward: [(0, '3528.664')] [2023-03-08 19:46:46,489][547799] Updated weights for policy 0, policy_version 65360 (0.0005) [2023-03-08 19:46:49,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10122.0). Total num frames: 33492992. Throughput: 0: 9822.6. Samples: 33495044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:46:49,600][547459] Avg episode reward: [(0, '3564.293')] [2023-03-08 19:46:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000065416_33492992.pth... [2023-03-08 19:46:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000064840_33198080.pth [2023-03-08 19:46:50,597][547799] Updated weights for policy 0, policy_version 65440 (0.0006) [2023-03-08 19:46:54,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10122.0). Total num frames: 33542144. Throughput: 0: 9850.3. Samples: 33525224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:46:54,600][547459] Avg episode reward: [(0, '3795.550')] [2023-03-08 19:46:54,718][547799] Updated weights for policy 0, policy_version 65520 (0.0005) [2023-03-08 19:46:58,920][547799] Updated weights for policy 0, policy_version 65600 (0.0005) [2023-03-08 19:46:59,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10108.1). Total num frames: 33591296. Throughput: 0: 9838.5. Samples: 33583404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:46:59,600][547459] Avg episode reward: [(0, '3514.333')] [2023-03-08 19:47:03,214][547799] Updated weights for policy 0, policy_version 65680 (0.0005) [2023-03-08 19:47:04,600][547459] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10108.1). Total num frames: 33640448. Throughput: 0: 9832.5. Samples: 33640804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:47:04,600][547459] Avg episode reward: [(0, '3203.265')] [2023-03-08 19:47:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000065704_33640448.pth... [2023-03-08 19:47:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000065128_33345536.pth [2023-03-08 19:47:07,398][547799] Updated weights for policy 0, policy_version 65760 (0.0005) [2023-03-08 19:47:09,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10094.2). Total num frames: 33689600. Throughput: 0: 9805.8. Samples: 33670260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:47:09,600][547459] Avg episode reward: [(0, '2958.131')] [2023-03-08 19:47:11,482][547799] Updated weights for policy 0, policy_version 65840 (0.0005) [2023-03-08 19:47:14,599][547459] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 10080.3). Total num frames: 33738752. Throughput: 0: 9809.2. Samples: 33730632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:47:14,600][547459] Avg episode reward: [(0, '3544.382')] [2023-03-08 19:47:15,547][547799] Updated weights for policy 0, policy_version 65920 (0.0005) [2023-03-08 19:47:19,569][547799] Updated weights for policy 0, policy_version 66000 (0.0005) [2023-03-08 19:47:19,600][547459] Fps is (10 sec: 10239.8, 60 sec: 9898.6, 300 sec: 10094.2). Total num frames: 33792000. Throughput: 0: 9916.8. Samples: 33791968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:47:19,601][547459] Avg episode reward: [(0, '3248.164')] [2023-03-08 19:47:19,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000066000_33792000.pth... [2023-03-08 19:47:19,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000065416_33492992.pth [2023-03-08 19:47:23,603][547799] Updated weights for policy 0, policy_version 66080 (0.0005) [2023-03-08 19:47:24,600][547459] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 10080.3). Total num frames: 33841152. Throughput: 0: 9917.4. Samples: 33822512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:47:24,600][547459] Avg episode reward: [(0, '3879.810')] [2023-03-08 19:47:27,809][547799] Updated weights for policy 0, policy_version 66160 (0.0005) [2023-03-08 19:47:29,600][547459] Fps is (10 sec: 9830.6, 60 sec: 9898.7, 300 sec: 10066.4). Total num frames: 33890304. Throughput: 0: 9913.7. Samples: 33881836. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:47:29,600][547459] Avg episode reward: [(0, '3843.841')] [2023-03-08 19:47:31,922][547799] Updated weights for policy 0, policy_version 66240 (0.0005) [2023-03-08 19:47:34,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10052.6). Total num frames: 33939456. Throughput: 0: 9905.1. Samples: 33940776. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:47:34,600][547459] Avg episode reward: [(0, '3179.871')] [2023-03-08 19:47:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000066288_33939456.pth... [2023-03-08 19:47:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000065704_33640448.pth [2023-03-08 19:47:36,046][547799] Updated weights for policy 0, policy_version 66320 (0.0006) [2023-03-08 19:47:39,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 33988608. Throughput: 0: 9918.2. Samples: 33971544. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:47:39,600][547459] Avg episode reward: [(0, '3435.103')] [2023-03-08 19:47:40,142][547799] Updated weights for policy 0, policy_version 66400 (0.0005) [2023-03-08 19:47:44,349][547799] Updated weights for policy 0, policy_version 66480 (0.0005) [2023-03-08 19:47:44,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 34037760. Throughput: 0: 9916.4. Samples: 34029640. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:47:44,600][547459] Avg episode reward: [(0, '3601.206')] [2023-03-08 19:47:48,574][547799] Updated weights for policy 0, policy_version 66560 (0.0005) [2023-03-08 19:47:49,599][547459] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 34086912. Throughput: 0: 9920.5. Samples: 34087224. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:47:49,600][547459] Avg episode reward: [(0, '3777.901')] [2023-03-08 19:47:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000066576_34086912.pth... [2023-03-08 19:47:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000066000_33792000.pth [2023-03-08 19:47:52,853][547799] Updated weights for policy 0, policy_version 66640 (0.0005) [2023-03-08 19:47:54,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 34136064. Throughput: 0: 9901.9. Samples: 34115844. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:47:54,600][547459] Avg episode reward: [(0, '3746.359')] [2023-03-08 19:47:57,034][547799] Updated weights for policy 0, policy_version 66720 (0.0006) [2023-03-08 19:47:59,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10010.9). Total num frames: 34185216. Throughput: 0: 9891.9. Samples: 34175768. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:47:59,600][547459] Avg episode reward: [(0, '3633.577')] [2023-03-08 19:48:01,222][547799] Updated weights for policy 0, policy_version 66800 (0.0005) [2023-03-08 19:48:04,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9983.1). Total num frames: 34230272. Throughput: 0: 9798.1. Samples: 34232880. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 19:48:04,600][547459] Avg episode reward: [(0, '3473.059')] [2023-03-08 19:48:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000066856_34230272.pth... [2023-03-08 19:48:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000066288_33939456.pth [2023-03-08 19:48:05,552][547799] Updated weights for policy 0, policy_version 66880 (0.0005) [2023-03-08 19:48:09,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 34279424. Throughput: 0: 9757.2. Samples: 34261584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:48:09,600][547459] Avg episode reward: [(0, '3484.437')] [2023-03-08 19:48:09,900][547799] Updated weights for policy 0, policy_version 66960 (0.0005) [2023-03-08 19:48:14,134][547799] Updated weights for policy 0, policy_version 67040 (0.0005) [2023-03-08 19:48:14,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 34328576. Throughput: 0: 9716.0. Samples: 34319056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:48:14,601][547459] Avg episode reward: [(0, '3763.934')] [2023-03-08 19:48:18,288][547799] Updated weights for policy 0, policy_version 67120 (0.0005) [2023-03-08 19:48:19,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9762.2, 300 sec: 9941.5). Total num frames: 34377728. Throughput: 0: 9710.0. Samples: 34377728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:48:19,600][547459] Avg episode reward: [(0, '3653.610')] [2023-03-08 19:48:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000067144_34377728.pth... [2023-03-08 19:48:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000066576_34086912.pth [2023-03-08 19:48:22,541][547799] Updated weights for policy 0, policy_version 67200 (0.0005) [2023-03-08 19:48:24,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9941.5). Total num frames: 34426880. Throughput: 0: 9663.6. Samples: 34406408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:48:24,600][547459] Avg episode reward: [(0, '3659.937')] [2023-03-08 19:48:26,599][547799] Updated weights for policy 0, policy_version 67280 (0.0005) [2023-03-08 19:48:29,599][547459] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 34476032. Throughput: 0: 9719.7. Samples: 34467024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:48:29,600][547459] Avg episode reward: [(0, '4013.265')] [2023-03-08 19:48:30,688][547799] Updated weights for policy 0, policy_version 67360 (0.0005) [2023-03-08 19:48:34,600][547459] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 34525184. Throughput: 0: 9733.8. Samples: 34525248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:48:34,601][547459] Avg episode reward: [(0, '4287.510')] [2023-03-08 19:48:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000067432_34525184.pth... [2023-03-08 19:48:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000066856_34230272.pth [2023-03-08 19:48:34,934][547799] Updated weights for policy 0, policy_version 67440 (0.0005) [2023-03-08 19:48:39,124][547799] Updated weights for policy 0, policy_version 67520 (0.0005) [2023-03-08 19:48:39,600][547459] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9913.7). Total num frames: 34574336. Throughput: 0: 9762.5. Samples: 34555156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:48:39,600][547459] Avg episode reward: [(0, '4216.550')] [2023-03-08 19:48:43,161][547799] Updated weights for policy 0, policy_version 67600 (0.0005) [2023-03-08 19:48:44,600][547459] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 34623488. Throughput: 0: 9767.3. Samples: 34615296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:48:44,600][547459] Avg episode reward: [(0, '4522.560')] [2023-03-08 19:48:47,244][547799] Updated weights for policy 0, policy_version 67680 (0.0005) [2023-03-08 19:48:49,599][547459] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9885.9). Total num frames: 34672640. Throughput: 0: 9845.6. Samples: 34675932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:48:49,600][547459] Avg episode reward: [(0, '4413.077')] [2023-03-08 19:48:49,664][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000067728_34676736.pth... [2023-03-08 19:48:49,666][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000067144_34377728.pth [2023-03-08 19:48:51,271][547799] Updated weights for policy 0, policy_version 67760 (0.0005) [2023-03-08 19:48:54,600][547459] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9899.8). Total num frames: 34725888. Throughput: 0: 9883.7. Samples: 34706352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:48:54,601][547459] Avg episode reward: [(0, '4701.625')] [2023-03-08 19:48:55,387][547799] Updated weights for policy 0, policy_version 67840 (0.0006) [2023-03-08 19:48:59,353][547799] Updated weights for policy 0, policy_version 67920 (0.0005) [2023-03-08 19:48:59,599][547459] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9885.9). Total num frames: 34775040. Throughput: 0: 9949.1. Samples: 34766764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:48:59,600][547459] Avg episode reward: [(0, '4610.885')] [2023-03-08 19:49:03,266][547799] Updated weights for policy 0, policy_version 68000 (0.0005) [2023-03-08 19:49:04,600][547459] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 34828288. Throughput: 0: 10021.3. Samples: 34828688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:49:04,600][547459] Avg episode reward: [(0, '4746.983')] [2023-03-08 19:49:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000068024_34828288.pth... [2023-03-08 19:49:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000067432_34525184.pth [2023-03-08 19:49:07,042][547799] Updated weights for policy 0, policy_version 68080 (0.0006) [2023-03-08 19:49:09,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 34881536. Throughput: 0: 10128.1. Samples: 34862172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:49:09,600][547459] Avg episode reward: [(0, '4717.929')] [2023-03-08 19:49:10,964][547799] Updated weights for policy 0, policy_version 68160 (0.0005) [2023-03-08 19:49:14,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10103.5, 300 sec: 9899.8). Total num frames: 34934784. Throughput: 0: 10155.0. Samples: 34924000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:49:14,600][547459] Avg episode reward: [(0, '4626.449')] [2023-03-08 19:49:14,927][547799] Updated weights for policy 0, policy_version 68240 (0.0005) [2023-03-08 19:49:18,958][547799] Updated weights for policy 0, policy_version 68320 (0.0005) [2023-03-08 19:49:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9899.8). Total num frames: 34983936. Throughput: 0: 10235.3. Samples: 34985836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:49:19,600][547459] Avg episode reward: [(0, '4051.450')] [2023-03-08 19:49:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000068328_34983936.pth... [2023-03-08 19:49:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000067728_34676736.pth [2023-03-08 19:49:22,900][547799] Updated weights for policy 0, policy_version 68400 (0.0006) [2023-03-08 19:49:24,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9913.7). Total num frames: 35037184. Throughput: 0: 10259.2. Samples: 35016820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:49:24,600][547459] Avg episode reward: [(0, '4673.093')] [2023-03-08 19:49:26,766][547799] Updated weights for policy 0, policy_version 68480 (0.0005) [2023-03-08 19:49:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 9913.7). Total num frames: 35090432. Throughput: 0: 10331.5. Samples: 35080212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:49:29,600][547459] Avg episode reward: [(0, '4674.864')] [2023-03-08 19:49:30,566][547799] Updated weights for policy 0, policy_version 68560 (0.0005) [2023-03-08 19:49:34,459][547799] Updated weights for policy 0, policy_version 68640 (0.0005) [2023-03-08 19:49:34,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 9927.6). Total num frames: 35143680. Throughput: 0: 10402.4. Samples: 35144040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:49:34,600][547459] Avg episode reward: [(0, '4766.969')] [2023-03-08 19:49:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000068640_35143680.pth... [2023-03-08 19:49:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000068024_34828288.pth [2023-03-08 19:49:38,379][547799] Updated weights for policy 0, policy_version 68720 (0.0006) [2023-03-08 19:49:39,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 9941.5). Total num frames: 35196928. Throughput: 0: 10444.3. Samples: 35176344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:49:39,600][547459] Avg episode reward: [(0, '4767.361')] [2023-03-08 19:49:42,134][547799] Updated weights for policy 0, policy_version 68800 (0.0005) [2023-03-08 19:49:44,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 9955.4). Total num frames: 35250176. Throughput: 0: 10532.9. Samples: 35240744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:49:44,600][547459] Avg episode reward: [(0, '4720.802')] [2023-03-08 19:49:46,036][547799] Updated weights for policy 0, policy_version 68880 (0.0006) [2023-03-08 19:49:49,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 9969.2). Total num frames: 35303424. Throughput: 0: 10549.7. Samples: 35303424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:49:49,600][547459] Avg episode reward: [(0, '4721.149')] [2023-03-08 19:49:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000068952_35303424.pth... [2023-03-08 19:49:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000068328_34983936.pth [2023-03-08 19:49:49,909][547799] Updated weights for policy 0, policy_version 68960 (0.0006) [2023-03-08 19:49:53,885][547799] Updated weights for policy 0, policy_version 69040 (0.0005) [2023-03-08 19:49:54,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 9969.2). Total num frames: 35352576. Throughput: 0: 10528.8. Samples: 35335968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:49:54,600][547459] Avg episode reward: [(0, '4621.255')] [2023-03-08 19:49:57,644][547799] Updated weights for policy 0, policy_version 69120 (0.0006) [2023-03-08 19:49:59,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 9997.0). Total num frames: 35409920. Throughput: 0: 10576.1. Samples: 35399924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:49:59,600][547459] Avg episode reward: [(0, '4763.878')] [2023-03-08 19:50:01,386][547799] Updated weights for policy 0, policy_version 69200 (0.0005) [2023-03-08 19:50:04,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10581.3, 300 sec: 10010.9). Total num frames: 35463168. Throughput: 0: 10608.8. Samples: 35463232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:50:04,600][547459] Avg episode reward: [(0, '4718.338')] [2023-03-08 19:50:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000069264_35463168.pth... [2023-03-08 19:50:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000068640_35143680.pth [2023-03-08 19:50:05,268][547799] Updated weights for policy 0, policy_version 69280 (0.0005) [2023-03-08 19:50:09,123][547799] Updated weights for policy 0, policy_version 69360 (0.0005) [2023-03-08 19:50:09,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10024.8). Total num frames: 35516416. Throughput: 0: 10647.3. Samples: 35495948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:50:09,600][547459] Avg episode reward: [(0, '4721.399')] [2023-03-08 19:50:13,313][547799] Updated weights for policy 0, policy_version 69440 (0.0006) [2023-03-08 19:50:14,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10024.8). Total num frames: 35565568. Throughput: 0: 10569.5. Samples: 35555840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:50:14,600][547459] Avg episode reward: [(0, '3914.689')] [2023-03-08 19:50:17,350][547799] Updated weights for policy 0, policy_version 69520 (0.0005) [2023-03-08 19:50:19,600][547459] Fps is (10 sec: 9830.3, 60 sec: 10513.1, 300 sec: 10024.8). Total num frames: 35614720. Throughput: 0: 10495.7. Samples: 35616348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:50:19,600][547459] Avg episode reward: [(0, '4083.811')] [2023-03-08 19:50:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000069560_35614720.pth... [2023-03-08 19:50:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000068952_35303424.pth [2023-03-08 19:50:21,292][547799] Updated weights for policy 0, policy_version 69600 (0.0006) [2023-03-08 19:50:24,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10038.7). Total num frames: 35667968. Throughput: 0: 10471.3. Samples: 35647552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:50:24,600][547459] Avg episode reward: [(0, '4456.339')] [2023-03-08 19:50:25,377][547799] Updated weights for policy 0, policy_version 69680 (0.0005) [2023-03-08 19:50:29,307][547799] Updated weights for policy 0, policy_version 69760 (0.0005) [2023-03-08 19:50:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10038.7). Total num frames: 35717120. Throughput: 0: 10405.8. Samples: 35709004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:50:29,600][547459] Avg episode reward: [(0, '4494.214')] [2023-03-08 19:50:33,253][547799] Updated weights for policy 0, policy_version 69840 (0.0005) [2023-03-08 19:50:34,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10052.6). Total num frames: 35770368. Throughput: 0: 10408.9. Samples: 35771824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:50:34,600][547459] Avg episode reward: [(0, '4590.095')] [2023-03-08 19:50:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000069864_35770368.pth... [2023-03-08 19:50:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000069264_35463168.pth [2023-03-08 19:50:37,159][547799] Updated weights for policy 0, policy_version 69920 (0.0005) [2023-03-08 19:50:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10066.4). Total num frames: 35819520. Throughput: 0: 10380.3. Samples: 35803080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:50:39,600][547459] Avg episode reward: [(0, '4378.885')] [2023-03-08 19:50:41,276][547799] Updated weights for policy 0, policy_version 70000 (0.0005) [2023-03-08 19:50:44,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10080.3). Total num frames: 35872768. Throughput: 0: 10270.5. Samples: 35862096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:50:44,600][547459] Avg episode reward: [(0, '4220.264')] [2023-03-08 19:50:45,297][547799] Updated weights for policy 0, policy_version 70080 (0.0006) [2023-03-08 19:50:49,370][547799] Updated weights for policy 0, policy_version 70160 (0.0005) [2023-03-08 19:50:49,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10066.4). Total num frames: 35921920. Throughput: 0: 10225.9. Samples: 35923396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:50:49,600][547459] Avg episode reward: [(0, '4135.992')] [2023-03-08 19:50:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000070160_35921920.pth... [2023-03-08 19:50:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000069560_35614720.pth [2023-03-08 19:50:53,470][547799] Updated weights for policy 0, policy_version 70240 (0.0005) [2023-03-08 19:50:54,599][547459] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10066.4). Total num frames: 35971072. Throughput: 0: 10177.1. Samples: 35953916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:50:54,600][547459] Avg episode reward: [(0, '3758.967')] [2023-03-08 19:50:57,432][547799] Updated weights for policy 0, policy_version 70320 (0.0005) [2023-03-08 19:50:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 36024320. Throughput: 0: 10226.5. Samples: 36016032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:50:59,600][547459] Avg episode reward: [(0, '4209.027')] [2023-03-08 19:51:01,272][547799] Updated weights for policy 0, policy_version 70400 (0.0005) [2023-03-08 19:51:04,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 36077568. Throughput: 0: 10289.6. Samples: 36079380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:51:04,600][547459] Avg episode reward: [(0, '4499.594')] [2023-03-08 19:51:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000070464_36077568.pth... [2023-03-08 19:51:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000069864_35770368.pth [2023-03-08 19:51:05,174][547799] Updated weights for policy 0, policy_version 70480 (0.0005) [2023-03-08 19:51:09,122][547799] Updated weights for policy 0, policy_version 70560 (0.0005) [2023-03-08 19:51:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10108.1). Total num frames: 36130816. Throughput: 0: 10284.3. Samples: 36110348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:51:09,601][547459] Avg episode reward: [(0, '4616.143')] [2023-03-08 19:51:13,067][547799] Updated weights for policy 0, policy_version 70640 (0.0005) [2023-03-08 19:51:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 36184064. Throughput: 0: 10288.2. Samples: 36171972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:51:14,600][547459] Avg episode reward: [(0, '4705.632')] [2023-03-08 19:51:16,931][547799] Updated weights for policy 0, policy_version 70720 (0.0005) [2023-03-08 19:51:19,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10135.9). Total num frames: 36237312. Throughput: 0: 10344.4. Samples: 36237320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:51:19,600][547459] Avg episode reward: [(0, '4750.414')] [2023-03-08 19:51:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000070776_36237312.pth... [2023-03-08 19:51:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000070160_35921920.pth [2023-03-08 19:51:20,667][547799] Updated weights for policy 0, policy_version 70800 (0.0005) [2023-03-08 19:51:24,527][547799] Updated weights for policy 0, policy_version 70880 (0.0006) [2023-03-08 19:51:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10149.8). Total num frames: 36290560. Throughput: 0: 10377.8. Samples: 36270080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:51:24,600][547459] Avg episode reward: [(0, '4767.795')] [2023-03-08 19:51:28,463][547799] Updated weights for policy 0, policy_version 70960 (0.0006) [2023-03-08 19:51:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10163.6). Total num frames: 36343808. Throughput: 0: 10438.9. Samples: 36331848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:51:29,600][547459] Avg episode reward: [(0, '4606.152')] [2023-03-08 19:51:32,387][547799] Updated weights for policy 0, policy_version 71040 (0.0006) [2023-03-08 19:51:34,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10163.6). Total num frames: 36392960. Throughput: 0: 10489.1. Samples: 36395408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:51:34,600][547459] Avg episode reward: [(0, '4767.446')] [2023-03-08 19:51:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000071080_36392960.pth... [2023-03-08 19:51:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000070464_36077568.pth [2023-03-08 19:51:36,260][547799] Updated weights for policy 0, policy_version 71120 (0.0006) [2023-03-08 19:51:39,599][547459] Fps is (10 sec: 10240.2, 60 sec: 10444.8, 300 sec: 10177.5). Total num frames: 36446208. Throughput: 0: 10511.7. Samples: 36426944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:51:39,600][547459] Avg episode reward: [(0, '4724.093')] [2023-03-08 19:51:40,108][547799] Updated weights for policy 0, policy_version 71200 (0.0005) [2023-03-08 19:51:44,071][547799] Updated weights for policy 0, policy_version 71280 (0.0005) [2023-03-08 19:51:44,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10191.4). Total num frames: 36499456. Throughput: 0: 10541.7. Samples: 36490408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:51:44,601][547459] Avg episode reward: [(0, '4821.859')] [2023-03-08 19:51:47,890][547799] Updated weights for policy 0, policy_version 71360 (0.0006) [2023-03-08 19:51:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10205.3). Total num frames: 36552704. Throughput: 0: 10519.7. Samples: 36552768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:51:49,600][547459] Avg episode reward: [(0, '4648.562')] [2023-03-08 19:51:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000071392_36552704.pth... [2023-03-08 19:51:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000070776_36237312.pth [2023-03-08 19:51:51,776][547799] Updated weights for policy 0, policy_version 71440 (0.0005) [2023-03-08 19:51:54,600][547459] Fps is (10 sec: 10649.4, 60 sec: 10581.3, 300 sec: 10219.2). Total num frames: 36605952. Throughput: 0: 10558.5. Samples: 36585480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:51:54,601][547459] Avg episode reward: [(0, '4523.892')] [2023-03-08 19:51:55,580][547799] Updated weights for policy 0, policy_version 71520 (0.0005) [2023-03-08 19:51:59,577][547799] Updated weights for policy 0, policy_version 71600 (0.0005) [2023-03-08 19:51:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10233.1). Total num frames: 36659200. Throughput: 0: 10595.6. Samples: 36648772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:51:59,600][547459] Avg episode reward: [(0, '4682.090')] [2023-03-08 19:52:03,370][547799] Updated weights for policy 0, policy_version 71680 (0.0006) [2023-03-08 19:52:04,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10246.9). Total num frames: 36712448. Throughput: 0: 10567.4. Samples: 36712852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:52:04,600][547459] Avg episode reward: [(0, '4734.913')] [2023-03-08 19:52:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000071704_36712448.pth... [2023-03-08 19:52:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000071080_36392960.pth [2023-03-08 19:52:07,228][547799] Updated weights for policy 0, policy_version 71760 (0.0006) [2023-03-08 19:52:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10260.8). Total num frames: 36765696. Throughput: 0: 10549.9. Samples: 36744828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:52:09,600][547459] Avg episode reward: [(0, '4576.182')] [2023-03-08 19:52:11,085][547799] Updated weights for policy 0, policy_version 71840 (0.0005) [2023-03-08 19:52:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10260.8). Total num frames: 36818944. Throughput: 0: 10590.6. Samples: 36808424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:52:14,600][547459] Avg episode reward: [(0, '4591.188')] [2023-03-08 19:52:14,870][547799] Updated weights for policy 0, policy_version 71920 (0.0005) [2023-03-08 19:52:18,521][547799] Updated weights for policy 0, policy_version 72000 (0.0005) [2023-03-08 19:52:19,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.4, 300 sec: 10274.7). Total num frames: 36872192. Throughput: 0: 10670.6. Samples: 36875584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:52:19,600][547459] Avg episode reward: [(0, '4746.745')] [2023-03-08 19:52:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000072016_36872192.pth... [2023-03-08 19:52:19,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000071392_36552704.pth [2023-03-08 19:52:22,235][547799] Updated weights for policy 0, policy_version 72080 (0.0005) [2023-03-08 19:52:24,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10302.5). Total num frames: 36929536. Throughput: 0: 10694.6. Samples: 36908204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:52:24,600][547459] Avg episode reward: [(0, '4511.136')] [2023-03-08 19:52:26,073][547799] Updated weights for policy 0, policy_version 72160 (0.0006) [2023-03-08 19:52:29,599][547459] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10316.4). Total num frames: 36982784. Throughput: 0: 10710.2. Samples: 36972364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:52:29,600][547459] Avg episode reward: [(0, '4547.820')] [2023-03-08 19:52:29,911][547799] Updated weights for policy 0, policy_version 72240 (0.0005) [2023-03-08 19:52:33,855][547799] Updated weights for policy 0, policy_version 72320 (0.0005) [2023-03-08 19:52:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10330.2). Total num frames: 37036032. Throughput: 0: 10738.1. Samples: 37035984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:52:34,601][547459] Avg episode reward: [(0, '4686.996')] [2023-03-08 19:52:34,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000072336_37036032.pth... [2023-03-08 19:52:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000071704_36712448.pth [2023-03-08 19:52:37,673][547799] Updated weights for policy 0, policy_version 72400 (0.0005) [2023-03-08 19:52:39,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10649.6, 300 sec: 10330.3). Total num frames: 37085184. Throughput: 0: 10720.3. Samples: 37067892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:52:39,600][547459] Avg episode reward: [(0, '4824.493')] [2023-03-08 19:52:41,632][547799] Updated weights for policy 0, policy_version 72480 (0.0006) [2023-03-08 19:52:44,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10344.1). Total num frames: 37138432. Throughput: 0: 10699.1. Samples: 37130232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:52:44,600][547459] Avg episode reward: [(0, '4361.734')] [2023-03-08 19:52:45,448][547799] Updated weights for policy 0, policy_version 72560 (0.0005) [2023-03-08 19:52:49,303][547799] Updated weights for policy 0, policy_version 72640 (0.0005) [2023-03-08 19:52:49,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10358.0). Total num frames: 37191680. Throughput: 0: 10693.8. Samples: 37194072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:52:49,600][547459] Avg episode reward: [(0, '4759.580')] [2023-03-08 19:52:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000072640_37191680.pth... [2023-03-08 19:52:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000072016_36872192.pth [2023-03-08 19:52:53,153][547799] Updated weights for policy 0, policy_version 72720 (0.0005) [2023-03-08 19:52:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10371.9). Total num frames: 37244928. Throughput: 0: 10693.2. Samples: 37226020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:52:54,600][547459] Avg episode reward: [(0, '4663.247')] [2023-03-08 19:52:56,985][547799] Updated weights for policy 0, policy_version 72800 (0.0005) [2023-03-08 19:52:59,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10399.7). Total num frames: 37298176. Throughput: 0: 10702.9. Samples: 37290056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:52:59,600][547459] Avg episode reward: [(0, '4779.497')] [2023-03-08 19:53:00,808][547799] Updated weights for policy 0, policy_version 72880 (0.0005) [2023-03-08 19:53:04,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10413.6). Total num frames: 37351424. Throughput: 0: 10618.5. Samples: 37353416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:53:04,600][547459] Avg episode reward: [(0, '4633.330')] [2023-03-08 19:53:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000072952_37351424.pth... [2023-03-08 19:53:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000072336_37036032.pth [2023-03-08 19:53:04,720][547799] Updated weights for policy 0, policy_version 72960 (0.0005) [2023-03-08 19:53:08,501][547799] Updated weights for policy 0, policy_version 73040 (0.0005) [2023-03-08 19:53:09,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10427.4). Total num frames: 37404672. Throughput: 0: 10615.1. Samples: 37385884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:53:09,600][547459] Avg episode reward: [(0, '4777.166')] [2023-03-08 19:53:12,393][547799] Updated weights for policy 0, policy_version 73120 (0.0006) [2023-03-08 19:53:14,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10441.3). Total num frames: 37457920. Throughput: 0: 10608.3. Samples: 37449736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:53:14,600][547459] Avg episode reward: [(0, '4668.582')] [2023-03-08 19:53:16,375][547799] Updated weights for policy 0, policy_version 73200 (0.0005) [2023-03-08 19:53:19,600][547459] Fps is (10 sec: 10649.4, 60 sec: 10649.6, 300 sec: 10455.2). Total num frames: 37511168. Throughput: 0: 10559.6. Samples: 37511168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:53:19,601][547459] Avg episode reward: [(0, '4650.771')] [2023-03-08 19:53:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000073264_37511168.pth... [2023-03-08 19:53:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000072640_37191680.pth [2023-03-08 19:53:20,297][547799] Updated weights for policy 0, policy_version 73280 (0.0006) [2023-03-08 19:53:24,288][547799] Updated weights for policy 0, policy_version 73360 (0.0005) [2023-03-08 19:53:24,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10455.2). Total num frames: 37560320. Throughput: 0: 10560.5. Samples: 37543116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:53:24,600][547459] Avg episode reward: [(0, '4078.846')] [2023-03-08 19:53:28,373][547799] Updated weights for policy 0, policy_version 73440 (0.0005) [2023-03-08 19:53:29,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.0, 300 sec: 10469.1). Total num frames: 37613568. Throughput: 0: 10502.3. Samples: 37602836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:53:29,601][547459] Avg episode reward: [(0, '3177.398')] [2023-03-08 19:53:32,253][547799] Updated weights for policy 0, policy_version 73520 (0.0005) [2023-03-08 19:53:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 37662720. Throughput: 0: 10484.7. Samples: 37665884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:53:34,600][547459] Avg episode reward: [(0, '3897.308')] [2023-03-08 19:53:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000073560_37662720.pth... [2023-03-08 19:53:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000072952_37351424.pth [2023-03-08 19:53:36,284][547799] Updated weights for policy 0, policy_version 73600 (0.0006) [2023-03-08 19:53:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 37715968. Throughput: 0: 10440.6. Samples: 37695848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:53:39,600][547459] Avg episode reward: [(0, '4237.644')] [2023-03-08 19:53:40,140][547799] Updated weights for policy 0, policy_version 73680 (0.0005) [2023-03-08 19:53:44,053][547799] Updated weights for policy 0, policy_version 73760 (0.0006) [2023-03-08 19:53:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 37769216. Throughput: 0: 10450.3. Samples: 37760320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:53:44,600][547459] Avg episode reward: [(0, '4621.884')] [2023-03-08 19:53:47,934][547799] Updated weights for policy 0, policy_version 73840 (0.0005) [2023-03-08 19:53:49,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 37822464. Throughput: 0: 10433.5. Samples: 37822924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:53:49,600][547459] Avg episode reward: [(0, '4663.280')] [2023-03-08 19:53:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000073872_37822464.pth... [2023-03-08 19:53:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000073264_37511168.pth [2023-03-08 19:53:51,830][547799] Updated weights for policy 0, policy_version 73920 (0.0005) [2023-03-08 19:53:54,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 37871616. Throughput: 0: 10425.5. Samples: 37855032. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:53:54,600][547459] Avg episode reward: [(0, '4596.282')] [2023-03-08 19:53:55,784][547799] Updated weights for policy 0, policy_version 74000 (0.0005) [2023-03-08 19:53:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 37924864. Throughput: 0: 10377.8. Samples: 37916736. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:53:59,600][547459] Avg episode reward: [(0, '4603.998')] [2023-03-08 19:53:59,744][547799] Updated weights for policy 0, policy_version 74080 (0.0005) [2023-03-08 19:54:03,812][547799] Updated weights for policy 0, policy_version 74160 (0.0005) [2023-03-08 19:54:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 37974016. Throughput: 0: 10357.1. Samples: 37977236. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:54:04,600][547459] Avg episode reward: [(0, '4591.953')] [2023-03-08 19:54:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000074168_37974016.pth... [2023-03-08 19:54:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000073560_37662720.pth [2023-03-08 19:54:07,772][547799] Updated weights for policy 0, policy_version 74240 (0.0005) [2023-03-08 19:54:09,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 38027264. Throughput: 0: 10331.1. Samples: 38008016. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:54:09,600][547459] Avg episode reward: [(0, '4200.590')] [2023-03-08 19:54:11,839][547799] Updated weights for policy 0, policy_version 74320 (0.0005) [2023-03-08 19:54:14,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10483.0). Total num frames: 38076416. Throughput: 0: 10360.8. Samples: 38069072. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:54:14,600][547459] Avg episode reward: [(0, '4137.721')] [2023-03-08 19:54:15,815][547799] Updated weights for policy 0, policy_version 74400 (0.0005) [2023-03-08 19:54:19,599][547459] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10469.1). Total num frames: 38125568. Throughput: 0: 10300.3. Samples: 38129396. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:54:19,600][547459] Avg episode reward: [(0, '4166.646')] [2023-03-08 19:54:19,619][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000074472_38129664.pth... [2023-03-08 19:54:19,621][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000073872_37822464.pth [2023-03-08 19:54:20,036][547799] Updated weights for policy 0, policy_version 74480 (0.0005) [2023-03-08 19:54:24,070][547799] Updated weights for policy 0, policy_version 74560 (0.0005) [2023-03-08 19:54:24,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10469.1). Total num frames: 38178816. Throughput: 0: 10305.3. Samples: 38159584. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:54:24,600][547459] Avg episode reward: [(0, '4000.308')] [2023-03-08 19:54:28,066][547799] Updated weights for policy 0, policy_version 74640 (0.0005) [2023-03-08 19:54:29,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10455.2). Total num frames: 38227968. Throughput: 0: 10212.6. Samples: 38219888. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:54:29,600][547459] Avg episode reward: [(0, '4079.644')] [2023-03-08 19:54:32,101][547799] Updated weights for policy 0, policy_version 74720 (0.0006) [2023-03-08 19:54:34,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10455.2). Total num frames: 38281216. Throughput: 0: 10185.8. Samples: 38281288. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:54:34,601][547459] Avg episode reward: [(0, '4550.606')] [2023-03-08 19:54:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000074768_38281216.pth... [2023-03-08 19:54:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000074168_37974016.pth [2023-03-08 19:54:36,000][547799] Updated weights for policy 0, policy_version 74800 (0.0005) [2023-03-08 19:54:39,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10441.3). Total num frames: 38330368. Throughput: 0: 10198.9. Samples: 38313984. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:54:39,600][547459] Avg episode reward: [(0, '4631.742')] [2023-03-08 19:54:40,033][547799] Updated weights for policy 0, policy_version 74880 (0.0005) [2023-03-08 19:54:43,991][547799] Updated weights for policy 0, policy_version 74960 (0.0006) [2023-03-08 19:54:44,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10441.3). Total num frames: 38383616. Throughput: 0: 10185.8. Samples: 38375100. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:54:44,600][547459] Avg episode reward: [(0, '4304.957')] [2023-03-08 19:54:47,744][547799] Updated weights for policy 0, policy_version 75040 (0.0005) [2023-03-08 19:54:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10240.0, 300 sec: 10455.2). Total num frames: 38436864. Throughput: 0: 10273.7. Samples: 38439552. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:54:49,600][547459] Avg episode reward: [(0, '4729.082')] [2023-03-08 19:54:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000075072_38436864.pth... [2023-03-08 19:54:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000074472_38129664.pth [2023-03-08 19:54:51,737][547799] Updated weights for policy 0, policy_version 75120 (0.0005) [2023-03-08 19:54:54,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10427.4). Total num frames: 38486016. Throughput: 0: 10256.8. Samples: 38469572. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:54:54,600][547459] Avg episode reward: [(0, '4378.776')] [2023-03-08 19:54:55,805][547799] Updated weights for policy 0, policy_version 75200 (0.0005) [2023-03-08 19:54:59,601][547459] Fps is (10 sec: 10238.6, 60 sec: 10239.8, 300 sec: 10427.4). Total num frames: 38539264. Throughput: 0: 10266.1. Samples: 38531060. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:54:59,602][547459] Avg episode reward: [(0, '4454.847')] [2023-03-08 19:54:59,782][547799] Updated weights for policy 0, policy_version 75280 (0.0005) [2023-03-08 19:55:03,750][547799] Updated weights for policy 0, policy_version 75360 (0.0006) [2023-03-08 19:55:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 38592512. Throughput: 0: 10292.9. Samples: 38592576. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:55:04,600][547459] Avg episode reward: [(0, '4546.970')] [2023-03-08 19:55:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000075376_38592512.pth... [2023-03-08 19:55:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000074768_38281216.pth [2023-03-08 19:55:07,550][547799] Updated weights for policy 0, policy_version 75440 (0.0005) [2023-03-08 19:55:09,600][547459] Fps is (10 sec: 10651.1, 60 sec: 10308.3, 300 sec: 10441.3). Total num frames: 38645760. Throughput: 0: 10349.0. Samples: 38625288. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:55:09,600][547459] Avg episode reward: [(0, '4791.579')] [2023-03-08 19:55:11,452][547799] Updated weights for policy 0, policy_version 75520 (0.0005) [2023-03-08 19:55:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10455.2). Total num frames: 38699008. Throughput: 0: 10405.8. Samples: 38688148. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:55:14,600][547459] Avg episode reward: [(0, '4697.829')] [2023-03-08 19:55:15,239][547799] Updated weights for policy 0, policy_version 75600 (0.0005) [2023-03-08 19:55:19,142][547799] Updated weights for policy 0, policy_version 75680 (0.0005) [2023-03-08 19:55:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 38752256. Throughput: 0: 10467.4. Samples: 38752320. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:55:19,600][547459] Avg episode reward: [(0, '4803.572')] [2023-03-08 19:55:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000075688_38752256.pth... [2023-03-08 19:55:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000075072_38436864.pth [2023-03-08 19:55:22,995][547799] Updated weights for policy 0, policy_version 75760 (0.0005) [2023-03-08 19:55:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 38805504. Throughput: 0: 10462.7. Samples: 38784804. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 19:55:24,600][547459] Avg episode reward: [(0, '4585.007')] [2023-03-08 19:55:26,983][547799] Updated weights for policy 0, policy_version 75840 (0.0005) [2023-03-08 19:55:29,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 38858752. Throughput: 0: 10485.5. Samples: 38846948. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:55:29,600][547459] Avg episode reward: [(0, '4620.071')] [2023-03-08 19:55:30,831][547799] Updated weights for policy 0, policy_version 75920 (0.0005) [2023-03-08 19:55:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 38907904. Throughput: 0: 10475.8. Samples: 38910964. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:55:34,600][547459] Avg episode reward: [(0, '4729.257')] [2023-03-08 19:55:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000075992_38907904.pth... [2023-03-08 19:55:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000075376_38592512.pth [2023-03-08 19:55:34,673][547799] Updated weights for policy 0, policy_version 76000 (0.0005) [2023-03-08 19:55:38,641][547799] Updated weights for policy 0, policy_version 76080 (0.0005) [2023-03-08 19:55:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 38961152. Throughput: 0: 10477.1. Samples: 38941044. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:55:39,603][547459] Avg episode reward: [(0, '4625.630')] [2023-03-08 19:55:42,560][547799] Updated weights for policy 0, policy_version 76160 (0.0005) [2023-03-08 19:55:44,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 39014400. Throughput: 0: 10526.9. Samples: 39004756. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:55:44,600][547459] Avg episode reward: [(0, '4602.865')] [2023-03-08 19:55:46,475][547799] Updated weights for policy 0, policy_version 76240 (0.0005) [2023-03-08 19:55:49,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 39067648. Throughput: 0: 10558.8. Samples: 39067720. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:55:49,600][547459] Avg episode reward: [(0, '4510.530')] [2023-03-08 19:55:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000076304_39067648.pth... [2023-03-08 19:55:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000075688_38752256.pth [2023-03-08 19:55:50,355][547799] Updated weights for policy 0, policy_version 76320 (0.0005) [2023-03-08 19:55:54,583][547799] Updated weights for policy 0, policy_version 76400 (0.0005) [2023-03-08 19:55:54,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 39116800. Throughput: 0: 10468.8. Samples: 39096384. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:55:54,600][547459] Avg episode reward: [(0, '4353.829')] [2023-03-08 19:55:58,427][547799] Updated weights for policy 0, policy_version 76480 (0.0005) [2023-03-08 19:55:59,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10445.0, 300 sec: 10469.1). Total num frames: 39165952. Throughput: 0: 10452.5. Samples: 39158512. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:55:59,600][547459] Avg episode reward: [(0, '4585.719')] [2023-03-08 19:56:02,278][547799] Updated weights for policy 0, policy_version 76560 (0.0005) [2023-03-08 19:56:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 39223296. Throughput: 0: 10466.1. Samples: 39223296. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:56:04,600][547459] Avg episode reward: [(0, '4674.486')] [2023-03-08 19:56:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000076608_39223296.pth... [2023-03-08 19:56:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000075992_38907904.pth [2023-03-08 19:56:06,079][547799] Updated weights for policy 0, policy_version 76640 (0.0005) [2023-03-08 19:56:09,599][547459] Fps is (10 sec: 11059.3, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 39276544. Throughput: 0: 10467.5. Samples: 39255840. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 19:56:09,600][547459] Avg episode reward: [(0, '4715.593')] [2023-03-08 19:56:09,864][547799] Updated weights for policy 0, policy_version 76720 (0.0005) [2023-03-08 19:56:13,826][547799] Updated weights for policy 0, policy_version 76800 (0.0005) [2023-03-08 19:56:14,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 39329792. Throughput: 0: 10477.2. Samples: 39318420. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:56:14,600][547459] Avg episode reward: [(0, '4714.100')] [2023-03-08 19:56:17,711][547799] Updated weights for policy 0, policy_version 76880 (0.0005) [2023-03-08 19:56:19,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 39378944. Throughput: 0: 10465.4. Samples: 39381908. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:56:19,600][547459] Avg episode reward: [(0, '4626.577')] [2023-03-08 19:56:19,667][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000076920_39383040.pth... [2023-03-08 19:56:19,670][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000076304_39067648.pth [2023-03-08 19:56:21,611][547799] Updated weights for policy 0, policy_version 76960 (0.0005) [2023-03-08 19:56:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 39432192. Throughput: 0: 10504.5. Samples: 39413748. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:56:24,600][547459] Avg episode reward: [(0, '4668.021')] [2023-03-08 19:56:25,472][547799] Updated weights for policy 0, policy_version 77040 (0.0005) [2023-03-08 19:56:29,243][547799] Updated weights for policy 0, policy_version 77120 (0.0005) [2023-03-08 19:56:29,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 39489536. Throughput: 0: 10507.1. Samples: 39477576. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:56:29,600][547459] Avg episode reward: [(0, '4759.270')] [2023-03-08 19:56:33,044][547799] Updated weights for policy 0, policy_version 77200 (0.0005) [2023-03-08 19:56:34,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 39542784. Throughput: 0: 10556.3. Samples: 39542752. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:56:34,600][547459] Avg episode reward: [(0, '4625.945')] [2023-03-08 19:56:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000077232_39542784.pth... [2023-03-08 19:56:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000076608_39223296.pth [2023-03-08 19:56:36,862][547799] Updated weights for policy 0, policy_version 77280 (0.0005) [2023-03-08 19:56:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 39591936. Throughput: 0: 10631.2. Samples: 39574788. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:56:39,600][547459] Avg episode reward: [(0, '4635.676')] [2023-03-08 19:56:40,777][547799] Updated weights for policy 0, policy_version 77360 (0.0005) [2023-03-08 19:56:44,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 39645184. Throughput: 0: 10634.3. Samples: 39637056. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:56:44,600][547459] Avg episode reward: [(0, '4770.342')] [2023-03-08 19:56:44,760][547799] Updated weights for policy 0, policy_version 77440 (0.0005) [2023-03-08 19:56:48,791][547799] Updated weights for policy 0, policy_version 77520 (0.0005) [2023-03-08 19:56:49,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 39694336. Throughput: 0: 10556.0. Samples: 39698316. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:56:49,600][547459] Avg episode reward: [(0, '4414.885')] [2023-03-08 19:56:49,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000077536_39698432.pth... [2023-03-08 19:56:49,615][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000076920_39383040.pth [2023-03-08 19:56:52,666][547799] Updated weights for policy 0, policy_version 77600 (0.0005) [2023-03-08 19:56:54,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 39747584. Throughput: 0: 10530.7. Samples: 39729724. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 19:56:54,600][547459] Avg episode reward: [(0, '4706.482')] [2023-03-08 19:56:56,610][547799] Updated weights for policy 0, policy_version 77680 (0.0005) [2023-03-08 19:56:59,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10469.1). Total num frames: 39800832. Throughput: 0: 10533.2. Samples: 39792412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:56:59,600][547459] Avg episode reward: [(0, '4515.082')] [2023-03-08 19:57:00,490][547799] Updated weights for policy 0, policy_version 77760 (0.0005) [2023-03-08 19:57:04,434][547799] Updated weights for policy 0, policy_version 77840 (0.0005) [2023-03-08 19:57:04,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 39854080. Throughput: 0: 10509.7. Samples: 39854844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:57:04,600][547459] Avg episode reward: [(0, '4674.637')] [2023-03-08 19:57:04,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000077840_39854080.pth... [2023-03-08 19:57:04,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000077232_39542784.pth [2023-03-08 19:57:08,368][547799] Updated weights for policy 0, policy_version 77920 (0.0005) [2023-03-08 19:57:09,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 39907328. Throughput: 0: 10513.3. Samples: 39886848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:57:09,600][547459] Avg episode reward: [(0, '4777.208')] [2023-03-08 19:57:12,236][547799] Updated weights for policy 0, policy_version 78000 (0.0005) [2023-03-08 19:57:14,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 39956480. Throughput: 0: 10462.1. Samples: 39948368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:57:14,600][547459] Avg episode reward: [(0, '4620.047')] [2023-03-08 19:57:16,218][547799] Updated weights for policy 0, policy_version 78080 (0.0005) [2023-03-08 19:57:19,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10441.3). Total num frames: 40009728. Throughput: 0: 10439.9. Samples: 40012548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:57:19,600][547459] Avg episode reward: [(0, '4724.323')] [2023-03-08 19:57:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000078144_40009728.pth... [2023-03-08 19:57:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000077536_39698432.pth [2023-03-08 19:57:20,092][547799] Updated weights for policy 0, policy_version 78160 (0.0005) [2023-03-08 19:57:24,038][547799] Updated weights for policy 0, policy_version 78240 (0.0005) [2023-03-08 19:57:24,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10441.3). Total num frames: 40062976. Throughput: 0: 10400.2. Samples: 40042796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:57:24,600][547459] Avg episode reward: [(0, '4683.885')] [2023-03-08 19:57:28,013][547799] Updated weights for policy 0, policy_version 78320 (0.0005) [2023-03-08 19:57:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 40112128. Throughput: 0: 10387.4. Samples: 40104488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:57:29,600][547459] Avg episode reward: [(0, '4652.846')] [2023-03-08 19:57:32,070][547799] Updated weights for policy 0, policy_version 78400 (0.0005) [2023-03-08 19:57:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10441.3). Total num frames: 40165376. Throughput: 0: 10401.4. Samples: 40166380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:57:34,601][547459] Avg episode reward: [(0, '4376.860')] [2023-03-08 19:57:34,606][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000078448_40165376.pth... [2023-03-08 19:57:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000077840_39854080.pth [2023-03-08 19:57:36,039][547799] Updated weights for policy 0, policy_version 78480 (0.0005) [2023-03-08 19:57:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 40214528. Throughput: 0: 10392.2. Samples: 40197372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:57:39,600][547459] Avg episode reward: [(0, '4659.806')] [2023-03-08 19:57:40,040][547799] Updated weights for policy 0, policy_version 78560 (0.0005) [2023-03-08 19:57:43,963][547799] Updated weights for policy 0, policy_version 78640 (0.0005) [2023-03-08 19:57:44,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 40267776. Throughput: 0: 10378.8. Samples: 40259460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:57:44,600][547459] Avg episode reward: [(0, '4697.865')] [2023-03-08 19:57:47,775][547799] Updated weights for policy 0, policy_version 78720 (0.0005) [2023-03-08 19:57:49,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10427.4). Total num frames: 40321024. Throughput: 0: 10414.4. Samples: 40323492. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:57:49,600][547459] Avg episode reward: [(0, '4801.041')] [2023-03-08 19:57:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000078752_40321024.pth... [2023-03-08 19:57:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000078144_40009728.pth [2023-03-08 19:57:51,681][547799] Updated weights for policy 0, policy_version 78800 (0.0005) [2023-03-08 19:57:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10427.4). Total num frames: 40374272. Throughput: 0: 10406.2. Samples: 40355128. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:57:54,600][547459] Avg episode reward: [(0, '4789.971')] [2023-03-08 19:57:55,595][547799] Updated weights for policy 0, policy_version 78880 (0.0006) [2023-03-08 19:57:59,462][547799] Updated weights for policy 0, policy_version 78960 (0.0005) [2023-03-08 19:57:59,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10427.4). Total num frames: 40427520. Throughput: 0: 10431.0. Samples: 40417764. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:57:59,600][547459] Avg episode reward: [(0, '4796.870')] [2023-03-08 19:58:03,569][547799] Updated weights for policy 0, policy_version 79040 (0.0005) [2023-03-08 19:58:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 40476672. Throughput: 0: 10378.4. Samples: 40479576. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:58:04,601][547459] Avg episode reward: [(0, '4583.659')] [2023-03-08 19:58:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000079056_40476672.pth... [2023-03-08 19:58:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000078448_40165376.pth [2023-03-08 19:58:07,419][547799] Updated weights for policy 0, policy_version 79120 (0.0005) [2023-03-08 19:58:09,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 40529920. Throughput: 0: 10395.3. Samples: 40510584. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:58:09,600][547459] Avg episode reward: [(0, '4539.874')] [2023-03-08 19:58:11,442][547799] Updated weights for policy 0, policy_version 79200 (0.0005) [2023-03-08 19:58:14,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 40579072. Throughput: 0: 10365.7. Samples: 40570944. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:58:14,600][547459] Avg episode reward: [(0, '4453.485')] [2023-03-08 19:58:15,438][547799] Updated weights for policy 0, policy_version 79280 (0.0006) [2023-03-08 19:58:19,481][547799] Updated weights for policy 0, policy_version 79360 (0.0005) [2023-03-08 19:58:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 40632320. Throughput: 0: 10356.0. Samples: 40632400. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:58:19,601][547459] Avg episode reward: [(0, '4742.279')] [2023-03-08 19:58:19,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000079360_40632320.pth... [2023-03-08 19:58:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000078752_40321024.pth [2023-03-08 19:58:23,304][547799] Updated weights for policy 0, policy_version 79440 (0.0005) [2023-03-08 19:58:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 40685568. Throughput: 0: 10394.0. Samples: 40665104. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:58:24,600][547459] Avg episode reward: [(0, '4664.203')] [2023-03-08 19:58:27,300][547799] Updated weights for policy 0, policy_version 79520 (0.0005) [2023-03-08 19:58:29,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 40734720. Throughput: 0: 10397.9. Samples: 40727364. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:58:29,600][547459] Avg episode reward: [(0, '4684.554')] [2023-03-08 19:58:31,236][547799] Updated weights for policy 0, policy_version 79600 (0.0005) [2023-03-08 19:58:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 40787968. Throughput: 0: 10333.9. Samples: 40788520. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 19:58:34,601][547459] Avg episode reward: [(0, '4441.716')] [2023-03-08 19:58:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000079664_40787968.pth... [2023-03-08 19:58:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000079056_40476672.pth [2023-03-08 19:58:35,206][547799] Updated weights for policy 0, policy_version 79680 (0.0005) [2023-03-08 19:58:39,084][547799] Updated weights for policy 0, policy_version 79760 (0.0005) [2023-03-08 19:58:39,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10413.6). Total num frames: 40841216. Throughput: 0: 10347.1. Samples: 40820748. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:58:39,601][547459] Avg episode reward: [(0, '4720.897')] [2023-03-08 19:58:43,101][547799] Updated weights for policy 0, policy_version 79840 (0.0005) [2023-03-08 19:58:44,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 40890368. Throughput: 0: 10321.7. Samples: 40882240. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:58:44,600][547459] Avg episode reward: [(0, '4638.752')] [2023-03-08 19:58:46,968][547799] Updated weights for policy 0, policy_version 79920 (0.0005) [2023-03-08 19:58:49,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 40943616. Throughput: 0: 10352.5. Samples: 40945436. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:58:49,600][547459] Avg episode reward: [(0, '4722.569')] [2023-03-08 19:58:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000079968_40943616.pth... [2023-03-08 19:58:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000079360_40632320.pth [2023-03-08 19:58:50,814][547799] Updated weights for policy 0, policy_version 80000 (0.0005) [2023-03-08 19:58:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 40996864. Throughput: 0: 10388.6. Samples: 40978072. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:58:54,600][547459] Avg episode reward: [(0, '4784.514')] [2023-03-08 19:58:54,685][547799] Updated weights for policy 0, policy_version 80080 (0.0005) [2023-03-08 19:58:58,405][547799] Updated weights for policy 0, policy_version 80160 (0.0005) [2023-03-08 19:58:59,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10444.8, 300 sec: 10441.3). Total num frames: 41054208. Throughput: 0: 10497.0. Samples: 41043308. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:58:59,600][547459] Avg episode reward: [(0, '4626.293')] [2023-03-08 19:59:02,308][547799] Updated weights for policy 0, policy_version 80240 (0.0005) [2023-03-08 19:59:04,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10427.4). Total num frames: 41103360. Throughput: 0: 10545.8. Samples: 41106960. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:59:04,600][547459] Avg episode reward: [(0, '4640.337')] [2023-03-08 19:59:04,637][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000080288_41107456.pth... [2023-03-08 19:59:04,639][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000079664_40787968.pth [2023-03-08 19:59:06,157][547799] Updated weights for policy 0, policy_version 80320 (0.0005) [2023-03-08 19:59:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10455.2). Total num frames: 41160704. Throughput: 0: 10516.2. Samples: 41138332. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:59:09,600][547459] Avg episode reward: [(0, '4808.540')] [2023-03-08 19:59:09,958][547799] Updated weights for policy 0, policy_version 80400 (0.0005) [2023-03-08 19:59:13,727][547799] Updated weights for policy 0, policy_version 80480 (0.0005) [2023-03-08 19:59:14,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10469.1). Total num frames: 41213952. Throughput: 0: 10586.7. Samples: 41203764. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:59:14,600][547459] Avg episode reward: [(0, '4761.254')] [2023-03-08 19:59:17,688][547799] Updated weights for policy 0, policy_version 80560 (0.0006) [2023-03-08 19:59:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10469.1). Total num frames: 41267200. Throughput: 0: 10638.8. Samples: 41267264. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:59:19,600][547459] Avg episode reward: [(0, '4772.175')] [2023-03-08 19:59:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000080600_41267200.pth... [2023-03-08 19:59:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000079968_40943616.pth [2023-03-08 19:59:21,474][547799] Updated weights for policy 0, policy_version 80640 (0.0005) [2023-03-08 19:59:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10483.0). Total num frames: 41320448. Throughput: 0: 10648.8. Samples: 41299944. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 19:59:24,600][547459] Avg episode reward: [(0, '4708.369')] [2023-03-08 19:59:25,195][547799] Updated weights for policy 0, policy_version 80720 (0.0005) [2023-03-08 19:59:28,821][547799] Updated weights for policy 0, policy_version 80800 (0.0005) [2023-03-08 19:59:29,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10496.9). Total num frames: 41377792. Throughput: 0: 10749.0. Samples: 41365944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:59:29,600][547459] Avg episode reward: [(0, '4724.906')] [2023-03-08 19:59:32,644][547799] Updated weights for policy 0, policy_version 80880 (0.0006) [2023-03-08 19:59:34,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10496.9). Total num frames: 41426944. Throughput: 0: 10768.4. Samples: 41430016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:59:34,600][547459] Avg episode reward: [(0, '4805.367')] [2023-03-08 19:59:34,629][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000080920_41431040.pth... [2023-03-08 19:59:34,630][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000080288_41107456.pth [2023-03-08 19:59:36,547][547799] Updated weights for policy 0, policy_version 80960 (0.0006) [2023-03-08 19:59:39,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10649.6, 300 sec: 10496.9). Total num frames: 41480192. Throughput: 0: 10751.6. Samples: 41461892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:59:39,600][547459] Avg episode reward: [(0, '4710.402')] [2023-03-08 19:59:40,398][547799] Updated weights for policy 0, policy_version 81040 (0.0005) [2023-03-08 19:59:44,208][547799] Updated weights for policy 0, policy_version 81120 (0.0005) [2023-03-08 19:59:44,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10496.9). Total num frames: 41533440. Throughput: 0: 10711.4. Samples: 41525320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:59:44,600][547459] Avg episode reward: [(0, '4681.450')] [2023-03-08 19:59:48,054][547799] Updated weights for policy 0, policy_version 81200 (0.0005) [2023-03-08 19:59:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10717.8, 300 sec: 10510.8). Total num frames: 41586688. Throughput: 0: 10713.8. Samples: 41589080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:59:49,600][547459] Avg episode reward: [(0, '4722.227')] [2023-03-08 19:59:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000081224_41586688.pth... [2023-03-08 19:59:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000080600_41267200.pth [2023-03-08 19:59:52,073][547799] Updated weights for policy 0, policy_version 81280 (0.0005) [2023-03-08 19:59:54,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10510.8). Total num frames: 41639936. Throughput: 0: 10708.8. Samples: 41620228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:59:54,600][547459] Avg episode reward: [(0, '4731.104')] [2023-03-08 19:59:56,018][547799] Updated weights for policy 0, policy_version 81360 (0.0006) [2023-03-08 19:59:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10510.8). Total num frames: 41693184. Throughput: 0: 10665.8. Samples: 41683724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 19:59:59,601][547459] Avg episode reward: [(0, '4681.717')] [2023-03-08 19:59:59,832][547799] Updated weights for policy 0, policy_version 81440 (0.0005) [2023-03-08 20:00:03,820][547799] Updated weights for policy 0, policy_version 81520 (0.0005) [2023-03-08 20:00:04,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10649.6, 300 sec: 10496.9). Total num frames: 41742336. Throughput: 0: 10641.1. Samples: 41746112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:00:04,600][547459] Avg episode reward: [(0, '4724.900')] [2023-03-08 20:00:04,616][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000081536_41746432.pth... [2023-03-08 20:00:04,618][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000080920_41431040.pth [2023-03-08 20:00:07,728][547799] Updated weights for policy 0, policy_version 81600 (0.0005) [2023-03-08 20:00:09,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 41795584. Throughput: 0: 10600.1. Samples: 41776948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:00:09,600][547459] Avg episode reward: [(0, '4675.896')] [2023-03-08 20:00:11,513][547799] Updated weights for policy 0, policy_version 81680 (0.0005) [2023-03-08 20:00:14,600][547459] Fps is (10 sec: 11059.3, 60 sec: 10649.6, 300 sec: 10510.8). Total num frames: 41852928. Throughput: 0: 10565.5. Samples: 41841392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:00:14,600][547459] Avg episode reward: [(0, '4644.596')] [2023-03-08 20:00:15,401][547799] Updated weights for policy 0, policy_version 81760 (0.0005) [2023-03-08 20:00:19,411][547799] Updated weights for policy 0, policy_version 81840 (0.0005) [2023-03-08 20:00:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 41902080. Throughput: 0: 10519.1. Samples: 41903376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:00:19,600][547459] Avg episode reward: [(0, '4763.275')] [2023-03-08 20:00:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000081840_41902080.pth... [2023-03-08 20:00:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000081224_41586688.pth [2023-03-08 20:00:23,217][547799] Updated weights for policy 0, policy_version 81920 (0.0005) [2023-03-08 20:00:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 41955328. Throughput: 0: 10513.0. Samples: 41934976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:00:24,600][547459] Avg episode reward: [(0, '4704.724')] [2023-03-08 20:00:26,948][547799] Updated weights for policy 0, policy_version 82000 (0.0005) [2023-03-08 20:00:29,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10510.8). Total num frames: 42008576. Throughput: 0: 10558.4. Samples: 42000448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:00:29,600][547459] Avg episode reward: [(0, '4602.476')] [2023-03-08 20:00:30,918][547799] Updated weights for policy 0, policy_version 82080 (0.0005) [2023-03-08 20:00:34,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 42061824. Throughput: 0: 10524.3. Samples: 42062672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:00:34,600][547459] Avg episode reward: [(0, '4481.832')] [2023-03-08 20:00:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000082152_42061824.pth... [2023-03-08 20:00:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000081536_41746432.pth [2023-03-08 20:00:34,816][547799] Updated weights for policy 0, policy_version 82160 (0.0005) [2023-03-08 20:00:38,783][547799] Updated weights for policy 0, policy_version 82240 (0.0005) [2023-03-08 20:00:39,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 42110976. Throughput: 0: 10541.9. Samples: 42094612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:00:39,600][547459] Avg episode reward: [(0, '4732.793')] [2023-03-08 20:00:42,697][547799] Updated weights for policy 0, policy_version 82320 (0.0005) [2023-03-08 20:00:44,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 42164224. Throughput: 0: 10499.0. Samples: 42156176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:00:44,600][547459] Avg episode reward: [(0, '4779.598')] [2023-03-08 20:00:46,571][547799] Updated weights for policy 0, policy_version 82400 (0.0005) [2023-03-08 20:00:49,599][547459] Fps is (10 sec: 11059.3, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 42221568. Throughput: 0: 10565.9. Samples: 42221576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:00:49,600][547459] Avg episode reward: [(0, '4732.392')] [2023-03-08 20:00:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000082464_42221568.pth... [2023-03-08 20:00:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000081840_41902080.pth [2023-03-08 20:00:50,318][547799] Updated weights for policy 0, policy_version 82480 (0.0005) [2023-03-08 20:00:54,203][547799] Updated weights for policy 0, policy_version 82560 (0.0005) [2023-03-08 20:00:54,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10581.3, 300 sec: 10538.5). Total num frames: 42274816. Throughput: 0: 10599.1. Samples: 42253908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:00:54,601][547459] Avg episode reward: [(0, '4586.804')] [2023-03-08 20:00:58,001][547799] Updated weights for policy 0, policy_version 82640 (0.0005) [2023-03-08 20:00:59,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 42328064. Throughput: 0: 10568.4. Samples: 42316972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:00:59,600][547459] Avg episode reward: [(0, '4774.449')] [2023-03-08 20:01:01,802][547799] Updated weights for policy 0, policy_version 82720 (0.0006) [2023-03-08 20:01:04,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 42381312. Throughput: 0: 10622.2. Samples: 42381376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:01:04,600][547459] Avg episode reward: [(0, '4816.906')] [2023-03-08 20:01:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000082776_42381312.pth... [2023-03-08 20:01:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000082152_42061824.pth [2023-03-08 20:01:05,663][547799] Updated weights for policy 0, policy_version 82800 (0.0005) [2023-03-08 20:01:09,498][547799] Updated weights for policy 0, policy_version 82880 (0.0005) [2023-03-08 20:01:09,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 42434560. Throughput: 0: 10646.5. Samples: 42414068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:01:09,600][547459] Avg episode reward: [(0, '4725.740')] [2023-03-08 20:01:13,325][547799] Updated weights for policy 0, policy_version 82960 (0.0006) [2023-03-08 20:01:14,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10538.5). Total num frames: 42487808. Throughput: 0: 10614.8. Samples: 42478116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:01:14,600][547459] Avg episode reward: [(0, '4732.837')] [2023-03-08 20:01:17,215][547799] Updated weights for policy 0, policy_version 83040 (0.0006) [2023-03-08 20:01:19,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10538.5). Total num frames: 42541056. Throughput: 0: 10630.8. Samples: 42541056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:01:19,600][547459] Avg episode reward: [(0, '4632.879')] [2023-03-08 20:01:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000083088_42541056.pth... [2023-03-08 20:01:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000082464_42221568.pth [2023-03-08 20:01:21,158][547799] Updated weights for policy 0, policy_version 83120 (0.0005) [2023-03-08 20:01:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 42590208. Throughput: 0: 10600.3. Samples: 42571628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:01:24,600][547459] Avg episode reward: [(0, '4634.334')] [2023-03-08 20:01:25,111][547799] Updated weights for policy 0, policy_version 83200 (0.0005) [2023-03-08 20:01:28,945][547799] Updated weights for policy 0, policy_version 83280 (0.0005) [2023-03-08 20:01:29,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 42643456. Throughput: 0: 10645.2. Samples: 42635212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:01:29,600][547459] Avg episode reward: [(0, '4572.253')] [2023-03-08 20:01:32,861][547799] Updated weights for policy 0, policy_version 83360 (0.0005) [2023-03-08 20:01:34,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 42696704. Throughput: 0: 10558.4. Samples: 42696704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:01:34,600][547459] Avg episode reward: [(0, '4690.725')] [2023-03-08 20:01:34,606][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000083392_42696704.pth... [2023-03-08 20:01:34,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000082776_42381312.pth [2023-03-08 20:01:36,857][547799] Updated weights for policy 0, policy_version 83440 (0.0005) [2023-03-08 20:01:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 42745856. Throughput: 0: 10550.0. Samples: 42728656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:01:39,600][547459] Avg episode reward: [(0, '4709.217')] [2023-03-08 20:01:40,751][547799] Updated weights for policy 0, policy_version 83520 (0.0005) [2023-03-08 20:01:44,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 42799104. Throughput: 0: 10532.4. Samples: 42790928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:01:44,600][547459] Avg episode reward: [(0, '4649.801')] [2023-03-08 20:01:44,710][547799] Updated weights for policy 0, policy_version 83600 (0.0005) [2023-03-08 20:01:48,636][547799] Updated weights for policy 0, policy_version 83680 (0.0005) [2023-03-08 20:01:49,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 42852352. Throughput: 0: 10483.3. Samples: 42853124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:01:49,600][547459] Avg episode reward: [(0, '4778.122')] [2023-03-08 20:01:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000083696_42852352.pth... [2023-03-08 20:01:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000083088_42541056.pth [2023-03-08 20:01:52,468][547799] Updated weights for policy 0, policy_version 83760 (0.0005) [2023-03-08 20:01:54,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 42905600. Throughput: 0: 10474.0. Samples: 42885396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:01:54,600][547459] Avg episode reward: [(0, '4685.865')] [2023-03-08 20:01:56,326][547799] Updated weights for policy 0, policy_version 83840 (0.0005) [2023-03-08 20:01:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 42958848. Throughput: 0: 10482.3. Samples: 42949820. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:01:59,600][547459] Avg episode reward: [(0, '4620.982')] [2023-03-08 20:02:00,191][547799] Updated weights for policy 0, policy_version 83920 (0.0005) [2023-03-08 20:02:03,981][547799] Updated weights for policy 0, policy_version 84000 (0.0005) [2023-03-08 20:02:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 43012096. Throughput: 0: 10505.3. Samples: 43013796. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:02:04,600][547459] Avg episode reward: [(0, '4597.245')] [2023-03-08 20:02:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000084008_43012096.pth... [2023-03-08 20:02:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000083392_42696704.pth [2023-03-08 20:02:07,835][547799] Updated weights for policy 0, policy_version 84080 (0.0005) [2023-03-08 20:02:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 43065344. Throughput: 0: 10533.3. Samples: 43045628. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:02:09,600][547459] Avg episode reward: [(0, '4576.832')] [2023-03-08 20:02:11,578][547799] Updated weights for policy 0, policy_version 84160 (0.0005) [2023-03-08 20:02:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 43118592. Throughput: 0: 10566.0. Samples: 43110684. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:02:14,600][547459] Avg episode reward: [(0, '4533.760')] [2023-03-08 20:02:15,504][547799] Updated weights for policy 0, policy_version 84240 (0.0005) [2023-03-08 20:02:19,409][547799] Updated weights for policy 0, policy_version 84320 (0.0005) [2023-03-08 20:02:19,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 43171840. Throughput: 0: 10580.0. Samples: 43172804. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:02:19,600][547459] Avg episode reward: [(0, '4628.200')] [2023-03-08 20:02:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000084320_43171840.pth... [2023-03-08 20:02:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000083696_42852352.pth [2023-03-08 20:02:23,335][547799] Updated weights for policy 0, policy_version 84400 (0.0005) [2023-03-08 20:02:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 43225088. Throughput: 0: 10576.9. Samples: 43204616. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:02:24,600][547459] Avg episode reward: [(0, '4628.682')] [2023-03-08 20:02:27,237][547799] Updated weights for policy 0, policy_version 84480 (0.0005) [2023-03-08 20:02:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 43278336. Throughput: 0: 10576.7. Samples: 43266880. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:02:29,600][547459] Avg episode reward: [(0, '4726.877')] [2023-03-08 20:02:31,069][547799] Updated weights for policy 0, policy_version 84560 (0.0005) [2023-03-08 20:02:34,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 43327488. Throughput: 0: 10596.8. Samples: 43329980. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:02:34,600][547459] Avg episode reward: [(0, '4617.899')] [2023-03-08 20:02:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000084624_43327488.pth... [2023-03-08 20:02:34,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000084008_43012096.pth [2023-03-08 20:02:35,093][547799] Updated weights for policy 0, policy_version 84640 (0.0005) [2023-03-08 20:02:39,009][547799] Updated weights for policy 0, policy_version 84720 (0.0005) [2023-03-08 20:02:39,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 43380736. Throughput: 0: 10566.9. Samples: 43360908. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:02:39,601][547459] Avg episode reward: [(0, '4534.713')] [2023-03-08 20:02:42,810][547799] Updated weights for policy 0, policy_version 84800 (0.0005) [2023-03-08 20:02:44,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 43433984. Throughput: 0: 10577.2. Samples: 43425792. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:02:44,600][547459] Avg episode reward: [(0, '4681.667')] [2023-03-08 20:02:46,646][547799] Updated weights for policy 0, policy_version 84880 (0.0005) [2023-03-08 20:02:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 43487232. Throughput: 0: 10549.2. Samples: 43488512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:02:49,600][547459] Avg episode reward: [(0, '4641.425')] [2023-03-08 20:02:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000084936_43487232.pth... [2023-03-08 20:02:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000084320_43171840.pth [2023-03-08 20:02:50,567][547799] Updated weights for policy 0, policy_version 84960 (0.0005) [2023-03-08 20:02:54,392][547799] Updated weights for policy 0, policy_version 85040 (0.0006) [2023-03-08 20:02:54,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 43540480. Throughput: 0: 10547.4. Samples: 43520260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:02:54,600][547459] Avg episode reward: [(0, '4732.064')] [2023-03-08 20:02:58,295][547799] Updated weights for policy 0, policy_version 85120 (0.0006) [2023-03-08 20:02:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10566.3). Total num frames: 43593728. Throughput: 0: 10519.6. Samples: 43584068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:02:59,600][547459] Avg episode reward: [(0, '4814.682')] [2023-03-08 20:03:02,286][547799] Updated weights for policy 0, policy_version 85200 (0.0005) [2023-03-08 20:03:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 43642880. Throughput: 0: 10514.8. Samples: 43645968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:03:04,600][547459] Avg episode reward: [(0, '4726.156')] [2023-03-08 20:03:04,650][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000085248_43646976.pth... [2023-03-08 20:03:04,652][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000084624_43327488.pth [2023-03-08 20:03:06,201][547799] Updated weights for policy 0, policy_version 85280 (0.0005) [2023-03-08 20:03:09,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10566.3). Total num frames: 43696128. Throughput: 0: 10508.9. Samples: 43677516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:03:09,600][547459] Avg episode reward: [(0, '4648.832')] [2023-03-08 20:03:10,060][547799] Updated weights for policy 0, policy_version 85360 (0.0005) [2023-03-08 20:03:13,952][547799] Updated weights for policy 0, policy_version 85440 (0.0005) [2023-03-08 20:03:14,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10566.3). Total num frames: 43749376. Throughput: 0: 10540.1. Samples: 43741184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:03:14,600][547459] Avg episode reward: [(0, '4784.616')] [2023-03-08 20:03:17,992][547799] Updated weights for policy 0, policy_version 85520 (0.0006) [2023-03-08 20:03:19,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10552.4). Total num frames: 43798528. Throughput: 0: 10455.4. Samples: 43800476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:03:19,600][547459] Avg episode reward: [(0, '4032.038')] [2023-03-08 20:03:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000085544_43798528.pth... [2023-03-08 20:03:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000084936_43487232.pth [2023-03-08 20:03:22,253][547799] Updated weights for policy 0, policy_version 85600 (0.0006) [2023-03-08 20:03:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 43851776. Throughput: 0: 10440.7. Samples: 43830740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:03:24,600][547459] Avg episode reward: [(0, '3721.114')] [2023-03-08 20:03:26,196][547799] Updated weights for policy 0, policy_version 85680 (0.0005) [2023-03-08 20:03:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10552.4). Total num frames: 43900928. Throughput: 0: 10376.5. Samples: 43892736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:03:29,600][547459] Avg episode reward: [(0, '4723.443')] [2023-03-08 20:03:30,136][547799] Updated weights for policy 0, policy_version 85760 (0.0005) [2023-03-08 20:03:33,932][547799] Updated weights for policy 0, policy_version 85840 (0.0005) [2023-03-08 20:03:34,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10552.4). Total num frames: 43954176. Throughput: 0: 10398.3. Samples: 43956436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:03:34,600][547459] Avg episode reward: [(0, '4722.617')] [2023-03-08 20:03:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000085848_43954176.pth... [2023-03-08 20:03:34,603][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000085248_43646976.pth [2023-03-08 20:03:37,715][547799] Updated weights for policy 0, policy_version 85920 (0.0005) [2023-03-08 20:03:39,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 44007424. Throughput: 0: 10414.7. Samples: 43988920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:03:39,600][547459] Avg episode reward: [(0, '4761.881')] [2023-03-08 20:03:41,534][547799] Updated weights for policy 0, policy_version 86000 (0.0005) [2023-03-08 20:03:44,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 44060672. Throughput: 0: 10443.4. Samples: 44054020. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:03:44,600][547459] Avg episode reward: [(0, '4730.864')] [2023-03-08 20:03:45,354][547799] Updated weights for policy 0, policy_version 86080 (0.0005) [2023-03-08 20:03:49,267][547799] Updated weights for policy 0, policy_version 86160 (0.0005) [2023-03-08 20:03:49,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 44113920. Throughput: 0: 10474.1. Samples: 44117304. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:03:49,600][547459] Avg episode reward: [(0, '4491.910')] [2023-03-08 20:03:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000086160_44113920.pth... [2023-03-08 20:03:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000085544_43798528.pth [2023-03-08 20:03:53,246][547799] Updated weights for policy 0, policy_version 86240 (0.0005) [2023-03-08 20:03:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10552.4). Total num frames: 44167168. Throughput: 0: 10438.5. Samples: 44147248. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:03:54,601][547459] Avg episode reward: [(0, '4499.748')] [2023-03-08 20:03:57,134][547799] Updated weights for policy 0, policy_version 86320 (0.0005) [2023-03-08 20:03:59,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10566.3). Total num frames: 44220416. Throughput: 0: 10445.8. Samples: 44211244. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:03:59,601][547459] Avg episode reward: [(0, '4665.762')] [2023-03-08 20:04:00,921][547799] Updated weights for policy 0, policy_version 86400 (0.0005) [2023-03-08 20:04:04,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 44273664. Throughput: 0: 10516.7. Samples: 44273728. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:04:04,601][547459] Avg episode reward: [(0, '4598.741')] [2023-03-08 20:04:04,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000086472_44273664.pth... [2023-03-08 20:04:04,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000085848_43954176.pth [2023-03-08 20:04:04,913][547799] Updated weights for policy 0, policy_version 86480 (0.0006) [2023-03-08 20:04:08,696][547799] Updated weights for policy 0, policy_version 86560 (0.0005) [2023-03-08 20:04:09,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 44326912. Throughput: 0: 10570.9. Samples: 44306432. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:04:09,600][547459] Avg episode reward: [(0, '4797.480')] [2023-03-08 20:04:12,607][547799] Updated weights for policy 0, policy_version 86640 (0.0005) [2023-03-08 20:04:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 44380160. Throughput: 0: 10579.3. Samples: 44368804. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:04:14,600][547459] Avg episode reward: [(0, '4761.949')] [2023-03-08 20:04:16,485][547799] Updated weights for policy 0, policy_version 86720 (0.0005) [2023-03-08 20:04:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 44433408. Throughput: 0: 10600.8. Samples: 44433472. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:04:19,600][547459] Avg episode reward: [(0, '4766.073')] [2023-03-08 20:04:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000086784_44433408.pth... [2023-03-08 20:04:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000086160_44113920.pth [2023-03-08 20:04:20,318][547799] Updated weights for policy 0, policy_version 86800 (0.0005) [2023-03-08 20:04:24,243][547799] Updated weights for policy 0, policy_version 86880 (0.0005) [2023-03-08 20:04:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 44482560. Throughput: 0: 10563.3. Samples: 44464268. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:04:24,600][547459] Avg episode reward: [(0, '4724.886')] [2023-03-08 20:04:28,189][547799] Updated weights for policy 0, policy_version 86960 (0.0005) [2023-03-08 20:04:29,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10538.5). Total num frames: 44535808. Throughput: 0: 10523.7. Samples: 44527588. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:04:29,600][547459] Avg episode reward: [(0, '4805.713')] [2023-03-08 20:04:31,926][547799] Updated weights for policy 0, policy_version 87040 (0.0005) [2023-03-08 20:04:34,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 44593152. Throughput: 0: 10574.4. Samples: 44593152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:04:34,600][547459] Avg episode reward: [(0, '4554.809')] [2023-03-08 20:04:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000087096_44593152.pth... [2023-03-08 20:04:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000086472_44273664.pth [2023-03-08 20:04:35,742][547799] Updated weights for policy 0, policy_version 87120 (0.0005) [2023-03-08 20:04:39,519][547799] Updated weights for policy 0, policy_version 87200 (0.0005) [2023-03-08 20:04:39,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 44646400. Throughput: 0: 10613.4. Samples: 44624852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:04:39,600][547459] Avg episode reward: [(0, '4725.565')] [2023-03-08 20:04:43,369][547799] Updated weights for policy 0, policy_version 87280 (0.0006) [2023-03-08 20:04:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 44699648. Throughput: 0: 10609.1. Samples: 44688652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:04:44,600][547459] Avg episode reward: [(0, '4772.810')] [2023-03-08 20:04:47,284][547799] Updated weights for policy 0, policy_version 87360 (0.0005) [2023-03-08 20:04:49,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 44752896. Throughput: 0: 10648.1. Samples: 44752892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:04:49,600][547459] Avg episode reward: [(0, '4572.611')] [2023-03-08 20:04:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000087408_44752896.pth... [2023-03-08 20:04:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000086784_44433408.pth [2023-03-08 20:04:51,041][547799] Updated weights for policy 0, policy_version 87440 (0.0005) [2023-03-08 20:04:54,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 44806144. Throughput: 0: 10651.0. Samples: 44785728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:04:54,600][547459] Avg episode reward: [(0, '4727.296')] [2023-03-08 20:04:54,895][547799] Updated weights for policy 0, policy_version 87520 (0.0005) [2023-03-08 20:04:58,776][547799] Updated weights for policy 0, policy_version 87600 (0.0005) [2023-03-08 20:04:59,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 44859392. Throughput: 0: 10649.7. Samples: 44848040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:04:59,600][547459] Avg episode reward: [(0, '4728.431')] [2023-03-08 20:05:02,617][547799] Updated weights for policy 0, policy_version 87680 (0.0006) [2023-03-08 20:05:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 44912640. Throughput: 0: 10653.6. Samples: 44912884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:05:04,601][547459] Avg episode reward: [(0, '4768.729')] [2023-03-08 20:05:04,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000087720_44912640.pth... [2023-03-08 20:05:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000087096_44593152.pth [2023-03-08 20:05:06,317][547799] Updated weights for policy 0, policy_version 87760 (0.0006) [2023-03-08 20:05:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 44965888. Throughput: 0: 10721.8. Samples: 44946748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:05:09,600][547459] Avg episode reward: [(0, '4788.992')] [2023-03-08 20:05:10,111][547799] Updated weights for policy 0, policy_version 87840 (0.0004) [2023-03-08 20:05:13,837][547799] Updated weights for policy 0, policy_version 87920 (0.0005) [2023-03-08 20:05:14,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10580.2). Total num frames: 45023232. Throughput: 0: 10748.3. Samples: 45011264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:05:14,600][547459] Avg episode reward: [(0, '4682.854')] [2023-03-08 20:05:17,624][547799] Updated weights for policy 0, policy_version 88000 (0.0005) [2023-03-08 20:05:19,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10717.8, 300 sec: 10580.2). Total num frames: 45076480. Throughput: 0: 10738.2. Samples: 45076372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:05:19,600][547459] Avg episode reward: [(0, '4621.421')] [2023-03-08 20:05:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000088040_45076480.pth... [2023-03-08 20:05:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000087408_44752896.pth [2023-03-08 20:05:21,475][547799] Updated weights for policy 0, policy_version 88080 (0.0005) [2023-03-08 20:05:24,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10786.1, 300 sec: 10580.2). Total num frames: 45129728. Throughput: 0: 10757.0. Samples: 45108916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:05:24,600][547459] Avg episode reward: [(0, '4811.425')] [2023-03-08 20:05:25,373][547799] Updated weights for policy 0, policy_version 88160 (0.0006) [2023-03-08 20:05:29,240][547799] Updated weights for policy 0, policy_version 88240 (0.0005) [2023-03-08 20:05:29,599][547459] Fps is (10 sec: 10240.2, 60 sec: 10717.9, 300 sec: 10566.3). Total num frames: 45178880. Throughput: 0: 10724.7. Samples: 45171264. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:05:29,600][547459] Avg episode reward: [(0, '4633.781')] [2023-03-08 20:05:33,144][547799] Updated weights for policy 0, policy_version 88320 (0.0005) [2023-03-08 20:05:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10580.2). Total num frames: 45232128. Throughput: 0: 10715.7. Samples: 45235100. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:05:34,600][547459] Avg episode reward: [(0, '4607.864')] [2023-03-08 20:05:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000088344_45232128.pth... [2023-03-08 20:05:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000087720_44912640.pth [2023-03-08 20:05:37,027][547799] Updated weights for policy 0, policy_version 88400 (0.0005) [2023-03-08 20:05:39,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10580.2). Total num frames: 45285376. Throughput: 0: 10673.2. Samples: 45266024. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:05:39,600][547459] Avg episode reward: [(0, '4478.244')] [2023-03-08 20:05:41,065][547799] Updated weights for policy 0, policy_version 88480 (0.0005) [2023-03-08 20:05:44,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 45338624. Throughput: 0: 10681.6. Samples: 45328712. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:05:44,600][547459] Avg episode reward: [(0, '4726.138')] [2023-03-08 20:05:44,885][547799] Updated weights for policy 0, policy_version 88560 (0.0005) [2023-03-08 20:05:48,925][547799] Updated weights for policy 0, policy_version 88640 (0.0005) [2023-03-08 20:05:49,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 45387776. Throughput: 0: 10595.9. Samples: 45389700. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:05:49,600][547459] Avg episode reward: [(0, '4650.957')] [2023-03-08 20:05:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000088648_45387776.pth... [2023-03-08 20:05:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000088040_45076480.pth [2023-03-08 20:05:52,969][547799] Updated weights for policy 0, policy_version 88720 (0.0005) [2023-03-08 20:05:54,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 45441024. Throughput: 0: 10527.3. Samples: 45420476. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:05:54,600][547459] Avg episode reward: [(0, '4654.061')] [2023-03-08 20:05:56,858][547799] Updated weights for policy 0, policy_version 88800 (0.0005) [2023-03-08 20:05:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 45494272. Throughput: 0: 10485.2. Samples: 45483100. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:05:59,600][547459] Avg episode reward: [(0, '4624.580')] [2023-03-08 20:06:00,704][547799] Updated weights for policy 0, policy_version 88880 (0.0005) [2023-03-08 20:06:04,570][547799] Updated weights for policy 0, policy_version 88960 (0.0005) [2023-03-08 20:06:04,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 45547520. Throughput: 0: 10470.0. Samples: 45547520. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:06:04,600][547459] Avg episode reward: [(0, '4642.144')] [2023-03-08 20:06:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000088960_45547520.pth... [2023-03-08 20:06:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000088344_45232128.pth [2023-03-08 20:06:08,300][547799] Updated weights for policy 0, policy_version 89040 (0.0005) [2023-03-08 20:06:09,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 45600768. Throughput: 0: 10475.1. Samples: 45580296. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:06:09,600][547459] Avg episode reward: [(0, '4820.838')] [2023-03-08 20:06:12,197][547799] Updated weights for policy 0, policy_version 89120 (0.0006) [2023-03-08 20:06:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 45654016. Throughput: 0: 10481.6. Samples: 45642936. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:06:14,600][547459] Avg episode reward: [(0, '4769.626')] [2023-03-08 20:06:16,156][547799] Updated weights for policy 0, policy_version 89200 (0.0005) [2023-03-08 20:06:19,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10552.4). Total num frames: 45703168. Throughput: 0: 10484.4. Samples: 45706896. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:06:19,600][547459] Avg episode reward: [(0, '4763.817')] [2023-03-08 20:06:19,656][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000089272_45707264.pth... [2023-03-08 20:06:19,659][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000088648_45387776.pth [2023-03-08 20:06:20,069][547799] Updated weights for policy 0, policy_version 89280 (0.0005) [2023-03-08 20:06:24,072][547799] Updated weights for policy 0, policy_version 89360 (0.0005) [2023-03-08 20:06:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10552.4). Total num frames: 45756416. Throughput: 0: 10454.5. Samples: 45736476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:06:24,600][547459] Avg episode reward: [(0, '4717.988')] [2023-03-08 20:06:28,002][547799] Updated weights for policy 0, policy_version 89440 (0.0005) [2023-03-08 20:06:29,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 45809664. Throughput: 0: 10459.0. Samples: 45799368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:06:29,600][547459] Avg episode reward: [(0, '4513.969')] [2023-03-08 20:06:32,014][547799] Updated weights for policy 0, policy_version 89520 (0.0005) [2023-03-08 20:06:34,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10552.4). Total num frames: 45858816. Throughput: 0: 10479.8. Samples: 45861292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:06:34,600][547459] Avg episode reward: [(0, '4412.749')] [2023-03-08 20:06:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000089568_45858816.pth... [2023-03-08 20:06:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000088960_45547520.pth [2023-03-08 20:06:35,961][547799] Updated weights for policy 0, policy_version 89600 (0.0005) [2023-03-08 20:06:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10552.4). Total num frames: 45912064. Throughput: 0: 10469.1. Samples: 45891584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:06:39,601][547459] Avg episode reward: [(0, '4601.237')] [2023-03-08 20:06:39,883][547799] Updated weights for policy 0, policy_version 89680 (0.0005) [2023-03-08 20:06:43,932][547799] Updated weights for policy 0, policy_version 89760 (0.0005) [2023-03-08 20:06:44,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10538.5). Total num frames: 45961216. Throughput: 0: 10443.0. Samples: 45953036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:06:44,600][547459] Avg episode reward: [(0, '4495.709')] [2023-03-08 20:06:47,893][547799] Updated weights for policy 0, policy_version 89840 (0.0006) [2023-03-08 20:06:49,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10538.5). Total num frames: 46014464. Throughput: 0: 10378.2. Samples: 46014536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:06:49,600][547459] Avg episode reward: [(0, '4627.700')] [2023-03-08 20:06:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000089872_46014464.pth... [2023-03-08 20:06:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000089272_45707264.pth [2023-03-08 20:06:51,884][547799] Updated weights for policy 0, policy_version 89920 (0.0005) [2023-03-08 20:06:54,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10538.5). Total num frames: 46067712. Throughput: 0: 10366.2. Samples: 46046776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:06:54,600][547459] Avg episode reward: [(0, '4391.712')] [2023-03-08 20:06:55,739][547799] Updated weights for policy 0, policy_version 90000 (0.0005) [2023-03-08 20:06:59,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 46116864. Throughput: 0: 10349.9. Samples: 46108680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:06:59,600][547459] Avg episode reward: [(0, '4584.908')] [2023-03-08 20:06:59,707][547799] Updated weights for policy 0, policy_version 90080 (0.0005) [2023-03-08 20:07:03,577][547799] Updated weights for policy 0, policy_version 90160 (0.0005) [2023-03-08 20:07:04,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 46170112. Throughput: 0: 10335.8. Samples: 46172008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:07:04,600][547459] Avg episode reward: [(0, '4719.532')] [2023-03-08 20:07:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000090176_46170112.pth... [2023-03-08 20:07:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000089568_45858816.pth [2023-03-08 20:07:07,464][547799] Updated weights for policy 0, policy_version 90240 (0.0005) [2023-03-08 20:07:09,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 46223360. Throughput: 0: 10372.4. Samples: 46203236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:07:09,600][547459] Avg episode reward: [(0, '4762.761')] [2023-03-08 20:07:11,313][547799] Updated weights for policy 0, policy_version 90320 (0.0006) [2023-03-08 20:07:14,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 46276608. Throughput: 0: 10410.8. Samples: 46267852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:07:14,600][547459] Avg episode reward: [(0, '4716.482')] [2023-03-08 20:07:15,226][547799] Updated weights for policy 0, policy_version 90400 (0.0005) [2023-03-08 20:07:19,084][547799] Updated weights for policy 0, policy_version 90480 (0.0005) [2023-03-08 20:07:19,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 46329856. Throughput: 0: 10413.9. Samples: 46329920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:07:19,600][547459] Avg episode reward: [(0, '4758.160')] [2023-03-08 20:07:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000090488_46329856.pth... [2023-03-08 20:07:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000089872_46014464.pth [2023-03-08 20:07:22,959][547799] Updated weights for policy 0, policy_version 90560 (0.0005) [2023-03-08 20:07:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 46383104. Throughput: 0: 10467.6. Samples: 46362624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:07:24,600][547459] Avg episode reward: [(0, '4673.273')] [2023-03-08 20:07:26,934][547799] Updated weights for policy 0, policy_version 90640 (0.0005) [2023-03-08 20:07:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10524.6). Total num frames: 46432256. Throughput: 0: 10468.7. Samples: 46424128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:07:29,600][547459] Avg episode reward: [(0, '4779.747')] [2023-03-08 20:07:30,868][547799] Updated weights for policy 0, policy_version 90720 (0.0005) [2023-03-08 20:07:34,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 46485504. Throughput: 0: 10515.2. Samples: 46487724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:07:34,600][547459] Avg episode reward: [(0, '4764.928')] [2023-03-08 20:07:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000090792_46485504.pth... [2023-03-08 20:07:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000090176_46170112.pth [2023-03-08 20:07:34,759][547799] Updated weights for policy 0, policy_version 90800 (0.0005) [2023-03-08 20:07:38,673][547799] Updated weights for policy 0, policy_version 90880 (0.0005) [2023-03-08 20:07:39,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 46538752. Throughput: 0: 10479.3. Samples: 46518344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:07:39,612][547459] Avg episode reward: [(0, '4430.952')] [2023-03-08 20:07:42,621][547799] Updated weights for policy 0, policy_version 90960 (0.0006) [2023-03-08 20:07:44,599][547459] Fps is (10 sec: 10649.8, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 46592000. Throughput: 0: 10518.9. Samples: 46582032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:07:44,610][547459] Avg episode reward: [(0, '4761.229')] [2023-03-08 20:07:46,366][547799] Updated weights for policy 0, policy_version 91040 (0.0005) [2023-03-08 20:07:49,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.0, 300 sec: 10524.6). Total num frames: 46645248. Throughput: 0: 10517.8. Samples: 46645312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:07:49,611][547459] Avg episode reward: [(0, '4722.791')] [2023-03-08 20:07:49,615][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000091104_46645248.pth... [2023-03-08 20:07:49,618][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000090488_46329856.pth [2023-03-08 20:07:50,275][547799] Updated weights for policy 0, policy_version 91120 (0.0006) [2023-03-08 20:07:54,112][547799] Updated weights for policy 0, policy_version 91200 (0.0005) [2023-03-08 20:07:54,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 46698496. Throughput: 0: 10549.1. Samples: 46677944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:07:54,601][547459] Avg episode reward: [(0, '4686.329')] [2023-03-08 20:07:58,084][547799] Updated weights for policy 0, policy_version 91280 (0.0006) [2023-03-08 20:07:59,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10538.5). Total num frames: 46751744. Throughput: 0: 10489.7. Samples: 46739888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:07:59,600][547459] Avg episode reward: [(0, '4765.663')] [2023-03-08 20:08:01,891][547799] Updated weights for policy 0, policy_version 91360 (0.0005) [2023-03-08 20:08:04,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10538.5). Total num frames: 46804992. Throughput: 0: 10549.3. Samples: 46804636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:08:04,600][547459] Avg episode reward: [(0, '4822.191')] [2023-03-08 20:08:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000091416_46804992.pth... [2023-03-08 20:08:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000090792_46485504.pth [2023-03-08 20:08:05,737][547799] Updated weights for policy 0, policy_version 91440 (0.0006) [2023-03-08 20:08:09,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 46854144. Throughput: 0: 10506.7. Samples: 46835424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:08:09,600][547459] Avg episode reward: [(0, '4450.792')] [2023-03-08 20:08:09,854][547799] Updated weights for policy 0, policy_version 91520 (0.0005) [2023-03-08 20:08:13,731][547799] Updated weights for policy 0, policy_version 91600 (0.0005) [2023-03-08 20:08:14,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 46907392. Throughput: 0: 10506.7. Samples: 46896928. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:08:14,601][547459] Avg episode reward: [(0, '4629.580')] [2023-03-08 20:08:17,565][547799] Updated weights for policy 0, policy_version 91680 (0.0005) [2023-03-08 20:08:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 46960640. Throughput: 0: 10522.1. Samples: 46961220. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:08:19,601][547459] Avg episode reward: [(0, '4620.720')] [2023-03-08 20:08:19,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000091720_46960640.pth... [2023-03-08 20:08:19,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000091104_46645248.pth [2023-03-08 20:08:21,415][547799] Updated weights for policy 0, policy_version 91760 (0.0005) [2023-03-08 20:08:24,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 47013888. Throughput: 0: 10538.0. Samples: 46992552. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:08:24,600][547459] Avg episode reward: [(0, '4415.424')] [2023-03-08 20:08:25,352][547799] Updated weights for policy 0, policy_version 91840 (0.0005) [2023-03-08 20:08:29,251][547799] Updated weights for policy 0, policy_version 91920 (0.0005) [2023-03-08 20:08:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 47063040. Throughput: 0: 10508.6. Samples: 47054920. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:08:29,600][547459] Avg episode reward: [(0, '4099.223')] [2023-03-08 20:08:33,205][547799] Updated weights for policy 0, policy_version 92000 (0.0005) [2023-03-08 20:08:34,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 47116288. Throughput: 0: 10494.7. Samples: 47117572. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:08:34,600][547459] Avg episode reward: [(0, '3932.320')] [2023-03-08 20:08:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000092024_47116288.pth... [2023-03-08 20:08:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000091416_46804992.pth [2023-03-08 20:08:37,234][547799] Updated weights for policy 0, policy_version 92080 (0.0005) [2023-03-08 20:08:39,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 47169536. Throughput: 0: 10461.0. Samples: 47148688. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:08:39,600][547459] Avg episode reward: [(0, '4279.784')] [2023-03-08 20:08:41,100][547799] Updated weights for policy 0, policy_version 92160 (0.0005) [2023-03-08 20:08:44,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 47218688. Throughput: 0: 10456.0. Samples: 47210408. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:08:44,600][547459] Avg episode reward: [(0, '4201.513')] [2023-03-08 20:08:45,206][547799] Updated weights for policy 0, policy_version 92240 (0.0005) [2023-03-08 20:08:49,046][547799] Updated weights for policy 0, policy_version 92320 (0.0005) [2023-03-08 20:08:49,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 47271936. Throughput: 0: 10402.8. Samples: 47272764. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:08:49,601][547459] Avg episode reward: [(0, '4495.181')] [2023-03-08 20:08:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000092328_47271936.pth... [2023-03-08 20:08:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000091720_46960640.pth [2023-03-08 20:08:52,956][547799] Updated weights for policy 0, policy_version 92400 (0.0005) [2023-03-08 20:08:54,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 47325184. Throughput: 0: 10420.8. Samples: 47304360. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:08:54,600][547459] Avg episode reward: [(0, '4586.493')] [2023-03-08 20:08:56,810][547799] Updated weights for policy 0, policy_version 92480 (0.0005) [2023-03-08 20:08:59,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 47378432. Throughput: 0: 10464.1. Samples: 47367812. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:08:59,600][547459] Avg episode reward: [(0, '4501.551')] [2023-03-08 20:09:00,729][547799] Updated weights for policy 0, policy_version 92560 (0.0005) [2023-03-08 20:09:04,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10510.8). Total num frames: 47427584. Throughput: 0: 10397.6. Samples: 47429112. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:09:04,600][547459] Avg episode reward: [(0, '4621.973')] [2023-03-08 20:09:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000092632_47427584.pth... [2023-03-08 20:09:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000092024_47116288.pth [2023-03-08 20:09:04,712][547799] Updated weights for policy 0, policy_version 92640 (0.0005) [2023-03-08 20:09:08,638][547799] Updated weights for policy 0, policy_version 92720 (0.0005) [2023-03-08 20:09:09,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 47480832. Throughput: 0: 10397.0. Samples: 47460416. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:09:09,600][547459] Avg episode reward: [(0, '4347.809')] [2023-03-08 20:09:12,507][547799] Updated weights for policy 0, policy_version 92800 (0.0005) [2023-03-08 20:09:14,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 47534080. Throughput: 0: 10420.6. Samples: 47523844. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:09:14,600][547459] Avg episode reward: [(0, '4452.159')] [2023-03-08 20:09:16,472][547799] Updated weights for policy 0, policy_version 92880 (0.0005) [2023-03-08 20:09:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 47587328. Throughput: 0: 10439.0. Samples: 47587328. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:09:19,601][547459] Avg episode reward: [(0, '4458.975')] [2023-03-08 20:09:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000092944_47587328.pth... [2023-03-08 20:09:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000092328_47271936.pth [2023-03-08 20:09:20,381][547799] Updated weights for policy 0, policy_version 92960 (0.0005) [2023-03-08 20:09:24,432][547799] Updated weights for policy 0, policy_version 93040 (0.0005) [2023-03-08 20:09:24,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10510.8). Total num frames: 47636480. Throughput: 0: 10434.1. Samples: 47618224. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:09:24,600][547459] Avg episode reward: [(0, '4564.004')] [2023-03-08 20:09:28,435][547799] Updated weights for policy 0, policy_version 93120 (0.0005) [2023-03-08 20:09:29,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 47689728. Throughput: 0: 10397.9. Samples: 47678312. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:09:29,600][547459] Avg episode reward: [(0, '4519.282')] [2023-03-08 20:09:32,195][547799] Updated weights for policy 0, policy_version 93200 (0.0005) [2023-03-08 20:09:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 47742976. Throughput: 0: 10450.8. Samples: 47743048. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:09:34,600][547459] Avg episode reward: [(0, '4821.354')] [2023-03-08 20:09:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000093248_47742976.pth... [2023-03-08 20:09:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000092632_47427584.pth [2023-03-08 20:09:36,035][547799] Updated weights for policy 0, policy_version 93280 (0.0005) [2023-03-08 20:09:39,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 47792128. Throughput: 0: 10469.4. Samples: 47775484. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:09:39,600][547459] Avg episode reward: [(0, '4761.328')] [2023-03-08 20:09:39,998][547799] Updated weights for policy 0, policy_version 93360 (0.0005) [2023-03-08 20:09:43,912][547799] Updated weights for policy 0, policy_version 93440 (0.0006) [2023-03-08 20:09:44,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 47845376. Throughput: 0: 10430.7. Samples: 47837192. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:09:44,600][547459] Avg episode reward: [(0, '4556.190')] [2023-03-08 20:09:47,758][547799] Updated weights for policy 0, policy_version 93520 (0.0006) [2023-03-08 20:09:49,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 47898624. Throughput: 0: 10481.1. Samples: 47900760. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:09:49,600][547459] Avg episode reward: [(0, '4733.452')] [2023-03-08 20:09:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000093552_47898624.pth... [2023-03-08 20:09:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000092944_47587328.pth [2023-03-08 20:09:51,565][547799] Updated weights for policy 0, policy_version 93600 (0.0005) [2023-03-08 20:09:54,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 47951872. Throughput: 0: 10506.6. Samples: 47933212. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:09:54,600][547459] Avg episode reward: [(0, '4809.009')] [2023-03-08 20:09:55,496][547799] Updated weights for policy 0, policy_version 93680 (0.0005) [2023-03-08 20:09:59,367][547799] Updated weights for policy 0, policy_version 93760 (0.0005) [2023-03-08 20:09:59,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 48005120. Throughput: 0: 10512.2. Samples: 47996892. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:09:59,600][547459] Avg episode reward: [(0, '4826.362')] [2023-03-08 20:10:03,180][547799] Updated weights for policy 0, policy_version 93840 (0.0006) [2023-03-08 20:10:04,599][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 48058368. Throughput: 0: 10526.4. Samples: 48061016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:10:04,600][547459] Avg episode reward: [(0, '4549.396')] [2023-03-08 20:10:04,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000093864_48058368.pth... [2023-03-08 20:10:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000093248_47742976.pth [2023-03-08 20:10:07,026][547799] Updated weights for policy 0, policy_version 93920 (0.0005) [2023-03-08 20:10:09,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 48111616. Throughput: 0: 10524.4. Samples: 48091820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:10:09,600][547459] Avg episode reward: [(0, '4732.725')] [2023-03-08 20:10:10,899][547799] Updated weights for policy 0, policy_version 94000 (0.0006) [2023-03-08 20:10:14,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.0, 300 sec: 10469.1). Total num frames: 48164864. Throughput: 0: 10618.4. Samples: 48156140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:10:14,600][547459] Avg episode reward: [(0, '4500.250')] [2023-03-08 20:10:14,821][547799] Updated weights for policy 0, policy_version 94080 (0.0005) [2023-03-08 20:10:18,748][547799] Updated weights for policy 0, policy_version 94160 (0.0005) [2023-03-08 20:10:19,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 48218112. Throughput: 0: 10557.2. Samples: 48218120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:10:19,600][547459] Avg episode reward: [(0, '4545.291')] [2023-03-08 20:10:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000094176_48218112.pth... [2023-03-08 20:10:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000093552_47898624.pth [2023-03-08 20:10:22,613][547799] Updated weights for policy 0, policy_version 94240 (0.0005) [2023-03-08 20:10:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 48267264. Throughput: 0: 10558.2. Samples: 48250604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:10:24,600][547459] Avg episode reward: [(0, '4530.038')] [2023-03-08 20:10:26,529][547799] Updated weights for policy 0, policy_version 94320 (0.0005) [2023-03-08 20:10:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 48320512. Throughput: 0: 10557.0. Samples: 48312256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:10:29,600][547459] Avg episode reward: [(0, '4597.937')] [2023-03-08 20:10:30,649][547799] Updated weights for policy 0, policy_version 94400 (0.0005) [2023-03-08 20:10:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 48369664. Throughput: 0: 10478.8. Samples: 48372304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:10:34,600][547459] Avg episode reward: [(0, '3957.363')] [2023-03-08 20:10:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000094472_48369664.pth... [2023-03-08 20:10:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000093864_48058368.pth [2023-03-08 20:10:34,678][547799] Updated weights for policy 0, policy_version 94480 (0.0005) [2023-03-08 20:10:38,672][547799] Updated weights for policy 0, policy_version 94560 (0.0005) [2023-03-08 20:10:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10455.2). Total num frames: 48422912. Throughput: 0: 10447.7. Samples: 48403360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:10:39,600][547459] Avg episode reward: [(0, '4414.499')] [2023-03-08 20:10:42,722][547799] Updated weights for policy 0, policy_version 94640 (0.0005) [2023-03-08 20:10:44,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 48472064. Throughput: 0: 10378.8. Samples: 48463936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:10:44,600][547459] Avg episode reward: [(0, '4164.892')] [2023-03-08 20:10:46,822][547799] Updated weights for policy 0, policy_version 94720 (0.0005) [2023-03-08 20:10:49,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 48525312. Throughput: 0: 10322.6. Samples: 48525536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:10:49,600][547459] Avg episode reward: [(0, '4191.614')] [2023-03-08 20:10:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000094776_48525312.pth... [2023-03-08 20:10:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000094176_48218112.pth [2023-03-08 20:10:50,615][547799] Updated weights for policy 0, policy_version 94800 (0.0005) [2023-03-08 20:10:54,532][547799] Updated weights for policy 0, policy_version 94880 (0.0005) [2023-03-08 20:10:54,600][547459] Fps is (10 sec: 10649.4, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 48578560. Throughput: 0: 10359.7. Samples: 48558008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:10:54,601][547459] Avg episode reward: [(0, '4536.075')] [2023-03-08 20:10:58,316][547799] Updated weights for policy 0, policy_version 94960 (0.0005) [2023-03-08 20:10:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 48631808. Throughput: 0: 10351.1. Samples: 48621940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:10:59,600][547459] Avg episode reward: [(0, '4673.709')] [2023-03-08 20:11:02,211][547799] Updated weights for policy 0, policy_version 95040 (0.0005) [2023-03-08 20:11:04,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10441.3). Total num frames: 48680960. Throughput: 0: 10361.7. Samples: 48684396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:11:04,600][547459] Avg episode reward: [(0, '4557.987')] [2023-03-08 20:11:04,622][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000095088_48685056.pth... [2023-03-08 20:11:04,624][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000094472_48369664.pth [2023-03-08 20:11:06,156][547799] Updated weights for policy 0, policy_version 95120 (0.0005) [2023-03-08 20:11:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 48738304. Throughput: 0: 10337.5. Samples: 48715792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:11:09,600][547459] Avg episode reward: [(0, '4612.652')] [2023-03-08 20:11:09,933][547799] Updated weights for policy 0, policy_version 95200 (0.0006) [2023-03-08 20:11:13,763][547799] Updated weights for policy 0, policy_version 95280 (0.0005) [2023-03-08 20:11:14,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 48791552. Throughput: 0: 10422.4. Samples: 48781264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:11:14,601][547459] Avg episode reward: [(0, '4796.840')] [2023-03-08 20:11:17,599][547799] Updated weights for policy 0, policy_version 95360 (0.0005) [2023-03-08 20:11:19,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 48844800. Throughput: 0: 10492.6. Samples: 48844472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:11:19,600][547459] Avg episode reward: [(0, '4826.761')] [2023-03-08 20:11:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000095400_48844800.pth... [2023-03-08 20:11:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000094776_48525312.pth [2023-03-08 20:11:21,538][547799] Updated weights for policy 0, policy_version 95440 (0.0006) [2023-03-08 20:11:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 48893952. Throughput: 0: 10488.5. Samples: 48875344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:11:24,600][547459] Avg episode reward: [(0, '4631.675')] [2023-03-08 20:11:25,561][547799] Updated weights for policy 0, policy_version 95520 (0.0005) [2023-03-08 20:11:29,437][547799] Updated weights for policy 0, policy_version 95600 (0.0005) [2023-03-08 20:11:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 48947200. Throughput: 0: 10556.5. Samples: 48938980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:11:29,600][547459] Avg episode reward: [(0, '4665.167')] [2023-03-08 20:11:33,312][547799] Updated weights for policy 0, policy_version 95680 (0.0005) [2023-03-08 20:11:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 49000448. Throughput: 0: 10555.0. Samples: 49000512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:11:34,600][547459] Avg episode reward: [(0, '4522.282')] [2023-03-08 20:11:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000095704_49000448.pth... [2023-03-08 20:11:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000095088_48685056.pth [2023-03-08 20:11:37,300][547799] Updated weights for policy 0, policy_version 95760 (0.0005) [2023-03-08 20:11:39,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 49049600. Throughput: 0: 10536.2. Samples: 49032136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:11:39,600][547459] Avg episode reward: [(0, '4673.769')] [2023-03-08 20:11:41,066][547799] Updated weights for policy 0, policy_version 95840 (0.0005) [2023-03-08 20:11:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10483.0). Total num frames: 49106944. Throughput: 0: 10532.4. Samples: 49095900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:11:44,601][547459] Avg episode reward: [(0, '4730.903')] [2023-03-08 20:11:44,807][547799] Updated weights for policy 0, policy_version 95920 (0.0005) [2023-03-08 20:11:48,535][547799] Updated weights for policy 0, policy_version 96000 (0.0005) [2023-03-08 20:11:49,599][547459] Fps is (10 sec: 11059.2, 60 sec: 10581.4, 300 sec: 10483.0). Total num frames: 49160192. Throughput: 0: 10635.0. Samples: 49162972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:11:49,600][547459] Avg episode reward: [(0, '4677.727')] [2023-03-08 20:11:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000096016_49160192.pth... [2023-03-08 20:11:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000095400_48844800.pth [2023-03-08 20:11:52,416][547799] Updated weights for policy 0, policy_version 96080 (0.0005) [2023-03-08 20:11:54,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.4, 300 sec: 10496.9). Total num frames: 49213440. Throughput: 0: 10624.5. Samples: 49193892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:11:54,600][547459] Avg episode reward: [(0, '4542.996')] [2023-03-08 20:11:56,455][547799] Updated weights for policy 0, policy_version 96160 (0.0005) [2023-03-08 20:11:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 49266688. Throughput: 0: 10557.1. Samples: 49256332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:11:59,601][547459] Avg episode reward: [(0, '4639.906')] [2023-03-08 20:12:00,308][547799] Updated weights for policy 0, policy_version 96240 (0.0005) [2023-03-08 20:12:04,246][547799] Updated weights for policy 0, policy_version 96320 (0.0005) [2023-03-08 20:12:04,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10581.3, 300 sec: 10483.0). Total num frames: 49315840. Throughput: 0: 10560.1. Samples: 49319676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:12:04,600][547459] Avg episode reward: [(0, '4685.637')] [2023-03-08 20:12:04,625][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000096328_49319936.pth... [2023-03-08 20:12:04,627][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000095704_49000448.pth [2023-03-08 20:12:08,034][547799] Updated weights for policy 0, policy_version 96400 (0.0005) [2023-03-08 20:12:09,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 49369088. Throughput: 0: 10574.5. Samples: 49351196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:12:09,600][547459] Avg episode reward: [(0, '4662.250')] [2023-03-08 20:12:11,927][547799] Updated weights for policy 0, policy_version 96480 (0.0005) [2023-03-08 20:12:14,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 49422336. Throughput: 0: 10563.0. Samples: 49414312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:12:14,600][547459] Avg episode reward: [(0, '4276.231')] [2023-03-08 20:12:15,882][547799] Updated weights for policy 0, policy_version 96560 (0.0005) [2023-03-08 20:12:19,600][547459] Fps is (10 sec: 10649.4, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 49475584. Throughput: 0: 10609.1. Samples: 49477920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:12:19,600][547459] Avg episode reward: [(0, '4751.403')] [2023-03-08 20:12:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000096632_49475584.pth... [2023-03-08 20:12:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000096016_49160192.pth [2023-03-08 20:12:19,738][547799] Updated weights for policy 0, policy_version 96640 (0.0005) [2023-03-08 20:12:23,744][547799] Updated weights for policy 0, policy_version 96720 (0.0005) [2023-03-08 20:12:24,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 49528832. Throughput: 0: 10582.7. Samples: 49508360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:12:24,600][547459] Avg episode reward: [(0, '4582.473')] [2023-03-08 20:12:27,838][547799] Updated weights for policy 0, policy_version 96800 (0.0005) [2023-03-08 20:12:29,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 49577984. Throughput: 0: 10519.0. Samples: 49569252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:12:29,600][547459] Avg episode reward: [(0, '4777.996')] [2023-03-08 20:12:31,618][547799] Updated weights for policy 0, policy_version 96880 (0.0005) [2023-03-08 20:12:34,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 49631232. Throughput: 0: 10461.0. Samples: 49633716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:12:34,600][547459] Avg episode reward: [(0, '4772.825')] [2023-03-08 20:12:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000096936_49631232.pth... [2023-03-08 20:12:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000096328_49319936.pth [2023-03-08 20:12:35,467][547799] Updated weights for policy 0, policy_version 96960 (0.0005) [2023-03-08 20:12:39,241][547799] Updated weights for policy 0, policy_version 97040 (0.0005) [2023-03-08 20:12:39,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10483.0). Total num frames: 49684480. Throughput: 0: 10509.9. Samples: 49666840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:12:39,600][547459] Avg episode reward: [(0, '4816.041')] [2023-03-08 20:12:43,135][547799] Updated weights for policy 0, policy_version 97120 (0.0005) [2023-03-08 20:12:44,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 49737728. Throughput: 0: 10515.8. Samples: 49729544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:12:44,600][547459] Avg episode reward: [(0, '4773.340')] [2023-03-08 20:12:46,932][547799] Updated weights for policy 0, policy_version 97200 (0.0005) [2023-03-08 20:12:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 49790976. Throughput: 0: 10534.1. Samples: 49793712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:12:49,600][547459] Avg episode reward: [(0, '4639.863')] [2023-03-08 20:12:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000097248_49790976.pth... [2023-03-08 20:12:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000096632_49475584.pth [2023-03-08 20:12:50,882][547799] Updated weights for policy 0, policy_version 97280 (0.0005) [2023-03-08 20:12:54,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 49844224. Throughput: 0: 10506.7. Samples: 49823996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:12:54,600][547459] Avg episode reward: [(0, '4452.502')] [2023-03-08 20:12:54,623][547799] Updated weights for policy 0, policy_version 97360 (0.0005) [2023-03-08 20:12:58,410][547799] Updated weights for policy 0, policy_version 97440 (0.0005) [2023-03-08 20:12:59,599][547459] Fps is (10 sec: 11059.3, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 49901568. Throughput: 0: 10578.4. Samples: 49890340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:12:59,601][547459] Avg episode reward: [(0, '4728.462')] [2023-03-08 20:13:02,318][547799] Updated weights for policy 0, policy_version 97520 (0.0005) [2023-03-08 20:13:04,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 49950720. Throughput: 0: 10574.4. Samples: 49953768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:13:04,600][547459] Avg episode reward: [(0, '4265.701')] [2023-03-08 20:13:04,642][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000097568_49954816.pth... [2023-03-08 20:13:04,644][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000096936_49631232.pth [2023-03-08 20:13:06,180][547799] Updated weights for policy 0, policy_version 97600 (0.0005) [2023-03-08 20:13:09,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 50003968. Throughput: 0: 10610.7. Samples: 49985840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:13:09,600][547459] Avg episode reward: [(0, '4731.705')] [2023-03-08 20:13:10,094][547799] Updated weights for policy 0, policy_version 97680 (0.0005) [2023-03-08 20:13:13,891][547799] Updated weights for policy 0, policy_version 97760 (0.0005) [2023-03-08 20:13:14,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 50057216. Throughput: 0: 10663.0. Samples: 50049088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:13:14,600][547459] Avg episode reward: [(0, '4762.893')] [2023-03-08 20:13:17,805][547799] Updated weights for policy 0, policy_version 97840 (0.0006) [2023-03-08 20:13:19,599][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.4, 300 sec: 10496.9). Total num frames: 50110464. Throughput: 0: 10645.5. Samples: 50112764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:13:19,600][547459] Avg episode reward: [(0, '4730.100')] [2023-03-08 20:13:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000097872_50110464.pth... [2023-03-08 20:13:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000097248_49790976.pth [2023-03-08 20:13:21,754][547799] Updated weights for policy 0, policy_version 97920 (0.0005) [2023-03-08 20:13:24,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 50163712. Throughput: 0: 10586.0. Samples: 50143212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:13:24,601][547459] Avg episode reward: [(0, '4675.747')] [2023-03-08 20:13:25,679][547799] Updated weights for policy 0, policy_version 98000 (0.0005) [2023-03-08 20:13:29,592][547799] Updated weights for policy 0, policy_version 98080 (0.0005) [2023-03-08 20:13:29,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10510.8). Total num frames: 50216960. Throughput: 0: 10565.7. Samples: 50205000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:13:29,600][547459] Avg episode reward: [(0, '4694.735')] [2023-03-08 20:13:33,584][547799] Updated weights for policy 0, policy_version 98160 (0.0005) [2023-03-08 20:13:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 50266112. Throughput: 0: 10534.7. Samples: 50267772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:13:34,600][547459] Avg episode reward: [(0, '4600.304')] [2023-03-08 20:13:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000098176_50266112.pth... [2023-03-08 20:13:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000097568_49954816.pth [2023-03-08 20:13:37,576][547799] Updated weights for policy 0, policy_version 98240 (0.0005) [2023-03-08 20:13:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10510.7). Total num frames: 50319360. Throughput: 0: 10552.9. Samples: 50298880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:13:39,601][547459] Avg episode reward: [(0, '4630.989')] [2023-03-08 20:13:41,424][547799] Updated weights for policy 0, policy_version 98320 (0.0006) [2023-03-08 20:13:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 50372608. Throughput: 0: 10472.4. Samples: 50361600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:13:44,600][547459] Avg episode reward: [(0, '4769.838')] [2023-03-08 20:13:45,323][547799] Updated weights for policy 0, policy_version 98400 (0.0005) [2023-03-08 20:13:49,331][547799] Updated weights for policy 0, policy_version 98480 (0.0005) [2023-03-08 20:13:49,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 50421760. Throughput: 0: 10456.4. Samples: 50424308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:13:49,600][547459] Avg episode reward: [(0, '4574.230')] [2023-03-08 20:13:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000098480_50421760.pth... [2023-03-08 20:13:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000097872_50110464.pth [2023-03-08 20:13:53,251][547799] Updated weights for policy 0, policy_version 98560 (0.0005) [2023-03-08 20:13:54,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 50475008. Throughput: 0: 10417.5. Samples: 50454628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:13:54,600][547459] Avg episode reward: [(0, '4732.789')] [2023-03-08 20:13:57,098][547799] Updated weights for policy 0, policy_version 98640 (0.0005) [2023-03-08 20:13:59,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 50528256. Throughput: 0: 10458.2. Samples: 50519708. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:13:59,600][547459] Avg episode reward: [(0, '4776.779')] [2023-03-08 20:14:00,985][547799] Updated weights for policy 0, policy_version 98720 (0.0005) [2023-03-08 20:14:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.0, 300 sec: 10510.8). Total num frames: 50581504. Throughput: 0: 10417.8. Samples: 50581568. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:14:04,601][547459] Avg episode reward: [(0, '4678.991')] [2023-03-08 20:14:04,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000098792_50581504.pth... [2023-03-08 20:14:04,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000098176_50266112.pth [2023-03-08 20:14:04,895][547799] Updated weights for policy 0, policy_version 98800 (0.0005) [2023-03-08 20:14:08,780][547799] Updated weights for policy 0, policy_version 98880 (0.0005) [2023-03-08 20:14:09,600][547459] Fps is (10 sec: 10649.4, 60 sec: 10513.0, 300 sec: 10510.7). Total num frames: 50634752. Throughput: 0: 10459.5. Samples: 50613888. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:14:09,600][547459] Avg episode reward: [(0, '4768.813')] [2023-03-08 20:14:12,682][547799] Updated weights for policy 0, policy_version 98960 (0.0005) [2023-03-08 20:14:14,599][547459] Fps is (10 sec: 10240.2, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 50683904. Throughput: 0: 10474.9. Samples: 50676368. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:14:14,600][547459] Avg episode reward: [(0, '4625.832')] [2023-03-08 20:14:16,617][547799] Updated weights for policy 0, policy_version 99040 (0.0005) [2023-03-08 20:14:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 50737152. Throughput: 0: 10443.0. Samples: 50737708. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:14:19,600][547459] Avg episode reward: [(0, '4306.597')] [2023-03-08 20:14:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000099096_50737152.pth... [2023-03-08 20:14:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000098480_50421760.pth [2023-03-08 20:14:20,641][547799] Updated weights for policy 0, policy_version 99120 (0.0005) [2023-03-08 20:14:24,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10496.9). Total num frames: 50786304. Throughput: 0: 10437.5. Samples: 50768568. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:14:24,600][547459] Avg episode reward: [(0, '4478.739')] [2023-03-08 20:14:24,646][547799] Updated weights for policy 0, policy_version 99200 (0.0006) [2023-03-08 20:14:28,540][547799] Updated weights for policy 0, policy_version 99280 (0.0005) [2023-03-08 20:14:29,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10496.9). Total num frames: 50839552. Throughput: 0: 10439.1. Samples: 50831360. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:14:29,600][547459] Avg episode reward: [(0, '4540.239')] [2023-03-08 20:14:32,530][547799] Updated weights for policy 0, policy_version 99360 (0.0005) [2023-03-08 20:14:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 50892800. Throughput: 0: 10413.1. Samples: 50892896. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:14:34,600][547459] Avg episode reward: [(0, '4544.478')] [2023-03-08 20:14:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000099400_50892800.pth... [2023-03-08 20:14:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000098792_50581504.pth [2023-03-08 20:14:36,353][547799] Updated weights for policy 0, policy_version 99440 (0.0005) [2023-03-08 20:14:39,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 50946048. Throughput: 0: 10465.3. Samples: 50925568. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:14:39,600][547459] Avg episode reward: [(0, '4516.577')] [2023-03-08 20:14:40,386][547799] Updated weights for policy 0, policy_version 99520 (0.0005) [2023-03-08 20:14:44,501][547799] Updated weights for policy 0, policy_version 99600 (0.0005) [2023-03-08 20:14:44,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10496.9). Total num frames: 50995200. Throughput: 0: 10354.2. Samples: 50985648. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:14:44,601][547459] Avg episode reward: [(0, '4066.169')] [2023-03-08 20:14:48,689][547799] Updated weights for policy 0, policy_version 99680 (0.0005) [2023-03-08 20:14:49,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 51044352. Throughput: 0: 10285.5. Samples: 51044416. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:14:49,600][547459] Avg episode reward: [(0, '4212.437')] [2023-03-08 20:14:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000099696_51044352.pth... [2023-03-08 20:14:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000099096_50737152.pth [2023-03-08 20:14:52,914][547799] Updated weights for policy 0, policy_version 99760 (0.0005) [2023-03-08 20:14:54,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10308.2, 300 sec: 10469.1). Total num frames: 51093504. Throughput: 0: 10205.9. Samples: 51073152. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:14:54,600][547459] Avg episode reward: [(0, '3573.376')] [2023-03-08 20:14:56,929][547799] Updated weights for policy 0, policy_version 99840 (0.0005) [2023-03-08 20:14:59,600][547459] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 10455.2). Total num frames: 51142656. Throughput: 0: 10175.8. Samples: 51134280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:14:59,600][547459] Avg episode reward: [(0, '3287.133')] [2023-03-08 20:15:01,227][547799] Updated weights for policy 0, policy_version 99920 (0.0005) [2023-03-08 20:15:04,600][547459] Fps is (10 sec: 9420.9, 60 sec: 10103.5, 300 sec: 10427.4). Total num frames: 51187712. Throughput: 0: 10057.0. Samples: 51190272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:15:04,600][547459] Avg episode reward: [(0, '3285.664')] [2023-03-08 20:15:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000099976_51187712.pth... [2023-03-08 20:15:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000099400_50892800.pth [2023-03-08 20:15:05,548][547799] Updated weights for policy 0, policy_version 100000 (0.0006) [2023-03-08 20:15:09,599][547459] Fps is (10 sec: 9420.9, 60 sec: 10035.2, 300 sec: 10413.6). Total num frames: 51236864. Throughput: 0: 10022.1. Samples: 51219564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:15:09,600][547459] Avg episode reward: [(0, '3364.748')] [2023-03-08 20:15:09,808][547799] Updated weights for policy 0, policy_version 100080 (0.0005) [2023-03-08 20:15:14,038][547799] Updated weights for policy 0, policy_version 100160 (0.0005) [2023-03-08 20:15:14,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10399.7). Total num frames: 51286016. Throughput: 0: 9914.2. Samples: 51277500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:15:14,600][547459] Avg episode reward: [(0, '2938.563')] [2023-03-08 20:15:18,148][547799] Updated weights for policy 0, policy_version 100240 (0.0005) [2023-03-08 20:15:19,599][547459] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10399.7). Total num frames: 51335168. Throughput: 0: 9894.1. Samples: 51338128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:15:19,600][547459] Avg episode reward: [(0, '3759.029')] [2023-03-08 20:15:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000100264_51335168.pth... [2023-03-08 20:15:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000099696_51044352.pth [2023-03-08 20:15:22,086][547799] Updated weights for policy 0, policy_version 100320 (0.0005) [2023-03-08 20:15:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10399.7). Total num frames: 51388416. Throughput: 0: 9830.6. Samples: 51367944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:15:24,600][547459] Avg episode reward: [(0, '4180.294')] [2023-03-08 20:15:26,162][547799] Updated weights for policy 0, policy_version 100400 (0.0005) [2023-03-08 20:15:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10399.7). Total num frames: 51437568. Throughput: 0: 9817.4. Samples: 51427432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:15:29,600][547459] Avg episode reward: [(0, '3384.778')] [2023-03-08 20:15:30,422][547799] Updated weights for policy 0, policy_version 100480 (0.0005) [2023-03-08 20:15:34,470][547799] Updated weights for policy 0, policy_version 100560 (0.0005) [2023-03-08 20:15:34,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10385.8). Total num frames: 51486720. Throughput: 0: 9833.9. Samples: 51486940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:15:34,600][547459] Avg episode reward: [(0, '3345.709')] [2023-03-08 20:15:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000100560_51486720.pth... [2023-03-08 20:15:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000099976_51187712.pth [2023-03-08 20:15:38,695][547799] Updated weights for policy 0, policy_version 100640 (0.0005) [2023-03-08 20:15:39,599][547459] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10385.8). Total num frames: 51535872. Throughput: 0: 9850.1. Samples: 51516404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:15:39,600][547459] Avg episode reward: [(0, '3386.127')] [2023-03-08 20:15:42,993][547799] Updated weights for policy 0, policy_version 100720 (0.0005) [2023-03-08 20:15:44,600][547459] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10358.0). Total num frames: 51580928. Throughput: 0: 9778.9. Samples: 51574332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:15:44,600][547459] Avg episode reward: [(0, '3094.639')] [2023-03-08 20:15:47,088][547799] Updated weights for policy 0, policy_version 100800 (0.0005) [2023-03-08 20:15:49,600][547459] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10358.0). Total num frames: 51634176. Throughput: 0: 9864.5. Samples: 51634176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:15:49,600][547459] Avg episode reward: [(0, '3813.289')] [2023-03-08 20:15:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000100848_51634176.pth... [2023-03-08 20:15:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000100264_51335168.pth [2023-03-08 20:15:51,172][547799] Updated weights for policy 0, policy_version 100880 (0.0005) [2023-03-08 20:15:54,600][547459] Fps is (10 sec: 9830.4, 60 sec: 9762.2, 300 sec: 10330.3). Total num frames: 51679232. Throughput: 0: 9869.2. Samples: 51663680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:15:54,600][547459] Avg episode reward: [(0, '3973.292')] [2023-03-08 20:15:55,472][547799] Updated weights for policy 0, policy_version 100960 (0.0005) [2023-03-08 20:15:59,599][547459] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 10330.3). Total num frames: 51728384. Throughput: 0: 9867.5. Samples: 51721536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:15:59,600][547459] Avg episode reward: [(0, '3481.260')] [2023-03-08 20:15:59,617][547799] Updated weights for policy 0, policy_version 101040 (0.0005) [2023-03-08 20:16:03,583][547799] Updated weights for policy 0, policy_version 101120 (0.0005) [2023-03-08 20:16:04,600][547459] Fps is (10 sec: 10239.9, 60 sec: 9898.6, 300 sec: 10316.4). Total num frames: 51781632. Throughput: 0: 9889.7. Samples: 51783168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:16:04,600][547459] Avg episode reward: [(0, '3559.550')] [2023-03-08 20:16:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000101136_51781632.pth... [2023-03-08 20:16:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000100560_51486720.pth [2023-03-08 20:16:07,720][547799] Updated weights for policy 0, policy_version 101200 (0.0005) [2023-03-08 20:16:09,599][547459] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10302.5). Total num frames: 51830784. Throughput: 0: 9888.5. Samples: 51812924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:16:09,600][547459] Avg episode reward: [(0, '3143.944')] [2023-03-08 20:16:11,853][547799] Updated weights for policy 0, policy_version 101280 (0.0006) [2023-03-08 20:16:14,599][547459] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10288.6). Total num frames: 51879936. Throughput: 0: 9892.4. Samples: 51872588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:16:14,600][547459] Avg episode reward: [(0, '3692.257')] [2023-03-08 20:16:15,891][547799] Updated weights for policy 0, policy_version 101360 (0.0005) [2023-03-08 20:16:19,600][547459] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10302.5). Total num frames: 51933184. Throughput: 0: 9918.1. Samples: 51933256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:16:19,600][547459] Avg episode reward: [(0, '3705.568')] [2023-03-08 20:16:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000101432_51933184.pth... [2023-03-08 20:16:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000100848_51634176.pth [2023-03-08 20:16:19,912][547799] Updated weights for policy 0, policy_version 101440 (0.0005) [2023-03-08 20:16:24,089][547799] Updated weights for policy 0, policy_version 101520 (0.0005) [2023-03-08 20:16:24,600][547459] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 10288.6). Total num frames: 51982336. Throughput: 0: 9940.3. Samples: 51963720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:16:24,600][547459] Avg episode reward: [(0, '3610.545')] [2023-03-08 20:16:28,143][547799] Updated weights for policy 0, policy_version 101600 (0.0006) [2023-03-08 20:16:29,599][547459] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10274.7). Total num frames: 52031488. Throughput: 0: 9976.5. Samples: 52023276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:16:29,600][547459] Avg episode reward: [(0, '3858.399')] [2023-03-08 20:16:32,091][547799] Updated weights for policy 0, policy_version 101680 (0.0005) [2023-03-08 20:16:34,600][547459] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10288.6). Total num frames: 52084736. Throughput: 0: 10014.0. Samples: 52084808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:16:34,600][547459] Avg episode reward: [(0, '4450.279')] [2023-03-08 20:16:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000101728_52084736.pth... [2023-03-08 20:16:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000101136_51781632.pth [2023-03-08 20:16:36,123][547799] Updated weights for policy 0, policy_version 101760 (0.0005) [2023-03-08 20:16:39,600][547459] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10260.8). Total num frames: 52133888. Throughput: 0: 10047.5. Samples: 52115816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:16:39,600][547459] Avg episode reward: [(0, '4248.533')] [2023-03-08 20:16:40,073][547799] Updated weights for policy 0, policy_version 101840 (0.0005) [2023-03-08 20:16:44,070][547799] Updated weights for policy 0, policy_version 101920 (0.0006) [2023-03-08 20:16:44,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10260.8). Total num frames: 52187136. Throughput: 0: 10135.5. Samples: 52177632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:16:44,600][547459] Avg episode reward: [(0, '4213.120')] [2023-03-08 20:16:48,100][547799] Updated weights for policy 0, policy_version 102000 (0.0005) [2023-03-08 20:16:49,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10246.9). Total num frames: 52236288. Throughput: 0: 10133.1. Samples: 52239156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:16:49,601][547459] Avg episode reward: [(0, '4246.691')] [2023-03-08 20:16:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000102024_52236288.pth... [2023-03-08 20:16:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000101432_51933184.pth [2023-03-08 20:16:52,036][547799] Updated weights for policy 0, policy_version 102080 (0.0005) [2023-03-08 20:16:54,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10246.9). Total num frames: 52289536. Throughput: 0: 10150.2. Samples: 52269684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:16:54,600][547459] Avg episode reward: [(0, '4533.183')] [2023-03-08 20:16:55,939][547799] Updated weights for policy 0, policy_version 102160 (0.0005) [2023-03-08 20:16:59,599][547459] Fps is (10 sec: 10649.8, 60 sec: 10240.0, 300 sec: 10260.8). Total num frames: 52342784. Throughput: 0: 10228.5. Samples: 52332868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:16:59,600][547459] Avg episode reward: [(0, '4490.524')] [2023-03-08 20:16:59,892][547799] Updated weights for policy 0, policy_version 102240 (0.0005) [2023-03-08 20:17:03,785][547799] Updated weights for policy 0, policy_version 102320 (0.0005) [2023-03-08 20:17:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10246.9). Total num frames: 52391936. Throughput: 0: 10279.4. Samples: 52395828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:17:04,600][547459] Avg episode reward: [(0, '4497.040')] [2023-03-08 20:17:04,616][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000102336_52396032.pth... [2023-03-08 20:17:04,618][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000101728_52084736.pth [2023-03-08 20:17:07,705][547799] Updated weights for policy 0, policy_version 102400 (0.0005) [2023-03-08 20:17:09,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10246.9). Total num frames: 52445184. Throughput: 0: 10298.0. Samples: 52427128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:17:09,600][547459] Avg episode reward: [(0, '4455.572')] [2023-03-08 20:17:11,590][547799] Updated weights for policy 0, policy_version 102480 (0.0005) [2023-03-08 20:17:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10246.9). Total num frames: 52498432. Throughput: 0: 10377.0. Samples: 52490240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:17:14,600][547459] Avg episode reward: [(0, '4633.114')] [2023-03-08 20:17:15,544][547799] Updated weights for policy 0, policy_version 102560 (0.0005) [2023-03-08 20:17:19,471][547799] Updated weights for policy 0, policy_version 102640 (0.0005) [2023-03-08 20:17:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10246.9). Total num frames: 52551680. Throughput: 0: 10379.6. Samples: 52551888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:17:19,601][547459] Avg episode reward: [(0, '4567.990')] [2023-03-08 20:17:19,606][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000102640_52551680.pth... [2023-03-08 20:17:19,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000102024_52236288.pth [2023-03-08 20:17:23,239][547799] Updated weights for policy 0, policy_version 102720 (0.0005) [2023-03-08 20:17:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10260.8). Total num frames: 52604928. Throughput: 0: 10415.7. Samples: 52584524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:17:24,601][547459] Avg episode reward: [(0, '4661.258')] [2023-03-08 20:17:27,027][547799] Updated weights for policy 0, policy_version 102800 (0.0005) [2023-03-08 20:17:29,599][547459] Fps is (10 sec: 10649.8, 60 sec: 10444.8, 300 sec: 10260.8). Total num frames: 52658176. Throughput: 0: 10493.1. Samples: 52649820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:17:29,600][547459] Avg episode reward: [(0, '4668.553')] [2023-03-08 20:17:31,091][547799] Updated weights for policy 0, policy_version 102880 (0.0005) [2023-03-08 20:17:34,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10260.8). Total num frames: 52711424. Throughput: 0: 10496.3. Samples: 52711488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:17:34,600][547459] Avg episode reward: [(0, '4589.953')] [2023-03-08 20:17:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000102952_52711424.pth... [2023-03-08 20:17:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000102336_52396032.pth [2023-03-08 20:17:34,844][547799] Updated weights for policy 0, policy_version 102960 (0.0005) [2023-03-08 20:17:38,688][547799] Updated weights for policy 0, policy_version 103040 (0.0005) [2023-03-08 20:17:39,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10260.8). Total num frames: 52764672. Throughput: 0: 10544.9. Samples: 52744204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:17:39,601][547459] Avg episode reward: [(0, '4661.590')] [2023-03-08 20:17:42,677][547799] Updated weights for policy 0, policy_version 103120 (0.0006) [2023-03-08 20:17:44,599][547459] Fps is (10 sec: 10240.2, 60 sec: 10444.8, 300 sec: 10246.9). Total num frames: 52813824. Throughput: 0: 10507.3. Samples: 52805696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:17:44,600][547459] Avg episode reward: [(0, '4708.358')] [2023-03-08 20:17:46,690][547799] Updated weights for policy 0, policy_version 103200 (0.0005) [2023-03-08 20:17:49,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10246.9). Total num frames: 52867072. Throughput: 0: 10503.0. Samples: 52868464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:17:49,600][547459] Avg episode reward: [(0, '4778.039')] [2023-03-08 20:17:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000103256_52867072.pth... [2023-03-08 20:17:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000102640_52551680.pth [2023-03-08 20:17:50,542][547799] Updated weights for policy 0, policy_version 103280 (0.0006) [2023-03-08 20:17:54,543][547799] Updated weights for policy 0, policy_version 103360 (0.0005) [2023-03-08 20:17:54,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10233.1). Total num frames: 52920320. Throughput: 0: 10504.6. Samples: 52899836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:17:54,600][547459] Avg episode reward: [(0, '4694.348')] [2023-03-08 20:17:58,323][547799] Updated weights for policy 0, policy_version 103440 (0.0005) [2023-03-08 20:17:59,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.0, 300 sec: 10246.9). Total num frames: 52973568. Throughput: 0: 10516.4. Samples: 52963480. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:17:59,600][547459] Avg episode reward: [(0, '4645.164')] [2023-03-08 20:18:02,171][547799] Updated weights for policy 0, policy_version 103520 (0.0005) [2023-03-08 20:18:04,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10246.9). Total num frames: 53026816. Throughput: 0: 10567.0. Samples: 53027404. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:18:04,601][547459] Avg episode reward: [(0, '4648.262')] [2023-03-08 20:18:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000103568_53026816.pth... [2023-03-08 20:18:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000102952_52711424.pth [2023-03-08 20:18:05,946][547799] Updated weights for policy 0, policy_version 103600 (0.0005) [2023-03-08 20:18:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10246.9). Total num frames: 53080064. Throughput: 0: 10556.0. Samples: 53059544. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:18:09,600][547459] Avg episode reward: [(0, '4465.928')] [2023-03-08 20:18:09,996][547799] Updated weights for policy 0, policy_version 103680 (0.0005) [2023-03-08 20:18:13,687][547799] Updated weights for policy 0, policy_version 103760 (0.0005) [2023-03-08 20:18:14,599][547459] Fps is (10 sec: 10649.8, 60 sec: 10581.4, 300 sec: 10246.9). Total num frames: 53133312. Throughput: 0: 10522.4. Samples: 53123328. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:18:14,600][547459] Avg episode reward: [(0, '4671.441')] [2023-03-08 20:18:17,577][547799] Updated weights for policy 0, policy_version 103840 (0.0005) [2023-03-08 20:18:19,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10246.9). Total num frames: 53186560. Throughput: 0: 10558.6. Samples: 53186624. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:18:19,600][547459] Avg episode reward: [(0, '4763.272')] [2023-03-08 20:18:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000103880_53186560.pth... [2023-03-08 20:18:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000103256_52867072.pth [2023-03-08 20:18:21,401][547799] Updated weights for policy 0, policy_version 103920 (0.0005) [2023-03-08 20:18:24,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10246.9). Total num frames: 53239808. Throughput: 0: 10558.3. Samples: 53219328. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:18:24,600][547459] Avg episode reward: [(0, '4770.092')] [2023-03-08 20:18:25,228][547799] Updated weights for policy 0, policy_version 104000 (0.0006) [2023-03-08 20:18:29,064][547799] Updated weights for policy 0, policy_version 104080 (0.0005) [2023-03-08 20:18:29,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10260.8). Total num frames: 53293056. Throughput: 0: 10611.4. Samples: 53283208. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:18:29,600][547459] Avg episode reward: [(0, '4714.096')] [2023-03-08 20:18:32,907][547799] Updated weights for policy 0, policy_version 104160 (0.0005) [2023-03-08 20:18:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10260.8). Total num frames: 53346304. Throughput: 0: 10618.9. Samples: 53346316. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:18:34,600][547459] Avg episode reward: [(0, '4459.279')] [2023-03-08 20:18:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000104192_53346304.pth... [2023-03-08 20:18:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000103568_53026816.pth [2023-03-08 20:18:36,892][547799] Updated weights for policy 0, policy_version 104240 (0.0005) [2023-03-08 20:18:39,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10246.9). Total num frames: 53395456. Throughput: 0: 10616.6. Samples: 53377584. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:18:39,600][547459] Avg episode reward: [(0, '4400.781')] [2023-03-08 20:18:40,901][547799] Updated weights for policy 0, policy_version 104320 (0.0006) [2023-03-08 20:18:44,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10260.8). Total num frames: 53448704. Throughput: 0: 10598.4. Samples: 53440408. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:18:44,600][547459] Avg episode reward: [(0, '4642.708')] [2023-03-08 20:18:44,863][547799] Updated weights for policy 0, policy_version 104400 (0.0005) [2023-03-08 20:18:48,882][547799] Updated weights for policy 0, policy_version 104480 (0.0005) [2023-03-08 20:18:49,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10246.9). Total num frames: 53497856. Throughput: 0: 10534.9. Samples: 53501476. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:18:49,600][547459] Avg episode reward: [(0, '4019.513')] [2023-03-08 20:18:49,650][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000104496_53501952.pth... [2023-03-08 20:18:49,653][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000103880_53186560.pth [2023-03-08 20:18:52,747][547799] Updated weights for policy 0, policy_version 104560 (0.0005) [2023-03-08 20:18:54,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10246.9). Total num frames: 53551104. Throughput: 0: 10497.6. Samples: 53531936. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:18:54,600][547459] Avg episode reward: [(0, '4564.505')] [2023-03-08 20:18:56,651][547799] Updated weights for policy 0, policy_version 104640 (0.0006) [2023-03-08 20:18:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10246.9). Total num frames: 53604352. Throughput: 0: 10503.4. Samples: 53595980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:18:59,600][547459] Avg episode reward: [(0, '4681.563')] [2023-03-08 20:19:00,682][547799] Updated weights for policy 0, policy_version 104720 (0.0006) [2023-03-08 20:19:04,485][547799] Updated weights for policy 0, policy_version 104800 (0.0005) [2023-03-08 20:19:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10246.9). Total num frames: 53657600. Throughput: 0: 10467.7. Samples: 53657672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:19:04,600][547459] Avg episode reward: [(0, '4642.636')] [2023-03-08 20:19:04,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000104800_53657600.pth... [2023-03-08 20:19:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000104192_53346304.pth [2023-03-08 20:19:08,222][547799] Updated weights for policy 0, policy_version 104880 (0.0005) [2023-03-08 20:19:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10260.8). Total num frames: 53710848. Throughput: 0: 10475.3. Samples: 53690716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:19:09,600][547459] Avg episode reward: [(0, '4640.083')] [2023-03-08 20:19:12,097][547799] Updated weights for policy 0, policy_version 104960 (0.0005) [2023-03-08 20:19:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.0, 300 sec: 10260.8). Total num frames: 53764096. Throughput: 0: 10502.3. Samples: 53755812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:19:14,601][547459] Avg episode reward: [(0, '4657.886')] [2023-03-08 20:19:15,989][547799] Updated weights for policy 0, policy_version 105040 (0.0005) [2023-03-08 20:19:19,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10274.7). Total num frames: 53817344. Throughput: 0: 10466.4. Samples: 53817304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:19:19,600][547459] Avg episode reward: [(0, '4693.679')] [2023-03-08 20:19:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000105112_53817344.pth... [2023-03-08 20:19:19,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000104496_53501952.pth [2023-03-08 20:19:19,948][547799] Updated weights for policy 0, policy_version 105120 (0.0005) [2023-03-08 20:19:23,943][547799] Updated weights for policy 0, policy_version 105200 (0.0005) [2023-03-08 20:19:24,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10260.8). Total num frames: 53866496. Throughput: 0: 10436.8. Samples: 53847240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:19:24,600][547459] Avg episode reward: [(0, '4322.451')] [2023-03-08 20:19:27,885][547799] Updated weights for policy 0, policy_version 105280 (0.0005) [2023-03-08 20:19:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10260.8). Total num frames: 53919744. Throughput: 0: 10447.2. Samples: 53910532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:19:29,601][547459] Avg episode reward: [(0, '4763.635')] [2023-03-08 20:19:31,908][547799] Updated weights for policy 0, policy_version 105360 (0.0005) [2023-03-08 20:19:34,600][547459] Fps is (10 sec: 10239.8, 60 sec: 10376.5, 300 sec: 10246.9). Total num frames: 53968896. Throughput: 0: 10455.8. Samples: 53971988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:19:34,602][547459] Avg episode reward: [(0, '4490.741')] [2023-03-08 20:19:34,641][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000105416_53972992.pth... [2023-03-08 20:19:34,644][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000104800_53657600.pth [2023-03-08 20:19:35,911][547799] Updated weights for policy 0, policy_version 105440 (0.0005) [2023-03-08 20:19:39,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10260.8). Total num frames: 54022144. Throughput: 0: 10478.2. Samples: 54003456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:19:39,606][547459] Avg episode reward: [(0, '4607.173')] [2023-03-08 20:19:39,674][547799] Updated weights for policy 0, policy_version 105520 (0.0005) [2023-03-08 20:19:43,597][547799] Updated weights for policy 0, policy_version 105600 (0.0005) [2023-03-08 20:19:44,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10274.7). Total num frames: 54075392. Throughput: 0: 10455.3. Samples: 54066468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:19:44,611][547459] Avg episode reward: [(0, '4580.795')] [2023-03-08 20:19:47,509][547799] Updated weights for policy 0, policy_version 105680 (0.0005) [2023-03-08 20:19:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10288.6). Total num frames: 54128640. Throughput: 0: 10477.5. Samples: 54129160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:19:49,611][547459] Avg episode reward: [(0, '4684.823')] [2023-03-08 20:19:49,615][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000105720_54128640.pth... [2023-03-08 20:19:49,617][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000105112_53817344.pth [2023-03-08 20:19:51,380][547799] Updated weights for policy 0, policy_version 105760 (0.0006) [2023-03-08 20:19:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10302.5). Total num frames: 54181888. Throughput: 0: 10460.0. Samples: 54161416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:19:54,611][547459] Avg episode reward: [(0, '4774.431')] [2023-03-08 20:19:55,292][547799] Updated weights for policy 0, policy_version 105840 (0.0005) [2023-03-08 20:19:59,228][547799] Updated weights for policy 0, policy_version 105920 (0.0005) [2023-03-08 20:19:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10316.4). Total num frames: 54231040. Throughput: 0: 10384.5. Samples: 54223112. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:19:59,600][547459] Avg episode reward: [(0, '4818.378')] [2023-03-08 20:20:03,132][547799] Updated weights for policy 0, policy_version 106000 (0.0005) [2023-03-08 20:20:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10330.2). Total num frames: 54284288. Throughput: 0: 10458.3. Samples: 54287928. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:20:04,600][547459] Avg episode reward: [(0, '4776.838')] [2023-03-08 20:20:04,615][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000106032_54288384.pth... [2023-03-08 20:20:04,618][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000105416_53972992.pth [2023-03-08 20:20:06,990][547799] Updated weights for policy 0, policy_version 106080 (0.0005) [2023-03-08 20:20:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10344.1). Total num frames: 54337536. Throughput: 0: 10479.5. Samples: 54318820. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:20:09,601][547459] Avg episode reward: [(0, '4535.630')] [2023-03-08 20:20:11,022][547799] Updated weights for policy 0, policy_version 106160 (0.0005) [2023-03-08 20:20:14,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 54386688. Throughput: 0: 10400.6. Samples: 54378560. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:20:14,600][547459] Avg episode reward: [(0, '3914.912')] [2023-03-08 20:20:15,095][547799] Updated weights for policy 0, policy_version 106240 (0.0005) [2023-03-08 20:20:19,151][547799] Updated weights for policy 0, policy_version 106320 (0.0005) [2023-03-08 20:20:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 54439936. Throughput: 0: 10400.5. Samples: 54440008. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:20:19,600][547459] Avg episode reward: [(0, '4026.658')] [2023-03-08 20:20:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000106328_54439936.pth... [2023-03-08 20:20:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000105720_54128640.pth [2023-03-08 20:20:23,086][547799] Updated weights for policy 0, policy_version 106400 (0.0005) [2023-03-08 20:20:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10344.1). Total num frames: 54489088. Throughput: 0: 10399.9. Samples: 54471452. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:20:24,600][547459] Avg episode reward: [(0, '4261.064')] [2023-03-08 20:20:26,950][547799] Updated weights for policy 0, policy_version 106480 (0.0005) [2023-03-08 20:20:29,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.6, 300 sec: 10358.0). Total num frames: 54542336. Throughput: 0: 10387.3. Samples: 54533896. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:20:29,600][547459] Avg episode reward: [(0, '3904.428')] [2023-03-08 20:20:31,089][547799] Updated weights for policy 0, policy_version 106560 (0.0005) [2023-03-08 20:20:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.6, 300 sec: 10358.0). Total num frames: 54591488. Throughput: 0: 10290.2. Samples: 54592216. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:20:34,600][547459] Avg episode reward: [(0, '3694.737')] [2023-03-08 20:20:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000106624_54591488.pth... [2023-03-08 20:20:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000106032_54288384.pth [2023-03-08 20:20:35,253][547799] Updated weights for policy 0, policy_version 106640 (0.0005) [2023-03-08 20:20:39,396][547799] Updated weights for policy 0, policy_version 106720 (0.0005) [2023-03-08 20:20:39,600][547459] Fps is (10 sec: 9830.3, 60 sec: 10308.3, 300 sec: 10371.9). Total num frames: 54640640. Throughput: 0: 10258.6. Samples: 54623052. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:20:39,600][547459] Avg episode reward: [(0, '3571.589')] [2023-03-08 20:20:43,419][547799] Updated weights for policy 0, policy_version 106800 (0.0005) [2023-03-08 20:20:44,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10371.9). Total num frames: 54693888. Throughput: 0: 10216.3. Samples: 54682844. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:20:44,600][547459] Avg episode reward: [(0, '3969.642')] [2023-03-08 20:20:47,361][547799] Updated weights for policy 0, policy_version 106880 (0.0005) [2023-03-08 20:20:49,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10385.8). Total num frames: 54743040. Throughput: 0: 10173.5. Samples: 54745736. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:20:49,600][547459] Avg episode reward: [(0, '4362.506')] [2023-03-08 20:20:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000106920_54743040.pth... [2023-03-08 20:20:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000106328_54439936.pth [2023-03-08 20:20:51,275][547799] Updated weights for policy 0, policy_version 106960 (0.0005) [2023-03-08 20:20:54,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10399.7). Total num frames: 54796288. Throughput: 0: 10186.0. Samples: 54777188. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:20:54,601][547459] Avg episode reward: [(0, '4584.722')] [2023-03-08 20:20:54,981][547799] Updated weights for policy 0, policy_version 107040 (0.0005) [2023-03-08 20:20:58,969][547799] Updated weights for policy 0, policy_version 107120 (0.0005) [2023-03-08 20:20:59,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10399.7). Total num frames: 54849536. Throughput: 0: 10284.1. Samples: 54841344. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:20:59,600][547459] Avg episode reward: [(0, '4400.119')] [2023-03-08 20:21:02,813][547799] Updated weights for policy 0, policy_version 107200 (0.0005) [2023-03-08 20:21:04,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10413.6). Total num frames: 54902784. Throughput: 0: 10356.0. Samples: 54906028. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:21:04,600][547459] Avg episode reward: [(0, '4330.616')] [2023-03-08 20:21:04,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000107232_54902784.pth... [2023-03-08 20:21:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000106624_54591488.pth [2023-03-08 20:21:06,537][547799] Updated weights for policy 0, policy_version 107280 (0.0005) [2023-03-08 20:21:09,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 54956032. Throughput: 0: 10354.3. Samples: 54937396. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:21:09,600][547459] Avg episode reward: [(0, '4416.090')] [2023-03-08 20:21:10,466][547799] Updated weights for policy 0, policy_version 107360 (0.0006) [2023-03-08 20:21:14,521][547799] Updated weights for policy 0, policy_version 107440 (0.0005) [2023-03-08 20:21:14,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10427.4). Total num frames: 55009280. Throughput: 0: 10352.4. Samples: 54999756. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:21:14,600][547459] Avg episode reward: [(0, '4386.207')] [2023-03-08 20:21:18,659][547799] Updated weights for policy 0, policy_version 107520 (0.0005) [2023-03-08 20:21:19,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 55058432. Throughput: 0: 10361.8. Samples: 55058496. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:21:19,600][547459] Avg episode reward: [(0, '3838.023')] [2023-03-08 20:21:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000107536_55058432.pth... [2023-03-08 20:21:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000106920_54743040.pth [2023-03-08 20:21:22,808][547799] Updated weights for policy 0, policy_version 107600 (0.0005) [2023-03-08 20:21:24,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 55107584. Throughput: 0: 10327.7. Samples: 55087800. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:21:24,600][547459] Avg episode reward: [(0, '4305.267')] [2023-03-08 20:21:26,689][547799] Updated weights for policy 0, policy_version 107680 (0.0005) [2023-03-08 20:21:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10427.4). Total num frames: 55160832. Throughput: 0: 10432.1. Samples: 55152288. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:21:29,600][547459] Avg episode reward: [(0, '4548.680')] [2023-03-08 20:21:30,575][547799] Updated weights for policy 0, policy_version 107760 (0.0005) [2023-03-08 20:21:34,562][547799] Updated weights for policy 0, policy_version 107840 (0.0006) [2023-03-08 20:21:34,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10441.3). Total num frames: 55214080. Throughput: 0: 10404.9. Samples: 55213956. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:21:34,600][547459] Avg episode reward: [(0, '4683.193')] [2023-03-08 20:21:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000107840_55214080.pth... [2023-03-08 20:21:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000107232_54902784.pth [2023-03-08 20:21:38,394][547799] Updated weights for policy 0, policy_version 107920 (0.0005) [2023-03-08 20:21:39,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10441.3). Total num frames: 55267328. Throughput: 0: 10424.8. Samples: 55246304. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:21:39,600][547459] Avg episode reward: [(0, '4708.148')] [2023-03-08 20:21:42,242][547799] Updated weights for policy 0, policy_version 108000 (0.0005) [2023-03-08 20:21:44,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 10441.3). Total num frames: 55316480. Throughput: 0: 10392.6. Samples: 55309012. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:21:44,600][547459] Avg episode reward: [(0, '4706.410')] [2023-03-08 20:21:46,186][547799] Updated weights for policy 0, policy_version 108080 (0.0005) [2023-03-08 20:21:49,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10441.3). Total num frames: 55369728. Throughput: 0: 10379.4. Samples: 55373100. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:21:49,600][547459] Avg episode reward: [(0, '4710.167')] [2023-03-08 20:21:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000108144_55369728.pth... [2023-03-08 20:21:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000107536_55058432.pth [2023-03-08 20:21:50,075][547799] Updated weights for policy 0, policy_version 108160 (0.0005) [2023-03-08 20:21:54,009][547799] Updated weights for policy 0, policy_version 108240 (0.0005) [2023-03-08 20:21:54,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10441.3). Total num frames: 55422976. Throughput: 0: 10335.8. Samples: 55402508. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:21:54,600][547459] Avg episode reward: [(0, '4686.486')] [2023-03-08 20:21:57,799][547799] Updated weights for policy 0, policy_version 108320 (0.0006) [2023-03-08 20:21:59,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 55476224. Throughput: 0: 10403.9. Samples: 55467932. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:21:59,600][547459] Avg episode reward: [(0, '4757.058')] [2023-03-08 20:22:01,711][547799] Updated weights for policy 0, policy_version 108400 (0.0005) [2023-03-08 20:22:04,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 55529472. Throughput: 0: 10495.1. Samples: 55530776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:22:04,600][547459] Avg episode reward: [(0, '4759.411')] [2023-03-08 20:22:04,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000108456_55529472.pth... [2023-03-08 20:22:04,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000107840_55214080.pth [2023-03-08 20:22:05,516][547799] Updated weights for policy 0, policy_version 108480 (0.0005) [2023-03-08 20:22:09,386][547799] Updated weights for policy 0, policy_version 108560 (0.0005) [2023-03-08 20:22:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 55582720. Throughput: 0: 10544.5. Samples: 55562304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:22:09,600][547459] Avg episode reward: [(0, '4682.982')] [2023-03-08 20:22:13,353][547799] Updated weights for policy 0, policy_version 108640 (0.0005) [2023-03-08 20:22:14,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 55635968. Throughput: 0: 10518.9. Samples: 55625640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:22:14,600][547459] Avg episode reward: [(0, '4171.200')] [2023-03-08 20:22:17,308][547799] Updated weights for policy 0, policy_version 108720 (0.0005) [2023-03-08 20:22:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10441.3). Total num frames: 55685120. Throughput: 0: 10531.8. Samples: 55687888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:22:19,600][547459] Avg episode reward: [(0, '4128.278')] [2023-03-08 20:22:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000108760_55685120.pth... [2023-03-08 20:22:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000108144_55369728.pth [2023-03-08 20:22:21,192][547799] Updated weights for policy 0, policy_version 108800 (0.0005) [2023-03-08 20:22:24,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10441.3). Total num frames: 55738368. Throughput: 0: 10515.4. Samples: 55719500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:22:24,601][547459] Avg episode reward: [(0, '4167.184')] [2023-03-08 20:22:25,101][547799] Updated weights for policy 0, policy_version 108880 (0.0005) [2023-03-08 20:22:29,081][547799] Updated weights for policy 0, policy_version 108960 (0.0004) [2023-03-08 20:22:29,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10441.3). Total num frames: 55791616. Throughput: 0: 10523.9. Samples: 55782588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:22:29,600][547459] Avg episode reward: [(0, '4243.185')] [2023-03-08 20:22:33,003][547799] Updated weights for policy 0, policy_version 109040 (0.0005) [2023-03-08 20:22:34,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10427.4). Total num frames: 55840768. Throughput: 0: 10473.0. Samples: 55844384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:22:34,600][547459] Avg episode reward: [(0, '4380.523')] [2023-03-08 20:22:34,624][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000109072_55844864.pth... [2023-03-08 20:22:34,626][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000108456_55529472.pth [2023-03-08 20:22:37,105][547799] Updated weights for policy 0, policy_version 109120 (0.0006) [2023-03-08 20:22:39,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10441.3). Total num frames: 55894016. Throughput: 0: 10470.7. Samples: 55873688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:22:39,600][547459] Avg episode reward: [(0, '4517.143')] [2023-03-08 20:22:41,067][547799] Updated weights for policy 0, policy_version 109200 (0.0005) [2023-03-08 20:22:44,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10441.3). Total num frames: 55947264. Throughput: 0: 10395.7. Samples: 55935740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:22:44,601][547459] Avg episode reward: [(0, '4445.450')] [2023-03-08 20:22:44,978][547799] Updated weights for policy 0, policy_version 109280 (0.0005) [2023-03-08 20:22:48,857][547799] Updated weights for policy 0, policy_version 109360 (0.0005) [2023-03-08 20:22:49,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10427.4). Total num frames: 55996416. Throughput: 0: 10432.9. Samples: 56000260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:22:49,600][547459] Avg episode reward: [(0, '4727.410')] [2023-03-08 20:22:49,631][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000109376_56000512.pth... [2023-03-08 20:22:49,633][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000108760_55685120.pth [2023-03-08 20:22:52,874][547799] Updated weights for policy 0, policy_version 109440 (0.0005) [2023-03-08 20:22:54,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10427.4). Total num frames: 56049664. Throughput: 0: 10383.3. Samples: 56029552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:22:54,600][547459] Avg episode reward: [(0, '4529.447')] [2023-03-08 20:22:56,773][547799] Updated weights for policy 0, policy_version 109520 (0.0005) [2023-03-08 20:22:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10427.4). Total num frames: 56102912. Throughput: 0: 10404.7. Samples: 56093852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:22:59,600][547459] Avg episode reward: [(0, '4464.974')] [2023-03-08 20:23:00,642][547799] Updated weights for policy 0, policy_version 109600 (0.0006) [2023-03-08 20:23:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10413.6). Total num frames: 56152064. Throughput: 0: 10361.7. Samples: 56154164. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:23:04,600][547459] Avg episode reward: [(0, '4484.431')] [2023-03-08 20:23:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000109672_56152064.pth... [2023-03-08 20:23:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000109072_55844864.pth [2023-03-08 20:23:04,773][547799] Updated weights for policy 0, policy_version 109680 (0.0005) [2023-03-08 20:23:08,972][547799] Updated weights for policy 0, policy_version 109760 (0.0005) [2023-03-08 20:23:09,600][547459] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10399.7). Total num frames: 56201216. Throughput: 0: 10306.9. Samples: 56183312. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:23:09,600][547459] Avg episode reward: [(0, '4188.070')] [2023-03-08 20:23:13,039][547799] Updated weights for policy 0, policy_version 109840 (0.0005) [2023-03-08 20:23:14,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10308.2, 300 sec: 10399.7). Total num frames: 56254464. Throughput: 0: 10235.1. Samples: 56243168. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:23:14,601][547459] Avg episode reward: [(0, '4105.401')] [2023-03-08 20:23:16,949][547799] Updated weights for policy 0, policy_version 109920 (0.0005) [2023-03-08 20:23:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 56303616. Throughput: 0: 10283.9. Samples: 56307160. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:23:19,600][547459] Avg episode reward: [(0, '4811.087')] [2023-03-08 20:23:19,636][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000109976_56307712.pth... [2023-03-08 20:23:19,638][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000109376_56000512.pth [2023-03-08 20:23:20,793][547799] Updated weights for policy 0, policy_version 110000 (0.0005) [2023-03-08 20:23:24,599][547459] Fps is (10 sec: 10240.2, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 56356864. Throughput: 0: 10328.6. Samples: 56338472. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:23:24,600][547459] Avg episode reward: [(0, '4749.544')] [2023-03-08 20:23:24,635][547799] Updated weights for policy 0, policy_version 110080 (0.0006) [2023-03-08 20:23:28,830][547799] Updated weights for policy 0, policy_version 110160 (0.0005) [2023-03-08 20:23:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10371.9). Total num frames: 56406016. Throughput: 0: 10309.0. Samples: 56399644. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:23:29,600][547459] Avg episode reward: [(0, '4652.652')] [2023-03-08 20:23:32,794][547799] Updated weights for policy 0, policy_version 110240 (0.0005) [2023-03-08 20:23:34,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 56459264. Throughput: 0: 10243.9. Samples: 56461236. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:23:34,600][547459] Avg episode reward: [(0, '4567.483')] [2023-03-08 20:23:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000110272_56459264.pth... [2023-03-08 20:23:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000109672_56152064.pth [2023-03-08 20:23:36,704][547799] Updated weights for policy 0, policy_version 110320 (0.0005) [2023-03-08 20:23:39,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 56512512. Throughput: 0: 10290.9. Samples: 56492644. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:23:39,600][547459] Avg episode reward: [(0, '4587.961')] [2023-03-08 20:23:40,682][547799] Updated weights for policy 0, policy_version 110400 (0.0005) [2023-03-08 20:23:44,460][547799] Updated weights for policy 0, policy_version 110480 (0.0005) [2023-03-08 20:23:44,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10308.3, 300 sec: 10399.7). Total num frames: 56565760. Throughput: 0: 10266.0. Samples: 56555820. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:23:44,600][547459] Avg episode reward: [(0, '4650.424')] [2023-03-08 20:23:48,388][547799] Updated weights for policy 0, policy_version 110560 (0.0005) [2023-03-08 20:23:49,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 56619008. Throughput: 0: 10331.3. Samples: 56619072. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:23:49,600][547459] Avg episode reward: [(0, '4570.414')] [2023-03-08 20:23:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000110584_56619008.pth... [2023-03-08 20:23:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000109976_56307712.pth [2023-03-08 20:23:52,250][547799] Updated weights for policy 0, policy_version 110640 (0.0005) [2023-03-08 20:23:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10399.7). Total num frames: 56672256. Throughput: 0: 10402.0. Samples: 56651400. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:23:54,600][547459] Avg episode reward: [(0, '4611.109')] [2023-03-08 20:23:56,109][547799] Updated weights for policy 0, policy_version 110720 (0.0005) [2023-03-08 20:23:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10385.8). Total num frames: 56721408. Throughput: 0: 10477.8. Samples: 56714668. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:23:59,600][547459] Avg episode reward: [(0, '4726.489')] [2023-03-08 20:23:59,997][547799] Updated weights for policy 0, policy_version 110800 (0.0005) [2023-03-08 20:24:03,849][547799] Updated weights for policy 0, policy_version 110880 (0.0005) [2023-03-08 20:24:04,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10399.7). Total num frames: 56778752. Throughput: 0: 10474.5. Samples: 56778512. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:24:04,601][547459] Avg episode reward: [(0, '4689.509')] [2023-03-08 20:24:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000110896_56778752.pth... [2023-03-08 20:24:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000110272_56459264.pth [2023-03-08 20:24:07,710][547799] Updated weights for policy 0, policy_version 110960 (0.0005) [2023-03-08 20:24:09,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10385.8). Total num frames: 56827904. Throughput: 0: 10479.6. Samples: 56810052. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:24:09,600][547459] Avg episode reward: [(0, '4547.213')] [2023-03-08 20:24:11,597][547799] Updated weights for policy 0, policy_version 111040 (0.0005) [2023-03-08 20:24:14,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10385.8). Total num frames: 56881152. Throughput: 0: 10532.6. Samples: 56873612. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:24:14,600][547459] Avg episode reward: [(0, '4779.030')] [2023-03-08 20:24:15,483][547799] Updated weights for policy 0, policy_version 111120 (0.0005) [2023-03-08 20:24:19,323][547799] Updated weights for policy 0, policy_version 111200 (0.0005) [2023-03-08 20:24:19,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10399.7). Total num frames: 56934400. Throughput: 0: 10556.9. Samples: 56936296. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:24:19,600][547459] Avg episode reward: [(0, '4635.472')] [2023-03-08 20:24:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000111200_56934400.pth... [2023-03-08 20:24:19,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000110584_56619008.pth [2023-03-08 20:24:23,327][547799] Updated weights for policy 0, policy_version 111280 (0.0005) [2023-03-08 20:24:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10399.7). Total num frames: 56987648. Throughput: 0: 10545.2. Samples: 56967176. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:24:24,600][547459] Avg episode reward: [(0, '4610.076')] [2023-03-08 20:24:27,034][547799] Updated weights for policy 0, policy_version 111360 (0.0005) [2023-03-08 20:24:29,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10413.6). Total num frames: 57040896. Throughput: 0: 10594.9. Samples: 57032592. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:24:29,600][547459] Avg episode reward: [(0, '4398.400')] [2023-03-08 20:24:30,865][547799] Updated weights for policy 0, policy_version 111440 (0.0006) [2023-03-08 20:24:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10413.6). Total num frames: 57094144. Throughput: 0: 10597.4. Samples: 57095956. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:24:34,600][547459] Avg episode reward: [(0, '4678.189')] [2023-03-08 20:24:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000111512_57094144.pth... [2023-03-08 20:24:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000110896_56778752.pth [2023-03-08 20:24:34,698][547799] Updated weights for policy 0, policy_version 111520 (0.0005) [2023-03-08 20:24:38,630][547799] Updated weights for policy 0, policy_version 111600 (0.0005) [2023-03-08 20:24:39,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10413.6). Total num frames: 57147392. Throughput: 0: 10580.0. Samples: 57127500. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:24:39,600][547459] Avg episode reward: [(0, '4536.217')] [2023-03-08 20:24:42,458][547799] Updated weights for policy 0, policy_version 111680 (0.0005) [2023-03-08 20:24:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10413.6). Total num frames: 57200640. Throughput: 0: 10607.7. Samples: 57192016. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:24:44,600][547459] Avg episode reward: [(0, '4584.129')] [2023-03-08 20:24:46,397][547799] Updated weights for policy 0, policy_version 111760 (0.0005) [2023-03-08 20:24:49,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10413.6). Total num frames: 57253888. Throughput: 0: 10571.1. Samples: 57254212. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:24:49,600][547459] Avg episode reward: [(0, '4441.488')] [2023-03-08 20:24:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000111824_57253888.pth... [2023-03-08 20:24:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000111200_56934400.pth [2023-03-08 20:24:50,242][547799] Updated weights for policy 0, policy_version 111840 (0.0004) [2023-03-08 20:24:54,165][547799] Updated weights for policy 0, policy_version 111920 (0.0005) [2023-03-08 20:24:54,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10427.4). Total num frames: 57307136. Throughput: 0: 10591.3. Samples: 57286664. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:24:54,600][547459] Avg episode reward: [(0, '4444.055')] [2023-03-08 20:24:57,985][547799] Updated weights for policy 0, policy_version 112000 (0.0005) [2023-03-08 20:24:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10427.4). Total num frames: 57360384. Throughput: 0: 10567.0. Samples: 57349128. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:24:59,600][547459] Avg episode reward: [(0, '4403.739')] [2023-03-08 20:25:01,852][547799] Updated weights for policy 0, policy_version 112080 (0.0005) [2023-03-08 20:25:04,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10427.4). Total num frames: 57413632. Throughput: 0: 10607.4. Samples: 57413632. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:25:04,601][547459] Avg episode reward: [(0, '4679.649')] [2023-03-08 20:25:04,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000112136_57413632.pth... [2023-03-08 20:25:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000111512_57094144.pth [2023-03-08 20:25:05,749][547799] Updated weights for policy 0, policy_version 112160 (0.0005) [2023-03-08 20:25:09,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10427.4). Total num frames: 57462784. Throughput: 0: 10594.2. Samples: 57443916. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:25:09,600][547459] Avg episode reward: [(0, '4380.534')] [2023-03-08 20:25:09,728][547799] Updated weights for policy 0, policy_version 112240 (0.0005) [2023-03-08 20:25:13,658][547799] Updated weights for policy 0, policy_version 112320 (0.0004) [2023-03-08 20:25:14,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10427.4). Total num frames: 57516032. Throughput: 0: 10547.5. Samples: 57507228. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:25:14,600][547459] Avg episode reward: [(0, '4499.259')] [2023-03-08 20:25:17,561][547799] Updated weights for policy 0, policy_version 112400 (0.0005) [2023-03-08 20:25:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10441.3). Total num frames: 57569280. Throughput: 0: 10519.9. Samples: 57569352. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:25:19,600][547459] Avg episode reward: [(0, '4448.681')] [2023-03-08 20:25:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000112440_57569280.pth... [2023-03-08 20:25:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000111824_57253888.pth [2023-03-08 20:25:21,313][547799] Updated weights for policy 0, policy_version 112480 (0.0005) [2023-03-08 20:25:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10441.3). Total num frames: 57622528. Throughput: 0: 10563.7. Samples: 57602868. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:25:24,601][547459] Avg episode reward: [(0, '4547.085')] [2023-03-08 20:25:25,147][547799] Updated weights for policy 0, policy_version 112560 (0.0005) [2023-03-08 20:25:28,971][547799] Updated weights for policy 0, policy_version 112640 (0.0005) [2023-03-08 20:25:29,599][547459] Fps is (10 sec: 10649.8, 60 sec: 10581.4, 300 sec: 10455.2). Total num frames: 57675776. Throughput: 0: 10558.4. Samples: 57667144. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:25:29,600][547459] Avg episode reward: [(0, '4267.449')] [2023-03-08 20:25:32,879][547799] Updated weights for policy 0, policy_version 112720 (0.0005) [2023-03-08 20:25:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10469.1). Total num frames: 57729024. Throughput: 0: 10561.7. Samples: 57729488. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:25:34,600][547459] Avg episode reward: [(0, '4218.790')] [2023-03-08 20:25:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000112752_57729024.pth... [2023-03-08 20:25:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000112136_57413632.pth [2023-03-08 20:25:36,806][547799] Updated weights for policy 0, policy_version 112800 (0.0005) [2023-03-08 20:25:39,600][547459] Fps is (10 sec: 10649.4, 60 sec: 10581.3, 300 sec: 10469.1). Total num frames: 57782272. Throughput: 0: 10546.4. Samples: 57761252. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:25:39,601][547459] Avg episode reward: [(0, '4221.332')] [2023-03-08 20:25:40,664][547799] Updated weights for policy 0, policy_version 112880 (0.0005) [2023-03-08 20:25:44,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 57831424. Throughput: 0: 10537.3. Samples: 57823304. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:25:44,600][547459] Avg episode reward: [(0, '4356.612')] [2023-03-08 20:25:44,675][547799] Updated weights for policy 0, policy_version 112960 (0.0005) [2023-03-08 20:25:48,583][547799] Updated weights for policy 0, policy_version 113040 (0.0005) [2023-03-08 20:25:49,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 57884672. Throughput: 0: 10489.8. Samples: 57885672. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:25:49,600][547459] Avg episode reward: [(0, '4392.213')] [2023-03-08 20:25:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000113056_57884672.pth... [2023-03-08 20:25:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000112440_57569280.pth [2023-03-08 20:25:52,615][547799] Updated weights for policy 0, policy_version 113120 (0.0005) [2023-03-08 20:25:54,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 57937920. Throughput: 0: 10505.7. Samples: 57916672. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:25:54,600][547459] Avg episode reward: [(0, '4371.254')] [2023-03-08 20:25:56,458][547799] Updated weights for policy 0, policy_version 113200 (0.0005) [2023-03-08 20:25:59,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 57991168. Throughput: 0: 10493.3. Samples: 57979428. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:25:59,601][547459] Avg episode reward: [(0, '4446.398')] [2023-03-08 20:26:00,311][547799] Updated weights for policy 0, policy_version 113280 (0.0006) [2023-03-08 20:26:04,271][547799] Updated weights for policy 0, policy_version 113360 (0.0005) [2023-03-08 20:26:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10455.2). Total num frames: 58040320. Throughput: 0: 10544.1. Samples: 58043836. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:26:04,600][547459] Avg episode reward: [(0, '4397.534')] [2023-03-08 20:26:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000113360_58040320.pth... [2023-03-08 20:26:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000112752_57729024.pth [2023-03-08 20:26:08,259][547799] Updated weights for policy 0, policy_version 113440 (0.0005) [2023-03-08 20:26:09,600][547459] Fps is (10 sec: 10239.6, 60 sec: 10513.0, 300 sec: 10455.2). Total num frames: 58093568. Throughput: 0: 10450.7. Samples: 58073152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:26:09,601][547459] Avg episode reward: [(0, '4256.256')] [2023-03-08 20:26:12,079][547799] Updated weights for policy 0, policy_version 113520 (0.0005) [2023-03-08 20:26:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 58146816. Throughput: 0: 10459.2. Samples: 58137808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:26:14,600][547459] Avg episode reward: [(0, '4501.672')] [2023-03-08 20:26:15,927][547799] Updated weights for policy 0, policy_version 113600 (0.0005) [2023-03-08 20:26:19,600][547459] Fps is (10 sec: 10649.9, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 58200064. Throughput: 0: 10468.1. Samples: 58200552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:26:19,600][547459] Avg episode reward: [(0, '4592.644')] [2023-03-08 20:26:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000113672_58200064.pth... [2023-03-08 20:26:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000113056_57884672.pth [2023-03-08 20:26:19,827][547799] Updated weights for policy 0, policy_version 113680 (0.0006) [2023-03-08 20:26:23,803][547799] Updated weights for policy 0, policy_version 113760 (0.0006) [2023-03-08 20:26:24,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 58253312. Throughput: 0: 10479.6. Samples: 58232832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:26:24,600][547459] Avg episode reward: [(0, '4680.105')] [2023-03-08 20:26:27,672][547799] Updated weights for policy 0, policy_version 113840 (0.0005) [2023-03-08 20:26:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 58302464. Throughput: 0: 10478.8. Samples: 58294852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:26:29,600][547459] Avg episode reward: [(0, '4540.013')] [2023-03-08 20:26:31,633][547799] Updated weights for policy 0, policy_version 113920 (0.0005) [2023-03-08 20:26:34,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 58355712. Throughput: 0: 10471.6. Samples: 58356896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:26:34,601][547459] Avg episode reward: [(0, '4734.505')] [2023-03-08 20:26:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000113976_58355712.pth... [2023-03-08 20:26:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000113360_58040320.pth [2023-03-08 20:26:35,553][547799] Updated weights for policy 0, policy_version 114000 (0.0005) [2023-03-08 20:26:39,517][547799] Updated weights for policy 0, policy_version 114080 (0.0006) [2023-03-08 20:26:39,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 58408960. Throughput: 0: 10486.1. Samples: 58388544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:26:39,600][547459] Avg episode reward: [(0, '4637.020')] [2023-03-08 20:26:43,307][547799] Updated weights for policy 0, policy_version 114160 (0.0006) [2023-03-08 20:26:44,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 58462208. Throughput: 0: 10519.7. Samples: 58452816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:26:44,600][547459] Avg episode reward: [(0, '4640.111')] [2023-03-08 20:26:47,070][547799] Updated weights for policy 0, policy_version 114240 (0.0005) [2023-03-08 20:26:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.0, 300 sec: 10483.0). Total num frames: 58515456. Throughput: 0: 10493.5. Samples: 58516044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:26:49,600][547459] Avg episode reward: [(0, '4737.683')] [2023-03-08 20:26:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000114288_58515456.pth... [2023-03-08 20:26:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000113672_58200064.pth [2023-03-08 20:26:51,033][547799] Updated weights for policy 0, policy_version 114320 (0.0005) [2023-03-08 20:26:54,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 58568704. Throughput: 0: 10557.3. Samples: 58548224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:26:54,600][547459] Avg episode reward: [(0, '4774.771')] [2023-03-08 20:26:54,937][547799] Updated weights for policy 0, policy_version 114400 (0.0005) [2023-03-08 20:26:58,788][547799] Updated weights for policy 0, policy_version 114480 (0.0005) [2023-03-08 20:26:59,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 58621952. Throughput: 0: 10504.8. Samples: 58610524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:26:59,600][547459] Avg episode reward: [(0, '4496.219')] [2023-03-08 20:27:02,570][547799] Updated weights for policy 0, policy_version 114560 (0.0005) [2023-03-08 20:27:04,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10483.0). Total num frames: 58675200. Throughput: 0: 10571.6. Samples: 58676272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:27:04,600][547459] Avg episode reward: [(0, '4780.377')] [2023-03-08 20:27:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000114600_58675200.pth... [2023-03-08 20:27:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000113976_58355712.pth [2023-03-08 20:27:06,276][547799] Updated weights for policy 0, policy_version 114640 (0.0005) [2023-03-08 20:27:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.4, 300 sec: 10483.0). Total num frames: 58728448. Throughput: 0: 10560.0. Samples: 58708032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:27:09,601][547459] Avg episode reward: [(0, '4640.897')] [2023-03-08 20:27:10,185][547799] Updated weights for policy 0, policy_version 114720 (0.0006) [2023-03-08 20:27:14,051][547799] Updated weights for policy 0, policy_version 114800 (0.0005) [2023-03-08 20:27:14,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 58781696. Throughput: 0: 10605.7. Samples: 58772108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:27:14,600][547459] Avg episode reward: [(0, '4822.001')] [2023-03-08 20:27:18,002][547799] Updated weights for policy 0, policy_version 114880 (0.0005) [2023-03-08 20:27:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 58834944. Throughput: 0: 10624.9. Samples: 58835016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:27:19,601][547459] Avg episode reward: [(0, '4779.406')] [2023-03-08 20:27:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000114912_58834944.pth... [2023-03-08 20:27:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000114288_58515456.pth [2023-03-08 20:27:21,749][547799] Updated weights for policy 0, policy_version 114960 (0.0005) [2023-03-08 20:27:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 58888192. Throughput: 0: 10649.6. Samples: 58867776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:27:24,601][547459] Avg episode reward: [(0, '4733.633')] [2023-03-08 20:27:25,618][547799] Updated weights for policy 0, policy_version 115040 (0.0005) [2023-03-08 20:27:29,587][547799] Updated weights for policy 0, policy_version 115120 (0.0005) [2023-03-08 20:27:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10510.7). Total num frames: 58941440. Throughput: 0: 10611.2. Samples: 58930320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:27:29,600][547459] Avg episode reward: [(0, '4686.395')] [2023-03-08 20:27:33,452][547799] Updated weights for policy 0, policy_version 115200 (0.0005) [2023-03-08 20:27:34,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10510.8). Total num frames: 58994688. Throughput: 0: 10636.6. Samples: 58994688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:27:34,600][547459] Avg episode reward: [(0, '4650.916')] [2023-03-08 20:27:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000115224_58994688.pth... [2023-03-08 20:27:34,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000114600_58675200.pth [2023-03-08 20:27:37,367][547799] Updated weights for policy 0, policy_version 115280 (0.0005) [2023-03-08 20:27:39,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 59043840. Throughput: 0: 10592.5. Samples: 59024888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:27:39,600][547459] Avg episode reward: [(0, '4109.446')] [2023-03-08 20:27:41,256][547799] Updated weights for policy 0, policy_version 115360 (0.0005) [2023-03-08 20:27:44,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 59097088. Throughput: 0: 10630.7. Samples: 59088904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:27:44,600][547459] Avg episode reward: [(0, '4278.511')] [2023-03-08 20:27:45,024][547799] Updated weights for policy 0, policy_version 115440 (0.0005) [2023-03-08 20:27:48,850][547799] Updated weights for policy 0, policy_version 115520 (0.0005) [2023-03-08 20:27:49,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 59154432. Throughput: 0: 10620.4. Samples: 59154192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:27:49,600][547459] Avg episode reward: [(0, '4159.713')] [2023-03-08 20:27:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000115536_59154432.pth... [2023-03-08 20:27:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000114912_58834944.pth [2023-03-08 20:27:52,681][547799] Updated weights for policy 0, policy_version 115600 (0.0005) [2023-03-08 20:27:54,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 59203584. Throughput: 0: 10622.6. Samples: 59186048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:27:54,600][547459] Avg episode reward: [(0, '4282.173')] [2023-03-08 20:27:56,672][547799] Updated weights for policy 0, policy_version 115680 (0.0005) [2023-03-08 20:27:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 59256832. Throughput: 0: 10589.6. Samples: 59248640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:27:59,601][547459] Avg episode reward: [(0, '4369.967')] [2023-03-08 20:28:00,591][547799] Updated weights for policy 0, policy_version 115760 (0.0005) [2023-03-08 20:28:04,451][547799] Updated weights for policy 0, policy_version 115840 (0.0005) [2023-03-08 20:28:04,600][547459] Fps is (10 sec: 10649.4, 60 sec: 10581.3, 300 sec: 10538.5). Total num frames: 59310080. Throughput: 0: 10572.3. Samples: 59310768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:28:04,601][547459] Avg episode reward: [(0, '4271.629')] [2023-03-08 20:28:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000115840_59310080.pth... [2023-03-08 20:28:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000115224_58994688.pth [2023-03-08 20:28:08,473][547799] Updated weights for policy 0, policy_version 115920 (0.0005) [2023-03-08 20:28:09,599][547459] Fps is (10 sec: 10240.2, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 59359232. Throughput: 0: 10531.0. Samples: 59341668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:28:09,600][547459] Avg episode reward: [(0, '4419.189')] [2023-03-08 20:28:12,363][547799] Updated weights for policy 0, policy_version 116000 (0.0005) [2023-03-08 20:28:14,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 59416576. Throughput: 0: 10539.1. Samples: 59404580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:28:14,600][547459] Avg episode reward: [(0, '4502.833')] [2023-03-08 20:28:16,169][547799] Updated weights for policy 0, policy_version 116080 (0.0005) [2023-03-08 20:28:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 59465728. Throughput: 0: 10528.4. Samples: 59468468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:28:19,600][547459] Avg episode reward: [(0, '4291.768')] [2023-03-08 20:28:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000116144_59465728.pth... [2023-03-08 20:28:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000115536_59154432.pth [2023-03-08 20:28:20,095][547799] Updated weights for policy 0, policy_version 116160 (0.0006) [2023-03-08 20:28:23,947][547799] Updated weights for policy 0, policy_version 116240 (0.0005) [2023-03-08 20:28:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 59518976. Throughput: 0: 10556.5. Samples: 59499932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:28:24,600][547459] Avg episode reward: [(0, '4513.112')] [2023-03-08 20:28:27,847][547799] Updated weights for policy 0, policy_version 116320 (0.0005) [2023-03-08 20:28:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 59572224. Throughput: 0: 10553.4. Samples: 59563808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:28:29,600][547459] Avg episode reward: [(0, '4686.825')] [2023-03-08 20:28:31,855][547799] Updated weights for policy 0, policy_version 116400 (0.0005) [2023-03-08 20:28:34,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10538.5). Total num frames: 59621376. Throughput: 0: 10466.5. Samples: 59625184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:28:34,600][547459] Avg episode reward: [(0, '4567.286')] [2023-03-08 20:28:34,615][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000116456_59625472.pth... [2023-03-08 20:28:34,616][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000115840_59310080.pth [2023-03-08 20:28:35,712][547799] Updated weights for policy 0, policy_version 116480 (0.0005) [2023-03-08 20:28:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 59674624. Throughput: 0: 10450.8. Samples: 59656336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:28:39,600][547459] Avg episode reward: [(0, '4684.264')] [2023-03-08 20:28:39,628][547799] Updated weights for policy 0, policy_version 116560 (0.0005) [2023-03-08 20:28:43,469][547799] Updated weights for policy 0, policy_version 116640 (0.0005) [2023-03-08 20:28:44,600][547459] Fps is (10 sec: 11059.1, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 59731968. Throughput: 0: 10473.6. Samples: 59719952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:28:44,600][547459] Avg episode reward: [(0, '4578.942')] [2023-03-08 20:28:47,307][547799] Updated weights for policy 0, policy_version 116720 (0.0005) [2023-03-08 20:28:49,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10538.5). Total num frames: 59781120. Throughput: 0: 10518.4. Samples: 59784096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:28:49,600][547459] Avg episode reward: [(0, '4628.930')] [2023-03-08 20:28:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000116760_59781120.pth... [2023-03-08 20:28:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000116144_59465728.pth [2023-03-08 20:28:51,278][547799] Updated weights for policy 0, policy_version 116800 (0.0006) [2023-03-08 20:28:54,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.0, 300 sec: 10552.4). Total num frames: 59834368. Throughput: 0: 10525.5. Samples: 59815316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:28:54,600][547459] Avg episode reward: [(0, '4593.834')] [2023-03-08 20:28:55,088][547799] Updated weights for policy 0, policy_version 116880 (0.0005) [2023-03-08 20:28:59,085][547799] Updated weights for policy 0, policy_version 116960 (0.0005) [2023-03-08 20:28:59,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 59887616. Throughput: 0: 10527.7. Samples: 59878328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:28:59,600][547459] Avg episode reward: [(0, '4291.690')] [2023-03-08 20:29:02,915][547799] Updated weights for policy 0, policy_version 117040 (0.0005) [2023-03-08 20:29:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 59940864. Throughput: 0: 10500.0. Samples: 59940968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:29:04,600][547459] Avg episode reward: [(0, '4586.315')] [2023-03-08 20:29:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000117072_59940864.pth... [2023-03-08 20:29:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000116456_59625472.pth [2023-03-08 20:29:06,765][547799] Updated weights for policy 0, policy_version 117120 (0.0005) [2023-03-08 20:29:09,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10552.4). Total num frames: 59994112. Throughput: 0: 10526.5. Samples: 59973624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:29:09,600][547459] Avg episode reward: [(0, '4492.039')] [2023-03-08 20:29:10,620][547799] Updated weights for policy 0, policy_version 117200 (0.0005) [2023-03-08 20:29:14,564][547799] Updated weights for policy 0, policy_version 117280 (0.0005) [2023-03-08 20:29:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10552.4). Total num frames: 60047360. Throughput: 0: 10489.7. Samples: 60035844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:29:14,601][547459] Avg episode reward: [(0, '4473.449')] [2023-03-08 20:29:18,600][547799] Updated weights for policy 0, policy_version 117360 (0.0005) [2023-03-08 20:29:19,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 60096512. Throughput: 0: 10522.3. Samples: 60098688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:29:19,600][547459] Avg episode reward: [(0, '4495.711')] [2023-03-08 20:29:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000117376_60096512.pth... [2023-03-08 20:29:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000116760_59781120.pth [2023-03-08 20:29:22,613][547799] Updated weights for policy 0, policy_version 117440 (0.0005) [2023-03-08 20:29:24,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 60149760. Throughput: 0: 10500.4. Samples: 60128856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:29:24,600][547459] Avg episode reward: [(0, '4688.539')] [2023-03-08 20:29:26,498][547799] Updated weights for policy 0, policy_version 117520 (0.0005) [2023-03-08 20:29:29,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10538.5). Total num frames: 60203008. Throughput: 0: 10490.0. Samples: 60192004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:29:29,600][547459] Avg episode reward: [(0, '4732.162')] [2023-03-08 20:29:30,319][547799] Updated weights for policy 0, policy_version 117600 (0.0005) [2023-03-08 20:29:34,327][547799] Updated weights for policy 0, policy_version 117680 (0.0005) [2023-03-08 20:29:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 60252160. Throughput: 0: 10456.2. Samples: 60254624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:29:34,600][547459] Avg episode reward: [(0, '4279.874')] [2023-03-08 20:29:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000117680_60252160.pth... [2023-03-08 20:29:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000117072_59940864.pth [2023-03-08 20:29:38,332][547799] Updated weights for policy 0, policy_version 117760 (0.0005) [2023-03-08 20:29:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 60305408. Throughput: 0: 10437.2. Samples: 60284992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:29:39,600][547459] Avg episode reward: [(0, '4332.164')] [2023-03-08 20:29:42,238][547799] Updated weights for policy 0, policy_version 117840 (0.0005) [2023-03-08 20:29:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 60358656. Throughput: 0: 10435.4. Samples: 60347920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:29:44,600][547459] Avg episode reward: [(0, '4563.621')] [2023-03-08 20:29:46,030][547799] Updated weights for policy 0, policy_version 117920 (0.0005) [2023-03-08 20:29:49,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 60407808. Throughput: 0: 10448.2. Samples: 60411136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:29:49,600][547459] Avg episode reward: [(0, '4694.061')] [2023-03-08 20:29:49,664][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000117992_60411904.pth... [2023-03-08 20:29:49,667][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000117376_60096512.pth [2023-03-08 20:29:50,032][547799] Updated weights for policy 0, policy_version 118000 (0.0005) [2023-03-08 20:29:53,795][547799] Updated weights for policy 0, policy_version 118080 (0.0006) [2023-03-08 20:29:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 60465152. Throughput: 0: 10453.3. Samples: 60444024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:29:54,600][547459] Avg episode reward: [(0, '4658.099')] [2023-03-08 20:29:57,770][547799] Updated weights for policy 0, policy_version 118160 (0.0006) [2023-03-08 20:29:59,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 60514304. Throughput: 0: 10450.4. Samples: 60506112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:29:59,600][547459] Avg episode reward: [(0, '4590.530')] [2023-03-08 20:30:01,739][547799] Updated weights for policy 0, policy_version 118240 (0.0005) [2023-03-08 20:30:04,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10524.6). Total num frames: 60567552. Throughput: 0: 10441.2. Samples: 60568544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:30:04,600][547459] Avg episode reward: [(0, '4644.425')] [2023-03-08 20:30:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000118296_60567552.pth... [2023-03-08 20:30:04,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000117680_60252160.pth [2023-03-08 20:30:05,619][547799] Updated weights for policy 0, policy_version 118320 (0.0006) [2023-03-08 20:30:09,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10510.8). Total num frames: 60616704. Throughput: 0: 10455.6. Samples: 60599356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:30:09,600][547459] Avg episode reward: [(0, '4307.122')] [2023-03-08 20:30:09,687][547799] Updated weights for policy 0, policy_version 118400 (0.0005) [2023-03-08 20:30:13,563][547799] Updated weights for policy 0, policy_version 118480 (0.0005) [2023-03-08 20:30:14,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10510.8). Total num frames: 60669952. Throughput: 0: 10429.6. Samples: 60661336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:30:14,600][547459] Avg episode reward: [(0, '4672.324')] [2023-03-08 20:30:17,474][547799] Updated weights for policy 0, policy_version 118560 (0.0005) [2023-03-08 20:30:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 60723200. Throughput: 0: 10425.4. Samples: 60723768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:30:19,600][547459] Avg episode reward: [(0, '4749.784')] [2023-03-08 20:30:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000118600_60723200.pth... [2023-03-08 20:30:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000117992_60411904.pth [2023-03-08 20:30:21,405][547799] Updated weights for policy 0, policy_version 118640 (0.0005) [2023-03-08 20:30:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10510.7). Total num frames: 60776448. Throughput: 0: 10465.5. Samples: 60755940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:30:24,600][547459] Avg episode reward: [(0, '4779.004')] [2023-03-08 20:30:25,251][547799] Updated weights for policy 0, policy_version 118720 (0.0005) [2023-03-08 20:30:29,179][547799] Updated weights for policy 0, policy_version 118800 (0.0005) [2023-03-08 20:30:29,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 60829696. Throughput: 0: 10464.2. Samples: 60818808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:30:29,601][547459] Avg episode reward: [(0, '4681.599')] [2023-03-08 20:30:32,910][547799] Updated weights for policy 0, policy_version 118880 (0.0005) [2023-03-08 20:30:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10510.8). Total num frames: 60882944. Throughput: 0: 10493.7. Samples: 60883352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:30:34,600][547459] Avg episode reward: [(0, '4690.767')] [2023-03-08 20:30:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000118912_60882944.pth... [2023-03-08 20:30:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000118296_60567552.pth [2023-03-08 20:30:36,774][547799] Updated weights for policy 0, policy_version 118960 (0.0005) [2023-03-08 20:30:39,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 60936192. Throughput: 0: 10483.4. Samples: 60915776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:30:39,600][547459] Avg episode reward: [(0, '4656.462')] [2023-03-08 20:30:40,518][547799] Updated weights for policy 0, policy_version 119040 (0.0005) [2023-03-08 20:30:44,482][547799] Updated weights for policy 0, policy_version 119120 (0.0005) [2023-03-08 20:30:44,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 60989440. Throughput: 0: 10521.5. Samples: 60979580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:30:44,600][547459] Avg episode reward: [(0, '4628.215')] [2023-03-08 20:30:48,287][547799] Updated weights for policy 0, policy_version 119200 (0.0005) [2023-03-08 20:30:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 61042688. Throughput: 0: 10561.6. Samples: 61043816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:30:49,600][547459] Avg episode reward: [(0, '4182.404')] [2023-03-08 20:30:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000119224_61042688.pth... [2023-03-08 20:30:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000118600_60723200.pth [2023-03-08 20:30:52,074][547799] Updated weights for policy 0, policy_version 119280 (0.0005) [2023-03-08 20:30:54,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 61095936. Throughput: 0: 10586.5. Samples: 61075748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:30:54,600][547459] Avg episode reward: [(0, '4502.524')] [2023-03-08 20:30:56,097][547799] Updated weights for policy 0, policy_version 119360 (0.0005) [2023-03-08 20:30:59,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 61145088. Throughput: 0: 10574.1. Samples: 61137168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:30:59,600][547459] Avg episode reward: [(0, '4197.340')] [2023-03-08 20:31:00,008][547799] Updated weights for policy 0, policy_version 119440 (0.0005) [2023-03-08 20:31:04,000][547799] Updated weights for policy 0, policy_version 119520 (0.0005) [2023-03-08 20:31:04,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.0, 300 sec: 10524.6). Total num frames: 61198336. Throughput: 0: 10581.9. Samples: 61199952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:31:04,601][547459] Avg episode reward: [(0, '4543.319')] [2023-03-08 20:31:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000119528_61198336.pth... [2023-03-08 20:31:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000118912_60882944.pth [2023-03-08 20:31:07,846][547799] Updated weights for policy 0, policy_version 119600 (0.0005) [2023-03-08 20:31:09,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 61251584. Throughput: 0: 10559.7. Samples: 61231128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:31:09,601][547459] Avg episode reward: [(0, '4542.274')] [2023-03-08 20:31:11,707][547799] Updated weights for policy 0, policy_version 119680 (0.0006) [2023-03-08 20:31:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 61304832. Throughput: 0: 10588.0. Samples: 61295268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:31:14,601][547459] Avg episode reward: [(0, '4450.441')] [2023-03-08 20:31:15,606][547799] Updated weights for policy 0, policy_version 119760 (0.0005) [2023-03-08 20:31:19,533][547799] Updated weights for policy 0, policy_version 119840 (0.0005) [2023-03-08 20:31:19,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.4, 300 sec: 10524.6). Total num frames: 61358080. Throughput: 0: 10549.7. Samples: 61358088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:31:19,600][547459] Avg episode reward: [(0, '4589.761')] [2023-03-08 20:31:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000119840_61358080.pth... [2023-03-08 20:31:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000119224_61042688.pth [2023-03-08 20:31:23,422][547799] Updated weights for policy 0, policy_version 119920 (0.0005) [2023-03-08 20:31:24,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 61407232. Throughput: 0: 10551.9. Samples: 61390612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:31:24,600][547459] Avg episode reward: [(0, '4448.814')] [2023-03-08 20:31:27,497][547799] Updated weights for policy 0, policy_version 120000 (0.0005) [2023-03-08 20:31:29,599][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 61460480. Throughput: 0: 10495.8. Samples: 61451888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:31:29,600][547459] Avg episode reward: [(0, '4485.461')] [2023-03-08 20:31:31,316][547799] Updated weights for policy 0, policy_version 120080 (0.0005) [2023-03-08 20:31:34,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 61513728. Throughput: 0: 10444.1. Samples: 61513800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:31:34,600][547459] Avg episode reward: [(0, '4636.958')] [2023-03-08 20:31:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000120144_61513728.pth... [2023-03-08 20:31:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000119528_61198336.pth [2023-03-08 20:31:35,285][547799] Updated weights for policy 0, policy_version 120160 (0.0005) [2023-03-08 20:31:39,177][547799] Updated weights for policy 0, policy_version 120240 (0.0005) [2023-03-08 20:31:39,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 61566976. Throughput: 0: 10444.6. Samples: 61545752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:31:39,600][547459] Avg episode reward: [(0, '4505.201')] [2023-03-08 20:31:43,049][547799] Updated weights for policy 0, policy_version 120320 (0.0005) [2023-03-08 20:31:44,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 61616128. Throughput: 0: 10465.4. Samples: 61608112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:31:44,600][547459] Avg episode reward: [(0, '4725.793')] [2023-03-08 20:31:46,940][547799] Updated weights for policy 0, policy_version 120400 (0.0005) [2023-03-08 20:31:49,600][547459] Fps is (10 sec: 10239.8, 60 sec: 10444.8, 300 sec: 10510.7). Total num frames: 61669376. Throughput: 0: 10513.2. Samples: 61673048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:31:49,600][547459] Avg episode reward: [(0, '4732.611')] [2023-03-08 20:31:49,634][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000120456_61673472.pth... [2023-03-08 20:31:49,636][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000119840_61358080.pth [2023-03-08 20:31:50,809][547799] Updated weights for policy 0, policy_version 120480 (0.0005) [2023-03-08 20:31:54,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 10510.8). Total num frames: 61722624. Throughput: 0: 10501.1. Samples: 61703676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:31:54,600][547459] Avg episode reward: [(0, '4528.231')] [2023-03-08 20:31:54,669][547799] Updated weights for policy 0, policy_version 120560 (0.0005) [2023-03-08 20:31:58,423][547799] Updated weights for policy 0, policy_version 120640 (0.0005) [2023-03-08 20:31:59,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 61779968. Throughput: 0: 10517.1. Samples: 61768536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:31:59,600][547459] Avg episode reward: [(0, '4638.529')] [2023-03-08 20:32:02,260][547799] Updated weights for policy 0, policy_version 120720 (0.0005) [2023-03-08 20:32:04,600][547459] Fps is (10 sec: 11059.0, 60 sec: 10581.3, 300 sec: 10524.6). Total num frames: 61833216. Throughput: 0: 10557.8. Samples: 61833192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:32:04,600][547459] Avg episode reward: [(0, '4728.062')] [2023-03-08 20:32:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000120768_61833216.pth... [2023-03-08 20:32:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000120144_61513728.pth [2023-03-08 20:32:06,069][547799] Updated weights for policy 0, policy_version 120800 (0.0005) [2023-03-08 20:32:09,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10510.8). Total num frames: 61882368. Throughput: 0: 10544.9. Samples: 61865132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:32:09,600][547459] Avg episode reward: [(0, '4497.440')] [2023-03-08 20:32:10,187][547799] Updated weights for policy 0, policy_version 120880 (0.0005) [2023-03-08 20:32:14,101][547799] Updated weights for policy 0, policy_version 120960 (0.0005) [2023-03-08 20:32:14,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10510.8). Total num frames: 61935616. Throughput: 0: 10526.4. Samples: 61925576. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:32:14,601][547459] Avg episode reward: [(0, '4455.562')] [2023-03-08 20:32:17,854][547799] Updated weights for policy 0, policy_version 121040 (0.0005) [2023-03-08 20:32:19,600][547459] Fps is (10 sec: 10649.4, 60 sec: 10513.0, 300 sec: 10510.8). Total num frames: 61988864. Throughput: 0: 10574.6. Samples: 61989660. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:32:19,600][547459] Avg episode reward: [(0, '4683.072')] [2023-03-08 20:32:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000121072_61988864.pth... [2023-03-08 20:32:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000120456_61673472.pth [2023-03-08 20:32:21,704][547799] Updated weights for policy 0, policy_version 121120 (0.0005) [2023-03-08 20:32:24,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 62042112. Throughput: 0: 10576.7. Samples: 62021704. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:32:24,600][547459] Avg episode reward: [(0, '4821.096')] [2023-03-08 20:32:25,717][547799] Updated weights for policy 0, policy_version 121200 (0.0005) [2023-03-08 20:32:29,552][547799] Updated weights for policy 0, policy_version 121280 (0.0005) [2023-03-08 20:32:29,599][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 62095360. Throughput: 0: 10592.6. Samples: 62084780. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:32:29,600][547459] Avg episode reward: [(0, '4640.805')] [2023-03-08 20:32:33,613][547799] Updated weights for policy 0, policy_version 121360 (0.0006) [2023-03-08 20:32:34,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10513.1, 300 sec: 10510.7). Total num frames: 62144512. Throughput: 0: 10517.7. Samples: 62146344. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:32:34,601][547459] Avg episode reward: [(0, '4504.908')] [2023-03-08 20:32:34,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000121376_62144512.pth... [2023-03-08 20:32:34,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000120768_61833216.pth [2023-03-08 20:32:37,462][547799] Updated weights for policy 0, policy_version 121440 (0.0005) [2023-03-08 20:32:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.0, 300 sec: 10510.8). Total num frames: 62197760. Throughput: 0: 10531.1. Samples: 62177576. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:32:39,600][547459] Avg episode reward: [(0, '4780.446')] [2023-03-08 20:32:41,357][547799] Updated weights for policy 0, policy_version 121520 (0.0005) [2023-03-08 20:32:44,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10496.9). Total num frames: 62251008. Throughput: 0: 10487.0. Samples: 62240452. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:32:44,600][547459] Avg episode reward: [(0, '4715.362')] [2023-03-08 20:32:45,281][547799] Updated weights for policy 0, policy_version 121600 (0.0005) [2023-03-08 20:32:49,094][547799] Updated weights for policy 0, policy_version 121680 (0.0006) [2023-03-08 20:32:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10510.7). Total num frames: 62304256. Throughput: 0: 10474.2. Samples: 62304532. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:32:49,600][547459] Avg episode reward: [(0, '4783.608')] [2023-03-08 20:32:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000121688_62304256.pth... [2023-03-08 20:32:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000121072_61988864.pth [2023-03-08 20:32:52,984][547799] Updated weights for policy 0, policy_version 121760 (0.0006) [2023-03-08 20:32:54,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 62357504. Throughput: 0: 10486.5. Samples: 62337024. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:32:54,600][547459] Avg episode reward: [(0, '4783.081')] [2023-03-08 20:32:56,808][547799] Updated weights for policy 0, policy_version 121840 (0.0005) [2023-03-08 20:32:59,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10513.1, 300 sec: 10510.8). Total num frames: 62410752. Throughput: 0: 10545.3. Samples: 62400112. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:32:59,600][547459] Avg episode reward: [(0, '4765.717')] [2023-03-08 20:33:00,671][547799] Updated weights for policy 0, policy_version 121920 (0.0005) [2023-03-08 20:33:04,369][547799] Updated weights for policy 0, policy_version 122000 (0.0005) [2023-03-08 20:33:04,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10524.6). Total num frames: 62464000. Throughput: 0: 10578.3. Samples: 62465684. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:33:04,600][547459] Avg episode reward: [(0, '4765.791')] [2023-03-08 20:33:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000122000_62464000.pth... [2023-03-08 20:33:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000121376_62144512.pth [2023-03-08 20:33:08,311][547799] Updated weights for policy 0, policy_version 122080 (0.0005) [2023-03-08 20:33:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10510.8). Total num frames: 62517248. Throughput: 0: 10557.0. Samples: 62496768. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:33:09,601][547459] Avg episode reward: [(0, '4789.168')] [2023-03-08 20:33:12,422][547799] Updated weights for policy 0, policy_version 122160 (0.0005) [2023-03-08 20:33:14,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10510.8). Total num frames: 62566400. Throughput: 0: 10520.2. Samples: 62558188. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:33:14,600][547459] Avg episode reward: [(0, '4367.430')] [2023-03-08 20:33:16,331][547799] Updated weights for policy 0, policy_version 122240 (0.0006) [2023-03-08 20:33:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10510.8). Total num frames: 62619648. Throughput: 0: 10517.5. Samples: 62619632. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:33:19,600][547459] Avg episode reward: [(0, '4594.519')] [2023-03-08 20:33:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000122304_62619648.pth... [2023-03-08 20:33:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000121688_62304256.pth [2023-03-08 20:33:20,347][547799] Updated weights for policy 0, policy_version 122320 (0.0005) [2023-03-08 20:33:24,192][547799] Updated weights for policy 0, policy_version 122400 (0.0006) [2023-03-08 20:33:24,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 62668800. Throughput: 0: 10528.8. Samples: 62651372. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:33:24,600][547459] Avg episode reward: [(0, '4644.082')] [2023-03-08 20:33:28,292][547799] Updated weights for policy 0, policy_version 122480 (0.0005) [2023-03-08 20:33:29,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10510.7). Total num frames: 62722048. Throughput: 0: 10504.4. Samples: 62713152. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:33:29,600][547459] Avg episode reward: [(0, '4442.884')] [2023-03-08 20:33:32,294][547799] Updated weights for policy 0, policy_version 122560 (0.0005) [2023-03-08 20:33:34,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10496.9). Total num frames: 62771200. Throughput: 0: 10429.6. Samples: 62773864. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:33:34,611][547459] Avg episode reward: [(0, '4504.201')] [2023-03-08 20:33:34,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000122600_62771200.pth... [2023-03-08 20:33:34,616][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000122000_62464000.pth [2023-03-08 20:33:36,305][547799] Updated weights for policy 0, policy_version 122640 (0.0006) [2023-03-08 20:33:39,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 62824448. Throughput: 0: 10377.9. Samples: 62804032. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:33:39,600][547459] Avg episode reward: [(0, '4729.968')] [2023-03-08 20:33:40,354][547799] Updated weights for policy 0, policy_version 122720 (0.0005) [2023-03-08 20:33:44,558][547799] Updated weights for policy 0, policy_version 122800 (0.0005) [2023-03-08 20:33:44,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 62873600. Throughput: 0: 10314.6. Samples: 62864268. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:33:44,611][547459] Avg episode reward: [(0, '4454.102')] [2023-03-08 20:33:48,478][547799] Updated weights for policy 0, policy_version 122880 (0.0005) [2023-03-08 20:33:49,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 62926848. Throughput: 0: 10243.9. Samples: 62926660. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:33:49,600][547459] Avg episode reward: [(0, '4375.647')] [2023-03-08 20:33:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000122904_62926848.pth... [2023-03-08 20:33:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000122304_62619648.pth [2023-03-08 20:33:52,259][547799] Updated weights for policy 0, policy_version 122960 (0.0005) [2023-03-08 20:33:54,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 62980096. Throughput: 0: 10259.8. Samples: 62958460. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:33:54,600][547459] Avg episode reward: [(0, '4629.609')] [2023-03-08 20:33:56,109][547799] Updated weights for policy 0, policy_version 123040 (0.0005) [2023-03-08 20:33:59,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10469.1). Total num frames: 63029248. Throughput: 0: 10288.4. Samples: 63021164. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:33:59,600][547459] Avg episode reward: [(0, '4686.186')] [2023-03-08 20:34:00,067][547799] Updated weights for policy 0, policy_version 123120 (0.0005) [2023-03-08 20:34:03,870][547799] Updated weights for policy 0, policy_version 123200 (0.0005) [2023-03-08 20:34:04,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10469.1). Total num frames: 63082496. Throughput: 0: 10361.7. Samples: 63085908. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:34:04,600][547459] Avg episode reward: [(0, '4678.277')] [2023-03-08 20:34:04,641][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000123216_63086592.pth... [2023-03-08 20:34:04,642][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000122600_62771200.pth [2023-03-08 20:34:07,675][547799] Updated weights for policy 0, policy_version 123280 (0.0005) [2023-03-08 20:34:09,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10469.1). Total num frames: 63135744. Throughput: 0: 10374.1. Samples: 63118204. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:34:09,600][547459] Avg episode reward: [(0, '4366.345')] [2023-03-08 20:34:11,608][547799] Updated weights for policy 0, policy_version 123360 (0.0005) [2023-03-08 20:34:14,600][547459] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 63188992. Throughput: 0: 10389.2. Samples: 63180664. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 20:34:14,600][547459] Avg episode reward: [(0, '4647.504')] [2023-03-08 20:34:15,597][547799] Updated weights for policy 0, policy_version 123440 (0.0005) [2023-03-08 20:34:19,529][547799] Updated weights for policy 0, policy_version 123520 (0.0005) [2023-03-08 20:34:19,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10376.5, 300 sec: 10483.0). Total num frames: 63242240. Throughput: 0: 10408.6. Samples: 63242252. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:34:19,600][547459] Avg episode reward: [(0, '4596.978')] [2023-03-08 20:34:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000123520_63242240.pth... [2023-03-08 20:34:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000122904_62926848.pth [2023-03-08 20:34:23,495][547799] Updated weights for policy 0, policy_version 123600 (0.0005) [2023-03-08 20:34:24,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 63295488. Throughput: 0: 10436.2. Samples: 63273660. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:34:24,600][547459] Avg episode reward: [(0, '4735.104')] [2023-03-08 20:34:27,300][547799] Updated weights for policy 0, policy_version 123680 (0.0005) [2023-03-08 20:34:29,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10376.6, 300 sec: 10483.0). Total num frames: 63344640. Throughput: 0: 10501.3. Samples: 63336824. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:34:29,600][547459] Avg episode reward: [(0, '4609.623')] [2023-03-08 20:34:31,358][547799] Updated weights for policy 0, policy_version 123760 (0.0005) [2023-03-08 20:34:34,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 63397888. Throughput: 0: 10473.2. Samples: 63397952. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:34:34,600][547459] Avg episode reward: [(0, '4546.716')] [2023-03-08 20:34:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000123824_63397888.pth... [2023-03-08 20:34:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000123216_63086592.pth [2023-03-08 20:34:35,317][547799] Updated weights for policy 0, policy_version 123840 (0.0005) [2023-03-08 20:34:39,243][547799] Updated weights for policy 0, policy_version 123920 (0.0005) [2023-03-08 20:34:39,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 63451136. Throughput: 0: 10481.5. Samples: 63430128. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:34:39,600][547459] Avg episode reward: [(0, '4720.630')] [2023-03-08 20:34:43,089][547799] Updated weights for policy 0, policy_version 124000 (0.0006) [2023-03-08 20:34:44,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 63500288. Throughput: 0: 10478.2. Samples: 63492684. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:34:44,600][547459] Avg episode reward: [(0, '4620.310')] [2023-03-08 20:34:46,982][547799] Updated weights for policy 0, policy_version 124080 (0.0005) [2023-03-08 20:34:49,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 63553536. Throughput: 0: 10442.2. Samples: 63555808. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:34:49,601][547459] Avg episode reward: [(0, '4766.827')] [2023-03-08 20:34:49,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000124128_63553536.pth... [2023-03-08 20:34:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000123520_63242240.pth [2023-03-08 20:34:50,871][547799] Updated weights for policy 0, policy_version 124160 (0.0004) [2023-03-08 20:34:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 63606784. Throughput: 0: 10425.1. Samples: 63587336. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:34:54,600][547459] Avg episode reward: [(0, '4798.993')] [2023-03-08 20:34:54,670][547799] Updated weights for policy 0, policy_version 124240 (0.0005) [2023-03-08 20:34:58,534][547799] Updated weights for policy 0, policy_version 124320 (0.0005) [2023-03-08 20:34:59,599][547459] Fps is (10 sec: 10649.8, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 63660032. Throughput: 0: 10470.9. Samples: 63651852. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:34:59,600][547459] Avg episode reward: [(0, '4680.994')] [2023-03-08 20:35:02,445][547799] Updated weights for policy 0, policy_version 124400 (0.0005) [2023-03-08 20:35:04,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 63713280. Throughput: 0: 10467.3. Samples: 63713280. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:35:04,600][547459] Avg episode reward: [(0, '4622.118')] [2023-03-08 20:35:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000124440_63713280.pth... [2023-03-08 20:35:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000123824_63397888.pth [2023-03-08 20:35:06,547][547799] Updated weights for policy 0, policy_version 124480 (0.0006) [2023-03-08 20:35:09,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 63766528. Throughput: 0: 10459.7. Samples: 63744348. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:35:09,600][547459] Avg episode reward: [(0, '4678.336')] [2023-03-08 20:35:10,268][547799] Updated weights for policy 0, policy_version 124560 (0.0005) [2023-03-08 20:35:14,179][547799] Updated weights for policy 0, policy_version 124640 (0.0005) [2023-03-08 20:35:14,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10496.9). Total num frames: 63819776. Throughput: 0: 10501.2. Samples: 63809380. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:35:14,600][547459] Avg episode reward: [(0, '4829.591')] [2023-03-08 20:35:18,160][547799] Updated weights for policy 0, policy_version 124720 (0.0005) [2023-03-08 20:35:19,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 63868928. Throughput: 0: 10530.8. Samples: 63871836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:35:19,600][547459] Avg episode reward: [(0, '4789.216')] [2023-03-08 20:35:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000124744_63868928.pth... [2023-03-08 20:35:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000124128_63553536.pth [2023-03-08 20:35:22,040][547799] Updated weights for policy 0, policy_version 124800 (0.0005) [2023-03-08 20:35:24,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10483.0). Total num frames: 63922176. Throughput: 0: 10496.3. Samples: 63902460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:35:24,600][547459] Avg episode reward: [(0, '4727.559')] [2023-03-08 20:35:26,055][547799] Updated weights for policy 0, policy_version 124880 (0.0006) [2023-03-08 20:35:29,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 63975424. Throughput: 0: 10491.5. Samples: 63964800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:35:29,600][547459] Avg episode reward: [(0, '4420.395')] [2023-03-08 20:35:30,010][547799] Updated weights for policy 0, policy_version 124960 (0.0005) [2023-03-08 20:35:33,799][547799] Updated weights for policy 0, policy_version 125040 (0.0005) [2023-03-08 20:35:34,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 64028672. Throughput: 0: 10506.2. Samples: 64028588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:35:34,600][547459] Avg episode reward: [(0, '4744.549')] [2023-03-08 20:35:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000125056_64028672.pth... [2023-03-08 20:35:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000124440_63713280.pth [2023-03-08 20:35:37,702][547799] Updated weights for policy 0, policy_version 125120 (0.0005) [2023-03-08 20:35:39,600][547459] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 64077824. Throughput: 0: 10510.1. Samples: 64060292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:35:39,600][547459] Avg episode reward: [(0, '4820.718')] [2023-03-08 20:35:41,671][547799] Updated weights for policy 0, policy_version 125200 (0.0005) [2023-03-08 20:35:44,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 64131072. Throughput: 0: 10442.6. Samples: 64121768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:35:44,600][547459] Avg episode reward: [(0, '4694.612')] [2023-03-08 20:35:45,666][547799] Updated weights for policy 0, policy_version 125280 (0.0006) [2023-03-08 20:35:49,560][547799] Updated weights for policy 0, policy_version 125360 (0.0005) [2023-03-08 20:35:49,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 64184320. Throughput: 0: 10467.2. Samples: 64184304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:35:49,600][547459] Avg episode reward: [(0, '4633.953')] [2023-03-08 20:35:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000125360_64184320.pth... [2023-03-08 20:35:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000124744_63868928.pth [2023-03-08 20:35:53,396][547799] Updated weights for policy 0, policy_version 125440 (0.0005) [2023-03-08 20:35:54,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 10483.0). Total num frames: 64237568. Throughput: 0: 10505.2. Samples: 64217084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:35:54,600][547459] Avg episode reward: [(0, '4660.476')] [2023-03-08 20:35:57,452][547799] Updated weights for policy 0, policy_version 125520 (0.0005) [2023-03-08 20:35:59,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 64286720. Throughput: 0: 10417.7. Samples: 64278176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:35:59,601][547459] Avg episode reward: [(0, '4542.621')] [2023-03-08 20:36:01,393][547799] Updated weights for policy 0, policy_version 125600 (0.0005) [2023-03-08 20:36:04,599][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 64339968. Throughput: 0: 10421.4. Samples: 64340800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:36:04,600][547459] Avg episode reward: [(0, '4702.824')] [2023-03-08 20:36:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000125664_64339968.pth... [2023-03-08 20:36:04,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000125056_64028672.pth [2023-03-08 20:36:05,199][547799] Updated weights for policy 0, policy_version 125680 (0.0005) [2023-03-08 20:36:08,973][547799] Updated weights for policy 0, policy_version 125760 (0.0005) [2023-03-08 20:36:09,600][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 64393216. Throughput: 0: 10457.4. Samples: 64373044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:36:09,600][547459] Avg episode reward: [(0, '4735.317')] [2023-03-08 20:36:12,865][547799] Updated weights for policy 0, policy_version 125840 (0.0005) [2023-03-08 20:36:14,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 64446464. Throughput: 0: 10513.6. Samples: 64437912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:36:14,600][547459] Avg episode reward: [(0, '4787.068')] [2023-03-08 20:36:16,884][547799] Updated weights for policy 0, policy_version 125920 (0.0005) [2023-03-08 20:36:19,600][547459] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 64495616. Throughput: 0: 10446.1. Samples: 64498664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:36:19,600][547459] Avg episode reward: [(0, '4734.447')] [2023-03-08 20:36:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000125968_64495616.pth... [2023-03-08 20:36:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000125360_64184320.pth [2023-03-08 20:36:20,805][547799] Updated weights for policy 0, policy_version 126000 (0.0005) [2023-03-08 20:36:24,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 64548864. Throughput: 0: 10423.7. Samples: 64529360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:36:24,600][547459] Avg episode reward: [(0, '4709.485')] [2023-03-08 20:36:24,861][547799] Updated weights for policy 0, policy_version 126080 (0.0006) [2023-03-08 20:36:28,708][547799] Updated weights for policy 0, policy_version 126160 (0.0005) [2023-03-08 20:36:29,600][547459] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 64602112. Throughput: 0: 10450.8. Samples: 64592056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:36:29,601][547459] Avg episode reward: [(0, '4817.224')] [2023-03-08 20:36:32,751][547799] Updated weights for policy 0, policy_version 126240 (0.0005) [2023-03-08 20:36:34,600][547459] Fps is (10 sec: 10239.8, 60 sec: 10376.5, 300 sec: 10455.2). Total num frames: 64651264. Throughput: 0: 10442.5. Samples: 64654216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:36:34,601][547459] Avg episode reward: [(0, '4578.415')] [2023-03-08 20:36:34,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000126272_64651264.pth... [2023-03-08 20:36:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000125664_64339968.pth [2023-03-08 20:36:36,606][547799] Updated weights for policy 0, policy_version 126320 (0.0005) [2023-03-08 20:36:39,600][547459] Fps is (10 sec: 10240.1, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 64704512. Throughput: 0: 10406.5. Samples: 64685376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:36:39,601][547459] Avg episode reward: [(0, '4657.995')] [2023-03-08 20:36:40,548][547799] Updated weights for policy 0, policy_version 126400 (0.0005) [2023-03-08 20:36:44,467][547799] Updated weights for policy 0, policy_version 126480 (0.0005) [2023-03-08 20:36:44,600][547459] Fps is (10 sec: 10649.8, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 64757760. Throughput: 0: 10435.9. Samples: 64747792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:36:44,601][547459] Avg episode reward: [(0, '4809.415')] [2023-03-08 20:36:48,325][547799] Updated weights for policy 0, policy_version 126560 (0.0005) [2023-03-08 20:36:49,599][547459] Fps is (10 sec: 10649.6, 60 sec: 10444.8, 300 sec: 10469.1). Total num frames: 64811008. Throughput: 0: 10450.5. Samples: 64811072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:36:49,600][547459] Avg episode reward: [(0, '4817.628')] [2023-03-08 20:36:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000126584_64811008.pth... [2023-03-08 20:36:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000125968_64495616.pth [2023-03-08 20:36:52,072][547799] Updated weights for policy 0, policy_version 126640 (0.0006) [2023-03-08 20:36:54,600][547459] Fps is (10 sec: 11059.2, 60 sec: 10513.1, 300 sec: 10469.1). Total num frames: 64868352. Throughput: 0: 10491.0. Samples: 64845140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:36:54,600][547459] Avg episode reward: [(0, '4758.898')] [2023-03-08 20:36:55,525][547799] Updated weights for policy 0, policy_version 126720 (0.0005) [2023-03-08 20:36:59,001][547799] Updated weights for policy 0, policy_version 126800 (0.0004) [2023-03-08 20:36:59,599][547459] Fps is (10 sec: 11468.9, 60 sec: 10649.6, 300 sec: 10483.0). Total num frames: 64925696. Throughput: 0: 10641.1. Samples: 64916760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:36:59,600][547459] Avg episode reward: [(0, '4785.165')] [2023-03-08 20:37:02,471][547799] Updated weights for policy 0, policy_version 126880 (0.0005) [2023-03-08 20:37:04,600][547459] Fps is (10 sec: 11878.4, 60 sec: 10786.1, 300 sec: 10524.6). Total num frames: 64987136. Throughput: 0: 10865.3. Samples: 64987604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:37:04,600][547459] Avg episode reward: [(0, '4798.030')] [2023-03-08 20:37:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000126928_64987136.pth... [2023-03-08 20:37:04,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000126272_64651264.pth [2023-03-08 20:37:05,855][547799] Updated weights for policy 0, policy_version 126960 (0.0005) [2023-03-08 20:37:09,444][547799] Updated weights for policy 0, policy_version 127040 (0.0005) [2023-03-08 20:37:09,600][547459] Fps is (10 sec: 11877.9, 60 sec: 10854.3, 300 sec: 10538.5). Total num frames: 65044480. Throughput: 0: 10989.4. Samples: 65023888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:37:09,601][547459] Avg episode reward: [(0, '4525.428')] [2023-03-08 20:37:13,011][547799] Updated weights for policy 0, policy_version 127120 (0.0004) [2023-03-08 20:37:14,600][547459] Fps is (10 sec: 11878.4, 60 sec: 10990.9, 300 sec: 10566.3). Total num frames: 65105920. Throughput: 0: 11108.2. Samples: 65091924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:37:14,600][547459] Avg episode reward: [(0, '4782.750')] [2023-03-08 20:37:16,307][547799] Updated weights for policy 0, policy_version 127200 (0.0005) [2023-03-08 20:37:19,600][547459] Fps is (10 sec: 11878.8, 60 sec: 11127.5, 300 sec: 10580.2). Total num frames: 65163264. Throughput: 0: 11290.1. Samples: 65162268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:37:19,600][547459] Avg episode reward: [(0, '4589.425')] [2023-03-08 20:37:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000127272_65163264.pth... [2023-03-08 20:37:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000126584_64811008.pth [2023-03-08 20:37:19,923][547799] Updated weights for policy 0, policy_version 127280 (0.0005) [2023-03-08 20:37:23,348][547799] Updated weights for policy 0, policy_version 127360 (0.0005) [2023-03-08 20:37:24,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11195.7, 300 sec: 10594.1). Total num frames: 65220608. Throughput: 0: 11399.5. Samples: 65198356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:37:24,600][547459] Avg episode reward: [(0, '4693.949')] [2023-03-08 20:37:26,749][547799] Updated weights for policy 0, policy_version 127440 (0.0005) [2023-03-08 20:37:29,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11332.3, 300 sec: 10635.7). Total num frames: 65282048. Throughput: 0: 11621.7. Samples: 65270768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:37:29,600][547459] Avg episode reward: [(0, '4394.400')] [2023-03-08 20:37:30,064][547799] Updated weights for policy 0, policy_version 127520 (0.0005) [2023-03-08 20:37:33,372][547799] Updated weights for policy 0, policy_version 127600 (0.0005) [2023-03-08 20:37:34,600][547459] Fps is (10 sec: 12288.1, 60 sec: 11537.1, 300 sec: 10663.5). Total num frames: 65343488. Throughput: 0: 11867.1. Samples: 65345092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:37:34,600][547459] Avg episode reward: [(0, '4667.007')] [2023-03-08 20:37:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000127624_65343488.pth... [2023-03-08 20:37:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000126928_64987136.pth [2023-03-08 20:37:36,731][547799] Updated weights for policy 0, policy_version 127680 (0.0005) [2023-03-08 20:37:39,600][547459] Fps is (10 sec: 12288.0, 60 sec: 11673.6, 300 sec: 10691.3). Total num frames: 65404928. Throughput: 0: 11903.7. Samples: 65380808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:37:39,600][547459] Avg episode reward: [(0, '4781.858')] [2023-03-08 20:37:40,188][547799] Updated weights for policy 0, policy_version 127760 (0.0005) [2023-03-08 20:37:43,613][547799] Updated weights for policy 0, policy_version 127840 (0.0005) [2023-03-08 20:37:44,599][547459] Fps is (10 sec: 12288.1, 60 sec: 11810.2, 300 sec: 10719.0). Total num frames: 65466368. Throughput: 0: 11928.9. Samples: 65453560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:37:44,600][547459] Avg episode reward: [(0, '4735.504')] [2023-03-08 20:37:46,698][547799] Updated weights for policy 0, policy_version 127920 (0.0005) [2023-03-08 20:37:49,599][547459] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 10746.8). Total num frames: 65527808. Throughput: 0: 12076.9. Samples: 65531064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:37:49,600][547459] Avg episode reward: [(0, '4801.128')] [2023-03-08 20:37:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000127984_65527808.pth... [2023-03-08 20:37:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000127272_65163264.pth [2023-03-08 20:37:49,927][547799] Updated weights for policy 0, policy_version 128000 (0.0004) [2023-03-08 20:37:53,326][547799] Updated weights for policy 0, policy_version 128080 (0.0005) [2023-03-08 20:37:54,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 10774.6). Total num frames: 65589248. Throughput: 0: 12105.8. Samples: 65568644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:37:54,600][547459] Avg episode reward: [(0, '4625.825')] [2023-03-08 20:37:56,910][547799] Updated weights for policy 0, policy_version 128160 (0.0005) [2023-03-08 20:37:59,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 10802.3). Total num frames: 65650688. Throughput: 0: 12116.8. Samples: 65637180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:37:59,600][547459] Avg episode reward: [(0, '4666.705')] [2023-03-08 20:38:00,372][547799] Updated weights for policy 0, policy_version 128240 (0.0005) [2023-03-08 20:38:03,792][547799] Updated weights for policy 0, policy_version 128320 (0.0005) [2023-03-08 20:38:04,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 10816.2). Total num frames: 65708032. Throughput: 0: 12144.1. Samples: 65708752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:38:04,600][547459] Avg episode reward: [(0, '4750.955')] [2023-03-08 20:38:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000128336_65708032.pth... [2023-03-08 20:38:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000127624_65343488.pth [2023-03-08 20:38:07,173][547799] Updated weights for policy 0, policy_version 128400 (0.0004) [2023-03-08 20:38:09,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12083.3, 300 sec: 10857.9). Total num frames: 65769472. Throughput: 0: 12147.0. Samples: 65744968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:38:09,600][547459] Avg episode reward: [(0, '4821.457')] [2023-03-08 20:38:10,461][547799] Updated weights for policy 0, policy_version 128480 (0.0006) [2023-03-08 20:38:13,779][547799] Updated weights for policy 0, policy_version 128560 (0.0005) [2023-03-08 20:38:14,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 10885.6). Total num frames: 65830912. Throughput: 0: 12192.2. Samples: 65819416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:38:14,600][547459] Avg episode reward: [(0, '4735.528')] [2023-03-08 20:38:17,388][547799] Updated weights for policy 0, policy_version 128640 (0.0005) [2023-03-08 20:38:19,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 10913.4). Total num frames: 65888256. Throughput: 0: 12071.9. Samples: 65888328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:38:19,600][547459] Avg episode reward: [(0, '4782.385')] [2023-03-08 20:38:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000128688_65888256.pth... [2023-03-08 20:38:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000127984_65527808.pth [2023-03-08 20:38:20,768][547799] Updated weights for policy 0, policy_version 128720 (0.0005) [2023-03-08 20:38:24,169][547799] Updated weights for policy 0, policy_version 128800 (0.0005) [2023-03-08 20:38:24,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 10941.2). Total num frames: 65949696. Throughput: 0: 12096.0. Samples: 65925128. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:38:24,600][547459] Avg episode reward: [(0, '4635.369')] [2023-03-08 20:38:27,630][547799] Updated weights for policy 0, policy_version 128880 (0.0005) [2023-03-08 20:38:29,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 10969.0). Total num frames: 66007040. Throughput: 0: 12075.4. Samples: 65996952. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:38:29,600][547459] Avg episode reward: [(0, '4735.854')] [2023-03-08 20:38:31,050][547799] Updated weights for policy 0, policy_version 128960 (0.0004) [2023-03-08 20:38:34,416][547799] Updated weights for policy 0, policy_version 129040 (0.0004) [2023-03-08 20:38:34,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 10996.7). Total num frames: 66068480. Throughput: 0: 11968.4. Samples: 66069644. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:38:34,600][547459] Avg episode reward: [(0, '4822.683')] [2023-03-08 20:38:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000129040_66068480.pth... [2023-03-08 20:38:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000128336_65708032.pth [2023-03-08 20:38:37,896][547799] Updated weights for policy 0, policy_version 129120 (0.0005) [2023-03-08 20:38:39,599][547459] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 11038.4). Total num frames: 66129920. Throughput: 0: 11927.0. Samples: 66105356. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:38:39,600][547459] Avg episode reward: [(0, '4737.155')] [2023-03-08 20:38:41,283][547799] Updated weights for policy 0, policy_version 129200 (0.0004) [2023-03-08 20:38:44,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11052.3). Total num frames: 66187264. Throughput: 0: 12034.7. Samples: 66178740. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:38:44,600][547459] Avg episode reward: [(0, '4825.492')] [2023-03-08 20:38:44,709][547799] Updated weights for policy 0, policy_version 129280 (0.0005) [2023-03-08 20:38:48,256][547799] Updated weights for policy 0, policy_version 129360 (0.0004) [2023-03-08 20:38:49,600][547459] Fps is (10 sec: 11878.2, 60 sec: 12014.9, 300 sec: 11080.0). Total num frames: 66248704. Throughput: 0: 11998.7. Samples: 66248696. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:38:49,600][547459] Avg episode reward: [(0, '4684.353')] [2023-03-08 20:38:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000129392_66248704.pth... [2023-03-08 20:38:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000128688_65888256.pth [2023-03-08 20:38:51,519][547799] Updated weights for policy 0, policy_version 129440 (0.0005) [2023-03-08 20:38:54,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11107.8). Total num frames: 66306048. Throughput: 0: 12013.3. Samples: 66285568. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:38:54,600][547459] Avg episode reward: [(0, '4636.327')] [2023-03-08 20:38:55,062][547799] Updated weights for policy 0, policy_version 129520 (0.0004) [2023-03-08 20:38:58,236][547799] Updated weights for policy 0, policy_version 129600 (0.0005) [2023-03-08 20:38:59,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11135.6). Total num frames: 66367488. Throughput: 0: 11995.4. Samples: 66359208. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:38:59,600][547459] Avg episode reward: [(0, '4819.321')] [2023-03-08 20:39:01,644][547799] Updated weights for policy 0, policy_version 129680 (0.0005) [2023-03-08 20:39:04,599][547459] Fps is (10 sec: 12288.0, 60 sec: 12015.0, 300 sec: 11163.3). Total num frames: 66428928. Throughput: 0: 12093.7. Samples: 66432544. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:39:04,600][547459] Avg episode reward: [(0, '4812.556')] [2023-03-08 20:39:04,657][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000129752_66433024.pth... [2023-03-08 20:39:04,658][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000129040_66068480.pth [2023-03-08 20:39:04,928][547799] Updated weights for policy 0, policy_version 129760 (0.0004) [2023-03-08 20:39:08,140][547799] Updated weights for policy 0, policy_version 129840 (0.0005) [2023-03-08 20:39:09,600][547459] Fps is (10 sec: 12697.6, 60 sec: 12083.2, 300 sec: 11205.0). Total num frames: 66494464. Throughput: 0: 12107.2. Samples: 66469952. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:39:09,600][547459] Avg episode reward: [(0, '4783.759')] [2023-03-08 20:39:11,587][547799] Updated weights for policy 0, policy_version 129920 (0.0005) [2023-03-08 20:39:14,599][547459] Fps is (10 sec: 12697.6, 60 sec: 12083.2, 300 sec: 11232.8). Total num frames: 66555904. Throughput: 0: 12147.8. Samples: 66543600. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:39:14,600][547459] Avg episode reward: [(0, '4824.356')] [2023-03-08 20:39:14,874][547799] Updated weights for policy 0, policy_version 130000 (0.0005) [2023-03-08 20:39:18,053][547799] Updated weights for policy 0, policy_version 130080 (0.0005) [2023-03-08 20:39:19,600][547459] Fps is (10 sec: 12287.7, 60 sec: 12151.4, 300 sec: 11260.5). Total num frames: 66617344. Throughput: 0: 12189.1. Samples: 66618156. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:39:19,600][547459] Avg episode reward: [(0, '4818.253')] [2023-03-08 20:39:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000130112_66617344.pth... [2023-03-08 20:39:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000129392_66248704.pth [2023-03-08 20:39:21,527][547799] Updated weights for policy 0, policy_version 130160 (0.0005) [2023-03-08 20:39:24,599][547459] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11302.2). Total num frames: 66678784. Throughput: 0: 12196.9. Samples: 66654216. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:39:24,600][547459] Avg episode reward: [(0, '4830.328')] [2023-03-08 20:39:24,731][547799] Updated weights for policy 0, policy_version 130240 (0.0004) [2023-03-08 20:39:28,243][547799] Updated weights for policy 0, policy_version 130320 (0.0004) [2023-03-08 20:39:29,600][547459] Fps is (10 sec: 11878.7, 60 sec: 12151.5, 300 sec: 11316.1). Total num frames: 66736128. Throughput: 0: 12199.9. Samples: 66727736. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:39:29,600][547459] Avg episode reward: [(0, '4737.261')] [2023-03-08 20:39:31,707][547799] Updated weights for policy 0, policy_version 130400 (0.0004) [2023-03-08 20:39:34,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 11343.8). Total num frames: 66797568. Throughput: 0: 12195.8. Samples: 66797508. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:39:34,600][547459] Avg episode reward: [(0, '4734.442')] [2023-03-08 20:39:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000130464_66797568.pth... [2023-03-08 20:39:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000129752_66433024.pth [2023-03-08 20:39:35,202][547799] Updated weights for policy 0, policy_version 130480 (0.0005) [2023-03-08 20:39:38,693][547799] Updated weights for policy 0, policy_version 130560 (0.0005) [2023-03-08 20:39:39,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11371.6). Total num frames: 66854912. Throughput: 0: 12195.8. Samples: 66834380. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:39:39,600][547459] Avg episode reward: [(0, '4827.581')] [2023-03-08 20:39:42,398][547799] Updated weights for policy 0, policy_version 130640 (0.0004) [2023-03-08 20:39:44,599][547459] Fps is (10 sec: 11468.9, 60 sec: 12083.2, 300 sec: 11385.5). Total num frames: 66912256. Throughput: 0: 12101.6. Samples: 66903780. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:39:44,600][547459] Avg episode reward: [(0, '4826.637')] [2023-03-08 20:39:45,696][547799] Updated weights for policy 0, policy_version 130720 (0.0004) [2023-03-08 20:39:49,115][547799] Updated weights for policy 0, policy_version 130800 (0.0005) [2023-03-08 20:39:49,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11413.3). Total num frames: 66973696. Throughput: 0: 12027.0. Samples: 66973760. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:39:49,600][547459] Avg episode reward: [(0, '4825.592')] [2023-03-08 20:39:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000130808_66973696.pth... [2023-03-08 20:39:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000130112_66617344.pth [2023-03-08 20:39:52,579][547799] Updated weights for policy 0, policy_version 130880 (0.0005) [2023-03-08 20:39:54,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12151.4, 300 sec: 11441.0). Total num frames: 67035136. Throughput: 0: 12008.5. Samples: 67010336. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:39:54,600][547459] Avg episode reward: [(0, '4830.662')] [2023-03-08 20:39:56,006][547799] Updated weights for policy 0, policy_version 130960 (0.0005) [2023-03-08 20:39:59,301][547799] Updated weights for policy 0, policy_version 131040 (0.0005) [2023-03-08 20:39:59,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11454.9). Total num frames: 67092480. Throughput: 0: 12015.3. Samples: 67084288. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:39:59,600][547459] Avg episode reward: [(0, '4830.756')] [2023-03-08 20:40:02,695][547799] Updated weights for policy 0, policy_version 131120 (0.0005) [2023-03-08 20:40:04,600][547459] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 11482.7). Total num frames: 67153920. Throughput: 0: 11949.2. Samples: 67155868. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:40:04,600][547459] Avg episode reward: [(0, '4815.625')] [2023-03-08 20:40:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000131160_67153920.pth... [2023-03-08 20:40:04,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000130464_66797568.pth [2023-03-08 20:40:05,939][547799] Updated weights for policy 0, policy_version 131200 (0.0005) [2023-03-08 20:40:09,426][547799] Updated weights for policy 0, policy_version 131280 (0.0005) [2023-03-08 20:40:09,600][547459] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 11510.5). Total num frames: 67215360. Throughput: 0: 11947.1. Samples: 67191836. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:40:09,600][547459] Avg episode reward: [(0, '4776.879')] [2023-03-08 20:40:12,503][547799] Updated weights for policy 0, policy_version 131360 (0.0005) [2023-03-08 20:40:14,600][547459] Fps is (10 sec: 12697.6, 60 sec: 12083.2, 300 sec: 11566.0). Total num frames: 67280896. Throughput: 0: 12025.0. Samples: 67268860. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:40:14,600][547459] Avg episode reward: [(0, '4761.467')] [2023-03-08 20:40:15,968][547799] Updated weights for policy 0, policy_version 131440 (0.0005) [2023-03-08 20:40:19,527][547799] Updated weights for policy 0, policy_version 131520 (0.0004) [2023-03-08 20:40:19,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12015.0, 300 sec: 11579.9). Total num frames: 67338240. Throughput: 0: 12016.3. Samples: 67338240. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:40:19,600][547459] Avg episode reward: [(0, '4826.569')] [2023-03-08 20:40:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000131520_67338240.pth... [2023-03-08 20:40:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000130808_66973696.pth [2023-03-08 20:40:23,063][547799] Updated weights for policy 0, policy_version 131600 (0.0004) [2023-03-08 20:40:24,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11593.8). Total num frames: 67395584. Throughput: 0: 11991.6. Samples: 67374000. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:40:24,600][547459] Avg episode reward: [(0, '4828.995')] [2023-03-08 20:40:26,411][547799] Updated weights for policy 0, policy_version 131680 (0.0004) [2023-03-08 20:40:29,600][547459] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11621.5). Total num frames: 67457024. Throughput: 0: 12037.3. Samples: 67445460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:40:29,600][547459] Avg episode reward: [(0, '4735.371')] [2023-03-08 20:40:29,770][547799] Updated weights for policy 0, policy_version 131760 (0.0005) [2023-03-08 20:40:33,093][547799] Updated weights for policy 0, policy_version 131840 (0.0005) [2023-03-08 20:40:34,599][547459] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11663.2). Total num frames: 67518464. Throughput: 0: 12107.9. Samples: 67518616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:40:34,600][547459] Avg episode reward: [(0, '4792.491')] [2023-03-08 20:40:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000131872_67518464.pth... [2023-03-08 20:40:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000131160_67153920.pth [2023-03-08 20:40:36,466][547799] Updated weights for policy 0, policy_version 131920 (0.0005) [2023-03-08 20:40:39,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11691.0). Total num frames: 67579904. Throughput: 0: 12112.4. Samples: 67555392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:40:39,600][547459] Avg episode reward: [(0, '4791.357')] [2023-03-08 20:40:39,842][547799] Updated weights for policy 0, policy_version 132000 (0.0005) [2023-03-08 20:40:43,075][547799] Updated weights for policy 0, policy_version 132080 (0.0005) [2023-03-08 20:40:44,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11718.7). Total num frames: 67641344. Throughput: 0: 12136.8. Samples: 67630444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:40:44,600][547459] Avg episode reward: [(0, '4788.367')] [2023-03-08 20:40:46,561][547799] Updated weights for policy 0, policy_version 132160 (0.0005) [2023-03-08 20:40:49,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11732.6). Total num frames: 67698688. Throughput: 0: 12082.5. Samples: 67699580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:40:49,600][547459] Avg episode reward: [(0, '4735.814')] [2023-03-08 20:40:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000132224_67698688.pth... [2023-03-08 20:40:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000131520_67338240.pth [2023-03-08 20:40:50,096][547799] Updated weights for policy 0, policy_version 132240 (0.0005) [2023-03-08 20:40:53,510][547799] Updated weights for policy 0, policy_version 132320 (0.0005) [2023-03-08 20:40:54,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11774.3). Total num frames: 67760128. Throughput: 0: 12084.0. Samples: 67735616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:40:54,600][547459] Avg episode reward: [(0, '4617.890')] [2023-03-08 20:40:56,957][547799] Updated weights for policy 0, policy_version 132400 (0.0005) [2023-03-08 20:40:59,599][547459] Fps is (10 sec: 11878.6, 60 sec: 12083.2, 300 sec: 11788.1). Total num frames: 67817472. Throughput: 0: 11972.3. Samples: 67807612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:40:59,600][547459] Avg episode reward: [(0, '4689.542')] [2023-03-08 20:41:00,475][547799] Updated weights for policy 0, policy_version 132480 (0.0005) [2023-03-08 20:41:03,803][547799] Updated weights for policy 0, policy_version 132560 (0.0005) [2023-03-08 20:41:04,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11815.9). Total num frames: 67878912. Throughput: 0: 12016.4. Samples: 67878976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:41:04,600][547459] Avg episode reward: [(0, '4727.536')] [2023-03-08 20:41:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000132576_67878912.pth... [2023-03-08 20:41:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000131872_67518464.pth [2023-03-08 20:41:07,133][547799] Updated weights for policy 0, policy_version 132640 (0.0005) [2023-03-08 20:41:09,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11829.8). Total num frames: 67936256. Throughput: 0: 12041.2. Samples: 67915856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:41:09,600][547459] Avg episode reward: [(0, '4774.576')] [2023-03-08 20:41:10,716][547799] Updated weights for policy 0, policy_version 132720 (0.0004) [2023-03-08 20:41:14,147][547799] Updated weights for policy 0, policy_version 132800 (0.0005) [2023-03-08 20:41:14,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11871.5). Total num frames: 67997696. Throughput: 0: 12000.4. Samples: 67985480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:41:14,600][547459] Avg episode reward: [(0, '4676.138')] [2023-03-08 20:41:17,382][547799] Updated weights for policy 0, policy_version 132880 (0.0005) [2023-03-08 20:41:19,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11899.2). Total num frames: 68059136. Throughput: 0: 12072.3. Samples: 68061872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:41:19,600][547459] Avg episode reward: [(0, '4738.465')] [2023-03-08 20:41:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000132928_68059136.pth... [2023-03-08 20:41:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000132224_67698688.pth [2023-03-08 20:41:20,769][547799] Updated weights for policy 0, policy_version 132960 (0.0006) [2023-03-08 20:41:24,051][547799] Updated weights for policy 0, policy_version 133040 (0.0006) [2023-03-08 20:41:24,599][547459] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 11927.0). Total num frames: 68120576. Throughput: 0: 12071.1. Samples: 68098588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:41:24,610][547459] Avg episode reward: [(0, '4686.936')] [2023-03-08 20:41:27,362][547799] Updated weights for policy 0, policy_version 133120 (0.0005) [2023-03-08 20:41:29,599][547459] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 68182016. Throughput: 0: 12025.9. Samples: 68171608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:41:29,610][547459] Avg episode reward: [(0, '4723.369')] [2023-03-08 20:41:30,681][547799] Updated weights for policy 0, policy_version 133200 (0.0006) [2023-03-08 20:41:34,054][547799] Updated weights for policy 0, policy_version 133280 (0.0005) [2023-03-08 20:41:34,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 68243456. Throughput: 0: 12102.1. Samples: 68244172. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:41:34,600][547459] Avg episode reward: [(0, '4758.904')] [2023-03-08 20:41:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000133288_68243456.pth... [2023-03-08 20:41:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000132576_67878912.pth [2023-03-08 20:41:37,559][547799] Updated weights for policy 0, policy_version 133360 (0.0005) [2023-03-08 20:41:39,599][547459] Fps is (10 sec: 11878.4, 60 sec: 12015.0, 300 sec: 12010.3). Total num frames: 68300800. Throughput: 0: 12104.6. Samples: 68280320. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:41:39,600][547459] Avg episode reward: [(0, '4693.023')] [2023-03-08 20:41:41,002][547799] Updated weights for policy 0, policy_version 133440 (0.0005) [2023-03-08 20:41:44,547][547799] Updated weights for policy 0, policy_version 133520 (0.0004) [2023-03-08 20:41:44,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 68362240. Throughput: 0: 12053.4. Samples: 68350016. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:41:44,600][547459] Avg episode reward: [(0, '4768.030')] [2023-03-08 20:41:47,786][547799] Updated weights for policy 0, policy_version 133600 (0.0004) [2023-03-08 20:41:49,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 68423680. Throughput: 0: 12163.2. Samples: 68426320. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:41:49,600][547459] Avg episode reward: [(0, '4767.822')] [2023-03-08 20:41:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000133640_68423680.pth... [2023-03-08 20:41:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000132928_68059136.pth [2023-03-08 20:41:50,986][547799] Updated weights for policy 0, policy_version 133680 (0.0004) [2023-03-08 20:41:54,372][547799] Updated weights for policy 0, policy_version 133760 (0.0004) [2023-03-08 20:41:54,599][547459] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 68485120. Throughput: 0: 12151.9. Samples: 68462692. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:41:54,600][547459] Avg episode reward: [(0, '4599.261')] [2023-03-08 20:41:57,666][547799] Updated weights for policy 0, policy_version 133840 (0.0005) [2023-03-08 20:41:59,600][547459] Fps is (10 sec: 12697.6, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 68550656. Throughput: 0: 12277.6. Samples: 68537972. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:41:59,600][547459] Avg episode reward: [(0, '4414.626')] [2023-03-08 20:42:00,890][547799] Updated weights for policy 0, policy_version 133920 (0.0005) [2023-03-08 20:42:04,413][547799] Updated weights for policy 0, policy_version 134000 (0.0004) [2023-03-08 20:42:04,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 68608000. Throughput: 0: 12162.2. Samples: 68609172. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:42:04,611][547459] Avg episode reward: [(0, '4330.950')] [2023-03-08 20:42:04,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000134000_68608000.pth... [2023-03-08 20:42:04,617][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000133288_68243456.pth [2023-03-08 20:42:08,086][547799] Updated weights for policy 0, policy_version 134080 (0.0004) [2023-03-08 20:42:09,600][547459] Fps is (10 sec: 11468.9, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 68665344. Throughput: 0: 12076.2. Samples: 68642016. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:42:09,611][547459] Avg episode reward: [(0, '4279.725')] [2023-03-08 20:42:11,589][547799] Updated weights for policy 0, policy_version 134160 (0.0005) [2023-03-08 20:42:14,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 68726784. Throughput: 0: 12043.3. Samples: 68713556. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:42:14,610][547459] Avg episode reward: [(0, '4436.258')] [2023-03-08 20:42:14,952][547799] Updated weights for policy 0, policy_version 134240 (0.0005) [2023-03-08 20:42:18,407][547799] Updated weights for policy 0, policy_version 134320 (0.0005) [2023-03-08 20:42:19,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 68784128. Throughput: 0: 12038.9. Samples: 68785920. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:42:19,611][547459] Avg episode reward: [(0, '4645.542')] [2023-03-08 20:42:19,613][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000134344_68784128.pth... [2023-03-08 20:42:19,616][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000133640_68423680.pth [2023-03-08 20:42:21,682][547799] Updated weights for policy 0, policy_version 134400 (0.0005) [2023-03-08 20:42:24,599][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 68845568. Throughput: 0: 12064.4. Samples: 68823220. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:42:24,610][547459] Avg episode reward: [(0, '4567.923')] [2023-03-08 20:42:25,070][547799] Updated weights for policy 0, policy_version 134480 (0.0005) [2023-03-08 20:42:28,665][547799] Updated weights for policy 0, policy_version 134560 (0.0005) [2023-03-08 20:42:29,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 68902912. Throughput: 0: 12056.6. Samples: 68892564. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:42:29,611][547459] Avg episode reward: [(0, '4741.041')] [2023-03-08 20:42:31,803][547799] Updated weights for policy 0, policy_version 134640 (0.0005) [2023-03-08 20:42:34,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 68968448. Throughput: 0: 12068.6. Samples: 68969408. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:42:34,611][547459] Avg episode reward: [(0, '4746.000')] [2023-03-08 20:42:34,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000134704_68968448.pth... [2023-03-08 20:42:34,617][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000134000_68608000.pth [2023-03-08 20:42:35,030][547799] Updated weights for policy 0, policy_version 134720 (0.0005) [2023-03-08 20:42:38,479][547799] Updated weights for policy 0, policy_version 134800 (0.0005) [2023-03-08 20:42:39,600][547459] Fps is (10 sec: 12697.7, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 69029888. Throughput: 0: 12081.1. Samples: 69006340. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:42:39,611][547459] Avg episode reward: [(0, '4760.540')] [2023-03-08 20:42:41,698][547799] Updated weights for policy 0, policy_version 134880 (0.0005) [2023-03-08 20:42:44,599][547459] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 69091328. Throughput: 0: 12072.9. Samples: 69081252. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:42:44,611][547459] Avg episode reward: [(0, '4782.206')] [2023-03-08 20:42:45,038][547799] Updated weights for policy 0, policy_version 134960 (0.0005) [2023-03-08 20:42:48,442][547799] Updated weights for policy 0, policy_version 135040 (0.0005) [2023-03-08 20:42:49,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 69148672. Throughput: 0: 12076.1. Samples: 69152596. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:42:49,611][547459] Avg episode reward: [(0, '4734.126')] [2023-03-08 20:42:49,616][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000135064_69152768.pth... [2023-03-08 20:42:49,618][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000134344_68784128.pth [2023-03-08 20:42:51,977][547799] Updated weights for policy 0, policy_version 135120 (0.0004) [2023-03-08 20:42:54,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 69210112. Throughput: 0: 12132.5. Samples: 69187980. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:42:54,600][547459] Avg episode reward: [(0, '4778.297')] [2023-03-08 20:42:55,654][547799] Updated weights for policy 0, policy_version 135200 (0.0005) [2023-03-08 20:42:59,075][547799] Updated weights for policy 0, policy_version 135280 (0.0004) [2023-03-08 20:42:59,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 69267456. Throughput: 0: 12040.4. Samples: 69255376. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:42:59,600][547459] Avg episode reward: [(0, '4580.889')] [2023-03-08 20:43:02,538][547799] Updated weights for policy 0, policy_version 135360 (0.0005) [2023-03-08 20:43:04,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 69324800. Throughput: 0: 12034.2. Samples: 69327460. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:43:04,600][547459] Avg episode reward: [(0, '4781.313')] [2023-03-08 20:43:04,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000135400_69324800.pth... [2023-03-08 20:43:04,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000134704_68968448.pth [2023-03-08 20:43:06,139][547799] Updated weights for policy 0, policy_version 135440 (0.0005) [2023-03-08 20:43:09,460][547799] Updated weights for policy 0, policy_version 135520 (0.0004) [2023-03-08 20:43:09,599][547459] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 69386240. Throughput: 0: 11979.4. Samples: 69362292. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:43:09,600][547459] Avg episode reward: [(0, '4818.882')] [2023-03-08 20:43:12,807][547799] Updated weights for policy 0, policy_version 135600 (0.0005) [2023-03-08 20:43:14,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 69443584. Throughput: 0: 12057.1. Samples: 69435132. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:43:14,600][547459] Avg episode reward: [(0, '4788.651')] [2023-03-08 20:43:16,445][547799] Updated weights for policy 0, policy_version 135680 (0.0005) [2023-03-08 20:43:19,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 69505024. Throughput: 0: 11911.6. Samples: 69505432. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:43:19,600][547459] Avg episode reward: [(0, '4748.966')] [2023-03-08 20:43:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000135752_69505024.pth... [2023-03-08 20:43:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000135064_69152768.pth [2023-03-08 20:43:19,853][547799] Updated weights for policy 0, policy_version 135760 (0.0005) [2023-03-08 20:43:23,418][547799] Updated weights for policy 0, policy_version 135840 (0.0005) [2023-03-08 20:43:24,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 69562368. Throughput: 0: 11843.7. Samples: 69539308. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:43:24,611][547459] Avg episode reward: [(0, '4508.746')] [2023-03-08 20:43:26,917][547799] Updated weights for policy 0, policy_version 135920 (0.0005) [2023-03-08 20:43:29,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 69619712. Throughput: 0: 11761.2. Samples: 69610508. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:43:29,600][547459] Avg episode reward: [(0, '4715.310')] [2023-03-08 20:43:30,340][547799] Updated weights for policy 0, policy_version 136000 (0.0005) [2023-03-08 20:43:33,784][547799] Updated weights for policy 0, policy_version 136080 (0.0005) [2023-03-08 20:43:34,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 69681152. Throughput: 0: 11747.3. Samples: 69681224. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:43:34,611][547459] Avg episode reward: [(0, '4759.736')] [2023-03-08 20:43:34,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000136096_69681152.pth... [2023-03-08 20:43:34,617][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000135400_69324800.pth [2023-03-08 20:43:37,282][547799] Updated weights for policy 0, policy_version 136160 (0.0005) [2023-03-08 20:43:39,599][547459] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 12052.0). Total num frames: 69742592. Throughput: 0: 11773.1. Samples: 69717768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:43:39,611][547459] Avg episode reward: [(0, '4818.614')] [2023-03-08 20:43:40,479][547799] Updated weights for policy 0, policy_version 136240 (0.0005) [2023-03-08 20:43:43,873][547799] Updated weights for policy 0, policy_version 136320 (0.0005) [2023-03-08 20:43:44,600][547459] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 12052.0). Total num frames: 69804032. Throughput: 0: 11920.7. Samples: 69791808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:43:44,611][547459] Avg episode reward: [(0, '4827.093')] [2023-03-08 20:43:47,100][547799] Updated weights for policy 0, policy_version 136400 (0.0005) [2023-03-08 20:43:49,600][547459] Fps is (10 sec: 12697.5, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 69869568. Throughput: 0: 12041.5. Samples: 69869328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:43:49,600][547459] Avg episode reward: [(0, '4783.573')] [2023-03-08 20:43:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000136464_69869568.pth... [2023-03-08 20:43:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000135752_69505024.pth [2023-03-08 20:43:50,352][547799] Updated weights for policy 0, policy_version 136480 (0.0005) [2023-03-08 20:43:53,756][547799] Updated weights for policy 0, policy_version 136560 (0.0005) [2023-03-08 20:43:54,599][547459] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 69926912. Throughput: 0: 12015.5. Samples: 69902988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:43:54,600][547459] Avg episode reward: [(0, '4541.585')] [2023-03-08 20:43:57,044][547799] Updated weights for policy 0, policy_version 136640 (0.0005) [2023-03-08 20:43:59,600][547459] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 69988352. Throughput: 0: 12022.1. Samples: 69976128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:43:59,600][547459] Avg episode reward: [(0, '4695.697')] [2023-03-08 20:44:00,574][547799] Updated weights for policy 0, policy_version 136720 (0.0006) [2023-03-08 20:44:03,916][547799] Updated weights for policy 0, policy_version 136800 (0.0005) [2023-03-08 20:44:04,599][547459] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 70045696. Throughput: 0: 12084.8. Samples: 70049248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:44:04,600][547459] Avg episode reward: [(0, '4814.161')] [2023-03-08 20:44:04,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000136808_70045696.pth... [2023-03-08 20:44:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000136096_69681152.pth [2023-03-08 20:44:07,245][547799] Updated weights for policy 0, policy_version 136880 (0.0006) [2023-03-08 20:44:09,599][547459] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 70111232. Throughput: 0: 12157.0. Samples: 70086372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:44:09,600][547459] Avg episode reward: [(0, '4818.504')] [2023-03-08 20:44:10,544][547799] Updated weights for policy 0, policy_version 136960 (0.0005) [2023-03-08 20:44:13,724][547799] Updated weights for policy 0, policy_version 137040 (0.0005) [2023-03-08 20:44:14,599][547459] Fps is (10 sec: 12697.6, 60 sec: 12151.5, 300 sec: 12052.0). Total num frames: 70172672. Throughput: 0: 12235.2. Samples: 70161092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:44:14,600][547459] Avg episode reward: [(0, '4677.573')] [2023-03-08 20:44:16,986][547799] Updated weights for policy 0, policy_version 137120 (0.0005) [2023-03-08 20:44:19,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12052.0). Total num frames: 70234112. Throughput: 0: 12317.1. Samples: 70235492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:44:19,600][547459] Avg episode reward: [(0, '4544.042')] [2023-03-08 20:44:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000137176_70234112.pth... [2023-03-08 20:44:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000136464_69869568.pth [2023-03-08 20:44:20,479][547799] Updated weights for policy 0, policy_version 137200 (0.0005) [2023-03-08 20:44:24,009][547799] Updated weights for policy 0, policy_version 137280 (0.0005) [2023-03-08 20:44:24,599][547459] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12052.0). Total num frames: 70291456. Throughput: 0: 12284.6. Samples: 70270576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:44:24,600][547459] Avg episode reward: [(0, '4695.731')] [2023-03-08 20:44:27,503][547799] Updated weights for policy 0, policy_version 137360 (0.0005) [2023-03-08 20:44:29,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12052.0). Total num frames: 70352896. Throughput: 0: 12195.4. Samples: 70340600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:44:29,600][547459] Avg episode reward: [(0, '4689.312')] [2023-03-08 20:44:30,859][547799] Updated weights for policy 0, policy_version 137440 (0.0005) [2023-03-08 20:44:34,018][547799] Updated weights for policy 0, policy_version 137520 (0.0005) [2023-03-08 20:44:34,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12065.8). Total num frames: 70414336. Throughput: 0: 12144.7. Samples: 70415840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:44:34,600][547459] Avg episode reward: [(0, '4813.962')] [2023-03-08 20:44:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000137528_70414336.pth... [2023-03-08 20:44:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000136808_70045696.pth [2023-03-08 20:44:37,520][547799] Updated weights for policy 0, policy_version 137600 (0.0005) [2023-03-08 20:44:39,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 70471680. Throughput: 0: 12183.9. Samples: 70451264. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:44:39,600][547459] Avg episode reward: [(0, '4662.299')] [2023-03-08 20:44:40,992][547799] Updated weights for policy 0, policy_version 137680 (0.0005) [2023-03-08 20:44:44,480][547799] Updated weights for policy 0, policy_version 137760 (0.0005) [2023-03-08 20:44:44,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 70533120. Throughput: 0: 12125.2. Samples: 70521760. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:44:44,600][547459] Avg episode reward: [(0, '4625.449')] [2023-03-08 20:44:47,950][547799] Updated weights for policy 0, policy_version 137840 (0.0005) [2023-03-08 20:44:49,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 70590464. Throughput: 0: 12096.0. Samples: 70593568. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:44:49,600][547459] Avg episode reward: [(0, '4776.673')] [2023-03-08 20:44:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000137872_70590464.pth... [2023-03-08 20:44:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000137176_70234112.pth [2023-03-08 20:44:51,369][547799] Updated weights for policy 0, policy_version 137920 (0.0004) [2023-03-08 20:44:54,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12065.9). Total num frames: 70651904. Throughput: 0: 12074.3. Samples: 70629716. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:44:54,600][547459] Avg episode reward: [(0, '4641.014')] [2023-03-08 20:44:54,696][547799] Updated weights for policy 0, policy_version 138000 (0.0005) [2023-03-08 20:44:57,931][547799] Updated weights for policy 0, policy_version 138080 (0.0005) [2023-03-08 20:44:59,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 70713344. Throughput: 0: 12068.6. Samples: 70704180. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:44:59,600][547459] Avg episode reward: [(0, '4663.618')] [2023-03-08 20:45:01,259][547799] Updated weights for policy 0, policy_version 138160 (0.0005) [2023-03-08 20:45:04,464][547799] Updated weights for policy 0, policy_version 138240 (0.0005) [2023-03-08 20:45:04,600][547459] Fps is (10 sec: 12697.4, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 70778880. Throughput: 0: 12082.7. Samples: 70779216. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:45:04,600][547459] Avg episode reward: [(0, '4622.080')] [2023-03-08 20:45:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000138240_70778880.pth... [2023-03-08 20:45:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000137528_70414336.pth [2023-03-08 20:45:07,660][547799] Updated weights for policy 0, policy_version 138320 (0.0005) [2023-03-08 20:45:09,600][547459] Fps is (10 sec: 12697.6, 60 sec: 12151.4, 300 sec: 12065.8). Total num frames: 70840320. Throughput: 0: 12184.5. Samples: 70818880. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:45:09,600][547459] Avg episode reward: [(0, '4563.301')] [2023-03-08 20:45:11,052][547799] Updated weights for policy 0, policy_version 138400 (0.0005) [2023-03-08 20:45:14,508][547799] Updated weights for policy 0, policy_version 138480 (0.0005) [2023-03-08 20:45:14,599][547459] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 70901760. Throughput: 0: 12209.6. Samples: 70890032. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:45:14,600][547459] Avg episode reward: [(0, '4629.766')] [2023-03-08 20:45:17,805][547799] Updated weights for policy 0, policy_version 138560 (0.0004) [2023-03-08 20:45:19,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 70963200. Throughput: 0: 12165.1. Samples: 70963272. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:45:19,600][547459] Avg episode reward: [(0, '4380.387')] [2023-03-08 20:45:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000138600_70963200.pth... [2023-03-08 20:45:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000137872_70590464.pth [2023-03-08 20:45:21,181][547799] Updated weights for policy 0, policy_version 138640 (0.0004) [2023-03-08 20:45:24,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 71020544. Throughput: 0: 12147.8. Samples: 70997916. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:45:24,600][547459] Avg episode reward: [(0, '4425.741')] [2023-03-08 20:45:24,821][547799] Updated weights for policy 0, policy_version 138720 (0.0004) [2023-03-08 20:45:28,089][547799] Updated weights for policy 0, policy_version 138800 (0.0005) [2023-03-08 20:45:29,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 71081984. Throughput: 0: 12181.4. Samples: 71069924. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:45:29,600][547459] Avg episode reward: [(0, '4602.379')] [2023-03-08 20:45:31,562][547799] Updated weights for policy 0, policy_version 138880 (0.0005) [2023-03-08 20:45:34,599][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 71139328. Throughput: 0: 12177.6. Samples: 71141560. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:45:34,600][547459] Avg episode reward: [(0, '4287.032')] [2023-03-08 20:45:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000138944_71139328.pth... [2023-03-08 20:45:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000138240_70778880.pth [2023-03-08 20:45:35,023][547799] Updated weights for policy 0, policy_version 138960 (0.0005) [2023-03-08 20:45:38,495][547799] Updated weights for policy 0, policy_version 139040 (0.0005) [2023-03-08 20:45:39,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 71200768. Throughput: 0: 12148.5. Samples: 71176400. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:45:39,600][547459] Avg episode reward: [(0, '4656.441')] [2023-03-08 20:45:42,026][547799] Updated weights for policy 0, policy_version 139120 (0.0005) [2023-03-08 20:45:44,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 71258112. Throughput: 0: 12051.7. Samples: 71246508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:45:44,600][547459] Avg episode reward: [(0, '4463.203')] [2023-03-08 20:45:45,674][547799] Updated weights for policy 0, policy_version 139200 (0.0004) [2023-03-08 20:45:49,099][547799] Updated weights for policy 0, policy_version 139280 (0.0005) [2023-03-08 20:45:49,600][547459] Fps is (10 sec: 11468.8, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 71315456. Throughput: 0: 11927.4. Samples: 71315948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:45:49,600][547459] Avg episode reward: [(0, '4548.462')] [2023-03-08 20:45:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000139288_71315456.pth... [2023-03-08 20:45:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000138600_70963200.pth [2023-03-08 20:45:52,276][547799] Updated weights for policy 0, policy_version 139360 (0.0006) [2023-03-08 20:45:54,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 71376896. Throughput: 0: 11936.7. Samples: 71356032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:45:54,600][547459] Avg episode reward: [(0, '4548.433')] [2023-03-08 20:45:55,882][547799] Updated weights for policy 0, policy_version 139440 (0.0004) [2023-03-08 20:45:59,175][547799] Updated weights for policy 0, policy_version 139520 (0.0004) [2023-03-08 20:45:59,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 71438336. Throughput: 0: 11910.7. Samples: 71426016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:45:59,600][547459] Avg episode reward: [(0, '4599.919')] [2023-03-08 20:46:02,487][547799] Updated weights for policy 0, policy_version 139600 (0.0005) [2023-03-08 20:46:04,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 71499776. Throughput: 0: 11923.7. Samples: 71499840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:46:04,600][547459] Avg episode reward: [(0, '4498.739')] [2023-03-08 20:46:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000139648_71499776.pth... [2023-03-08 20:46:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000138944_71139328.pth [2023-03-08 20:46:05,838][547799] Updated weights for policy 0, policy_version 139680 (0.0005) [2023-03-08 20:46:09,138][547799] Updated weights for policy 0, policy_version 139760 (0.0005) [2023-03-08 20:46:09,600][547459] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 71561216. Throughput: 0: 11971.6. Samples: 71536640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:46:09,600][547459] Avg episode reward: [(0, '4661.873')] [2023-03-08 20:46:12,551][547799] Updated weights for policy 0, policy_version 139840 (0.0005) [2023-03-08 20:46:14,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 71618560. Throughput: 0: 11987.2. Samples: 71609348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:46:14,600][547459] Avg episode reward: [(0, '4597.431')] [2023-03-08 20:46:16,253][547799] Updated weights for policy 0, policy_version 139920 (0.0005) [2023-03-08 20:46:19,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 12052.0). Total num frames: 71675904. Throughput: 0: 11875.7. Samples: 71675968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:46:19,600][547459] Avg episode reward: [(0, '4656.121')] [2023-03-08 20:46:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000139992_71675904.pth... [2023-03-08 20:46:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000139288_71315456.pth [2023-03-08 20:46:19,866][547799] Updated weights for policy 0, policy_version 140000 (0.0004) [2023-03-08 20:46:23,130][547799] Updated weights for policy 0, policy_version 140080 (0.0005) [2023-03-08 20:46:24,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 71737344. Throughput: 0: 11935.6. Samples: 71713504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:46:24,600][547459] Avg episode reward: [(0, '4801.660')] [2023-03-08 20:46:26,626][547799] Updated weights for policy 0, policy_version 140160 (0.0005) [2023-03-08 20:46:29,599][547459] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 71798784. Throughput: 0: 11985.4. Samples: 71785852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:46:29,600][547459] Avg episode reward: [(0, '4710.770')] [2023-03-08 20:46:29,894][547799] Updated weights for policy 0, policy_version 140240 (0.0004) [2023-03-08 20:46:33,145][547799] Updated weights for policy 0, policy_version 140320 (0.0005) [2023-03-08 20:46:34,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 71860224. Throughput: 0: 12094.8. Samples: 71860216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:46:34,600][547459] Avg episode reward: [(0, '4818.685')] [2023-03-08 20:46:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000140352_71860224.pth... [2023-03-08 20:46:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000139648_71499776.pth [2023-03-08 20:46:36,525][547799] Updated weights for policy 0, policy_version 140400 (0.0005) [2023-03-08 20:46:39,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 71917568. Throughput: 0: 12024.9. Samples: 71897152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:46:39,600][547459] Avg episode reward: [(0, '4814.743')] [2023-03-08 20:46:40,039][547799] Updated weights for policy 0, policy_version 140480 (0.0005) [2023-03-08 20:46:43,632][547799] Updated weights for policy 0, policy_version 140560 (0.0005) [2023-03-08 20:46:44,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 71974912. Throughput: 0: 12010.3. Samples: 71966480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:46:44,600][547459] Avg episode reward: [(0, '4688.365')] [2023-03-08 20:46:47,092][547799] Updated weights for policy 0, policy_version 140640 (0.0005) [2023-03-08 20:46:49,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 12038.1). Total num frames: 72036352. Throughput: 0: 11919.6. Samples: 72036220. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:46:49,600][547459] Avg episode reward: [(0, '4691.141')] [2023-03-08 20:46:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000140696_72036352.pth... [2023-03-08 20:46:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000139992_71675904.pth [2023-03-08 20:46:50,611][547799] Updated weights for policy 0, policy_version 140720 (0.0005) [2023-03-08 20:46:53,965][547799] Updated weights for policy 0, policy_version 140800 (0.0005) [2023-03-08 20:46:54,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 72093696. Throughput: 0: 11914.3. Samples: 72072784. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:46:54,600][547459] Avg episode reward: [(0, '4726.417')] [2023-03-08 20:46:57,412][547799] Updated weights for policy 0, policy_version 140880 (0.0005) [2023-03-08 20:46:59,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 72155136. Throughput: 0: 11862.1. Samples: 72143140. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:46:59,600][547459] Avg episode reward: [(0, '4774.212')] [2023-03-08 20:47:00,925][547799] Updated weights for policy 0, policy_version 140960 (0.0004) [2023-03-08 20:47:04,536][547799] Updated weights for policy 0, policy_version 141040 (0.0005) [2023-03-08 20:47:04,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 72212480. Throughput: 0: 11922.8. Samples: 72212492. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:47:04,600][547459] Avg episode reward: [(0, '4779.851')] [2023-03-08 20:47:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000141040_72212480.pth... [2023-03-08 20:47:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000140352_71860224.pth [2023-03-08 20:47:08,033][547799] Updated weights for policy 0, policy_version 141120 (0.0004) [2023-03-08 20:47:09,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 12010.3). Total num frames: 72269824. Throughput: 0: 11874.0. Samples: 72247832. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:47:09,600][547459] Avg episode reward: [(0, '4616.262')] [2023-03-08 20:47:11,300][547799] Updated weights for policy 0, policy_version 141200 (0.0005) [2023-03-08 20:47:14,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 72331264. Throughput: 0: 11863.4. Samples: 72319704. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:47:14,600][547459] Avg episode reward: [(0, '4635.781')] [2023-03-08 20:47:14,846][547799] Updated weights for policy 0, policy_version 141280 (0.0006) [2023-03-08 20:47:18,407][547799] Updated weights for policy 0, policy_version 141360 (0.0005) [2023-03-08 20:47:19,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 72388608. Throughput: 0: 11782.6. Samples: 72390432. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:47:19,600][547459] Avg episode reward: [(0, '4721.100')] [2023-03-08 20:47:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000141384_72388608.pth... [2023-03-08 20:47:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000140696_72036352.pth [2023-03-08 20:47:21,664][547799] Updated weights for policy 0, policy_version 141440 (0.0005) [2023-03-08 20:47:24,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 72450048. Throughput: 0: 11804.9. Samples: 72428372. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:47:24,600][547459] Avg episode reward: [(0, '4700.795')] [2023-03-08 20:47:25,053][547799] Updated weights for policy 0, policy_version 141520 (0.0004) [2023-03-08 20:47:28,437][547799] Updated weights for policy 0, policy_version 141600 (0.0005) [2023-03-08 20:47:29,600][547459] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 72511488. Throughput: 0: 11848.9. Samples: 72499680. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:47:29,600][547459] Avg episode reward: [(0, '4738.000')] [2023-03-08 20:47:31,773][547799] Updated weights for policy 0, policy_version 141680 (0.0004) [2023-03-08 20:47:34,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 72572928. Throughput: 0: 11969.5. Samples: 72574848. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:47:34,600][547459] Avg episode reward: [(0, '4595.970')] [2023-03-08 20:47:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000141744_72572928.pth... [2023-03-08 20:47:34,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000141040_72212480.pth [2023-03-08 20:47:35,066][547799] Updated weights for policy 0, policy_version 141760 (0.0004) [2023-03-08 20:47:38,392][547799] Updated weights for policy 0, policy_version 141840 (0.0005) [2023-03-08 20:47:39,600][547459] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 72634368. Throughput: 0: 11981.9. Samples: 72611968. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:47:39,600][547459] Avg episode reward: [(0, '4628.189')] [2023-03-08 20:47:41,791][547799] Updated weights for policy 0, policy_version 141920 (0.0004) [2023-03-08 20:47:44,599][547459] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 72695808. Throughput: 0: 12010.1. Samples: 72683592. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 20:47:44,600][547459] Avg episode reward: [(0, '4428.572')] [2023-03-08 20:47:45,105][547799] Updated weights for policy 0, policy_version 142000 (0.0005) [2023-03-08 20:47:48,640][547799] Updated weights for policy 0, policy_version 142080 (0.0005) [2023-03-08 20:47:49,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 72757248. Throughput: 0: 12107.1. Samples: 72757312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:47:49,600][547459] Avg episode reward: [(0, '4616.291')] [2023-03-08 20:47:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000142104_72757248.pth... [2023-03-08 20:47:49,617][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000141384_72388608.pth [2023-03-08 20:47:51,840][547799] Updated weights for policy 0, policy_version 142160 (0.0005) [2023-03-08 20:47:54,599][547459] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 72814592. Throughput: 0: 12139.4. Samples: 72794104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:47:54,600][547459] Avg episode reward: [(0, '4627.317')] [2023-03-08 20:47:55,355][547799] Updated weights for policy 0, policy_version 142240 (0.0005) [2023-03-08 20:47:58,982][547799] Updated weights for policy 0, policy_version 142320 (0.0005) [2023-03-08 20:47:59,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 72871936. Throughput: 0: 12086.9. Samples: 72863612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:47:59,600][547459] Avg episode reward: [(0, '4716.337')] [2023-03-08 20:48:02,239][547799] Updated weights for policy 0, policy_version 142400 (0.0004) [2023-03-08 20:48:04,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 72933376. Throughput: 0: 12144.9. Samples: 72936952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:48:04,600][547459] Avg episode reward: [(0, '4780.147')] [2023-03-08 20:48:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000142448_72933376.pth... [2023-03-08 20:48:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000141744_72572928.pth [2023-03-08 20:48:05,719][547799] Updated weights for policy 0, policy_version 142480 (0.0005) [2023-03-08 20:48:09,074][547799] Updated weights for policy 0, policy_version 142560 (0.0004) [2023-03-08 20:48:09,599][547459] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 72994816. Throughput: 0: 12081.7. Samples: 72972048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:48:09,600][547459] Avg episode reward: [(0, '4685.828')] [2023-03-08 20:48:12,354][547799] Updated weights for policy 0, policy_version 142640 (0.0005) [2023-03-08 20:48:14,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 73056256. Throughput: 0: 12122.7. Samples: 73045200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:48:14,600][547459] Avg episode reward: [(0, '4582.664')] [2023-03-08 20:48:15,927][547799] Updated weights for policy 0, policy_version 142720 (0.0004) [2023-03-08 20:48:19,389][547799] Updated weights for policy 0, policy_version 142800 (0.0005) [2023-03-08 20:48:19,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 73113600. Throughput: 0: 12007.9. Samples: 73115204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:48:19,600][547459] Avg episode reward: [(0, '4582.064')] [2023-03-08 20:48:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000142800_73113600.pth... [2023-03-08 20:48:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000142104_72757248.pth [2023-03-08 20:48:22,728][547799] Updated weights for policy 0, policy_version 142880 (0.0004) [2023-03-08 20:48:24,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 73175040. Throughput: 0: 12007.6. Samples: 73152308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:48:24,600][547459] Avg episode reward: [(0, '4651.373')] [2023-03-08 20:48:25,965][547799] Updated weights for policy 0, policy_version 142960 (0.0005) [2023-03-08 20:48:29,401][547799] Updated weights for policy 0, policy_version 143040 (0.0005) [2023-03-08 20:48:29,600][547459] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 73236480. Throughput: 0: 12076.1. Samples: 73227016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:48:29,600][547459] Avg episode reward: [(0, '4574.864')] [2023-03-08 20:48:32,861][547799] Updated weights for policy 0, policy_version 143120 (0.0005) [2023-03-08 20:48:34,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 73297920. Throughput: 0: 12034.5. Samples: 73298864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:48:34,600][547459] Avg episode reward: [(0, '4582.855')] [2023-03-08 20:48:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000143160_73297920.pth... [2023-03-08 20:48:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000142448_72933376.pth [2023-03-08 20:48:36,203][547799] Updated weights for policy 0, policy_version 143200 (0.0006) [2023-03-08 20:48:39,421][547799] Updated weights for policy 0, policy_version 143280 (0.0005) [2023-03-08 20:48:39,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 73359360. Throughput: 0: 12018.6. Samples: 73334940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:48:39,600][547459] Avg episode reward: [(0, '4369.536')] [2023-03-08 20:48:42,729][547799] Updated weights for policy 0, policy_version 143360 (0.0006) [2023-03-08 20:48:44,600][547459] Fps is (10 sec: 12697.6, 60 sec: 12151.4, 300 sec: 12052.0). Total num frames: 73424896. Throughput: 0: 12185.7. Samples: 73411968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:48:44,600][547459] Avg episode reward: [(0, '4492.385')] [2023-03-08 20:48:45,846][547799] Updated weights for policy 0, policy_version 143440 (0.0005) [2023-03-08 20:48:49,089][547799] Updated weights for policy 0, policy_version 143520 (0.0004) [2023-03-08 20:48:49,600][547459] Fps is (10 sec: 12697.6, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 73486336. Throughput: 0: 12228.3. Samples: 73487228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:48:49,600][547459] Avg episode reward: [(0, '4417.189')] [2023-03-08 20:48:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000143528_73486336.pth... [2023-03-08 20:48:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000142800_73113600.pth [2023-03-08 20:48:52,281][547799] Updated weights for policy 0, policy_version 143600 (0.0005) [2023-03-08 20:48:54,600][547459] Fps is (10 sec: 12697.7, 60 sec: 12288.0, 300 sec: 12079.7). Total num frames: 73551872. Throughput: 0: 12333.8. Samples: 73527068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:48:54,600][547459] Avg episode reward: [(0, '4472.064')] [2023-03-08 20:48:55,548][547799] Updated weights for policy 0, policy_version 143680 (0.0004) [2023-03-08 20:48:58,889][547799] Updated weights for policy 0, policy_version 143760 (0.0005) [2023-03-08 20:48:59,599][547459] Fps is (10 sec: 12697.7, 60 sec: 12356.3, 300 sec: 12093.6). Total num frames: 73613312. Throughput: 0: 12351.7. Samples: 73601024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:48:59,600][547459] Avg episode reward: [(0, '4496.140')] [2023-03-08 20:49:02,397][547799] Updated weights for policy 0, policy_version 143840 (0.0005) [2023-03-08 20:49:04,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 73670656. Throughput: 0: 12362.2. Samples: 73671504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:49:04,600][547459] Avg episode reward: [(0, '4563.905')] [2023-03-08 20:49:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000143888_73670656.pth... [2023-03-08 20:49:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000143160_73297920.pth [2023-03-08 20:49:05,804][547799] Updated weights for policy 0, policy_version 143920 (0.0005) [2023-03-08 20:49:09,075][547799] Updated weights for policy 0, policy_version 144000 (0.0005) [2023-03-08 20:49:09,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 73732096. Throughput: 0: 12388.5. Samples: 73709792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:49:09,600][547459] Avg episode reward: [(0, '4646.857')] [2023-03-08 20:49:12,511][547799] Updated weights for policy 0, policy_version 144080 (0.0005) [2023-03-08 20:49:14,600][547459] Fps is (10 sec: 12288.1, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 73793536. Throughput: 0: 12316.3. Samples: 73781248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:49:14,600][547459] Avg episode reward: [(0, '4435.735')] [2023-03-08 20:49:15,985][547799] Updated weights for policy 0, policy_version 144160 (0.0005) [2023-03-08 20:49:19,260][547799] Updated weights for policy 0, policy_version 144240 (0.0005) [2023-03-08 20:49:19,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 73850880. Throughput: 0: 12345.4. Samples: 73854408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:49:19,600][547459] Avg episode reward: [(0, '4452.817')] [2023-03-08 20:49:19,608][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000144248_73854976.pth... [2023-03-08 20:49:19,610][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000143528_73486336.pth [2023-03-08 20:49:22,660][547799] Updated weights for policy 0, policy_version 144320 (0.0005) [2023-03-08 20:49:24,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 73912320. Throughput: 0: 12330.0. Samples: 73889788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:49:24,600][547459] Avg episode reward: [(0, '4467.942')] [2023-03-08 20:49:26,241][547799] Updated weights for policy 0, policy_version 144400 (0.0004) [2023-03-08 20:49:29,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12052.0). Total num frames: 73969664. Throughput: 0: 12201.3. Samples: 73961028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:49:29,600][547459] Avg episode reward: [(0, '4632.961')] [2023-03-08 20:49:29,644][547799] Updated weights for policy 0, policy_version 144480 (0.0005) [2023-03-08 20:49:32,954][547799] Updated weights for policy 0, policy_version 144560 (0.0005) [2023-03-08 20:49:34,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12079.7). Total num frames: 74035200. Throughput: 0: 12177.3. Samples: 74035208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:49:34,600][547459] Avg episode reward: [(0, '4641.289')] [2023-03-08 20:49:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000144600_74035200.pth... [2023-03-08 20:49:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000143888_73670656.pth [2023-03-08 20:49:36,291][547799] Updated weights for policy 0, policy_version 144640 (0.0005) [2023-03-08 20:49:39,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12065.8). Total num frames: 74092544. Throughput: 0: 12094.2. Samples: 74071308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:49:39,600][547459] Avg episode reward: [(0, '4495.765')] [2023-03-08 20:49:39,757][547799] Updated weights for policy 0, policy_version 144720 (0.0004) [2023-03-08 20:49:43,139][547799] Updated weights for policy 0, policy_version 144800 (0.0005) [2023-03-08 20:49:44,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 74153984. Throughput: 0: 12024.0. Samples: 74142104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:49:44,600][547459] Avg episode reward: [(0, '4468.202')] [2023-03-08 20:49:46,508][547799] Updated weights for policy 0, policy_version 144880 (0.0005) [2023-03-08 20:49:49,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 74211328. Throughput: 0: 12020.7. Samples: 74212436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:49:49,600][547459] Avg episode reward: [(0, '4508.715')] [2023-03-08 20:49:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000144944_74211328.pth... [2023-03-08 20:49:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000144248_73854976.pth [2023-03-08 20:49:50,119][547799] Updated weights for policy 0, policy_version 144960 (0.0005) [2023-03-08 20:49:53,446][547799] Updated weights for policy 0, policy_version 145040 (0.0005) [2023-03-08 20:49:54,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 12065.8). Total num frames: 74272768. Throughput: 0: 11971.0. Samples: 74248484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:49:54,600][547459] Avg episode reward: [(0, '4554.871')] [2023-03-08 20:49:56,546][547799] Updated weights for policy 0, policy_version 145120 (0.0005) [2023-03-08 20:49:59,600][547459] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 74334208. Throughput: 0: 12105.4. Samples: 74325992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:49:59,600][547459] Avg episode reward: [(0, '4599.556')] [2023-03-08 20:49:59,974][547799] Updated weights for policy 0, policy_version 145200 (0.0004) [2023-03-08 20:50:03,424][547799] Updated weights for policy 0, policy_version 145280 (0.0005) [2023-03-08 20:50:04,600][547459] Fps is (10 sec: 12287.8, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 74395648. Throughput: 0: 12032.2. Samples: 74395856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:50:04,600][547459] Avg episode reward: [(0, '4780.411')] [2023-03-08 20:50:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000145304_74395648.pth... [2023-03-08 20:50:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000144600_74035200.pth [2023-03-08 20:50:06,872][547799] Updated weights for policy 0, policy_version 145360 (0.0005) [2023-03-08 20:50:09,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 74452992. Throughput: 0: 12057.8. Samples: 74432388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:50:09,600][547459] Avg episode reward: [(0, '4779.431')] [2023-03-08 20:50:10,219][547799] Updated weights for policy 0, policy_version 145440 (0.0005) [2023-03-08 20:50:13,672][547799] Updated weights for policy 0, policy_version 145520 (0.0005) [2023-03-08 20:50:14,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 74514432. Throughput: 0: 12094.6. Samples: 74505284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:50:14,600][547459] Avg episode reward: [(0, '4688.709')] [2023-03-08 20:50:17,017][547799] Updated weights for policy 0, policy_version 145600 (0.0005) [2023-03-08 20:50:19,600][547459] Fps is (10 sec: 12697.5, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 74579968. Throughput: 0: 12105.8. Samples: 74579968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:50:19,600][547459] Avg episode reward: [(0, '4762.853')] [2023-03-08 20:50:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000145664_74579968.pth... [2023-03-08 20:50:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000144944_74211328.pth [2023-03-08 20:50:20,239][547799] Updated weights for policy 0, policy_version 145680 (0.0005) [2023-03-08 20:50:23,532][547799] Updated weights for policy 0, policy_version 145760 (0.0004) [2023-03-08 20:50:24,599][547459] Fps is (10 sec: 12697.7, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 74641408. Throughput: 0: 12129.5. Samples: 74617136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:50:24,600][547459] Avg episode reward: [(0, '4541.877')] [2023-03-08 20:50:26,842][547799] Updated weights for policy 0, policy_version 145840 (0.0005) [2023-03-08 20:50:29,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 74698752. Throughput: 0: 12182.4. Samples: 74690312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:50:29,600][547459] Avg episode reward: [(0, '4677.815')] [2023-03-08 20:50:30,379][547799] Updated weights for policy 0, policy_version 145920 (0.0005) [2023-03-08 20:50:33,704][547799] Updated weights for policy 0, policy_version 146000 (0.0005) [2023-03-08 20:50:34,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 74760192. Throughput: 0: 12184.4. Samples: 74760732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:50:34,600][547459] Avg episode reward: [(0, '4671.705')] [2023-03-08 20:50:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000146016_74760192.pth... [2023-03-08 20:50:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000145304_74395648.pth [2023-03-08 20:50:37,189][547799] Updated weights for policy 0, policy_version 146080 (0.0005) [2023-03-08 20:50:39,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 74821632. Throughput: 0: 12190.5. Samples: 74797056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:50:39,600][547459] Avg episode reward: [(0, '4534.907')] [2023-03-08 20:50:40,596][547799] Updated weights for policy 0, policy_version 146160 (0.0005) [2023-03-08 20:50:43,901][547799] Updated weights for policy 0, policy_version 146240 (0.0005) [2023-03-08 20:50:44,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 74883072. Throughput: 0: 12105.9. Samples: 74870760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:50:44,600][547459] Avg episode reward: [(0, '4484.321')] [2023-03-08 20:50:47,301][547799] Updated weights for policy 0, policy_version 146320 (0.0005) [2023-03-08 20:50:49,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 74944512. Throughput: 0: 12192.5. Samples: 74944520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:50:49,600][547459] Avg episode reward: [(0, '4587.131')] [2023-03-08 20:50:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000146376_74944512.pth... [2023-03-08 20:50:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000145664_74579968.pth [2023-03-08 20:50:50,706][547799] Updated weights for policy 0, policy_version 146400 (0.0005) [2023-03-08 20:50:53,992][547799] Updated weights for policy 0, policy_version 146480 (0.0005) [2023-03-08 20:50:54,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 75001856. Throughput: 0: 12186.1. Samples: 74980760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:50:54,600][547459] Avg episode reward: [(0, '4576.328')] [2023-03-08 20:50:57,198][547799] Updated weights for policy 0, policy_version 146560 (0.0004) [2023-03-08 20:50:59,600][547459] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 75067392. Throughput: 0: 12218.4. Samples: 75055112. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:50:59,600][547459] Avg episode reward: [(0, '4660.500')] [2023-03-08 20:51:00,498][547799] Updated weights for policy 0, policy_version 146640 (0.0005) [2023-03-08 20:51:03,819][547799] Updated weights for policy 0, policy_version 146720 (0.0005) [2023-03-08 20:51:04,600][547459] Fps is (10 sec: 12697.4, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 75128832. Throughput: 0: 12198.4. Samples: 75128896. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:51:04,600][547459] Avg episode reward: [(0, '4631.052')] [2023-03-08 20:51:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000146736_75128832.pth... [2023-03-08 20:51:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000146016_74760192.pth [2023-03-08 20:51:07,168][547799] Updated weights for policy 0, policy_version 146800 (0.0005) [2023-03-08 20:51:09,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12288.0, 300 sec: 12107.5). Total num frames: 75190272. Throughput: 0: 12191.6. Samples: 75165760. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:51:09,600][547459] Avg episode reward: [(0, '4520.391')] [2023-03-08 20:51:10,482][547799] Updated weights for policy 0, policy_version 146880 (0.0005) [2023-03-08 20:51:13,865][547799] Updated weights for policy 0, policy_version 146960 (0.0005) [2023-03-08 20:51:14,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12121.4). Total num frames: 75251712. Throughput: 0: 12203.9. Samples: 75239488. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:51:14,600][547459] Avg episode reward: [(0, '4743.072')] [2023-03-08 20:51:17,183][547799] Updated weights for policy 0, policy_version 147040 (0.0005) [2023-03-08 20:51:19,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12121.4). Total num frames: 75313152. Throughput: 0: 12276.0. Samples: 75313152. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:51:19,600][547459] Avg episode reward: [(0, '4422.079')] [2023-03-08 20:51:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000147096_75313152.pth... [2023-03-08 20:51:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000146376_74944512.pth [2023-03-08 20:51:20,732][547799] Updated weights for policy 0, policy_version 147120 (0.0005) [2023-03-08 20:51:24,260][547799] Updated weights for policy 0, policy_version 147200 (0.0004) [2023-03-08 20:51:24,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 75370496. Throughput: 0: 12198.4. Samples: 75345984. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:51:24,600][547459] Avg episode reward: [(0, '4655.822')] [2023-03-08 20:51:27,957][547799] Updated weights for policy 0, policy_version 147280 (0.0004) [2023-03-08 20:51:29,599][547459] Fps is (10 sec: 11059.3, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 75423744. Throughput: 0: 12067.9. Samples: 75413812. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:51:29,600][547459] Avg episode reward: [(0, '4656.512')] [2023-03-08 20:51:31,517][547799] Updated weights for policy 0, policy_version 147360 (0.0005) [2023-03-08 20:51:34,600][547459] Fps is (10 sec: 11059.2, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 75481088. Throughput: 0: 11936.5. Samples: 75481660. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:51:34,600][547459] Avg episode reward: [(0, '4563.162')] [2023-03-08 20:51:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000147424_75481088.pth... [2023-03-08 20:51:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000146736_75128832.pth [2023-03-08 20:51:35,143][547799] Updated weights for policy 0, policy_version 147440 (0.0005) [2023-03-08 20:51:38,677][547799] Updated weights for policy 0, policy_version 147520 (0.0005) [2023-03-08 20:51:39,600][547459] Fps is (10 sec: 11468.7, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 75538432. Throughput: 0: 11908.4. Samples: 75516640. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:51:39,600][547459] Avg episode reward: [(0, '4426.025')] [2023-03-08 20:51:42,187][547799] Updated weights for policy 0, policy_version 147600 (0.0005) [2023-03-08 20:51:44,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 75599872. Throughput: 0: 11830.7. Samples: 75587492. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:51:44,600][547459] Avg episode reward: [(0, '4430.009')] [2023-03-08 20:51:45,654][547799] Updated weights for policy 0, policy_version 147680 (0.0004) [2023-03-08 20:51:49,241][547799] Updated weights for policy 0, policy_version 147760 (0.0005) [2023-03-08 20:51:49,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12079.7). Total num frames: 75657216. Throughput: 0: 11735.6. Samples: 75657000. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:51:49,600][547459] Avg episode reward: [(0, '4662.226')] [2023-03-08 20:51:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000147768_75657216.pth... [2023-03-08 20:51:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000147096_75313152.pth [2023-03-08 20:51:52,771][547799] Updated weights for policy 0, policy_version 147840 (0.0005) [2023-03-08 20:51:54,600][547459] Fps is (10 sec: 11059.2, 60 sec: 11810.1, 300 sec: 12052.0). Total num frames: 75710464. Throughput: 0: 11660.0. Samples: 75690460. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:51:54,600][547459] Avg episode reward: [(0, '4272.472')] [2023-03-08 20:51:56,398][547799] Updated weights for policy 0, policy_version 147920 (0.0004) [2023-03-08 20:51:59,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 12065.8). Total num frames: 75771904. Throughput: 0: 11612.4. Samples: 75762044. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:51:59,600][547459] Avg episode reward: [(0, '4614.931')] [2023-03-08 20:51:59,673][547799] Updated weights for policy 0, policy_version 148000 (0.0005) [2023-03-08 20:52:03,091][547799] Updated weights for policy 0, policy_version 148080 (0.0005) [2023-03-08 20:52:04,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11741.9, 300 sec: 12079.7). Total num frames: 75833344. Throughput: 0: 11593.6. Samples: 75834864. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:52:04,600][547459] Avg episode reward: [(0, '4778.194')] [2023-03-08 20:52:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000148112_75833344.pth... [2023-03-08 20:52:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000147424_75481088.pth [2023-03-08 20:52:06,382][547799] Updated weights for policy 0, policy_version 148160 (0.0005) [2023-03-08 20:52:09,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 12065.8). Total num frames: 75890688. Throughput: 0: 11659.2. Samples: 75870648. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:52:09,600][547459] Avg episode reward: [(0, '4804.811')] [2023-03-08 20:52:10,048][547799] Updated weights for policy 0, policy_version 148240 (0.0005) [2023-03-08 20:52:13,097][547799] Updated weights for policy 0, policy_version 148320 (0.0005) [2023-03-08 20:52:14,600][547459] Fps is (10 sec: 12288.0, 60 sec: 11741.9, 300 sec: 12093.6). Total num frames: 75956224. Throughput: 0: 11812.2. Samples: 75945364. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:52:14,600][547459] Avg episode reward: [(0, '4772.128')] [2023-03-08 20:52:16,461][547799] Updated weights for policy 0, policy_version 148400 (0.0005) [2023-03-08 20:52:19,599][547459] Fps is (10 sec: 12288.0, 60 sec: 11673.6, 300 sec: 12079.7). Total num frames: 76013568. Throughput: 0: 11906.9. Samples: 76017468. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:52:19,600][547459] Avg episode reward: [(0, '4775.174')] [2023-03-08 20:52:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000148472_76017664.pth... [2023-03-08 20:52:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000147768_75657216.pth [2023-03-08 20:52:19,960][547799] Updated weights for policy 0, policy_version 148480 (0.0005) [2023-03-08 20:52:23,305][547799] Updated weights for policy 0, policy_version 148560 (0.0006) [2023-03-08 20:52:24,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 12079.7). Total num frames: 76075008. Throughput: 0: 11931.7. Samples: 76053564. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:52:24,611][547459] Avg episode reward: [(0, '4804.657')] [2023-03-08 20:52:26,670][547799] Updated weights for policy 0, policy_version 148640 (0.0005) [2023-03-08 20:52:29,600][547459] Fps is (10 sec: 12697.5, 60 sec: 11946.6, 300 sec: 12093.6). Total num frames: 76140544. Throughput: 0: 11974.3. Samples: 76126336. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:52:29,611][547459] Avg episode reward: [(0, '4804.231')] [2023-03-08 20:52:29,916][547799] Updated weights for policy 0, policy_version 148720 (0.0005) [2023-03-08 20:52:33,388][547799] Updated weights for policy 0, policy_version 148800 (0.0005) [2023-03-08 20:52:34,599][547459] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 76197888. Throughput: 0: 12065.8. Samples: 76199960. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:52:34,610][547459] Avg episode reward: [(0, '4760.392')] [2023-03-08 20:52:34,613][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000148824_76197888.pth... [2023-03-08 20:52:34,615][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000148112_75833344.pth [2023-03-08 20:52:36,778][547799] Updated weights for policy 0, policy_version 148880 (0.0005) [2023-03-08 20:52:39,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 76259328. Throughput: 0: 12115.3. Samples: 76235648. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:52:39,600][547459] Avg episode reward: [(0, '4672.585')] [2023-03-08 20:52:40,212][547799] Updated weights for policy 0, policy_version 148960 (0.0005) [2023-03-08 20:52:43,294][547799] Updated weights for policy 0, policy_version 149040 (0.0005) [2023-03-08 20:52:44,600][547459] Fps is (10 sec: 12697.6, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 76324864. Throughput: 0: 12223.4. Samples: 76312096. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:52:44,600][547459] Avg episode reward: [(0, '4762.465')] [2023-03-08 20:52:46,389][547799] Updated weights for policy 0, policy_version 149120 (0.0005) [2023-03-08 20:52:49,600][547459] Fps is (10 sec: 12697.7, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 76386304. Throughput: 0: 12330.2. Samples: 76389720. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:52:49,600][547459] Avg episode reward: [(0, '4819.373')] [2023-03-08 20:52:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000149192_76386304.pth... [2023-03-08 20:52:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000148472_76017664.pth [2023-03-08 20:52:49,684][547799] Updated weights for policy 0, policy_version 149200 (0.0005) [2023-03-08 20:52:52,951][547799] Updated weights for policy 0, policy_version 149280 (0.0005) [2023-03-08 20:52:54,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12121.4). Total num frames: 76447744. Throughput: 0: 12362.0. Samples: 76426940. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:52:54,600][547459] Avg episode reward: [(0, '4772.386')] [2023-03-08 20:52:56,557][547799] Updated weights for policy 0, policy_version 149360 (0.0005) [2023-03-08 20:52:59,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12288.0, 300 sec: 12121.4). Total num frames: 76509184. Throughput: 0: 12272.0. Samples: 76497604. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 20:52:59,600][547459] Avg episode reward: [(0, '4612.649')] [2023-03-08 20:52:59,785][547799] Updated weights for policy 0, policy_version 149440 (0.0004) [2023-03-08 20:53:03,044][547799] Updated weights for policy 0, policy_version 149520 (0.0004) [2023-03-08 20:53:04,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12288.0, 300 sec: 12121.4). Total num frames: 76570624. Throughput: 0: 12306.8. Samples: 76571276. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:53:04,600][547459] Avg episode reward: [(0, '4824.385')] [2023-03-08 20:53:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000149552_76570624.pth... [2023-03-08 20:53:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000148824_76197888.pth [2023-03-08 20:53:06,546][547799] Updated weights for policy 0, policy_version 149600 (0.0004) [2023-03-08 20:53:09,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12356.3, 300 sec: 12121.4). Total num frames: 76632064. Throughput: 0: 12309.8. Samples: 76607504. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:53:09,600][547459] Avg episode reward: [(0, '4814.879')] [2023-03-08 20:53:09,890][547799] Updated weights for policy 0, policy_version 149680 (0.0005) [2023-03-08 20:53:13,066][547799] Updated weights for policy 0, policy_version 149760 (0.0004) [2023-03-08 20:53:14,599][547459] Fps is (10 sec: 12288.2, 60 sec: 12288.0, 300 sec: 12135.3). Total num frames: 76693504. Throughput: 0: 12397.7. Samples: 76684232. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:53:14,600][547459] Avg episode reward: [(0, '4817.485')] [2023-03-08 20:53:16,390][547799] Updated weights for policy 0, policy_version 149840 (0.0005) [2023-03-08 20:53:19,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12356.2, 300 sec: 12135.3). Total num frames: 76754944. Throughput: 0: 12334.4. Samples: 76755008. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:53:19,600][547459] Avg episode reward: [(0, '4696.830')] [2023-03-08 20:53:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000149912_76754944.pth... [2023-03-08 20:53:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000149192_76386304.pth [2023-03-08 20:53:19,872][547799] Updated weights for policy 0, policy_version 149920 (0.0004) [2023-03-08 20:53:23,285][547799] Updated weights for policy 0, policy_version 150000 (0.0005) [2023-03-08 20:53:24,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12288.0, 300 sec: 12121.4). Total num frames: 76812288. Throughput: 0: 12326.1. Samples: 76790324. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:53:24,600][547459] Avg episode reward: [(0, '4806.639')] [2023-03-08 20:53:26,823][547799] Updated weights for policy 0, policy_version 150080 (0.0005) [2023-03-08 20:53:29,600][547459] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 12121.4). Total num frames: 76873728. Throughput: 0: 12270.7. Samples: 76864276. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:53:29,600][547459] Avg episode reward: [(0, '4819.607')] [2023-03-08 20:53:29,994][547799] Updated weights for policy 0, policy_version 150160 (0.0005) [2023-03-08 20:53:33,556][547799] Updated weights for policy 0, policy_version 150240 (0.0005) [2023-03-08 20:53:34,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12121.4). Total num frames: 76935168. Throughput: 0: 12120.5. Samples: 76935144. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:53:34,600][547459] Avg episode reward: [(0, '4727.252')] [2023-03-08 20:53:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000150264_76935168.pth... [2023-03-08 20:53:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000149552_76570624.pth [2023-03-08 20:53:37,038][547799] Updated weights for policy 0, policy_version 150320 (0.0005) [2023-03-08 20:53:39,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12219.8, 300 sec: 12093.6). Total num frames: 76992512. Throughput: 0: 12084.4. Samples: 76970736. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:53:39,600][547459] Avg episode reward: [(0, '4782.165')] [2023-03-08 20:53:40,604][547799] Updated weights for policy 0, policy_version 150400 (0.0005) [2023-03-08 20:53:44,023][547799] Updated weights for policy 0, policy_version 150480 (0.0005) [2023-03-08 20:53:44,599][547459] Fps is (10 sec: 11468.9, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 77049856. Throughput: 0: 12027.9. Samples: 77038860. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:53:44,600][547459] Avg episode reward: [(0, '4735.048')] [2023-03-08 20:53:47,157][547799] Updated weights for policy 0, policy_version 150560 (0.0005) [2023-03-08 20:53:49,600][547459] Fps is (10 sec: 12287.7, 60 sec: 12151.4, 300 sec: 12079.7). Total num frames: 77115392. Throughput: 0: 12090.1. Samples: 77115332. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:53:49,600][547459] Avg episode reward: [(0, '4819.616')] [2023-03-08 20:53:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000150616_77115392.pth... [2023-03-08 20:53:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000149912_76754944.pth [2023-03-08 20:53:50,641][547799] Updated weights for policy 0, policy_version 150640 (0.0005) [2023-03-08 20:53:53,882][547799] Updated weights for policy 0, policy_version 150720 (0.0004) [2023-03-08 20:53:54,599][547459] Fps is (10 sec: 12697.6, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 77176832. Throughput: 0: 12076.0. Samples: 77150924. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:53:54,600][547459] Avg episode reward: [(0, '4782.735')] [2023-03-08 20:53:57,131][547799] Updated weights for policy 0, policy_version 150800 (0.0004) [2023-03-08 20:53:59,599][547459] Fps is (10 sec: 12288.2, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 77238272. Throughput: 0: 12084.3. Samples: 77228028. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:53:59,600][547459] Avg episode reward: [(0, '4716.510')] [2023-03-08 20:54:00,597][547799] Updated weights for policy 0, policy_version 150880 (0.0004) [2023-03-08 20:54:03,837][547799] Updated weights for policy 0, policy_version 150960 (0.0005) [2023-03-08 20:54:04,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 77299712. Throughput: 0: 12106.1. Samples: 77299784. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 20:54:04,600][547459] Avg episode reward: [(0, '4699.025')] [2023-03-08 20:54:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000150976_77299712.pth... [2023-03-08 20:54:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000150264_76935168.pth [2023-03-08 20:54:06,970][547799] Updated weights for policy 0, policy_version 151040 (0.0005) [2023-03-08 20:54:09,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 77361152. Throughput: 0: 12230.0. Samples: 77340672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:54:09,600][547459] Avg episode reward: [(0, '4663.358')] [2023-03-08 20:54:10,448][547799] Updated weights for policy 0, policy_version 151120 (0.0005) [2023-03-08 20:54:13,732][547799] Updated weights for policy 0, policy_version 151200 (0.0005) [2023-03-08 20:54:14,599][547459] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 77422592. Throughput: 0: 12141.3. Samples: 77410632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:54:14,600][547459] Avg episode reward: [(0, '4608.373')] [2023-03-08 20:54:17,007][547799] Updated weights for policy 0, policy_version 151280 (0.0005) [2023-03-08 20:54:19,599][547459] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 77484032. Throughput: 0: 12204.4. Samples: 77484340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:54:19,600][547459] Avg episode reward: [(0, '4816.698')] [2023-03-08 20:54:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000151336_77484032.pth... [2023-03-08 20:54:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000150616_77115392.pth [2023-03-08 20:54:20,479][547799] Updated weights for policy 0, policy_version 151360 (0.0004) [2023-03-08 20:54:23,699][547799] Updated weights for policy 0, policy_version 151440 (0.0005) [2023-03-08 20:54:24,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12121.4). Total num frames: 77545472. Throughput: 0: 12286.0. Samples: 77523608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:54:24,600][547459] Avg episode reward: [(0, '4626.251')] [2023-03-08 20:54:27,104][547799] Updated weights for policy 0, policy_version 151520 (0.0005) [2023-03-08 20:54:29,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 77606912. Throughput: 0: 12364.9. Samples: 77595280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:54:29,600][547459] Avg episode reward: [(0, '4690.097')] [2023-03-08 20:54:30,470][547799] Updated weights for policy 0, policy_version 151600 (0.0005) [2023-03-08 20:54:33,979][547799] Updated weights for policy 0, policy_version 151680 (0.0004) [2023-03-08 20:54:34,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 77664256. Throughput: 0: 12270.8. Samples: 77667516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:54:34,600][547459] Avg episode reward: [(0, '4690.395')] [2023-03-08 20:54:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000151696_77668352.pth... [2023-03-08 20:54:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000150976_77299712.pth [2023-03-08 20:54:37,576][547799] Updated weights for policy 0, policy_version 151760 (0.0004) [2023-03-08 20:54:39,599][547459] Fps is (10 sec: 11468.9, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 77721600. Throughput: 0: 12226.4. Samples: 77701112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:54:39,600][547459] Avg episode reward: [(0, '4405.042')] [2023-03-08 20:54:41,279][547799] Updated weights for policy 0, policy_version 151840 (0.0005) [2023-03-08 20:54:44,600][547459] Fps is (10 sec: 11468.7, 60 sec: 12151.4, 300 sec: 12093.6). Total num frames: 77778944. Throughput: 0: 11969.5. Samples: 77766656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:54:44,600][547459] Avg episode reward: [(0, '4531.625')] [2023-03-08 20:54:44,908][547799] Updated weights for policy 0, policy_version 151920 (0.0006) [2023-03-08 20:54:48,464][547799] Updated weights for policy 0, policy_version 152000 (0.0004) [2023-03-08 20:54:49,600][547459] Fps is (10 sec: 11468.6, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 77836288. Throughput: 0: 11923.9. Samples: 77836360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:54:49,600][547459] Avg episode reward: [(0, '4640.101')] [2023-03-08 20:54:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000152024_77836288.pth... [2023-03-08 20:54:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000151336_77484032.pth [2023-03-08 20:54:52,003][547799] Updated weights for policy 0, policy_version 152080 (0.0005) [2023-03-08 20:54:54,600][547459] Fps is (10 sec: 11468.9, 60 sec: 11946.6, 300 sec: 12065.8). Total num frames: 77893632. Throughput: 0: 11784.7. Samples: 77870984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:54:54,600][547459] Avg episode reward: [(0, '4450.400')] [2023-03-08 20:54:55,766][547799] Updated weights for policy 0, policy_version 152160 (0.0005) [2023-03-08 20:54:59,502][547799] Updated weights for policy 0, policy_version 152240 (0.0005) [2023-03-08 20:54:59,599][547459] Fps is (10 sec: 11059.3, 60 sec: 11810.1, 300 sec: 12038.1). Total num frames: 77946880. Throughput: 0: 11655.9. Samples: 77935148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:54:59,600][547459] Avg episode reward: [(0, '4782.960')] [2023-03-08 20:55:02,992][547799] Updated weights for policy 0, policy_version 152320 (0.0005) [2023-03-08 20:55:04,600][547459] Fps is (10 sec: 11059.2, 60 sec: 11741.9, 300 sec: 12038.1). Total num frames: 78004224. Throughput: 0: 11600.2. Samples: 78006348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:55:04,600][547459] Avg episode reward: [(0, '4515.868')] [2023-03-08 20:55:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000152352_78004224.pth... [2023-03-08 20:55:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000151696_77668352.pth [2023-03-08 20:55:06,442][547799] Updated weights for policy 0, policy_version 152400 (0.0005) [2023-03-08 20:55:09,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 12038.1). Total num frames: 78065664. Throughput: 0: 11505.9. Samples: 78041372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:55:09,600][547459] Avg episode reward: [(0, '4696.256')] [2023-03-08 20:55:09,858][547799] Updated weights for policy 0, policy_version 152480 (0.0005) [2023-03-08 20:55:13,268][547799] Updated weights for policy 0, policy_version 152560 (0.0005) [2023-03-08 20:55:14,599][547459] Fps is (10 sec: 11878.6, 60 sec: 11673.6, 300 sec: 12010.3). Total num frames: 78123008. Throughput: 0: 11532.0. Samples: 78114220. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:55:14,600][547459] Avg episode reward: [(0, '4743.988')] [2023-03-08 20:55:16,850][547799] Updated weights for policy 0, policy_version 152640 (0.0005) [2023-03-08 20:55:19,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 12010.3). Total num frames: 78184448. Throughput: 0: 11487.5. Samples: 78184456. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:55:19,600][547459] Avg episode reward: [(0, '4824.604')] [2023-03-08 20:55:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000152704_78184448.pth... [2023-03-08 20:55:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000152024_77836288.pth [2023-03-08 20:55:20,279][547799] Updated weights for policy 0, policy_version 152720 (0.0005) [2023-03-08 20:55:23,809][547799] Updated weights for policy 0, policy_version 152800 (0.0005) [2023-03-08 20:55:24,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11996.4). Total num frames: 78237696. Throughput: 0: 11486.7. Samples: 78218012. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:55:24,600][547459] Avg episode reward: [(0, '4817.410')] [2023-03-08 20:55:27,457][547799] Updated weights for policy 0, policy_version 152880 (0.0005) [2023-03-08 20:55:29,599][547459] Fps is (10 sec: 11059.3, 60 sec: 11468.8, 300 sec: 11982.5). Total num frames: 78295040. Throughput: 0: 11559.9. Samples: 78286848. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:55:29,600][547459] Avg episode reward: [(0, '4682.412')] [2023-03-08 20:55:31,105][547799] Updated weights for policy 0, policy_version 152960 (0.0005) [2023-03-08 20:55:34,578][547799] Updated weights for policy 0, policy_version 153040 (0.0005) [2023-03-08 20:55:34,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11537.1, 300 sec: 11982.5). Total num frames: 78356480. Throughput: 0: 11556.9. Samples: 78356420. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:55:34,600][547459] Avg episode reward: [(0, '4772.397')] [2023-03-08 20:55:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000153040_78356480.pth... [2023-03-08 20:55:34,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000152352_78004224.pth [2023-03-08 20:55:38,307][547799] Updated weights for policy 0, policy_version 153120 (0.0005) [2023-03-08 20:55:39,599][547459] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 11954.8). Total num frames: 78409728. Throughput: 0: 11516.0. Samples: 78389204. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:55:39,600][547459] Avg episode reward: [(0, '4681.082')] [2023-03-08 20:55:41,823][547799] Updated weights for policy 0, policy_version 153200 (0.0005) [2023-03-08 20:55:44,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11954.8). Total num frames: 78471168. Throughput: 0: 11652.8. Samples: 78459524. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:55:44,600][547459] Avg episode reward: [(0, '4822.571')] [2023-03-08 20:55:45,211][547799] Updated weights for policy 0, policy_version 153280 (0.0005) [2023-03-08 20:55:48,813][547799] Updated weights for policy 0, policy_version 153360 (0.0004) [2023-03-08 20:55:49,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11537.1, 300 sec: 11954.8). Total num frames: 78528512. Throughput: 0: 11605.1. Samples: 78528576. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:55:49,600][547459] Avg episode reward: [(0, '4823.116')] [2023-03-08 20:55:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000153376_78528512.pth... [2023-03-08 20:55:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000152704_78184448.pth [2023-03-08 20:55:52,277][547799] Updated weights for policy 0, policy_version 153440 (0.0005) [2023-03-08 20:55:54,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11537.1, 300 sec: 11927.0). Total num frames: 78585856. Throughput: 0: 11642.1. Samples: 78565264. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:55:54,600][547459] Avg episode reward: [(0, '4765.799')] [2023-03-08 20:55:55,767][547799] Updated weights for policy 0, policy_version 153520 (0.0005) [2023-03-08 20:55:59,085][547799] Updated weights for policy 0, policy_version 153600 (0.0005) [2023-03-08 20:55:59,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11927.0). Total num frames: 78647296. Throughput: 0: 11606.7. Samples: 78636524. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:55:59,600][547459] Avg episode reward: [(0, '4680.268')] [2023-03-08 20:56:02,473][547799] Updated weights for policy 0, policy_version 153680 (0.0005) [2023-03-08 20:56:04,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11913.1). Total num frames: 78704640. Throughput: 0: 11625.5. Samples: 78707604. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:56:04,600][547459] Avg episode reward: [(0, '4592.538')] [2023-03-08 20:56:04,638][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000153728_78708736.pth... [2023-03-08 20:56:04,640][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000153040_78356480.pth [2023-03-08 20:56:05,976][547799] Updated weights for policy 0, policy_version 153760 (0.0005) [2023-03-08 20:56:09,274][547799] Updated weights for policy 0, policy_version 153840 (0.0005) [2023-03-08 20:56:09,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11913.1). Total num frames: 78766080. Throughput: 0: 11721.9. Samples: 78745496. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:56:09,600][547459] Avg episode reward: [(0, '4790.899')] [2023-03-08 20:56:12,624][547799] Updated weights for policy 0, policy_version 153920 (0.0005) [2023-03-08 20:56:14,600][547459] Fps is (10 sec: 12697.5, 60 sec: 11810.1, 300 sec: 11927.0). Total num frames: 78831616. Throughput: 0: 11833.1. Samples: 78819340. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:56:14,600][547459] Avg episode reward: [(0, '4773.440')] [2023-03-08 20:56:15,802][547799] Updated weights for policy 0, policy_version 154000 (0.0004) [2023-03-08 20:56:19,106][547799] Updated weights for policy 0, policy_version 154080 (0.0005) [2023-03-08 20:56:19,599][547459] Fps is (10 sec: 12697.6, 60 sec: 11810.2, 300 sec: 11940.9). Total num frames: 78893056. Throughput: 0: 11939.8. Samples: 78893708. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:56:19,600][547459] Avg episode reward: [(0, '4774.550')] [2023-03-08 20:56:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000154088_78893056.pth... [2023-03-08 20:56:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000153376_78528512.pth [2023-03-08 20:56:22,417][547799] Updated weights for policy 0, policy_version 154160 (0.0005) [2023-03-08 20:56:24,600][547459] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 78954496. Throughput: 0: 12043.1. Samples: 78931144. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:56:24,600][547459] Avg episode reward: [(0, '4825.872')] [2023-03-08 20:56:25,960][547799] Updated weights for policy 0, policy_version 154240 (0.0005) [2023-03-08 20:56:29,222][547799] Updated weights for policy 0, policy_version 154320 (0.0005) [2023-03-08 20:56:29,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 79015936. Throughput: 0: 12089.3. Samples: 79003544. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:56:29,600][547459] Avg episode reward: [(0, '4827.262')] [2023-03-08 20:56:32,548][547799] Updated weights for policy 0, policy_version 154400 (0.0005) [2023-03-08 20:56:34,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 79077376. Throughput: 0: 12192.3. Samples: 79077232. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:56:34,600][547459] Avg episode reward: [(0, '4810.330')] [2023-03-08 20:56:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000154448_79077376.pth... [2023-03-08 20:56:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000153728_78708736.pth [2023-03-08 20:56:35,927][547799] Updated weights for policy 0, policy_version 154480 (0.0005) [2023-03-08 20:56:39,389][547799] Updated weights for policy 0, policy_version 154560 (0.0005) [2023-03-08 20:56:39,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 79134720. Throughput: 0: 12147.9. Samples: 79111920. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:56:39,600][547459] Avg episode reward: [(0, '4715.290')] [2023-03-08 20:56:42,788][547799] Updated weights for policy 0, policy_version 154640 (0.0005) [2023-03-08 20:56:44,600][547459] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 79196160. Throughput: 0: 12199.0. Samples: 79185480. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:56:44,600][547459] Avg episode reward: [(0, '4670.963')] [2023-03-08 20:56:46,165][547799] Updated weights for policy 0, policy_version 154720 (0.0004) [2023-03-08 20:56:49,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 79253504. Throughput: 0: 12140.8. Samples: 79253940. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:56:49,600][547459] Avg episode reward: [(0, '4422.838')] [2023-03-08 20:56:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000154792_79253504.pth... [2023-03-08 20:56:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000154088_78893056.pth [2023-03-08 20:56:49,854][547799] Updated weights for policy 0, policy_version 154800 (0.0005) [2023-03-08 20:56:53,433][547799] Updated weights for policy 0, policy_version 154880 (0.0005) [2023-03-08 20:56:54,600][547459] Fps is (10 sec: 11468.8, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 79310848. Throughput: 0: 12091.4. Samples: 79289608. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:56:54,600][547459] Avg episode reward: [(0, '4478.283')] [2023-03-08 20:56:56,889][547799] Updated weights for policy 0, policy_version 154960 (0.0004) [2023-03-08 20:56:59,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 79372288. Throughput: 0: 12014.7. Samples: 79360000. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:56:59,600][547459] Avg episode reward: [(0, '4467.760')] [2023-03-08 20:57:00,068][547799] Updated weights for policy 0, policy_version 155040 (0.0005) [2023-03-08 20:57:03,429][547799] Updated weights for policy 0, policy_version 155120 (0.0005) [2023-03-08 20:57:04,599][547459] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 79433728. Throughput: 0: 12002.0. Samples: 79433800. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:57:04,600][547459] Avg episode reward: [(0, '4455.681')] [2023-03-08 20:57:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000155144_79433728.pth... [2023-03-08 20:57:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000154448_79077376.pth [2023-03-08 20:57:06,998][547799] Updated weights for policy 0, policy_version 155200 (0.0006) [2023-03-08 20:57:09,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 79491072. Throughput: 0: 11940.7. Samples: 79468476. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:57:09,600][547459] Avg episode reward: [(0, '4560.304')] [2023-03-08 20:57:10,499][547799] Updated weights for policy 0, policy_version 155280 (0.0005) [2023-03-08 20:57:14,200][547799] Updated weights for policy 0, policy_version 155360 (0.0004) [2023-03-08 20:57:14,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 79548416. Throughput: 0: 11843.9. Samples: 79536520. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:57:14,600][547459] Avg episode reward: [(0, '4421.792')] [2023-03-08 20:57:17,544][547799] Updated weights for policy 0, policy_version 155440 (0.0005) [2023-03-08 20:57:19,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 79605760. Throughput: 0: 11785.3. Samples: 79607568. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:57:19,600][547459] Avg episode reward: [(0, '4383.261')] [2023-03-08 20:57:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000155480_79605760.pth... [2023-03-08 20:57:19,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000154792_79253504.pth [2023-03-08 20:57:21,187][547799] Updated weights for policy 0, policy_version 155520 (0.0005) [2023-03-08 20:57:24,600][547459] Fps is (10 sec: 11468.7, 60 sec: 11810.1, 300 sec: 11940.9). Total num frames: 79663104. Throughput: 0: 11783.0. Samples: 79642156. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:57:24,600][547459] Avg episode reward: [(0, '4337.951')] [2023-03-08 20:57:24,764][547799] Updated weights for policy 0, policy_version 155600 (0.0005) [2023-03-08 20:57:28,588][547799] Updated weights for policy 0, policy_version 155680 (0.0004) [2023-03-08 20:57:29,600][547459] Fps is (10 sec: 11059.2, 60 sec: 11673.6, 300 sec: 11927.0). Total num frames: 79716352. Throughput: 0: 11614.3. Samples: 79708124. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:57:29,600][547459] Avg episode reward: [(0, '4208.554')] [2023-03-08 20:57:32,508][547799] Updated weights for policy 0, policy_version 155760 (0.0004) [2023-03-08 20:57:34,599][547459] Fps is (10 sec: 10649.6, 60 sec: 11537.1, 300 sec: 11899.2). Total num frames: 79769600. Throughput: 0: 11523.1. Samples: 79772480. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:57:34,600][547459] Avg episode reward: [(0, '4480.892')] [2023-03-08 20:57:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000155800_79769600.pth... [2023-03-08 20:57:34,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000155144_79433728.pth [2023-03-08 20:57:36,123][547799] Updated weights for policy 0, policy_version 155840 (0.0004) [2023-03-08 20:57:39,319][547799] Updated weights for policy 0, policy_version 155920 (0.0005) [2023-03-08 20:57:39,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11605.4, 300 sec: 11885.3). Total num frames: 79831040. Throughput: 0: 11571.5. Samples: 79810324. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:57:39,600][547459] Avg episode reward: [(0, '4488.887')] [2023-03-08 20:57:42,721][547799] Updated weights for policy 0, policy_version 156000 (0.0005) [2023-03-08 20:57:44,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11605.3, 300 sec: 11885.3). Total num frames: 79892480. Throughput: 0: 11611.9. Samples: 79882536. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:57:44,600][547459] Avg episode reward: [(0, '4539.242')] [2023-03-08 20:57:46,174][547799] Updated weights for policy 0, policy_version 156080 (0.0004) [2023-03-08 20:57:49,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11871.5). Total num frames: 79949824. Throughput: 0: 11476.7. Samples: 79950252. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:57:49,600][547459] Avg episode reward: [(0, '4636.530')] [2023-03-08 20:57:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000156152_79949824.pth... [2023-03-08 20:57:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000155480_79605760.pth [2023-03-08 20:57:49,829][547799] Updated weights for policy 0, policy_version 156160 (0.0004) [2023-03-08 20:57:53,265][547799] Updated weights for policy 0, policy_version 156240 (0.0004) [2023-03-08 20:57:54,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11857.6). Total num frames: 80007168. Throughput: 0: 11509.0. Samples: 79986380. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:57:54,600][547459] Avg episode reward: [(0, '4733.309')] [2023-03-08 20:57:56,771][547799] Updated weights for policy 0, policy_version 156320 (0.0005) [2023-03-08 20:57:59,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11843.7). Total num frames: 80064512. Throughput: 0: 11551.1. Samples: 80056320. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:57:59,600][547459] Avg episode reward: [(0, '4637.706')] [2023-03-08 20:58:00,357][547799] Updated weights for policy 0, policy_version 156400 (0.0005) [2023-03-08 20:58:03,759][547799] Updated weights for policy 0, policy_version 156480 (0.0005) [2023-03-08 20:58:04,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11537.0, 300 sec: 11843.7). Total num frames: 80125952. Throughput: 0: 11521.2. Samples: 80126024. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:58:04,600][547459] Avg episode reward: [(0, '4596.535')] [2023-03-08 20:58:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000156496_80125952.pth... [2023-03-08 20:58:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000155800_79769600.pth [2023-03-08 20:58:07,325][547799] Updated weights for policy 0, policy_version 156560 (0.0005) [2023-03-08 20:58:09,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11537.1, 300 sec: 11829.8). Total num frames: 80183296. Throughput: 0: 11541.2. Samples: 80161512. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:58:09,600][547459] Avg episode reward: [(0, '4681.861')] [2023-03-08 20:58:10,730][547799] Updated weights for policy 0, policy_version 156640 (0.0006) [2023-03-08 20:58:14,198][547799] Updated weights for policy 0, policy_version 156720 (0.0005) [2023-03-08 20:58:14,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11829.8). Total num frames: 80244736. Throughput: 0: 11661.0. Samples: 80232868. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:58:14,600][547459] Avg episode reward: [(0, '4512.123')] [2023-03-08 20:58:17,703][547799] Updated weights for policy 0, policy_version 156800 (0.0005) [2023-03-08 20:58:19,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 11829.8). Total num frames: 80302080. Throughput: 0: 11806.1. Samples: 80303756. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 20:58:19,600][547459] Avg episode reward: [(0, '4592.893')] [2023-03-08 20:58:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000156840_80302080.pth... [2023-03-08 20:58:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000156152_79949824.pth [2023-03-08 20:58:21,084][547799] Updated weights for policy 0, policy_version 156880 (0.0005) [2023-03-08 20:58:24,594][547799] Updated weights for policy 0, policy_version 156960 (0.0005) [2023-03-08 20:58:24,599][547459] Fps is (10 sec: 11878.6, 60 sec: 11673.6, 300 sec: 11829.8). Total num frames: 80363520. Throughput: 0: 11747.1. Samples: 80338944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:58:24,600][547459] Avg episode reward: [(0, '4734.557')] [2023-03-08 20:58:27,856][547799] Updated weights for policy 0, policy_version 157040 (0.0005) [2023-03-08 20:58:29,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 80420864. Throughput: 0: 11777.1. Samples: 80412504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:58:29,600][547459] Avg episode reward: [(0, '4831.036')] [2023-03-08 20:58:31,310][547799] Updated weights for policy 0, policy_version 157120 (0.0004) [2023-03-08 20:58:34,600][547459] Fps is (10 sec: 11878.2, 60 sec: 11878.4, 300 sec: 11829.8). Total num frames: 80482304. Throughput: 0: 11836.2. Samples: 80482884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:58:34,600][547459] Avg episode reward: [(0, '4786.197')] [2023-03-08 20:58:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000157192_80482304.pth... [2023-03-08 20:58:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000156496_80125952.pth [2023-03-08 20:58:34,845][547799] Updated weights for policy 0, policy_version 157200 (0.0005) [2023-03-08 20:58:38,363][547799] Updated weights for policy 0, policy_version 157280 (0.0005) [2023-03-08 20:58:39,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11829.8). Total num frames: 80539648. Throughput: 0: 11834.7. Samples: 80518944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:58:39,600][547459] Avg episode reward: [(0, '4825.281')] [2023-03-08 20:58:41,882][547799] Updated weights for policy 0, policy_version 157360 (0.0005) [2023-03-08 20:58:44,599][547459] Fps is (10 sec: 11469.0, 60 sec: 11741.9, 300 sec: 11802.0). Total num frames: 80596992. Throughput: 0: 11823.4. Samples: 80588372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:58:44,600][547459] Avg episode reward: [(0, '4644.081')] [2023-03-08 20:58:45,437][547799] Updated weights for policy 0, policy_version 157440 (0.0005) [2023-03-08 20:58:48,803][547799] Updated weights for policy 0, policy_version 157520 (0.0004) [2023-03-08 20:58:49,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 80658432. Throughput: 0: 11900.5. Samples: 80661548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:58:49,600][547459] Avg episode reward: [(0, '4479.749')] [2023-03-08 20:58:49,622][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000157544_80662528.pth... [2023-03-08 20:58:49,624][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000156840_80302080.pth [2023-03-08 20:58:52,004][547799] Updated weights for policy 0, policy_version 157600 (0.0005) [2023-03-08 20:58:54,599][547459] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11802.0). Total num frames: 80719872. Throughput: 0: 11897.7. Samples: 80696908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:58:54,600][547459] Avg episode reward: [(0, '4829.294')] [2023-03-08 20:58:55,308][547799] Updated weights for policy 0, policy_version 157680 (0.0005) [2023-03-08 20:58:58,673][547799] Updated weights for policy 0, policy_version 157760 (0.0005) [2023-03-08 20:58:59,599][547459] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11802.0). Total num frames: 80781312. Throughput: 0: 11983.5. Samples: 80772124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:58:59,600][547459] Avg episode reward: [(0, '4785.863')] [2023-03-08 20:59:02,264][547799] Updated weights for policy 0, policy_version 157840 (0.0005) [2023-03-08 20:59:04,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11788.1). Total num frames: 80838656. Throughput: 0: 11924.8. Samples: 80840372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:59:04,600][547459] Avg episode reward: [(0, '4822.978')] [2023-03-08 20:59:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000157888_80838656.pth... [2023-03-08 20:59:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000157192_80482304.pth [2023-03-08 20:59:05,709][547799] Updated weights for policy 0, policy_version 157920 (0.0005) [2023-03-08 20:59:09,150][547799] Updated weights for policy 0, policy_version 158000 (0.0005) [2023-03-08 20:59:09,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11788.1). Total num frames: 80900096. Throughput: 0: 11955.2. Samples: 80876928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:59:09,600][547459] Avg episode reward: [(0, '4826.347')] [2023-03-08 20:59:12,556][547799] Updated weights for policy 0, policy_version 158080 (0.0005) [2023-03-08 20:59:14,600][547459] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11788.1). Total num frames: 80961536. Throughput: 0: 11925.5. Samples: 80949152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:59:14,600][547459] Avg episode reward: [(0, '4827.784')] [2023-03-08 20:59:15,982][547799] Updated weights for policy 0, policy_version 158160 (0.0005) [2023-03-08 20:59:19,376][547799] Updated weights for policy 0, policy_version 158240 (0.0005) [2023-03-08 20:59:19,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11774.3). Total num frames: 81018880. Throughput: 0: 11951.3. Samples: 81020692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:59:19,600][547459] Avg episode reward: [(0, '4830.540')] [2023-03-08 20:59:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000158240_81018880.pth... [2023-03-08 20:59:19,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000157544_80662528.pth [2023-03-08 20:59:22,633][547799] Updated weights for policy 0, policy_version 158320 (0.0004) [2023-03-08 20:59:24,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11946.6, 300 sec: 11774.3). Total num frames: 81080320. Throughput: 0: 12008.4. Samples: 81059324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 20:59:24,600][547459] Avg episode reward: [(0, '4826.813')] [2023-03-08 20:59:26,206][547799] Updated weights for policy 0, policy_version 158400 (0.0004) [2023-03-08 20:59:29,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11774.3). Total num frames: 81137664. Throughput: 0: 12021.5. Samples: 81129340. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:59:29,600][547459] Avg episode reward: [(0, '4824.112')] [2023-03-08 20:59:29,634][547799] Updated weights for policy 0, policy_version 158480 (0.0005) [2023-03-08 20:59:33,084][547799] Updated weights for policy 0, policy_version 158560 (0.0005) [2023-03-08 20:59:34,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11788.1). Total num frames: 81199104. Throughput: 0: 12004.4. Samples: 81201744. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:59:34,600][547459] Avg episode reward: [(0, '4823.927')] [2023-03-08 20:59:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000158592_81199104.pth... [2023-03-08 20:59:34,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000157888_80838656.pth [2023-03-08 20:59:36,505][547799] Updated weights for policy 0, policy_version 158640 (0.0005) [2023-03-08 20:59:39,599][547459] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11802.0). Total num frames: 81260544. Throughput: 0: 12012.0. Samples: 81237448. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:59:39,600][547459] Avg episode reward: [(0, '4775.733')] [2023-03-08 20:59:39,916][547799] Updated weights for policy 0, policy_version 158720 (0.0005) [2023-03-08 20:59:43,267][547799] Updated weights for policy 0, policy_version 158800 (0.0005) [2023-03-08 20:59:44,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11802.0). Total num frames: 81317888. Throughput: 0: 11937.7. Samples: 81309320. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:59:44,600][547459] Avg episode reward: [(0, '4823.836')] [2023-03-08 20:59:46,628][547799] Updated weights for policy 0, policy_version 158880 (0.0005) [2023-03-08 20:59:49,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 11815.9). Total num frames: 81379328. Throughput: 0: 12060.6. Samples: 81383096. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:59:49,600][547459] Avg episode reward: [(0, '4822.704')] [2023-03-08 20:59:49,630][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000158952_81383424.pth... [2023-03-08 20:59:49,632][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000158240_81018880.pth [2023-03-08 20:59:49,966][547799] Updated weights for policy 0, policy_version 158960 (0.0005) [2023-03-08 20:59:53,098][547799] Updated weights for policy 0, policy_version 159040 (0.0004) [2023-03-08 20:59:54,599][547459] Fps is (10 sec: 12697.6, 60 sec: 12083.2, 300 sec: 11857.6). Total num frames: 81444864. Throughput: 0: 12102.4. Samples: 81421536. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:59:54,600][547459] Avg episode reward: [(0, '4792.357')] [2023-03-08 20:59:56,418][547799] Updated weights for policy 0, policy_version 159120 (0.0005) [2023-03-08 20:59:59,600][547459] Fps is (10 sec: 12697.4, 60 sec: 12083.2, 300 sec: 11871.5). Total num frames: 81506304. Throughput: 0: 12140.4. Samples: 81495472. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 20:59:59,600][547459] Avg episode reward: [(0, '4782.927')] [2023-03-08 20:59:59,758][547799] Updated weights for policy 0, policy_version 159200 (0.0005) [2023-03-08 21:00:03,348][547799] Updated weights for policy 0, policy_version 159280 (0.0005) [2023-03-08 21:00:04,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11857.6). Total num frames: 81563648. Throughput: 0: 12085.3. Samples: 81564532. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:00:04,600][547459] Avg episode reward: [(0, '4728.452')] [2023-03-08 21:00:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000159304_81563648.pth... [2023-03-08 21:00:04,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000158592_81199104.pth [2023-03-08 21:00:06,740][547799] Updated weights for policy 0, policy_version 159360 (0.0005) [2023-03-08 21:00:09,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11871.5). Total num frames: 81625088. Throughput: 0: 12039.0. Samples: 81601080. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:00:09,600][547459] Avg episode reward: [(0, '4651.234')] [2023-03-08 21:00:10,228][547799] Updated weights for policy 0, policy_version 159440 (0.0005) [2023-03-08 21:00:13,644][547799] Updated weights for policy 0, policy_version 159520 (0.0004) [2023-03-08 21:00:14,599][547459] Fps is (10 sec: 11878.6, 60 sec: 12014.9, 300 sec: 11857.6). Total num frames: 81682432. Throughput: 0: 12062.3. Samples: 81672144. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:00:14,600][547459] Avg episode reward: [(0, '4436.858')] [2023-03-08 21:00:17,079][547799] Updated weights for policy 0, policy_version 159600 (0.0005) [2023-03-08 21:00:19,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11885.3). Total num frames: 81743872. Throughput: 0: 12118.4. Samples: 81747072. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:00:19,600][547459] Avg episode reward: [(0, '4736.019')] [2023-03-08 21:00:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000159656_81743872.pth... [2023-03-08 21:00:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000158952_81383424.pth [2023-03-08 21:00:20,320][547799] Updated weights for policy 0, policy_version 159680 (0.0005) [2023-03-08 21:00:23,863][547799] Updated weights for policy 0, policy_version 159760 (0.0005) [2023-03-08 21:00:24,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11899.2). Total num frames: 81805312. Throughput: 0: 12109.4. Samples: 81782372. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:00:24,600][547459] Avg episode reward: [(0, '4484.798')] [2023-03-08 21:00:27,352][547799] Updated weights for policy 0, policy_version 159840 (0.0005) [2023-03-08 21:00:29,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11885.3). Total num frames: 81862656. Throughput: 0: 12049.5. Samples: 81851548. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:00:29,600][547459] Avg episode reward: [(0, '4441.338')] [2023-03-08 21:00:30,755][547799] Updated weights for policy 0, policy_version 159920 (0.0005) [2023-03-08 21:00:34,123][547799] Updated weights for policy 0, policy_version 160000 (0.0005) [2023-03-08 21:00:34,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 11913.1). Total num frames: 81924096. Throughput: 0: 12023.6. Samples: 81924160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:00:34,600][547459] Avg episode reward: [(0, '4788.126')] [2023-03-08 21:00:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000160008_81924096.pth... [2023-03-08 21:00:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000159304_81563648.pth [2023-03-08 21:00:37,558][547799] Updated weights for policy 0, policy_version 160080 (0.0005) [2023-03-08 21:00:39,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11913.1). Total num frames: 81985536. Throughput: 0: 11987.2. Samples: 81960960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:00:39,600][547459] Avg episode reward: [(0, '4658.394')] [2023-03-08 21:00:41,070][547799] Updated weights for policy 0, policy_version 160160 (0.0004) [2023-03-08 21:00:44,600][547459] Fps is (10 sec: 11468.8, 60 sec: 12014.9, 300 sec: 11899.2). Total num frames: 82038784. Throughput: 0: 11819.5. Samples: 82027348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:00:44,600][547459] Avg episode reward: [(0, '4407.973')] [2023-03-08 21:00:44,948][547799] Updated weights for policy 0, policy_version 160240 (0.0006) [2023-03-08 21:00:48,624][547799] Updated weights for policy 0, policy_version 160320 (0.0006) [2023-03-08 21:00:49,600][547459] Fps is (10 sec: 10649.7, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 82092032. Throughput: 0: 11752.0. Samples: 82093372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:00:49,600][547459] Avg episode reward: [(0, '4702.571')] [2023-03-08 21:00:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000160336_82092032.pth... [2023-03-08 21:00:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000159656_81743872.pth [2023-03-08 21:00:52,167][547799] Updated weights for policy 0, policy_version 160400 (0.0005) [2023-03-08 21:00:54,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11885.3). Total num frames: 82153472. Throughput: 0: 11739.7. Samples: 82129364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:00:54,600][547459] Avg episode reward: [(0, '4738.845')] [2023-03-08 21:00:55,334][547799] Updated weights for policy 0, policy_version 160480 (0.0005) [2023-03-08 21:00:58,695][547799] Updated weights for policy 0, policy_version 160560 (0.0004) [2023-03-08 21:00:59,599][547459] Fps is (10 sec: 12288.0, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 82214912. Throughput: 0: 11827.9. Samples: 82204400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:00:59,600][547459] Avg episode reward: [(0, '4780.933')] [2023-03-08 21:01:02,111][547799] Updated weights for policy 0, policy_version 160640 (0.0005) [2023-03-08 21:01:04,599][547459] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 82276352. Throughput: 0: 11760.6. Samples: 82276300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:01:04,600][547459] Avg episode reward: [(0, '4790.086')] [2023-03-08 21:01:04,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000160696_82276352.pth... [2023-03-08 21:01:04,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000160008_81924096.pth [2023-03-08 21:01:05,625][547799] Updated weights for policy 0, policy_version 160720 (0.0004) [2023-03-08 21:01:09,073][547799] Updated weights for policy 0, policy_version 160800 (0.0005) [2023-03-08 21:01:09,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11871.5). Total num frames: 82333696. Throughput: 0: 11754.2. Samples: 82311312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:01:09,600][547459] Avg episode reward: [(0, '4681.282')] [2023-03-08 21:01:12,470][547799] Updated weights for policy 0, policy_version 160880 (0.0005) [2023-03-08 21:01:14,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 82395136. Throughput: 0: 11808.1. Samples: 82382912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:01:14,600][547459] Avg episode reward: [(0, '4595.812')] [2023-03-08 21:01:16,030][547799] Updated weights for policy 0, policy_version 160960 (0.0005) [2023-03-08 21:01:18,947][547799] Updated weights for policy 0, policy_version 161040 (0.0005) [2023-03-08 21:01:19,600][547459] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 82456576. Throughput: 0: 11914.8. Samples: 82460328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:01:19,600][547459] Avg episode reward: [(0, '4731.227')] [2023-03-08 21:01:19,652][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000161056_82460672.pth... [2023-03-08 21:01:19,654][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000160336_82092032.pth [2023-03-08 21:01:22,272][547799] Updated weights for policy 0, policy_version 161120 (0.0005) [2023-03-08 21:01:24,599][547459] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 82518016. Throughput: 0: 11918.3. Samples: 82497280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:01:24,600][547459] Avg episode reward: [(0, '4831.392')] [2023-03-08 21:01:25,717][547799] Updated weights for policy 0, policy_version 161200 (0.0005) [2023-03-08 21:01:29,050][547799] Updated weights for policy 0, policy_version 161280 (0.0005) [2023-03-08 21:01:29,600][547459] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11871.5). Total num frames: 82579456. Throughput: 0: 12033.9. Samples: 82568872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:01:29,600][547459] Avg episode reward: [(0, '4827.286')] [2023-03-08 21:01:32,596][547799] Updated weights for policy 0, policy_version 161360 (0.0005) [2023-03-08 21:01:34,600][547459] Fps is (10 sec: 11878.2, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 82636800. Throughput: 0: 12105.1. Samples: 82638104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:01:34,600][547459] Avg episode reward: [(0, '4787.532')] [2023-03-08 21:01:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000161400_82636800.pth... [2023-03-08 21:01:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000160696_82276352.pth [2023-03-08 21:01:36,069][547799] Updated weights for policy 0, policy_version 161440 (0.0004) [2023-03-08 21:01:39,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11857.6). Total num frames: 82694144. Throughput: 0: 12108.4. Samples: 82674244. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:01:39,600][547459] Avg episode reward: [(0, '4829.692')] [2023-03-08 21:01:39,673][547799] Updated weights for policy 0, policy_version 161520 (0.0004) [2023-03-08 21:01:43,261][547799] Updated weights for policy 0, policy_version 161600 (0.0005) [2023-03-08 21:01:44,599][547459] Fps is (10 sec: 11469.0, 60 sec: 11878.4, 300 sec: 11857.6). Total num frames: 82751488. Throughput: 0: 11968.3. Samples: 82742972. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:01:44,600][547459] Avg episode reward: [(0, '4783.567')] [2023-03-08 21:01:47,033][547799] Updated weights for policy 0, policy_version 161680 (0.0004) [2023-03-08 21:01:49,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11857.6). Total num frames: 82808832. Throughput: 0: 11886.3. Samples: 82811184. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:01:49,600][547459] Avg episode reward: [(0, '4788.026')] [2023-03-08 21:01:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000161736_82808832.pth... [2023-03-08 21:01:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000161056_82460672.pth [2023-03-08 21:01:50,469][547799] Updated weights for policy 0, policy_version 161760 (0.0005) [2023-03-08 21:01:53,981][547799] Updated weights for policy 0, policy_version 161840 (0.0005) [2023-03-08 21:01:54,600][547459] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11843.7). Total num frames: 82866176. Throughput: 0: 11874.2. Samples: 82845652. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:01:54,600][547459] Avg episode reward: [(0, '4791.239')] [2023-03-08 21:01:57,450][547799] Updated weights for policy 0, policy_version 161920 (0.0006) [2023-03-08 21:01:59,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11843.7). Total num frames: 82927616. Throughput: 0: 11831.7. Samples: 82915340. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:01:59,600][547459] Avg episode reward: [(0, '4834.336')] [2023-03-08 21:01:59,601][547754] Saving new best policy, reward=4834.336! [2023-03-08 21:02:00,818][547799] Updated weights for policy 0, policy_version 162000 (0.0005) [2023-03-08 21:02:04,245][547799] Updated weights for policy 0, policy_version 162080 (0.0006) [2023-03-08 21:02:04,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11857.6). Total num frames: 82989056. Throughput: 0: 11737.5. Samples: 82988516. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:02:04,600][547459] Avg episode reward: [(0, '4835.235')] [2023-03-08 21:02:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000162088_82989056.pth... [2023-03-08 21:02:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000161400_82636800.pth [2023-03-08 21:02:04,606][547754] Saving new best policy, reward=4835.235! [2023-03-08 21:02:07,666][547799] Updated weights for policy 0, policy_version 162160 (0.0005) [2023-03-08 21:02:09,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11857.6). Total num frames: 83046400. Throughput: 0: 11720.8. Samples: 83024716. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:02:09,600][547459] Avg episode reward: [(0, '4830.936')] [2023-03-08 21:02:11,007][547799] Updated weights for policy 0, policy_version 162240 (0.0005) [2023-03-08 21:02:14,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11857.6). Total num frames: 83103744. Throughput: 0: 11725.4. Samples: 83096512. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:02:14,600][547459] Avg episode reward: [(0, '4783.329')] [2023-03-08 21:02:14,620][547799] Updated weights for policy 0, policy_version 162320 (0.0005) [2023-03-08 21:02:18,206][547799] Updated weights for policy 0, policy_version 162400 (0.0005) [2023-03-08 21:02:19,600][547459] Fps is (10 sec: 11468.7, 60 sec: 11741.9, 300 sec: 11857.6). Total num frames: 83161088. Throughput: 0: 11710.4. Samples: 83165072. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:02:19,600][547459] Avg episode reward: [(0, '4785.414')] [2023-03-08 21:02:19,606][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000162432_83165184.pth... [2023-03-08 21:02:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000161736_82808832.pth [2023-03-08 21:02:21,709][547799] Updated weights for policy 0, policy_version 162480 (0.0005) [2023-03-08 21:02:24,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11885.3). Total num frames: 83222528. Throughput: 0: 11686.2. Samples: 83200124. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:02:24,600][547459] Avg episode reward: [(0, '4775.952')] [2023-03-08 21:02:25,055][547799] Updated weights for policy 0, policy_version 162560 (0.0005) [2023-03-08 21:02:28,519][547799] Updated weights for policy 0, policy_version 162640 (0.0005) [2023-03-08 21:02:29,599][547459] Fps is (10 sec: 12288.1, 60 sec: 11741.9, 300 sec: 11913.1). Total num frames: 83283968. Throughput: 0: 11750.5. Samples: 83271744. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:02:29,600][547459] Avg episode reward: [(0, '4735.403')] [2023-03-08 21:02:31,786][547799] Updated weights for policy 0, policy_version 162720 (0.0005) [2023-03-08 21:02:34,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11913.1). Total num frames: 83345408. Throughput: 0: 11871.8. Samples: 83345416. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:02:34,600][547459] Avg episode reward: [(0, '4457.267')] [2023-03-08 21:02:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000162784_83345408.pth... [2023-03-08 21:02:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000162088_82989056.pth [2023-03-08 21:02:35,252][547799] Updated weights for policy 0, policy_version 162800 (0.0005) [2023-03-08 21:02:38,623][547799] Updated weights for policy 0, policy_version 162880 (0.0005) [2023-03-08 21:02:39,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 83402752. Throughput: 0: 11893.9. Samples: 83380876. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:02:39,600][547459] Avg episode reward: [(0, '4592.719')] [2023-03-08 21:02:42,119][547799] Updated weights for policy 0, policy_version 162960 (0.0005) [2023-03-08 21:02:44,600][547459] Fps is (10 sec: 12288.1, 60 sec: 11946.6, 300 sec: 11927.0). Total num frames: 83468288. Throughput: 0: 12014.8. Samples: 83456008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:02:44,600][547459] Avg episode reward: [(0, '4836.390')] [2023-03-08 21:02:44,601][547754] Saving new best policy, reward=4836.390! [2023-03-08 21:02:45,113][547799] Updated weights for policy 0, policy_version 163040 (0.0005) [2023-03-08 21:02:48,384][547799] Updated weights for policy 0, policy_version 163120 (0.0004) [2023-03-08 21:02:49,600][547459] Fps is (10 sec: 13107.2, 60 sec: 12083.2, 300 sec: 11954.8). Total num frames: 83533824. Throughput: 0: 12118.2. Samples: 83533836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:02:49,600][547459] Avg episode reward: [(0, '4832.069')] [2023-03-08 21:02:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000163152_83533824.pth... [2023-03-08 21:02:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000162432_83165184.pth [2023-03-08 21:02:51,697][547799] Updated weights for policy 0, policy_version 163200 (0.0005) [2023-03-08 21:02:54,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11954.8). Total num frames: 83591168. Throughput: 0: 12080.0. Samples: 83568316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:02:54,600][547459] Avg episode reward: [(0, '4826.863')] [2023-03-08 21:02:55,132][547799] Updated weights for policy 0, policy_version 163280 (0.0005) [2023-03-08 21:02:58,467][547799] Updated weights for policy 0, policy_version 163360 (0.0005) [2023-03-08 21:02:59,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11954.8). Total num frames: 83652608. Throughput: 0: 12091.2. Samples: 83640620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:02:59,600][547459] Avg episode reward: [(0, '4823.789')] [2023-03-08 21:03:01,768][547799] Updated weights for policy 0, policy_version 163440 (0.0004) [2023-03-08 21:03:04,599][547459] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 83714048. Throughput: 0: 12286.5. Samples: 83717964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:03:04,600][547459] Avg episode reward: [(0, '4812.367')] [2023-03-08 21:03:04,619][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000163512_83718144.pth... [2023-03-08 21:03:04,621][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000162784_83345408.pth [2023-03-08 21:03:04,960][547799] Updated weights for policy 0, policy_version 163520 (0.0005) [2023-03-08 21:03:08,181][547799] Updated weights for policy 0, policy_version 163600 (0.0005) [2023-03-08 21:03:09,600][547459] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 11968.7). Total num frames: 83775488. Throughput: 0: 12330.7. Samples: 83755008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:03:09,600][547459] Avg episode reward: [(0, '4832.685')] [2023-03-08 21:03:11,610][547799] Updated weights for policy 0, policy_version 163680 (0.0005) [2023-03-08 21:03:14,599][547459] Fps is (10 sec: 12697.6, 60 sec: 12288.0, 300 sec: 11996.4). Total num frames: 83841024. Throughput: 0: 12379.0. Samples: 83828800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:03:14,600][547459] Avg episode reward: [(0, '4836.534')] [2023-03-08 21:03:14,601][547754] Saving new best policy, reward=4836.534! [2023-03-08 21:03:14,804][547799] Updated weights for policy 0, policy_version 163760 (0.0004) [2023-03-08 21:03:18,201][547799] Updated weights for policy 0, policy_version 163840 (0.0005) [2023-03-08 21:03:19,599][547459] Fps is (10 sec: 12697.7, 60 sec: 12356.3, 300 sec: 11996.4). Total num frames: 83902464. Throughput: 0: 12372.4. Samples: 83902172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:03:19,600][547459] Avg episode reward: [(0, '4739.004')] [2023-03-08 21:03:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000163872_83902464.pth... [2023-03-08 21:03:19,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000163152_83533824.pth [2023-03-08 21:03:21,627][547799] Updated weights for policy 0, policy_version 163920 (0.0005) [2023-03-08 21:03:24,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12356.3, 300 sec: 12010.3). Total num frames: 83963904. Throughput: 0: 12395.2. Samples: 83938660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:03:24,600][547459] Avg episode reward: [(0, '4833.654')] [2023-03-08 21:03:24,854][547799] Updated weights for policy 0, policy_version 164000 (0.0005) [2023-03-08 21:03:28,253][547799] Updated weights for policy 0, policy_version 164080 (0.0005) [2023-03-08 21:03:29,600][547459] Fps is (10 sec: 12287.8, 60 sec: 12356.2, 300 sec: 12010.3). Total num frames: 84025344. Throughput: 0: 12372.3. Samples: 84012760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:03:29,600][547459] Avg episode reward: [(0, '4835.509')] [2023-03-08 21:03:31,471][547799] Updated weights for policy 0, policy_version 164160 (0.0005) [2023-03-08 21:03:34,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12356.3, 300 sec: 12024.2). Total num frames: 84086784. Throughput: 0: 12287.0. Samples: 84086752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:03:34,600][547459] Avg episode reward: [(0, '4833.943')] [2023-03-08 21:03:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000164232_84086784.pth... [2023-03-08 21:03:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000163512_83718144.pth [2023-03-08 21:03:34,913][547799] Updated weights for policy 0, policy_version 164240 (0.0005) [2023-03-08 21:03:38,227][547799] Updated weights for policy 0, policy_version 164320 (0.0004) [2023-03-08 21:03:39,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12424.5, 300 sec: 12038.1). Total num frames: 84148224. Throughput: 0: 12338.2. Samples: 84123536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:03:39,600][547459] Avg episode reward: [(0, '4835.148')] [2023-03-08 21:03:41,598][547799] Updated weights for policy 0, policy_version 164400 (0.0004) [2023-03-08 21:03:44,599][547459] Fps is (10 sec: 11878.6, 60 sec: 12288.0, 300 sec: 12024.2). Total num frames: 84205568. Throughput: 0: 12368.6. Samples: 84197204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:03:44,600][547459] Avg episode reward: [(0, '4824.802')] [2023-03-08 21:03:45,071][547799] Updated weights for policy 0, policy_version 164480 (0.0005) [2023-03-08 21:03:48,316][547799] Updated weights for policy 0, policy_version 164560 (0.0005) [2023-03-08 21:03:49,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12038.1). Total num frames: 84271104. Throughput: 0: 12292.0. Samples: 84271104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:03:49,600][547459] Avg episode reward: [(0, '4826.770')] [2023-03-08 21:03:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000164592_84271104.pth... [2023-03-08 21:03:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000163872_83902464.pth [2023-03-08 21:03:51,548][547799] Updated weights for policy 0, policy_version 164640 (0.0005) [2023-03-08 21:03:54,599][547459] Fps is (10 sec: 12287.9, 60 sec: 12288.0, 300 sec: 12024.2). Total num frames: 84328448. Throughput: 0: 12286.5. Samples: 84307900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:03:54,600][547459] Avg episode reward: [(0, '4834.693')] [2023-03-08 21:03:55,124][547799] Updated weights for policy 0, policy_version 164720 (0.0004) [2023-03-08 21:03:58,561][547799] Updated weights for policy 0, policy_version 164800 (0.0005) [2023-03-08 21:03:59,599][547459] Fps is (10 sec: 11469.0, 60 sec: 12219.8, 300 sec: 12024.2). Total num frames: 84385792. Throughput: 0: 12195.6. Samples: 84377600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:03:59,600][547459] Avg episode reward: [(0, '4832.063')] [2023-03-08 21:04:02,115][547799] Updated weights for policy 0, policy_version 164880 (0.0005) [2023-03-08 21:04:04,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12288.0, 300 sec: 12038.1). Total num frames: 84451328. Throughput: 0: 12204.9. Samples: 84451392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:04:04,600][547459] Avg episode reward: [(0, '4835.859')] [2023-03-08 21:04:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000164944_84451328.pth... [2023-03-08 21:04:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000164232_84086784.pth [2023-03-08 21:04:05,164][547799] Updated weights for policy 0, policy_version 164960 (0.0005) [2023-03-08 21:04:08,560][547799] Updated weights for policy 0, policy_version 165040 (0.0005) [2023-03-08 21:04:09,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12024.2). Total num frames: 84508672. Throughput: 0: 12213.2. Samples: 84488256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:04:09,600][547459] Avg episode reward: [(0, '4835.128')] [2023-03-08 21:04:12,018][547799] Updated weights for policy 0, policy_version 165120 (0.0005) [2023-03-08 21:04:14,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 84570112. Throughput: 0: 12130.8. Samples: 84558644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:04:14,600][547459] Avg episode reward: [(0, '4834.686')] [2023-03-08 21:04:15,572][547799] Updated weights for policy 0, policy_version 165200 (0.0005) [2023-03-08 21:04:19,190][547799] Updated weights for policy 0, policy_version 165280 (0.0006) [2023-03-08 21:04:19,600][547459] Fps is (10 sec: 11878.1, 60 sec: 12083.1, 300 sec: 12024.2). Total num frames: 84627456. Throughput: 0: 12014.9. Samples: 84627424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:04:19,600][547459] Avg episode reward: [(0, '4833.333')] [2023-03-08 21:04:19,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000165288_84627456.pth... [2023-03-08 21:04:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000164592_84271104.pth [2023-03-08 21:04:22,627][547799] Updated weights for policy 0, policy_version 165360 (0.0005) [2023-03-08 21:04:24,599][547459] Fps is (10 sec: 11468.9, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 84684800. Throughput: 0: 12001.4. Samples: 84663596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:04:24,600][547459] Avg episode reward: [(0, '4831.116')] [2023-03-08 21:04:26,117][547799] Updated weights for policy 0, policy_version 165440 (0.0005) [2023-03-08 21:04:29,567][547799] Updated weights for policy 0, policy_version 165520 (0.0005) [2023-03-08 21:04:29,599][547459] Fps is (10 sec: 11878.7, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 84746240. Throughput: 0: 11940.3. Samples: 84734516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:04:29,600][547459] Avg episode reward: [(0, '4832.222')] [2023-03-08 21:04:32,941][547799] Updated weights for policy 0, policy_version 165600 (0.0005) [2023-03-08 21:04:34,600][547459] Fps is (10 sec: 12287.8, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 84807680. Throughput: 0: 11919.8. Samples: 84807496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:04:34,600][547459] Avg episode reward: [(0, '4835.633')] [2023-03-08 21:04:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000165640_84807680.pth... [2023-03-08 21:04:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000164944_84451328.pth [2023-03-08 21:04:36,192][547799] Updated weights for policy 0, policy_version 165680 (0.0005) [2023-03-08 21:04:39,501][547799] Updated weights for policy 0, policy_version 165760 (0.0006) [2023-03-08 21:04:39,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 84869120. Throughput: 0: 11934.8. Samples: 84844968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:04:39,600][547459] Avg episode reward: [(0, '4829.580')] [2023-03-08 21:04:43,060][547799] Updated weights for policy 0, policy_version 165840 (0.0005) [2023-03-08 21:04:44,600][547459] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 84926464. Throughput: 0: 11962.6. Samples: 84915916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:04:44,600][547459] Avg episode reward: [(0, '4830.997')] [2023-03-08 21:04:46,347][547799] Updated weights for policy 0, policy_version 165920 (0.0005) [2023-03-08 21:04:49,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 84987904. Throughput: 0: 12007.0. Samples: 84991708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:04:49,600][547459] Avg episode reward: [(0, '4745.130')] [2023-03-08 21:04:49,624][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000166000_84992000.pth... [2023-03-08 21:04:49,625][547799] Updated weights for policy 0, policy_version 166000 (0.0005) [2023-03-08 21:04:49,626][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000165288_84627456.pth [2023-03-08 21:04:53,173][547799] Updated weights for policy 0, policy_version 166080 (0.0005) [2023-03-08 21:04:54,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 85045248. Throughput: 0: 11925.8. Samples: 85024916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:04:54,600][547459] Avg episode reward: [(0, '4834.112')] [2023-03-08 21:04:56,619][547799] Updated weights for policy 0, policy_version 166160 (0.0005) [2023-03-08 21:04:59,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 85110784. Throughput: 0: 11987.0. Samples: 85098060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:04:59,600][547459] Avg episode reward: [(0, '4833.366')] [2023-03-08 21:04:59,914][547799] Updated weights for policy 0, policy_version 166240 (0.0005) [2023-03-08 21:05:03,273][547799] Updated weights for policy 0, policy_version 166320 (0.0006) [2023-03-08 21:05:04,600][547459] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 85168128. Throughput: 0: 12089.7. Samples: 85171460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:05:04,600][547459] Avg episode reward: [(0, '4780.862')] [2023-03-08 21:05:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000166344_85168128.pth... [2023-03-08 21:05:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000165640_84807680.pth [2023-03-08 21:05:06,743][547799] Updated weights for policy 0, policy_version 166400 (0.0005) [2023-03-08 21:05:09,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 85225472. Throughput: 0: 12060.8. Samples: 85206332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:05:09,600][547459] Avg episode reward: [(0, '4788.113')] [2023-03-08 21:05:10,534][547799] Updated weights for policy 0, policy_version 166480 (0.0006) [2023-03-08 21:05:14,242][547799] Updated weights for policy 0, policy_version 166560 (0.0006) [2023-03-08 21:05:14,599][547459] Fps is (10 sec: 11059.3, 60 sec: 11810.2, 300 sec: 11982.5). Total num frames: 85278720. Throughput: 0: 11910.9. Samples: 85270504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:05:14,600][547459] Avg episode reward: [(0, '4835.299')] [2023-03-08 21:05:17,760][547799] Updated weights for policy 0, policy_version 166640 (0.0005) [2023-03-08 21:05:19,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 85340160. Throughput: 0: 11878.7. Samples: 85342036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:05:19,600][547459] Avg episode reward: [(0, '4831.307')] [2023-03-08 21:05:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000166680_85340160.pth... [2023-03-08 21:05:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000166000_84992000.pth [2023-03-08 21:05:21,005][547799] Updated weights for policy 0, policy_version 166720 (0.0005) [2023-03-08 21:05:24,459][547799] Updated weights for policy 0, policy_version 166800 (0.0005) [2023-03-08 21:05:24,599][547459] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 85401600. Throughput: 0: 11862.2. Samples: 85378768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:05:24,611][547459] Avg episode reward: [(0, '4835.812')] [2023-03-08 21:05:28,050][547799] Updated weights for policy 0, policy_version 166880 (0.0005) [2023-03-08 21:05:29,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 85458944. Throughput: 0: 11855.6. Samples: 85449420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:05:29,607][547459] Avg episode reward: [(0, '4831.403')] [2023-03-08 21:05:31,314][547799] Updated weights for policy 0, policy_version 166960 (0.0005) [2023-03-08 21:05:34,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 85520384. Throughput: 0: 11802.4. Samples: 85522816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:05:34,611][547459] Avg episode reward: [(0, '4833.350')] [2023-03-08 21:05:34,613][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000167032_85520384.pth... [2023-03-08 21:05:34,615][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000166344_85168128.pth [2023-03-08 21:05:34,699][547799] Updated weights for policy 0, policy_version 167040 (0.0005) [2023-03-08 21:05:38,128][547799] Updated weights for policy 0, policy_version 167120 (0.0005) [2023-03-08 21:05:39,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 85581824. Throughput: 0: 11836.8. Samples: 85557572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:05:39,600][547459] Avg episode reward: [(0, '4737.968')] [2023-03-08 21:05:41,573][547799] Updated weights for policy 0, policy_version 167200 (0.0004) [2023-03-08 21:05:44,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 85639168. Throughput: 0: 11759.9. Samples: 85627256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:05:44,600][547459] Avg episode reward: [(0, '4829.452')] [2023-03-08 21:05:45,215][547799] Updated weights for policy 0, policy_version 167280 (0.0005) [2023-03-08 21:05:48,660][547799] Updated weights for policy 0, policy_version 167360 (0.0005) [2023-03-08 21:05:49,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 12010.3). Total num frames: 85696512. Throughput: 0: 11746.1. Samples: 85700036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:05:49,600][547459] Avg episode reward: [(0, '4825.914')] [2023-03-08 21:05:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000167376_85696512.pth... [2023-03-08 21:05:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000166680_85340160.pth [2023-03-08 21:05:51,957][547799] Updated weights for policy 0, policy_version 167440 (0.0005) [2023-03-08 21:05:54,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11996.4). Total num frames: 85753856. Throughput: 0: 11786.2. Samples: 85736712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:05:54,600][547459] Avg episode reward: [(0, '4785.500')] [2023-03-08 21:05:55,796][547799] Updated weights for policy 0, policy_version 167520 (0.0004) [2023-03-08 21:05:59,044][547799] Updated weights for policy 0, policy_version 167600 (0.0005) [2023-03-08 21:05:59,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11996.4). Total num frames: 85815296. Throughput: 0: 11875.8. Samples: 85804916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:05:59,600][547459] Avg episode reward: [(0, '4738.138')] [2023-03-08 21:06:02,476][547799] Updated weights for policy 0, policy_version 167680 (0.0005) [2023-03-08 21:06:04,600][547459] Fps is (10 sec: 12287.8, 60 sec: 11810.1, 300 sec: 12010.3). Total num frames: 85876736. Throughput: 0: 11882.5. Samples: 85876748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:06:04,600][547459] Avg episode reward: [(0, '4632.334')] [2023-03-08 21:06:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000167728_85876736.pth... [2023-03-08 21:06:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000167032_85520384.pth [2023-03-08 21:06:05,917][547799] Updated weights for policy 0, policy_version 167760 (0.0005) [2023-03-08 21:06:09,376][547799] Updated weights for policy 0, policy_version 167840 (0.0005) [2023-03-08 21:06:09,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11996.4). Total num frames: 85934080. Throughput: 0: 11853.2. Samples: 85912164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:06:09,600][547459] Avg episode reward: [(0, '4741.186')] [2023-03-08 21:06:13,051][547799] Updated weights for policy 0, policy_version 167920 (0.0004) [2023-03-08 21:06:14,600][547459] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 85991424. Throughput: 0: 11783.8. Samples: 85979692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:06:14,600][547459] Avg episode reward: [(0, '4705.686')] [2023-03-08 21:06:16,668][547799] Updated weights for policy 0, policy_version 168000 (0.0004) [2023-03-08 21:06:19,600][547459] Fps is (10 sec: 11468.7, 60 sec: 11810.1, 300 sec: 11968.6). Total num frames: 86048768. Throughput: 0: 11689.2. Samples: 86048832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:06:19,600][547459] Avg episode reward: [(0, '4661.690')] [2023-03-08 21:06:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000168064_86048768.pth... [2023-03-08 21:06:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000167376_85696512.pth [2023-03-08 21:06:20,237][547799] Updated weights for policy 0, policy_version 168080 (0.0005) [2023-03-08 21:06:23,521][547799] Updated weights for policy 0, policy_version 168160 (0.0004) [2023-03-08 21:06:24,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11968.6). Total num frames: 86110208. Throughput: 0: 11726.7. Samples: 86085272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:06:24,600][547459] Avg episode reward: [(0, '4755.904')] [2023-03-08 21:06:26,854][547799] Updated weights for policy 0, policy_version 168240 (0.0004) [2023-03-08 21:06:29,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11968.7). Total num frames: 86167552. Throughput: 0: 11816.2. Samples: 86158984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:06:29,600][547459] Avg episode reward: [(0, '4702.501')] [2023-03-08 21:06:30,466][547799] Updated weights for policy 0, policy_version 168320 (0.0005) [2023-03-08 21:06:33,942][547799] Updated weights for policy 0, policy_version 168400 (0.0004) [2023-03-08 21:06:34,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 86228992. Throughput: 0: 11752.8. Samples: 86228912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:06:34,600][547459] Avg episode reward: [(0, '4829.006')] [2023-03-08 21:06:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000168416_86228992.pth... [2023-03-08 21:06:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000167728_85876736.pth [2023-03-08 21:06:37,488][547799] Updated weights for policy 0, policy_version 168480 (0.0004) [2023-03-08 21:06:39,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11968.6). Total num frames: 86282240. Throughput: 0: 11669.3. Samples: 86261832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:06:39,600][547459] Avg episode reward: [(0, '4619.910')] [2023-03-08 21:06:40,999][547799] Updated weights for policy 0, policy_version 168560 (0.0004) [2023-03-08 21:06:44,456][547799] Updated weights for policy 0, policy_version 168640 (0.0004) [2023-03-08 21:06:44,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11982.5). Total num frames: 86343680. Throughput: 0: 11713.6. Samples: 86332028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:06:44,600][547459] Avg episode reward: [(0, '4650.906')] [2023-03-08 21:06:47,795][547799] Updated weights for policy 0, policy_version 168720 (0.0004) [2023-03-08 21:06:49,600][547459] Fps is (10 sec: 12287.8, 60 sec: 11810.1, 300 sec: 11996.4). Total num frames: 86405120. Throughput: 0: 11741.8. Samples: 86405128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:06:49,600][547459] Avg episode reward: [(0, '4582.537')] [2023-03-08 21:06:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000168760_86405120.pth... [2023-03-08 21:06:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000168064_86048768.pth [2023-03-08 21:06:51,286][547799] Updated weights for policy 0, policy_version 168800 (0.0005) [2023-03-08 21:06:54,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 86462464. Throughput: 0: 11772.0. Samples: 86441904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:06:54,600][547459] Avg episode reward: [(0, '4236.559')] [2023-03-08 21:06:54,887][547799] Updated weights for policy 0, policy_version 168880 (0.0004) [2023-03-08 21:06:58,355][547799] Updated weights for policy 0, policy_version 168960 (0.0005) [2023-03-08 21:06:59,600][547459] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11968.7). Total num frames: 86519808. Throughput: 0: 11775.3. Samples: 86509580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:06:59,600][547459] Avg episode reward: [(0, '4452.364')] [2023-03-08 21:07:01,984][547799] Updated weights for policy 0, policy_version 169040 (0.0005) [2023-03-08 21:07:04,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11968.6). Total num frames: 86577152. Throughput: 0: 11778.3. Samples: 86578856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:07:04,600][547459] Avg episode reward: [(0, '4574.654')] [2023-03-08 21:07:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000169096_86577152.pth... [2023-03-08 21:07:04,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000168416_86228992.pth [2023-03-08 21:07:05,515][547799] Updated weights for policy 0, policy_version 169120 (0.0005) [2023-03-08 21:07:09,028][547799] Updated weights for policy 0, policy_version 169200 (0.0005) [2023-03-08 21:07:09,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11968.7). Total num frames: 86634496. Throughput: 0: 11746.1. Samples: 86613844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:07:09,600][547459] Avg episode reward: [(0, '4646.416')] [2023-03-08 21:07:12,513][547799] Updated weights for policy 0, policy_version 169280 (0.0005) [2023-03-08 21:07:14,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11982.5). Total num frames: 86695936. Throughput: 0: 11665.7. Samples: 86683940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:07:14,600][547459] Avg episode reward: [(0, '4681.269')] [2023-03-08 21:07:15,513][547799] Updated weights for policy 0, policy_version 169360 (0.0004) [2023-03-08 21:07:18,968][547799] Updated weights for policy 0, policy_version 169440 (0.0004) [2023-03-08 21:07:19,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 86757376. Throughput: 0: 11792.9. Samples: 86759592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:07:19,600][547459] Avg episode reward: [(0, '4662.320')] [2023-03-08 21:07:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000169448_86757376.pth... [2023-03-08 21:07:19,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000168760_86405120.pth [2023-03-08 21:07:22,322][547799] Updated weights for policy 0, policy_version 169520 (0.0005) [2023-03-08 21:07:24,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 86818816. Throughput: 0: 11893.6. Samples: 86797044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:07:24,600][547459] Avg episode reward: [(0, '4658.692')] [2023-03-08 21:07:25,691][547799] Updated weights for policy 0, policy_version 169600 (0.0005) [2023-03-08 21:07:29,109][547799] Updated weights for policy 0, policy_version 169680 (0.0004) [2023-03-08 21:07:29,599][547459] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 86880256. Throughput: 0: 11943.8. Samples: 86869496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:07:29,600][547459] Avg episode reward: [(0, '4793.625')] [2023-03-08 21:07:32,385][547799] Updated weights for policy 0, policy_version 169760 (0.0005) [2023-03-08 21:07:34,600][547459] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11996.4). Total num frames: 86941696. Throughput: 0: 11932.2. Samples: 86942076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:07:34,600][547459] Avg episode reward: [(0, '4744.440')] [2023-03-08 21:07:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000169808_86941696.pth... [2023-03-08 21:07:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000169096_86577152.pth [2023-03-08 21:07:35,866][547799] Updated weights for policy 0, policy_version 169840 (0.0005) [2023-03-08 21:07:39,600][547459] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 86994944. Throughput: 0: 11884.6. Samples: 86976712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:07:39,600][547459] Avg episode reward: [(0, '4716.955')] [2023-03-08 21:07:39,661][547799] Updated weights for policy 0, policy_version 169920 (0.0006) [2023-03-08 21:07:43,274][547799] Updated weights for policy 0, policy_version 170000 (0.0005) [2023-03-08 21:07:44,600][547459] Fps is (10 sec: 11059.2, 60 sec: 11810.1, 300 sec: 11927.0). Total num frames: 87052288. Throughput: 0: 11870.9. Samples: 87043772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:07:44,600][547459] Avg episode reward: [(0, '4768.459')] [2023-03-08 21:07:46,860][547799] Updated weights for policy 0, policy_version 170080 (0.0005) [2023-03-08 21:07:49,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11927.0). Total num frames: 87109632. Throughput: 0: 11871.6. Samples: 87113076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:07:49,600][547459] Avg episode reward: [(0, '4759.403')] [2023-03-08 21:07:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000170136_87109632.pth... [2023-03-08 21:07:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000169448_86757376.pth [2023-03-08 21:07:50,379][547799] Updated weights for policy 0, policy_version 170160 (0.0005) [2023-03-08 21:07:53,913][547799] Updated weights for policy 0, policy_version 170240 (0.0005) [2023-03-08 21:07:54,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11913.1). Total num frames: 87166976. Throughput: 0: 11852.8. Samples: 87147220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:07:54,600][547459] Avg episode reward: [(0, '4771.102')] [2023-03-08 21:07:57,433][547799] Updated weights for policy 0, policy_version 170320 (0.0004) [2023-03-08 21:07:59,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11913.1). Total num frames: 87228416. Throughput: 0: 11837.1. Samples: 87216612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:07:59,600][547459] Avg episode reward: [(0, '4807.235')] [2023-03-08 21:08:00,932][547799] Updated weights for policy 0, policy_version 170400 (0.0005) [2023-03-08 21:08:04,559][547799] Updated weights for policy 0, policy_version 170480 (0.0005) [2023-03-08 21:08:04,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11810.2, 300 sec: 11899.2). Total num frames: 87285760. Throughput: 0: 11692.6. Samples: 87285756. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:08:04,600][547459] Avg episode reward: [(0, '4809.145')] [2023-03-08 21:08:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000170480_87285760.pth... [2023-03-08 21:08:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000169808_86941696.pth [2023-03-08 21:08:08,003][547799] Updated weights for policy 0, policy_version 170560 (0.0004) [2023-03-08 21:08:09,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11871.5). Total num frames: 87343104. Throughput: 0: 11671.0. Samples: 87322240. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:08:09,600][547459] Avg episode reward: [(0, '4821.256')] [2023-03-08 21:08:11,791][547799] Updated weights for policy 0, policy_version 170640 (0.0005) [2023-03-08 21:08:14,600][547459] Fps is (10 sec: 11059.1, 60 sec: 11673.6, 300 sec: 11843.7). Total num frames: 87396352. Throughput: 0: 11524.4. Samples: 87388096. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:08:14,600][547459] Avg episode reward: [(0, '4583.155')] [2023-03-08 21:08:15,331][547799] Updated weights for policy 0, policy_version 170720 (0.0005) [2023-03-08 21:08:18,917][547799] Updated weights for policy 0, policy_version 170800 (0.0005) [2023-03-08 21:08:19,599][547459] Fps is (10 sec: 11059.2, 60 sec: 11605.3, 300 sec: 11829.8). Total num frames: 87453696. Throughput: 0: 11455.8. Samples: 87457588. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:08:19,600][547459] Avg episode reward: [(0, '4774.995')] [2023-03-08 21:08:19,619][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000170816_87457792.pth... [2023-03-08 21:08:19,620][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000170136_87109632.pth [2023-03-08 21:08:22,562][547799] Updated weights for policy 0, policy_version 170880 (0.0005) [2023-03-08 21:08:24,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11829.8). Total num frames: 87515136. Throughput: 0: 11418.9. Samples: 87490560. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:08:24,600][547459] Avg episode reward: [(0, '4783.040')] [2023-03-08 21:08:25,960][547799] Updated weights for policy 0, policy_version 170960 (0.0005) [2023-03-08 21:08:29,430][547799] Updated weights for policy 0, policy_version 171040 (0.0005) [2023-03-08 21:08:29,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11537.0, 300 sec: 11815.9). Total num frames: 87572480. Throughput: 0: 11534.8. Samples: 87562840. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:08:29,600][547459] Avg episode reward: [(0, '4689.063')] [2023-03-08 21:08:32,671][547799] Updated weights for policy 0, policy_version 171120 (0.0005) [2023-03-08 21:08:34,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11537.1, 300 sec: 11815.9). Total num frames: 87633920. Throughput: 0: 11586.4. Samples: 87634464. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:08:34,600][547459] Avg episode reward: [(0, '4777.718')] [2023-03-08 21:08:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000171160_87633920.pth... [2023-03-08 21:08:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000170480_87285760.pth [2023-03-08 21:08:36,140][547799] Updated weights for policy 0, policy_version 171200 (0.0004) [2023-03-08 21:08:39,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11605.4, 300 sec: 11815.9). Total num frames: 87691264. Throughput: 0: 11631.5. Samples: 87670636. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:08:39,600][547459] Avg episode reward: [(0, '4773.757')] [2023-03-08 21:08:39,693][547799] Updated weights for policy 0, policy_version 171280 (0.0005) [2023-03-08 21:08:43,190][547799] Updated weights for policy 0, policy_version 171360 (0.0005) [2023-03-08 21:08:44,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11802.0). Total num frames: 87752704. Throughput: 0: 11640.3. Samples: 87740424. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:08:44,600][547459] Avg episode reward: [(0, '4598.190')] [2023-03-08 21:08:46,765][547799] Updated weights for policy 0, policy_version 171440 (0.0005) [2023-03-08 21:08:49,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11802.0). Total num frames: 87810048. Throughput: 0: 11648.9. Samples: 87809956. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:08:49,600][547459] Avg episode reward: [(0, '4826.807')] [2023-03-08 21:08:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000171504_87810048.pth... [2023-03-08 21:08:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000170816_87457792.pth [2023-03-08 21:08:50,212][547799] Updated weights for policy 0, policy_version 171520 (0.0005) [2023-03-08 21:08:53,799][547799] Updated weights for policy 0, policy_version 171600 (0.0004) [2023-03-08 21:08:54,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11802.0). Total num frames: 87867392. Throughput: 0: 11604.6. Samples: 87844448. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:08:54,600][547459] Avg episode reward: [(0, '4790.282')] [2023-03-08 21:08:57,127][547799] Updated weights for policy 0, policy_version 171680 (0.0005) [2023-03-08 21:08:59,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11788.1). Total num frames: 87928832. Throughput: 0: 11744.7. Samples: 87916608. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:08:59,600][547459] Avg episode reward: [(0, '4793.573')] [2023-03-08 21:09:00,562][547799] Updated weights for policy 0, policy_version 171760 (0.0004) [2023-03-08 21:09:04,301][547799] Updated weights for policy 0, policy_version 171840 (0.0004) [2023-03-08 21:09:04,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11605.3, 300 sec: 11774.3). Total num frames: 87982080. Throughput: 0: 11724.5. Samples: 87985188. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:09:04,600][547459] Avg episode reward: [(0, '4534.586')] [2023-03-08 21:09:04,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000171840_87982080.pth... [2023-03-08 21:09:04,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000171160_87633920.pth [2023-03-08 21:09:07,758][547799] Updated weights for policy 0, policy_version 171920 (0.0005) [2023-03-08 21:09:09,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11774.3). Total num frames: 88043520. Throughput: 0: 11780.8. Samples: 88020696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:09:09,600][547459] Avg episode reward: [(0, '4778.461')] [2023-03-08 21:09:11,115][547799] Updated weights for policy 0, policy_version 172000 (0.0004) [2023-03-08 21:09:14,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 88100864. Throughput: 0: 11761.5. Samples: 88092108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:09:14,600][547459] Avg episode reward: [(0, '4697.066')] [2023-03-08 21:09:14,608][547799] Updated weights for policy 0, policy_version 172080 (0.0005) [2023-03-08 21:09:18,054][547799] Updated weights for policy 0, policy_version 172160 (0.0005) [2023-03-08 21:09:19,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 88162304. Throughput: 0: 11755.3. Samples: 88163452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:09:19,600][547459] Avg episode reward: [(0, '4814.247')] [2023-03-08 21:09:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000172192_88162304.pth... [2023-03-08 21:09:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000171504_87810048.pth [2023-03-08 21:09:21,688][547799] Updated weights for policy 0, policy_version 172240 (0.0004) [2023-03-08 21:09:24,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 88219648. Throughput: 0: 11697.3. Samples: 88197016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:09:24,600][547459] Avg episode reward: [(0, '4556.104')] [2023-03-08 21:09:25,151][547799] Updated weights for policy 0, policy_version 172320 (0.0005) [2023-03-08 21:09:28,617][547799] Updated weights for policy 0, policy_version 172400 (0.0004) [2023-03-08 21:09:29,599][547459] Fps is (10 sec: 11469.0, 60 sec: 11741.9, 300 sec: 11760.4). Total num frames: 88276992. Throughput: 0: 11738.8. Samples: 88268668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:09:29,600][547459] Avg episode reward: [(0, '4825.133')] [2023-03-08 21:09:31,986][547799] Updated weights for policy 0, policy_version 172480 (0.0004) [2023-03-08 21:09:34,600][547459] Fps is (10 sec: 12287.8, 60 sec: 11810.1, 300 sec: 11774.3). Total num frames: 88342528. Throughput: 0: 11836.3. Samples: 88342592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:09:34,600][547459] Avg episode reward: [(0, '4831.027')] [2023-03-08 21:09:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000172544_88342528.pth... [2023-03-08 21:09:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000171840_87982080.pth [2023-03-08 21:09:35,216][547799] Updated weights for policy 0, policy_version 172560 (0.0005) [2023-03-08 21:09:38,615][547799] Updated weights for policy 0, policy_version 172640 (0.0004) [2023-03-08 21:09:39,599][547459] Fps is (10 sec: 12288.0, 60 sec: 11810.1, 300 sec: 11774.3). Total num frames: 88399872. Throughput: 0: 11886.1. Samples: 88379320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:09:39,600][547459] Avg episode reward: [(0, '4823.558')] [2023-03-08 21:09:42,047][547799] Updated weights for policy 0, policy_version 172720 (0.0005) [2023-03-08 21:09:44,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11774.3). Total num frames: 88461312. Throughput: 0: 11868.8. Samples: 88450704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:09:44,600][547459] Avg episode reward: [(0, '4781.333')] [2023-03-08 21:09:45,440][547799] Updated weights for policy 0, policy_version 172800 (0.0005) [2023-03-08 21:09:48,889][547799] Updated weights for policy 0, policy_version 172880 (0.0004) [2023-03-08 21:09:49,600][547459] Fps is (10 sec: 12287.8, 60 sec: 11878.4, 300 sec: 11788.1). Total num frames: 88522752. Throughput: 0: 11945.5. Samples: 88522736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:09:49,600][547459] Avg episode reward: [(0, '4816.653')] [2023-03-08 21:09:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000172896_88522752.pth... [2023-03-08 21:09:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000172192_88162304.pth [2023-03-08 21:09:52,394][547799] Updated weights for policy 0, policy_version 172960 (0.0005) [2023-03-08 21:09:54,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11760.4). Total num frames: 88580096. Throughput: 0: 11919.1. Samples: 88557056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:09:54,600][547459] Avg episode reward: [(0, '4791.279')] [2023-03-08 21:09:55,825][547799] Updated weights for policy 0, policy_version 173040 (0.0005) [2023-03-08 21:09:59,192][547799] Updated weights for policy 0, policy_version 173120 (0.0005) [2023-03-08 21:09:59,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11774.3). Total num frames: 88641536. Throughput: 0: 11937.9. Samples: 88629312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:09:59,600][547459] Avg episode reward: [(0, '4779.664')] [2023-03-08 21:10:02,426][547799] Updated weights for policy 0, policy_version 173200 (0.0005) [2023-03-08 21:10:04,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11788.1). Total num frames: 88702976. Throughput: 0: 12029.7. Samples: 88704788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:10:04,600][547459] Avg episode reward: [(0, '4782.467')] [2023-03-08 21:10:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000173248_88702976.pth... [2023-03-08 21:10:04,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000172544_88342528.pth [2023-03-08 21:10:05,812][547799] Updated weights for policy 0, policy_version 173280 (0.0005) [2023-03-08 21:10:09,286][547799] Updated weights for policy 0, policy_version 173360 (0.0005) [2023-03-08 21:10:09,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11802.0). Total num frames: 88760320. Throughput: 0: 12062.1. Samples: 88739812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:10:09,600][547459] Avg episode reward: [(0, '4728.607')] [2023-03-08 21:10:12,632][547799] Updated weights for policy 0, policy_version 173440 (0.0005) [2023-03-08 21:10:14,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11802.0). Total num frames: 88821760. Throughput: 0: 12040.6. Samples: 88810496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:10:14,602][547459] Avg episode reward: [(0, '4790.120')] [2023-03-08 21:10:16,129][547799] Updated weights for policy 0, policy_version 173520 (0.0004) [2023-03-08 21:10:19,534][547799] Updated weights for policy 0, policy_version 173600 (0.0005) [2023-03-08 21:10:19,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11802.0). Total num frames: 88883200. Throughput: 0: 12013.9. Samples: 88883216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:10:19,611][547459] Avg episode reward: [(0, '4825.491')] [2023-03-08 21:10:19,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000173600_88883200.pth... [2023-03-08 21:10:19,617][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000172896_88522752.pth [2023-03-08 21:10:23,013][547799] Updated weights for policy 0, policy_version 173680 (0.0005) [2023-03-08 21:10:24,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11802.0). Total num frames: 88940544. Throughput: 0: 12010.2. Samples: 88919780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:10:24,610][547459] Avg episode reward: [(0, '4824.904')] [2023-03-08 21:10:26,423][547799] Updated weights for policy 0, policy_version 173760 (0.0005) [2023-03-08 21:10:29,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 11802.0). Total num frames: 89001984. Throughput: 0: 12001.2. Samples: 88990756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:10:29,610][547459] Avg episode reward: [(0, '4831.722')] [2023-03-08 21:10:29,682][547799] Updated weights for policy 0, policy_version 173840 (0.0004) [2023-03-08 21:10:33,033][547799] Updated weights for policy 0, policy_version 173920 (0.0005) [2023-03-08 21:10:34,600][547459] Fps is (10 sec: 12287.8, 60 sec: 12014.9, 300 sec: 11802.0). Total num frames: 89063424. Throughput: 0: 12018.7. Samples: 89063576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:10:34,611][547459] Avg episode reward: [(0, '4826.893')] [2023-03-08 21:10:34,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000173952_89063424.pth... [2023-03-08 21:10:34,617][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000173248_88702976.pth [2023-03-08 21:10:36,503][547799] Updated weights for policy 0, policy_version 174000 (0.0005) [2023-03-08 21:10:39,599][547459] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11802.0). Total num frames: 89120768. Throughput: 0: 12071.4. Samples: 89100268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:10:39,600][547459] Avg episode reward: [(0, '4785.641')] [2023-03-08 21:10:40,015][547799] Updated weights for policy 0, policy_version 174080 (0.0005) [2023-03-08 21:10:43,391][547799] Updated weights for policy 0, policy_version 174160 (0.0005) [2023-03-08 21:10:44,600][547459] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11815.9). Total num frames: 89182208. Throughput: 0: 12047.9. Samples: 89171468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:10:44,600][547459] Avg episode reward: [(0, '4829.685')] [2023-03-08 21:10:47,084][547799] Updated weights for policy 0, policy_version 174240 (0.0005) [2023-03-08 21:10:49,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11815.9). Total num frames: 89239552. Throughput: 0: 11885.1. Samples: 89239616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:10:49,600][547459] Avg episode reward: [(0, '4665.172')] [2023-03-08 21:10:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000174296_89239552.pth... [2023-03-08 21:10:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000173600_88883200.pth [2023-03-08 21:10:50,610][547799] Updated weights for policy 0, policy_version 174320 (0.0004) [2023-03-08 21:10:54,394][547799] Updated weights for policy 0, policy_version 174400 (0.0005) [2023-03-08 21:10:54,600][547459] Fps is (10 sec: 11059.2, 60 sec: 11878.4, 300 sec: 11788.1). Total num frames: 89292800. Throughput: 0: 11833.5. Samples: 89272320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:10:54,600][547459] Avg episode reward: [(0, '4563.810')] [2023-03-08 21:10:57,692][547799] Updated weights for policy 0, policy_version 174480 (0.0005) [2023-03-08 21:10:59,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11788.2). Total num frames: 89354240. Throughput: 0: 11887.5. Samples: 89345432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:10:59,600][547459] Avg episode reward: [(0, '4584.092')] [2023-03-08 21:11:01,144][547799] Updated weights for policy 0, policy_version 174560 (0.0005) [2023-03-08 21:11:04,595][547799] Updated weights for policy 0, policy_version 174640 (0.0004) [2023-03-08 21:11:04,599][547459] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 11802.0). Total num frames: 89415680. Throughput: 0: 11828.1. Samples: 89415480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:11:04,600][547459] Avg episode reward: [(0, '4628.742')] [2023-03-08 21:11:04,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000174640_89415680.pth... [2023-03-08 21:11:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000173952_89063424.pth [2023-03-08 21:11:08,273][547799] Updated weights for policy 0, policy_version 174720 (0.0005) [2023-03-08 21:11:09,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11802.0). Total num frames: 89473024. Throughput: 0: 11749.7. Samples: 89448520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:11:09,600][547459] Avg episode reward: [(0, '4758.002')] [2023-03-08 21:11:11,749][547799] Updated weights for policy 0, policy_version 174800 (0.0005) [2023-03-08 21:11:14,600][547459] Fps is (10 sec: 11468.7, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 89530368. Throughput: 0: 11732.3. Samples: 89518712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:11:14,600][547459] Avg episode reward: [(0, '4686.305')] [2023-03-08 21:11:15,301][547799] Updated weights for policy 0, policy_version 174880 (0.0004) [2023-03-08 21:11:18,911][547799] Updated weights for policy 0, policy_version 174960 (0.0004) [2023-03-08 21:11:19,599][547459] Fps is (10 sec: 11059.3, 60 sec: 11673.6, 300 sec: 11774.3). Total num frames: 89583616. Throughput: 0: 11644.7. Samples: 89587588. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:11:19,600][547459] Avg episode reward: [(0, '4733.240')] [2023-03-08 21:11:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000174968_89583616.pth... [2023-03-08 21:11:19,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000174296_89239552.pth [2023-03-08 21:11:22,467][547799] Updated weights for policy 0, policy_version 175040 (0.0005) [2023-03-08 21:11:24,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11788.2). Total num frames: 89645056. Throughput: 0: 11567.1. Samples: 89620788. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:11:24,600][547459] Avg episode reward: [(0, '4788.018')] [2023-03-08 21:11:25,643][547799] Updated weights for policy 0, policy_version 175120 (0.0005) [2023-03-08 21:11:29,001][547799] Updated weights for policy 0, policy_version 175200 (0.0005) [2023-03-08 21:11:29,599][547459] Fps is (10 sec: 12288.0, 60 sec: 11741.9, 300 sec: 11788.2). Total num frames: 89706496. Throughput: 0: 11688.4. Samples: 89697444. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:11:29,600][547459] Avg episode reward: [(0, '4833.423')] [2023-03-08 21:11:32,517][547799] Updated weights for policy 0, policy_version 175280 (0.0005) [2023-03-08 21:11:34,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 89767936. Throughput: 0: 11743.5. Samples: 89768072. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:11:34,600][547459] Avg episode reward: [(0, '4834.077')] [2023-03-08 21:11:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000175328_89767936.pth... [2023-03-08 21:11:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000174640_89415680.pth [2023-03-08 21:11:35,758][547799] Updated weights for policy 0, policy_version 175360 (0.0004) [2023-03-08 21:11:39,167][547799] Updated weights for policy 0, policy_version 175440 (0.0004) [2023-03-08 21:11:39,599][547459] Fps is (10 sec: 12288.1, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 89829376. Throughput: 0: 11860.6. Samples: 89806044. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:11:39,600][547459] Avg episode reward: [(0, '4831.904')] [2023-03-08 21:11:42,641][547799] Updated weights for policy 0, policy_version 175520 (0.0005) [2023-03-08 21:11:44,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11802.0). Total num frames: 89886720. Throughput: 0: 11827.3. Samples: 89877660. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:11:44,600][547459] Avg episode reward: [(0, '4784.988')] [2023-03-08 21:11:46,147][547799] Updated weights for policy 0, policy_version 175600 (0.0005) [2023-03-08 21:11:49,473][547799] Updated weights for policy 0, policy_version 175680 (0.0005) [2023-03-08 21:11:49,600][547459] Fps is (10 sec: 11878.2, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 89948160. Throughput: 0: 11841.9. Samples: 89948368. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:11:49,600][547459] Avg episode reward: [(0, '4765.288')] [2023-03-08 21:11:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000175680_89948160.pth... [2023-03-08 21:11:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000174968_89583616.pth [2023-03-08 21:11:53,130][547799] Updated weights for policy 0, policy_version 175760 (0.0005) [2023-03-08 21:11:54,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11815.9). Total num frames: 90005504. Throughput: 0: 11876.9. Samples: 89982980. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:11:54,600][547459] Avg episode reward: [(0, '4738.905')] [2023-03-08 21:11:56,702][547799] Updated weights for policy 0, policy_version 175840 (0.0005) [2023-03-08 21:11:59,600][547459] Fps is (10 sec: 11468.7, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 90062848. Throughput: 0: 11825.8. Samples: 90050876. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:11:59,600][547459] Avg episode reward: [(0, '4739.297')] [2023-03-08 21:12:00,248][547799] Updated weights for policy 0, policy_version 175920 (0.0005) [2023-03-08 21:12:03,745][547799] Updated weights for policy 0, policy_version 176000 (0.0005) [2023-03-08 21:12:04,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 90120192. Throughput: 0: 11864.2. Samples: 90121476. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:12:04,600][547459] Avg episode reward: [(0, '4837.224')] [2023-03-08 21:12:04,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000176016_90120192.pth... [2023-03-08 21:12:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000175328_89767936.pth [2023-03-08 21:12:04,605][547754] Saving new best policy, reward=4837.224! [2023-03-08 21:12:07,159][547799] Updated weights for policy 0, policy_version 176080 (0.0005) [2023-03-08 21:12:09,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 90181632. Throughput: 0: 11917.1. Samples: 90157056. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:12:09,611][547459] Avg episode reward: [(0, '4683.866')] [2023-03-08 21:12:10,550][547799] Updated weights for policy 0, policy_version 176160 (0.0005) [2023-03-08 21:12:14,044][547799] Updated weights for policy 0, policy_version 176240 (0.0005) [2023-03-08 21:12:14,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 90238976. Throughput: 0: 11821.8. Samples: 90229424. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:12:14,611][547459] Avg episode reward: [(0, '4788.507')] [2023-03-08 21:12:17,663][547799] Updated weights for policy 0, policy_version 176320 (0.0005) [2023-03-08 21:12:19,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11788.1). Total num frames: 90296320. Throughput: 0: 11740.4. Samples: 90296392. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:12:19,611][547459] Avg episode reward: [(0, '4836.622')] [2023-03-08 21:12:19,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000176360_90296320.pth... [2023-03-08 21:12:19,617][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000175680_89948160.pth [2023-03-08 21:12:21,216][547799] Updated weights for policy 0, policy_version 176400 (0.0005) [2023-03-08 21:12:24,583][547799] Updated weights for policy 0, policy_version 176480 (0.0005) [2023-03-08 21:12:24,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11788.1). Total num frames: 90357760. Throughput: 0: 11710.6. Samples: 90333020. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:12:24,611][547459] Avg episode reward: [(0, '4832.002')] [2023-03-08 21:12:28,015][547799] Updated weights for policy 0, policy_version 176560 (0.0005) [2023-03-08 21:12:29,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11774.3). Total num frames: 90415104. Throughput: 0: 11726.1. Samples: 90405332. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:12:29,610][547459] Avg episode reward: [(0, '4831.854')] [2023-03-08 21:12:31,476][547799] Updated weights for policy 0, policy_version 176640 (0.0005) [2023-03-08 21:12:34,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11788.2). Total num frames: 90472448. Throughput: 0: 11716.3. Samples: 90475600. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:12:34,610][547459] Avg episode reward: [(0, '4673.066')] [2023-03-08 21:12:34,626][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000176712_90476544.pth... [2023-03-08 21:12:34,628][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000176016_90120192.pth [2023-03-08 21:12:35,000][547799] Updated weights for policy 0, policy_version 176720 (0.0005) [2023-03-08 21:12:38,445][547799] Updated weights for policy 0, policy_version 176800 (0.0005) [2023-03-08 21:12:39,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11741.8, 300 sec: 11802.0). Total num frames: 90533888. Throughput: 0: 11712.1. Samples: 90510024. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:12:39,611][547459] Avg episode reward: [(0, '4753.716')] [2023-03-08 21:12:41,800][547799] Updated weights for policy 0, policy_version 176880 (0.0005) [2023-03-08 21:12:44,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11802.0). Total num frames: 90591232. Throughput: 0: 11825.3. Samples: 90583012. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:12:44,610][547459] Avg episode reward: [(0, '4720.234')] [2023-03-08 21:12:45,448][547799] Updated weights for policy 0, policy_version 176960 (0.0006) [2023-03-08 21:12:48,972][547799] Updated weights for policy 0, policy_version 177040 (0.0005) [2023-03-08 21:12:49,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11802.0). Total num frames: 90648576. Throughput: 0: 11774.8. Samples: 90651344. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:12:49,610][547459] Avg episode reward: [(0, '4656.823')] [2023-03-08 21:12:49,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000177048_90648576.pth... [2023-03-08 21:12:49,616][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000176360_90296320.pth [2023-03-08 21:12:52,483][547799] Updated weights for policy 0, policy_version 177120 (0.0005) [2023-03-08 21:12:54,599][547459] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11802.0). Total num frames: 90710016. Throughput: 0: 11743.5. Samples: 90685512. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:12:54,610][547459] Avg episode reward: [(0, '4703.653')] [2023-03-08 21:12:55,912][547799] Updated weights for policy 0, policy_version 177200 (0.0005) [2023-03-08 21:12:59,319][547799] Updated weights for policy 0, policy_version 177280 (0.0005) [2023-03-08 21:12:59,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11802.0). Total num frames: 90767360. Throughput: 0: 11734.2. Samples: 90757464. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:12:59,610][547459] Avg episode reward: [(0, '4745.744')] [2023-03-08 21:13:02,690][547799] Updated weights for policy 0, policy_version 177360 (0.0005) [2023-03-08 21:13:04,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 90828800. Throughput: 0: 11833.9. Samples: 90828916. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:13:04,611][547459] Avg episode reward: [(0, '4832.420')] [2023-03-08 21:13:04,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000177400_90828800.pth... [2023-03-08 21:13:04,617][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000176712_90476544.pth [2023-03-08 21:13:06,168][547799] Updated weights for policy 0, policy_version 177440 (0.0005) [2023-03-08 21:13:09,458][547799] Updated weights for policy 0, policy_version 177520 (0.0005) [2023-03-08 21:13:09,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 90890240. Throughput: 0: 11838.1. Samples: 90865736. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:13:09,611][547459] Avg episode reward: [(0, '4823.372')] [2023-03-08 21:13:12,911][547799] Updated weights for policy 0, policy_version 177600 (0.0005) [2023-03-08 21:13:14,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 90947584. Throughput: 0: 11857.3. Samples: 90938912. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:13:14,600][547459] Avg episode reward: [(0, '4834.021')] [2023-03-08 21:13:16,419][547799] Updated weights for policy 0, policy_version 177680 (0.0005) [2023-03-08 21:13:19,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11843.7). Total num frames: 91009024. Throughput: 0: 11883.5. Samples: 91010356. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:13:19,600][547459] Avg episode reward: [(0, '4834.291')] [2023-03-08 21:13:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000177752_91009024.pth... [2023-03-08 21:13:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000177048_90648576.pth [2023-03-08 21:13:19,799][547799] Updated weights for policy 0, policy_version 177760 (0.0006) [2023-03-08 21:13:23,220][547799] Updated weights for policy 0, policy_version 177840 (0.0005) [2023-03-08 21:13:24,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11857.6). Total num frames: 91070464. Throughput: 0: 11907.9. Samples: 91045880. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:13:24,600][547459] Avg episode reward: [(0, '4782.985')] [2023-03-08 21:13:26,612][547799] Updated weights for policy 0, policy_version 177920 (0.0006) [2023-03-08 21:13:29,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11843.7). Total num frames: 91127808. Throughput: 0: 11871.8. Samples: 91117244. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:13:29,600][547459] Avg episode reward: [(0, '4779.932')] [2023-03-08 21:13:30,094][547799] Updated weights for policy 0, policy_version 178000 (0.0005) [2023-03-08 21:13:33,612][547799] Updated weights for policy 0, policy_version 178080 (0.0005) [2023-03-08 21:13:34,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11843.7). Total num frames: 91185152. Throughput: 0: 11951.7. Samples: 91189172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:13:34,600][547459] Avg episode reward: [(0, '4832.256')] [2023-03-08 21:13:34,634][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000178104_91189248.pth... [2023-03-08 21:13:34,636][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000177400_90828800.pth [2023-03-08 21:13:37,011][547799] Updated weights for policy 0, policy_version 178160 (0.0005) [2023-03-08 21:13:39,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11843.7). Total num frames: 91246592. Throughput: 0: 11956.2. Samples: 91223540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:13:39,600][547459] Avg episode reward: [(0, '4826.363')] [2023-03-08 21:13:40,747][547799] Updated weights for policy 0, policy_version 178240 (0.0006) [2023-03-08 21:13:44,599][547459] Fps is (10 sec: 11059.3, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 91295744. Throughput: 0: 11809.2. Samples: 91288876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:13:44,600][547459] Avg episode reward: [(0, '4831.019')] [2023-03-08 21:13:44,654][547799] Updated weights for policy 0, policy_version 178320 (0.0006) [2023-03-08 21:13:48,336][547799] Updated weights for policy 0, policy_version 178400 (0.0004) [2023-03-08 21:13:49,599][547459] Fps is (10 sec: 10649.7, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 91353088. Throughput: 0: 11676.6. Samples: 91354360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:13:49,600][547459] Avg episode reward: [(0, '4789.660')] [2023-03-08 21:13:49,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000178424_91353088.pth... [2023-03-08 21:13:49,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000177752_91009024.pth [2023-03-08 21:13:51,828][547799] Updated weights for policy 0, policy_version 178480 (0.0004) [2023-03-08 21:13:54,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11802.0). Total num frames: 91410432. Throughput: 0: 11647.0. Samples: 91389848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:13:54,600][547459] Avg episode reward: [(0, '4742.139')] [2023-03-08 21:13:55,579][547799] Updated weights for policy 0, policy_version 178560 (0.0006) [2023-03-08 21:13:58,829][547799] Updated weights for policy 0, policy_version 178640 (0.0005) [2023-03-08 21:13:59,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11829.8). Total num frames: 91471872. Throughput: 0: 11570.7. Samples: 91459592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:13:59,600][547459] Avg episode reward: [(0, '4740.518')] [2023-03-08 21:14:02,181][547799] Updated weights for policy 0, policy_version 178720 (0.0004) [2023-03-08 21:14:04,600][547459] Fps is (10 sec: 12287.8, 60 sec: 11741.9, 300 sec: 11829.8). Total num frames: 91533312. Throughput: 0: 11622.6. Samples: 91533376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:14:04,600][547459] Avg episode reward: [(0, '4741.215')] [2023-03-08 21:14:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000178776_91533312.pth... [2023-03-08 21:14:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000178104_91189248.pth [2023-03-08 21:14:05,455][547799] Updated weights for policy 0, policy_version 178800 (0.0005) [2023-03-08 21:14:08,975][547799] Updated weights for policy 0, policy_version 178880 (0.0004) [2023-03-08 21:14:09,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11829.8). Total num frames: 91590656. Throughput: 0: 11644.6. Samples: 91569888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:14:09,600][547459] Avg episode reward: [(0, '4747.024')] [2023-03-08 21:14:12,539][547799] Updated weights for policy 0, policy_version 178960 (0.0004) [2023-03-08 21:14:14,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11741.8, 300 sec: 11829.8). Total num frames: 91652096. Throughput: 0: 11612.1. Samples: 91639788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:14:14,600][547459] Avg episode reward: [(0, '4833.184')] [2023-03-08 21:14:15,830][547799] Updated weights for policy 0, policy_version 179040 (0.0005) [2023-03-08 21:14:19,326][547799] Updated weights for policy 0, policy_version 179120 (0.0005) [2023-03-08 21:14:19,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11829.8). Total num frames: 91709440. Throughput: 0: 11632.1. Samples: 91712616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:14:19,600][547459] Avg episode reward: [(0, '4830.942')] [2023-03-08 21:14:19,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000179128_91713536.pth... [2023-03-08 21:14:19,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000178424_91353088.pth [2023-03-08 21:14:22,903][547799] Updated weights for policy 0, policy_version 179200 (0.0005) [2023-03-08 21:14:24,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11843.7). Total num frames: 91770880. Throughput: 0: 11615.9. Samples: 91746256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:14:24,600][547459] Avg episode reward: [(0, '4743.284')] [2023-03-08 21:14:26,375][547799] Updated weights for policy 0, policy_version 179280 (0.0004) [2023-03-08 21:14:29,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11815.9). Total num frames: 91828224. Throughput: 0: 11713.8. Samples: 91816000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:14:29,600][547459] Avg episode reward: [(0, '4678.355')] [2023-03-08 21:14:29,779][547799] Updated weights for policy 0, policy_version 179360 (0.0005) [2023-03-08 21:14:33,380][547799] Updated weights for policy 0, policy_version 179440 (0.0004) [2023-03-08 21:14:34,600][547459] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11815.9). Total num frames: 91885568. Throughput: 0: 11805.0. Samples: 91885588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:14:34,600][547459] Avg episode reward: [(0, '4699.916')] [2023-03-08 21:14:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000179464_91885568.pth... [2023-03-08 21:14:34,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000178776_91533312.pth [2023-03-08 21:14:37,076][547799] Updated weights for policy 0, policy_version 179520 (0.0005) [2023-03-08 21:14:39,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11802.0). Total num frames: 91942912. Throughput: 0: 11786.2. Samples: 91920228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:14:39,600][547459] Avg episode reward: [(0, '4741.841')] [2023-03-08 21:14:40,462][547799] Updated weights for policy 0, policy_version 179600 (0.0005) [2023-03-08 21:14:44,064][547799] Updated weights for policy 0, policy_version 179680 (0.0005) [2023-03-08 21:14:44,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11741.8, 300 sec: 11788.1). Total num frames: 92000256. Throughput: 0: 11816.3. Samples: 91991328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:14:44,600][547459] Avg episode reward: [(0, '4689.305')] [2023-03-08 21:14:47,453][547799] Updated weights for policy 0, policy_version 179760 (0.0005) [2023-03-08 21:14:49,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11741.8, 300 sec: 11788.1). Total num frames: 92057600. Throughput: 0: 11725.6. Samples: 92061028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:14:49,600][547459] Avg episode reward: [(0, '4392.550')] [2023-03-08 21:14:49,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000179808_92061696.pth... [2023-03-08 21:14:49,615][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000179128_91713536.pth [2023-03-08 21:14:51,171][547799] Updated weights for policy 0, policy_version 179840 (0.0005) [2023-03-08 21:14:54,600][547459] Fps is (10 sec: 11468.9, 60 sec: 11741.8, 300 sec: 11774.3). Total num frames: 92114944. Throughput: 0: 11631.4. Samples: 92093300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:14:54,600][547459] Avg episode reward: [(0, '4790.757')] [2023-03-08 21:14:54,687][547799] Updated weights for policy 0, policy_version 179920 (0.0004) [2023-03-08 21:14:58,302][547799] Updated weights for policy 0, policy_version 180000 (0.0004) [2023-03-08 21:14:59,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11760.4). Total num frames: 92172288. Throughput: 0: 11631.2. Samples: 92163188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:14:59,600][547459] Avg episode reward: [(0, '4758.905')] [2023-03-08 21:15:01,785][547799] Updated weights for policy 0, policy_version 180080 (0.0004) [2023-03-08 21:15:04,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11605.4, 300 sec: 11760.4). Total num frames: 92229632. Throughput: 0: 11576.2. Samples: 92233544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:15:04,600][547459] Avg episode reward: [(0, '4496.734')] [2023-03-08 21:15:04,649][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000180144_92233728.pth... [2023-03-08 21:15:04,651][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000179464_91885568.pth [2023-03-08 21:15:05,245][547799] Updated weights for policy 0, policy_version 180160 (0.0005) [2023-03-08 21:15:08,422][547799] Updated weights for policy 0, policy_version 180240 (0.0005) [2023-03-08 21:15:09,600][547459] Fps is (10 sec: 12287.8, 60 sec: 11741.8, 300 sec: 11774.3). Total num frames: 92295168. Throughput: 0: 11697.3. Samples: 92272636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:15:09,600][547459] Avg episode reward: [(0, '4795.042')] [2023-03-08 21:15:12,084][547799] Updated weights for policy 0, policy_version 180320 (0.0004) [2023-03-08 21:15:14,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11605.4, 300 sec: 11746.5). Total num frames: 92348416. Throughput: 0: 11650.9. Samples: 92340288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:15:14,600][547459] Avg episode reward: [(0, '4824.763')] [2023-03-08 21:15:15,708][547799] Updated weights for policy 0, policy_version 180400 (0.0004) [2023-03-08 21:15:19,208][547799] Updated weights for policy 0, policy_version 180480 (0.0004) [2023-03-08 21:15:19,600][547459] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11760.4). Total num frames: 92409856. Throughput: 0: 11647.4. Samples: 92409720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:15:19,600][547459] Avg episode reward: [(0, '4773.582')] [2023-03-08 21:15:19,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000180488_92409856.pth... [2023-03-08 21:15:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000179808_92061696.pth [2023-03-08 21:15:22,767][547799] Updated weights for policy 0, policy_version 180560 (0.0004) [2023-03-08 21:15:24,600][547459] Fps is (10 sec: 11878.2, 60 sec: 11605.3, 300 sec: 11746.5). Total num frames: 92467200. Throughput: 0: 11648.4. Samples: 92444408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:15:24,600][547459] Avg episode reward: [(0, '4833.784')] [2023-03-08 21:15:26,293][547799] Updated weights for policy 0, policy_version 180640 (0.0005) [2023-03-08 21:15:29,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11605.4, 300 sec: 11732.6). Total num frames: 92524544. Throughput: 0: 11649.4. Samples: 92515548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:15:29,600][547459] Avg episode reward: [(0, '4837.172')] [2023-03-08 21:15:29,704][547799] Updated weights for policy 0, policy_version 180720 (0.0005) [2023-03-08 21:15:33,150][547799] Updated weights for policy 0, policy_version 180800 (0.0005) [2023-03-08 21:15:34,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11746.5). Total num frames: 92585984. Throughput: 0: 11665.9. Samples: 92585992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:15:34,600][547459] Avg episode reward: [(0, '4698.495')] [2023-03-08 21:15:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000180832_92585984.pth... [2023-03-08 21:15:34,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000180144_92233728.pth [2023-03-08 21:15:36,638][547799] Updated weights for policy 0, policy_version 180880 (0.0005) [2023-03-08 21:15:39,599][547459] Fps is (10 sec: 12288.0, 60 sec: 11741.9, 300 sec: 11746.5). Total num frames: 92647424. Throughput: 0: 11762.6. Samples: 92622616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:15:39,600][547459] Avg episode reward: [(0, '4785.388')] [2023-03-08 21:15:39,916][547799] Updated weights for policy 0, policy_version 180960 (0.0005) [2023-03-08 21:15:43,215][547799] Updated weights for policy 0, policy_version 181040 (0.0005) [2023-03-08 21:15:44,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11746.5). Total num frames: 92704768. Throughput: 0: 11850.2. Samples: 92696448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:15:44,600][547459] Avg episode reward: [(0, '4735.395')] [2023-03-08 21:15:46,804][547799] Updated weights for policy 0, policy_version 181120 (0.0004) [2023-03-08 21:15:49,600][547459] Fps is (10 sec: 11878.2, 60 sec: 11810.1, 300 sec: 11774.3). Total num frames: 92766208. Throughput: 0: 11871.8. Samples: 92767776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:15:49,600][547459] Avg episode reward: [(0, '4695.425')] [2023-03-08 21:15:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000181184_92766208.pth... [2023-03-08 21:15:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000180488_92409856.pth [2023-03-08 21:15:50,042][547799] Updated weights for policy 0, policy_version 181200 (0.0005) [2023-03-08 21:15:53,464][547799] Updated weights for policy 0, policy_version 181280 (0.0005) [2023-03-08 21:15:54,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11774.3). Total num frames: 92827648. Throughput: 0: 11795.3. Samples: 92803424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:15:54,600][547459] Avg episode reward: [(0, '4745.140')] [2023-03-08 21:15:56,988][547799] Updated weights for policy 0, policy_version 181360 (0.0005) [2023-03-08 21:15:59,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11760.4). Total num frames: 92884992. Throughput: 0: 11889.6. Samples: 92875320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:15:59,600][547459] Avg episode reward: [(0, '4693.802')] [2023-03-08 21:16:00,416][547799] Updated weights for policy 0, policy_version 181440 (0.0005) [2023-03-08 21:16:03,961][547799] Updated weights for policy 0, policy_version 181520 (0.0005) [2023-03-08 21:16:04,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11760.4). Total num frames: 92942336. Throughput: 0: 11898.1. Samples: 92945136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:16:04,600][547459] Avg episode reward: [(0, '4837.913')] [2023-03-08 21:16:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000181528_92942336.pth... [2023-03-08 21:16:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000180832_92585984.pth [2023-03-08 21:16:04,606][547754] Saving new best policy, reward=4837.913! [2023-03-08 21:16:07,635][547799] Updated weights for policy 0, policy_version 181600 (0.0005) [2023-03-08 21:16:09,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11760.4). Total num frames: 92999680. Throughput: 0: 11867.9. Samples: 92978464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:16:09,600][547459] Avg episode reward: [(0, '4605.600')] [2023-03-08 21:16:11,335][547799] Updated weights for policy 0, policy_version 181680 (0.0005) [2023-03-08 21:16:14,599][547459] Fps is (10 sec: 11059.3, 60 sec: 11741.9, 300 sec: 11760.4). Total num frames: 93052928. Throughput: 0: 11732.1. Samples: 93043492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:16:14,600][547459] Avg episode reward: [(0, '4792.493')] [2023-03-08 21:16:14,943][547799] Updated weights for policy 0, policy_version 181760 (0.0004) [2023-03-08 21:16:18,312][547799] Updated weights for policy 0, policy_version 181840 (0.0004) [2023-03-08 21:16:19,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11760.4). Total num frames: 93114368. Throughput: 0: 11790.7. Samples: 93116576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:16:19,600][547459] Avg episode reward: [(0, '4742.100')] [2023-03-08 21:16:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000181864_93114368.pth... [2023-03-08 21:16:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000181184_92766208.pth [2023-03-08 21:16:21,767][547799] Updated weights for policy 0, policy_version 181920 (0.0005) [2023-03-08 21:16:24,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11760.4). Total num frames: 93175808. Throughput: 0: 11755.0. Samples: 93151592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:16:24,600][547459] Avg episode reward: [(0, '4742.320')] [2023-03-08 21:16:25,214][547799] Updated weights for policy 0, policy_version 182000 (0.0005) [2023-03-08 21:16:28,761][547799] Updated weights for policy 0, policy_version 182080 (0.0005) [2023-03-08 21:16:29,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11746.5). Total num frames: 93233152. Throughput: 0: 11663.5. Samples: 93221304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:16:29,600][547459] Avg episode reward: [(0, '4753.093')] [2023-03-08 21:16:32,260][547799] Updated weights for policy 0, policy_version 182160 (0.0004) [2023-03-08 21:16:34,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11732.6). Total num frames: 93290496. Throughput: 0: 11659.0. Samples: 93292428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:16:34,600][547459] Avg episode reward: [(0, '4749.478')] [2023-03-08 21:16:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000182208_93290496.pth... [2023-03-08 21:16:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000181528_92942336.pth [2023-03-08 21:16:35,660][547799] Updated weights for policy 0, policy_version 182240 (0.0004) [2023-03-08 21:16:39,219][547799] Updated weights for policy 0, policy_version 182320 (0.0004) [2023-03-08 21:16:39,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11746.5). Total num frames: 93351936. Throughput: 0: 11695.1. Samples: 93329704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:16:39,600][547459] Avg episode reward: [(0, '4783.690')] [2023-03-08 21:16:42,654][547799] Updated weights for policy 0, policy_version 182400 (0.0005) [2023-03-08 21:16:44,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11718.7). Total num frames: 93405184. Throughput: 0: 11594.3. Samples: 93397064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:16:44,600][547459] Avg episode reward: [(0, '4832.830')] [2023-03-08 21:16:46,439][547799] Updated weights for policy 0, policy_version 182480 (0.0005) [2023-03-08 21:16:49,599][547459] Fps is (10 sec: 11059.3, 60 sec: 11605.4, 300 sec: 11718.7). Total num frames: 93462528. Throughput: 0: 11582.1. Samples: 93466332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:16:49,600][547459] Avg episode reward: [(0, '4710.655')] [2023-03-08 21:16:49,632][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000182552_93466624.pth... [2023-03-08 21:16:49,633][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000181864_93114368.pth [2023-03-08 21:16:49,956][547799] Updated weights for policy 0, policy_version 182560 (0.0005) [2023-03-08 21:16:53,386][547799] Updated weights for policy 0, policy_version 182640 (0.0004) [2023-03-08 21:16:54,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11732.6). Total num frames: 93523968. Throughput: 0: 11633.6. Samples: 93501976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:16:54,600][547459] Avg episode reward: [(0, '4794.828')] [2023-03-08 21:16:56,711][547799] Updated weights for policy 0, policy_version 182720 (0.0005) [2023-03-08 21:16:59,600][547459] Fps is (10 sec: 12288.0, 60 sec: 11673.6, 300 sec: 11746.5). Total num frames: 93585408. Throughput: 0: 11779.9. Samples: 93573588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:16:59,600][547459] Avg episode reward: [(0, '4668.744')] [2023-03-08 21:17:00,158][547799] Updated weights for policy 0, policy_version 182800 (0.0005) [2023-03-08 21:17:03,472][547799] Updated weights for policy 0, policy_version 182880 (0.0004) [2023-03-08 21:17:04,600][547459] Fps is (10 sec: 12697.5, 60 sec: 11810.1, 300 sec: 11760.4). Total num frames: 93650944. Throughput: 0: 11872.4. Samples: 93650836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:17:04,600][547459] Avg episode reward: [(0, '4557.605')] [2023-03-08 21:17:04,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000182912_93650944.pth... [2023-03-08 21:17:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000182208_93290496.pth [2023-03-08 21:17:06,560][547799] Updated weights for policy 0, policy_version 182960 (0.0005) [2023-03-08 21:17:09,600][547459] Fps is (10 sec: 12697.6, 60 sec: 11878.4, 300 sec: 11774.3). Total num frames: 93712384. Throughput: 0: 11955.2. Samples: 93689576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:17:09,600][547459] Avg episode reward: [(0, '4808.500')] [2023-03-08 21:17:09,727][547799] Updated weights for policy 0, policy_version 183040 (0.0005) [2023-03-08 21:17:13,153][547799] Updated weights for policy 0, policy_version 183120 (0.0005) [2023-03-08 21:17:14,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11788.1). Total num frames: 93773824. Throughput: 0: 12019.2. Samples: 93762168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:17:14,600][547459] Avg episode reward: [(0, '4792.968')] [2023-03-08 21:17:16,613][547799] Updated weights for policy 0, policy_version 183200 (0.0004) [2023-03-08 21:17:19,600][547459] Fps is (10 sec: 12287.7, 60 sec: 12014.9, 300 sec: 11788.1). Total num frames: 93835264. Throughput: 0: 12062.5. Samples: 93835244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:17:19,601][547459] Avg episode reward: [(0, '4709.991')] [2023-03-08 21:17:19,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000183272_93835264.pth... [2023-03-08 21:17:19,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000182552_93466624.pth [2023-03-08 21:17:19,950][547799] Updated weights for policy 0, policy_version 183280 (0.0006) [2023-03-08 21:17:23,523][547799] Updated weights for policy 0, policy_version 183360 (0.0006) [2023-03-08 21:17:24,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11788.1). Total num frames: 93892608. Throughput: 0: 11968.4. Samples: 93868280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:17:24,600][547459] Avg episode reward: [(0, '4834.780')] [2023-03-08 21:17:27,113][547799] Updated weights for policy 0, policy_version 183440 (0.0005) [2023-03-08 21:17:29,600][547459] Fps is (10 sec: 11469.0, 60 sec: 11946.7, 300 sec: 11788.1). Total num frames: 93949952. Throughput: 0: 12014.9. Samples: 93937736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:17:29,600][547459] Avg episode reward: [(0, '4790.563')] [2023-03-08 21:17:30,494][547799] Updated weights for policy 0, policy_version 183520 (0.0005) [2023-03-08 21:17:34,178][547799] Updated weights for policy 0, policy_version 183600 (0.0005) [2023-03-08 21:17:34,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11774.3). Total num frames: 94007296. Throughput: 0: 12022.8. Samples: 94007360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:17:34,600][547459] Avg episode reward: [(0, '4791.386')] [2023-03-08 21:17:34,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000183608_94007296.pth... [2023-03-08 21:17:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000182912_93650944.pth [2023-03-08 21:17:37,640][547799] Updated weights for policy 0, policy_version 183680 (0.0005) [2023-03-08 21:17:39,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11774.3). Total num frames: 94064640. Throughput: 0: 12046.5. Samples: 94044068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:17:39,600][547459] Avg episode reward: [(0, '4791.410')] [2023-03-08 21:17:41,052][547799] Updated weights for policy 0, policy_version 183760 (0.0004) [2023-03-08 21:17:44,388][547799] Updated weights for policy 0, policy_version 183840 (0.0005) [2023-03-08 21:17:44,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11788.1). Total num frames: 94126080. Throughput: 0: 12056.1. Samples: 94116112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:17:44,600][547459] Avg episode reward: [(0, '4834.729')] [2023-03-08 21:17:47,752][547799] Updated weights for policy 0, policy_version 183920 (0.0005) [2023-03-08 21:17:49,600][547459] Fps is (10 sec: 12287.8, 60 sec: 12083.2, 300 sec: 11788.1). Total num frames: 94187520. Throughput: 0: 11927.7. Samples: 94187584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:17:49,600][547459] Avg episode reward: [(0, '4794.380')] [2023-03-08 21:17:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000183960_94187520.pth... [2023-03-08 21:17:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000183272_93835264.pth [2023-03-08 21:17:51,187][547799] Updated weights for policy 0, policy_version 184000 (0.0005) [2023-03-08 21:17:54,360][547799] Updated weights for policy 0, policy_version 184080 (0.0005) [2023-03-08 21:17:54,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11802.0). Total num frames: 94248960. Throughput: 0: 11886.0. Samples: 94224448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:17:54,600][547459] Avg episode reward: [(0, '4837.041')] [2023-03-08 21:17:57,864][547799] Updated weights for policy 0, policy_version 184160 (0.0005) [2023-03-08 21:17:59,600][547459] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 11802.0). Total num frames: 94310400. Throughput: 0: 11909.9. Samples: 94298112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:17:59,600][547459] Avg episode reward: [(0, '4799.081')] [2023-03-08 21:18:01,236][547799] Updated weights for policy 0, policy_version 184240 (0.0004) [2023-03-08 21:18:04,422][547799] Updated weights for policy 0, policy_version 184320 (0.0004) [2023-03-08 21:18:04,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11802.0). Total num frames: 94371840. Throughput: 0: 11944.3. Samples: 94372736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:18:04,600][547459] Avg episode reward: [(0, '4755.481')] [2023-03-08 21:18:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000184320_94371840.pth... [2023-03-08 21:18:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000183608_94007296.pth [2023-03-08 21:18:07,840][547799] Updated weights for policy 0, policy_version 184400 (0.0005) [2023-03-08 21:18:09,600][547459] Fps is (10 sec: 12697.6, 60 sec: 12083.2, 300 sec: 11829.8). Total num frames: 94437376. Throughput: 0: 12009.7. Samples: 94408716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:18:09,600][547459] Avg episode reward: [(0, '4739.442')] [2023-03-08 21:18:11,138][547799] Updated weights for policy 0, policy_version 184480 (0.0005) [2023-03-08 21:18:14,599][547459] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 11802.0). Total num frames: 94490624. Throughput: 0: 12101.6. Samples: 94482308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:18:14,600][547459] Avg episode reward: [(0, '4603.518')] [2023-03-08 21:18:14,688][547799] Updated weights for policy 0, policy_version 184560 (0.0005) [2023-03-08 21:18:18,003][547799] Updated weights for policy 0, policy_version 184640 (0.0005) [2023-03-08 21:18:19,600][547459] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11802.0). Total num frames: 94552064. Throughput: 0: 12118.2. Samples: 94552680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:18:19,600][547459] Avg episode reward: [(0, '4819.231')] [2023-03-08 21:18:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000184672_94552064.pth... [2023-03-08 21:18:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000183960_94187520.pth [2023-03-08 21:18:21,604][547799] Updated weights for policy 0, policy_version 184720 (0.0005) [2023-03-08 21:18:24,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11802.0). Total num frames: 94609408. Throughput: 0: 12081.7. Samples: 94587744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:18:24,600][547459] Avg episode reward: [(0, '4755.872')] [2023-03-08 21:18:24,996][547799] Updated weights for policy 0, policy_version 184800 (0.0005) [2023-03-08 21:18:28,439][547799] Updated weights for policy 0, policy_version 184880 (0.0004) [2023-03-08 21:18:29,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 11815.9). Total num frames: 94670848. Throughput: 0: 12070.3. Samples: 94659276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:18:29,600][547459] Avg episode reward: [(0, '4795.086')] [2023-03-08 21:18:31,948][547799] Updated weights for policy 0, policy_version 184960 (0.0004) [2023-03-08 21:18:34,600][547459] Fps is (10 sec: 12287.8, 60 sec: 12083.2, 300 sec: 11815.9). Total num frames: 94732288. Throughput: 0: 12132.1. Samples: 94733528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:18:34,600][547459] Avg episode reward: [(0, '4836.085')] [2023-03-08 21:18:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000185024_94732288.pth... [2023-03-08 21:18:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000184320_94371840.pth [2023-03-08 21:18:35,013][547799] Updated weights for policy 0, policy_version 185040 (0.0005) [2023-03-08 21:18:38,542][547799] Updated weights for policy 0, policy_version 185120 (0.0005) [2023-03-08 21:18:39,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12151.4, 300 sec: 11857.6). Total num frames: 94793728. Throughput: 0: 12119.2. Samples: 94769812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:18:39,600][547459] Avg episode reward: [(0, '4743.994')] [2023-03-08 21:18:42,134][547799] Updated weights for policy 0, policy_version 185200 (0.0005) [2023-03-08 21:18:44,599][547459] Fps is (10 sec: 11878.6, 60 sec: 12083.2, 300 sec: 11857.6). Total num frames: 94851072. Throughput: 0: 12034.3. Samples: 94839652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:18:44,600][547459] Avg episode reward: [(0, '4747.383')] [2023-03-08 21:18:45,384][547799] Updated weights for policy 0, policy_version 185280 (0.0005) [2023-03-08 21:18:49,054][547799] Updated weights for policy 0, policy_version 185360 (0.0004) [2023-03-08 21:18:49,600][547459] Fps is (10 sec: 11468.7, 60 sec: 12014.9, 300 sec: 11857.6). Total num frames: 94908416. Throughput: 0: 11925.0. Samples: 94909360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:18:49,600][547459] Avg episode reward: [(0, '4749.262')] [2023-03-08 21:18:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000185368_94908416.pth... [2023-03-08 21:18:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000184672_94552064.pth [2023-03-08 21:18:52,542][547799] Updated weights for policy 0, policy_version 185440 (0.0005) [2023-03-08 21:18:54,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11857.6). Total num frames: 94969856. Throughput: 0: 11923.3. Samples: 94945264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:18:54,600][547459] Avg episode reward: [(0, '4781.371')] [2023-03-08 21:18:55,995][547799] Updated weights for policy 0, policy_version 185520 (0.0004) [2023-03-08 21:18:59,408][547799] Updated weights for policy 0, policy_version 185600 (0.0004) [2023-03-08 21:18:59,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11843.7). Total num frames: 95027200. Throughput: 0: 11850.4. Samples: 95015576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:18:59,600][547459] Avg episode reward: [(0, '4667.411')] [2023-03-08 21:19:02,674][547799] Updated weights for policy 0, policy_version 185680 (0.0005) [2023-03-08 21:19:04,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11857.6). Total num frames: 95088640. Throughput: 0: 11933.4. Samples: 95089684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:19:04,600][547459] Avg episode reward: [(0, '4589.807')] [2023-03-08 21:19:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000185720_95088640.pth... [2023-03-08 21:19:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000185024_94732288.pth [2023-03-08 21:19:06,185][547799] Updated weights for policy 0, policy_version 185760 (0.0004) [2023-03-08 21:19:09,599][547799] Updated weights for policy 0, policy_version 185840 (0.0004) [2023-03-08 21:19:09,599][547459] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 11857.6). Total num frames: 95150080. Throughput: 0: 11949.5. Samples: 95125472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:19:09,600][547459] Avg episode reward: [(0, '4682.965')] [2023-03-08 21:19:13,332][547799] Updated weights for policy 0, policy_version 185920 (0.0004) [2023-03-08 21:19:14,599][547459] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11843.7). Total num frames: 95203328. Throughput: 0: 11873.7. Samples: 95193592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:19:14,600][547459] Avg episode reward: [(0, '4574.335')] [2023-03-08 21:19:16,701][547799] Updated weights for policy 0, policy_version 186000 (0.0005) [2023-03-08 21:19:19,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11843.7). Total num frames: 95264768. Throughput: 0: 11822.7. Samples: 95265548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:19:19,600][547459] Avg episode reward: [(0, '4768.182')] [2023-03-08 21:19:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000186064_95264768.pth... [2023-03-08 21:19:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000185368_94908416.pth [2023-03-08 21:19:20,126][547799] Updated weights for policy 0, policy_version 186080 (0.0005) [2023-03-08 21:19:23,474][547799] Updated weights for policy 0, policy_version 186160 (0.0004) [2023-03-08 21:19:24,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11857.6). Total num frames: 95326208. Throughput: 0: 11846.8. Samples: 95302920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:19:24,600][547459] Avg episode reward: [(0, '4700.124')] [2023-03-08 21:19:27,309][547799] Updated weights for policy 0, policy_version 186240 (0.0004) [2023-03-08 21:19:29,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 95379456. Throughput: 0: 11789.2. Samples: 95370168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:19:29,600][547459] Avg episode reward: [(0, '4727.579')] [2023-03-08 21:19:30,808][547799] Updated weights for policy 0, policy_version 186320 (0.0005) [2023-03-08 21:19:34,293][547799] Updated weights for policy 0, policy_version 186400 (0.0005) [2023-03-08 21:19:34,600][547459] Fps is (10 sec: 11059.1, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 95436800. Throughput: 0: 11801.3. Samples: 95440420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:19:34,600][547459] Avg episode reward: [(0, '4788.419')] [2023-03-08 21:19:34,613][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000186408_95440896.pth... [2023-03-08 21:19:34,615][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000185720_95088640.pth [2023-03-08 21:19:37,817][547799] Updated weights for policy 0, policy_version 186480 (0.0004) [2023-03-08 21:19:39,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11857.6). Total num frames: 95498240. Throughput: 0: 11760.0. Samples: 95474464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:19:39,600][547459] Avg episode reward: [(0, '4793.037')] [2023-03-08 21:19:41,190][547799] Updated weights for policy 0, policy_version 186560 (0.0004) [2023-03-08 21:19:44,599][547459] Fps is (10 sec: 11878.6, 60 sec: 11741.9, 300 sec: 11857.6). Total num frames: 95555584. Throughput: 0: 11818.2. Samples: 95547392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:19:44,600][547459] Avg episode reward: [(0, '4700.675')] [2023-03-08 21:19:44,608][547799] Updated weights for policy 0, policy_version 186640 (0.0004) [2023-03-08 21:19:47,833][547799] Updated weights for policy 0, policy_version 186720 (0.0005) [2023-03-08 21:19:49,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 95621120. Throughput: 0: 11811.3. Samples: 95621192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:19:49,600][547459] Avg episode reward: [(0, '4787.261')] [2023-03-08 21:19:49,604][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000186760_95621120.pth... [2023-03-08 21:19:49,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000186064_95264768.pth [2023-03-08 21:19:51,402][547799] Updated weights for policy 0, policy_version 186800 (0.0005) [2023-03-08 21:19:54,599][547459] Fps is (10 sec: 12288.0, 60 sec: 11810.1, 300 sec: 11885.3). Total num frames: 95678464. Throughput: 0: 11753.9. Samples: 95654396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:19:54,600][547459] Avg episode reward: [(0, '4740.211')] [2023-03-08 21:19:54,878][547799] Updated weights for policy 0, policy_version 186880 (0.0005) [2023-03-08 21:19:58,402][547799] Updated weights for policy 0, policy_version 186960 (0.0005) [2023-03-08 21:19:59,600][547459] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11885.3). Total num frames: 95735808. Throughput: 0: 11808.2. Samples: 95724964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:19:59,600][547459] Avg episode reward: [(0, '4783.093')] [2023-03-08 21:20:01,867][547799] Updated weights for policy 0, policy_version 187040 (0.0005) [2023-03-08 21:20:04,600][547459] Fps is (10 sec: 11878.2, 60 sec: 11810.1, 300 sec: 11871.5). Total num frames: 95797248. Throughput: 0: 11816.9. Samples: 95797312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:20:04,600][547459] Avg episode reward: [(0, '4782.923')] [2023-03-08 21:20:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000187104_95797248.pth... [2023-03-08 21:20:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000186408_95440896.pth [2023-03-08 21:20:05,183][547799] Updated weights for policy 0, policy_version 187120 (0.0005) [2023-03-08 21:20:08,600][547799] Updated weights for policy 0, policy_version 187200 (0.0005) [2023-03-08 21:20:09,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11885.3). Total num frames: 95854592. Throughput: 0: 11798.7. Samples: 95833860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:20:09,600][547459] Avg episode reward: [(0, '4782.275')] [2023-03-08 21:20:12,047][547799] Updated weights for policy 0, policy_version 187280 (0.0005) [2023-03-08 21:20:14,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 95916032. Throughput: 0: 11889.5. Samples: 95905196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:20:14,600][547459] Avg episode reward: [(0, '4784.690')] [2023-03-08 21:20:15,440][547799] Updated weights for policy 0, policy_version 187360 (0.0005) [2023-03-08 21:20:18,924][547799] Updated weights for policy 0, policy_version 187440 (0.0005) [2023-03-08 21:20:19,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11885.3). Total num frames: 95973376. Throughput: 0: 11901.3. Samples: 95975976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:20:19,600][547459] Avg episode reward: [(0, '4833.666')] [2023-03-08 21:20:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000187448_95973376.pth... [2023-03-08 21:20:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000186760_95621120.pth [2023-03-08 21:20:22,279][547799] Updated weights for policy 0, policy_version 187520 (0.0004) [2023-03-08 21:20:24,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 96034816. Throughput: 0: 11994.0. Samples: 96014192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:20:24,600][547459] Avg episode reward: [(0, '4827.826')] [2023-03-08 21:20:25,683][547799] Updated weights for policy 0, policy_version 187600 (0.0004) [2023-03-08 21:20:29,018][547799] Updated weights for policy 0, policy_version 187680 (0.0005) [2023-03-08 21:20:29,599][547459] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11899.2). Total num frames: 96096256. Throughput: 0: 11985.9. Samples: 96086756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:20:29,600][547459] Avg episode reward: [(0, '4831.920')] [2023-03-08 21:20:32,503][547799] Updated weights for policy 0, policy_version 187760 (0.0005) [2023-03-08 21:20:34,600][547459] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11899.2). Total num frames: 96157696. Throughput: 0: 11923.7. Samples: 96157760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:20:34,600][547459] Avg episode reward: [(0, '4777.820')] [2023-03-08 21:20:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000187808_96157696.pth... [2023-03-08 21:20:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000187104_95797248.pth [2023-03-08 21:20:35,764][547799] Updated weights for policy 0, policy_version 187840 (0.0005) [2023-03-08 21:20:39,017][547799] Updated weights for policy 0, policy_version 187920 (0.0005) [2023-03-08 21:20:39,600][547459] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11913.1). Total num frames: 96219136. Throughput: 0: 12059.8. Samples: 96197088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:20:39,600][547459] Avg episode reward: [(0, '4645.000')] [2023-03-08 21:20:42,437][547799] Updated weights for policy 0, policy_version 188000 (0.0005) [2023-03-08 21:20:44,599][547459] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 11913.1). Total num frames: 96280576. Throughput: 0: 12079.2. Samples: 96268528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:20:44,600][547459] Avg episode reward: [(0, '4832.986')] [2023-03-08 21:20:45,999][547799] Updated weights for policy 0, policy_version 188080 (0.0005) [2023-03-08 21:20:49,269][547799] Updated weights for policy 0, policy_version 188160 (0.0005) [2023-03-08 21:20:49,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11899.2). Total num frames: 96337920. Throughput: 0: 12097.2. Samples: 96341684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:20:49,600][547459] Avg episode reward: [(0, '4783.952')] [2023-03-08 21:20:49,626][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000188168_96342016.pth... [2023-03-08 21:20:49,627][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000187448_95973376.pth [2023-03-08 21:20:52,687][547799] Updated weights for policy 0, policy_version 188240 (0.0005) [2023-03-08 21:20:54,600][547459] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11913.1). Total num frames: 96399360. Throughput: 0: 12079.7. Samples: 96377448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:20:54,600][547459] Avg episode reward: [(0, '4833.858')] [2023-03-08 21:20:56,333][547799] Updated weights for policy 0, policy_version 188320 (0.0004) [2023-03-08 21:20:59,599][547459] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11913.1). Total num frames: 96456704. Throughput: 0: 11982.4. Samples: 96444404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:20:59,600][547459] Avg episode reward: [(0, '4551.637')] [2023-03-08 21:20:59,949][547799] Updated weights for policy 0, policy_version 188400 (0.0005) [2023-03-08 21:21:03,258][547799] Updated weights for policy 0, policy_version 188480 (0.0005) [2023-03-08 21:21:04,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11913.1). Total num frames: 96514048. Throughput: 0: 12034.8. Samples: 96517540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:21:04,600][547459] Avg episode reward: [(0, '4731.688')] [2023-03-08 21:21:04,629][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000188512_96518144.pth... [2023-03-08 21:21:04,631][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000187808_96157696.pth [2023-03-08 21:21:06,751][547799] Updated weights for policy 0, policy_version 188560 (0.0005) [2023-03-08 21:21:09,599][547459] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11940.9). Total num frames: 96575488. Throughput: 0: 11956.6. Samples: 96552240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:21:09,600][547459] Avg episode reward: [(0, '4648.484')] [2023-03-08 21:21:10,285][547799] Updated weights for policy 0, policy_version 188640 (0.0004) [2023-03-08 21:21:13,746][547799] Updated weights for policy 0, policy_version 188720 (0.0005) [2023-03-08 21:21:14,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 96632832. Throughput: 0: 11904.1. Samples: 96622440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:21:14,600][547459] Avg episode reward: [(0, '4837.521')] [2023-03-08 21:21:17,240][547799] Updated weights for policy 0, policy_version 188800 (0.0005) [2023-03-08 21:21:19,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11913.1). Total num frames: 96690176. Throughput: 0: 11889.5. Samples: 96692788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:21:19,600][547459] Avg episode reward: [(0, '4787.085')] [2023-03-08 21:21:19,642][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000188856_96694272.pth... [2023-03-08 21:21:19,644][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000188168_96342016.pth [2023-03-08 21:21:20,632][547799] Updated weights for policy 0, policy_version 188880 (0.0005) [2023-03-08 21:21:24,083][547799] Updated weights for policy 0, policy_version 188960 (0.0005) [2023-03-08 21:21:24,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 96751616. Throughput: 0: 11847.3. Samples: 96730216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:21:24,611][547459] Avg episode reward: [(0, '4779.791')] [2023-03-08 21:21:27,552][547799] Updated weights for policy 0, policy_version 189040 (0.0005) [2023-03-08 21:21:29,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11940.9). Total num frames: 96813056. Throughput: 0: 11833.8. Samples: 96801048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:21:29,611][547459] Avg episode reward: [(0, '4783.806')] [2023-03-08 21:21:30,935][547799] Updated weights for policy 0, policy_version 189120 (0.0005) [2023-03-08 21:21:34,456][547799] Updated weights for policy 0, policy_version 189200 (0.0004) [2023-03-08 21:21:34,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11927.0). Total num frames: 96870400. Throughput: 0: 11758.6. Samples: 96870824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:21:34,611][547459] Avg episode reward: [(0, '4790.569')] [2023-03-08 21:21:34,614][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000189200_96870400.pth... [2023-03-08 21:21:34,617][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000188512_96518144.pth [2023-03-08 21:21:37,921][547799] Updated weights for policy 0, policy_version 189280 (0.0005) [2023-03-08 21:21:39,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 96931840. Throughput: 0: 11767.8. Samples: 96907000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:21:39,600][547459] Avg episode reward: [(0, '4745.506')] [2023-03-08 21:21:41,271][547799] Updated weights for policy 0, policy_version 189360 (0.0005) [2023-03-08 21:21:44,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11954.8). Total num frames: 96989184. Throughput: 0: 11897.0. Samples: 96979768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:21:44,600][547459] Avg episode reward: [(0, '4751.598')] [2023-03-08 21:21:44,680][547799] Updated weights for policy 0, policy_version 189440 (0.0005) [2023-03-08 21:21:47,939][547799] Updated weights for policy 0, policy_version 189520 (0.0005) [2023-03-08 21:21:49,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 97050624. Throughput: 0: 11912.1. Samples: 97053584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:21:49,600][547459] Avg episode reward: [(0, '4837.889')] [2023-03-08 21:21:49,627][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000189560_97054720.pth... [2023-03-08 21:21:49,628][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000188856_96694272.pth [2023-03-08 21:21:51,297][547799] Updated weights for policy 0, policy_version 189600 (0.0004) [2023-03-08 21:21:54,600][547459] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 97112064. Throughput: 0: 11979.0. Samples: 97091296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:21:54,600][547459] Avg episode reward: [(0, '4700.886')] [2023-03-08 21:21:54,785][547799] Updated weights for policy 0, policy_version 189680 (0.0005) [2023-03-08 21:21:58,291][547799] Updated weights for policy 0, policy_version 189760 (0.0005) [2023-03-08 21:21:59,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11927.0). Total num frames: 97169408. Throughput: 0: 11959.9. Samples: 97160636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:21:59,600][547459] Avg episode reward: [(0, '4790.997')] [2023-03-08 21:22:01,763][547799] Updated weights for policy 0, policy_version 189840 (0.0005) [2023-03-08 21:22:04,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 97230848. Throughput: 0: 11956.9. Samples: 97230848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:22:04,600][547459] Avg episode reward: [(0, '4747.088')] [2023-03-08 21:22:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000189904_97230848.pth... [2023-03-08 21:22:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000189200_96870400.pth [2023-03-08 21:22:05,140][547799] Updated weights for policy 0, policy_version 189920 (0.0005) [2023-03-08 21:22:08,591][547799] Updated weights for policy 0, policy_version 190000 (0.0005) [2023-03-08 21:22:09,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 97288192. Throughput: 0: 11944.6. Samples: 97267720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:22:09,600][547459] Avg episode reward: [(0, '4759.319')] [2023-03-08 21:22:11,991][547799] Updated weights for policy 0, policy_version 190080 (0.0005) [2023-03-08 21:22:14,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11913.1). Total num frames: 97349632. Throughput: 0: 11968.1. Samples: 97339612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:22:14,600][547459] Avg episode reward: [(0, '4660.878')] [2023-03-08 21:22:15,372][547799] Updated weights for policy 0, policy_version 190160 (0.0005) [2023-03-08 21:22:18,807][547799] Updated weights for policy 0, policy_version 190240 (0.0004) [2023-03-08 21:22:19,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11913.1). Total num frames: 97406976. Throughput: 0: 12002.2. Samples: 97410920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:22:19,600][547459] Avg episode reward: [(0, '4567.705')] [2023-03-08 21:22:19,606][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000190256_97411072.pth... [2023-03-08 21:22:19,608][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000189560_97054720.pth [2023-03-08 21:22:22,484][547799] Updated weights for policy 0, policy_version 190320 (0.0005) [2023-03-08 21:22:24,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 97468416. Throughput: 0: 11931.4. Samples: 97443912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:22:24,600][547459] Avg episode reward: [(0, '4612.716')] [2023-03-08 21:22:26,131][547799] Updated weights for policy 0, policy_version 190400 (0.0004) [2023-03-08 21:22:29,600][547459] Fps is (10 sec: 11468.7, 60 sec: 11810.1, 300 sec: 11913.1). Total num frames: 97521664. Throughput: 0: 11809.0. Samples: 97511172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:22:29,600][547459] Avg episode reward: [(0, '4836.282')] [2023-03-08 21:22:29,893][547799] Updated weights for policy 0, policy_version 190480 (0.0005) [2023-03-08 21:22:33,410][547799] Updated weights for policy 0, policy_version 190560 (0.0005) [2023-03-08 21:22:34,600][547459] Fps is (10 sec: 11059.2, 60 sec: 11810.1, 300 sec: 11913.1). Total num frames: 97579008. Throughput: 0: 11677.7. Samples: 97579080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:22:34,600][547459] Avg episode reward: [(0, '4703.464')] [2023-03-08 21:22:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000190584_97579008.pth... [2023-03-08 21:22:34,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000189904_97230848.pth [2023-03-08 21:22:36,956][547799] Updated weights for policy 0, policy_version 190640 (0.0005) [2023-03-08 21:22:39,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11899.2). Total num frames: 97636352. Throughput: 0: 11651.9. Samples: 97615632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:22:39,600][547459] Avg episode reward: [(0, '4662.969')] [2023-03-08 21:22:40,347][547799] Updated weights for policy 0, policy_version 190720 (0.0004) [2023-03-08 21:22:43,802][547799] Updated weights for policy 0, policy_version 190800 (0.0005) [2023-03-08 21:22:44,600][547459] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 97697792. Throughput: 0: 11683.2. Samples: 97686380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:22:44,611][547459] Avg episode reward: [(0, '4511.569')] [2023-03-08 21:22:47,215][547799] Updated weights for policy 0, policy_version 190880 (0.0004) [2023-03-08 21:22:49,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11885.3). Total num frames: 97755136. Throughput: 0: 11649.0. Samples: 97755052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:22:49,610][547459] Avg episode reward: [(0, '4528.775')] [2023-03-08 21:22:49,613][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000190928_97755136.pth... [2023-03-08 21:22:49,617][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000190256_97411072.pth [2023-03-08 21:22:51,041][547799] Updated weights for policy 0, policy_version 190960 (0.0004) [2023-03-08 21:22:54,583][547799] Updated weights for policy 0, policy_version 191040 (0.0005) [2023-03-08 21:22:54,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11871.5). Total num frames: 97812480. Throughput: 0: 11561.0. Samples: 97787968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:22:54,600][547459] Avg episode reward: [(0, '4572.739')] [2023-03-08 21:22:57,856][547799] Updated weights for policy 0, policy_version 191120 (0.0004) [2023-03-08 21:22:59,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11871.5). Total num frames: 97873920. Throughput: 0: 11599.5. Samples: 97861588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:22:59,600][547459] Avg episode reward: [(0, '4269.797')] [2023-03-08 21:23:01,271][547799] Updated weights for policy 0, policy_version 191200 (0.0005) [2023-03-08 21:23:04,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11843.7). Total num frames: 97931264. Throughput: 0: 11643.5. Samples: 97934880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:23:04,600][547459] Avg episode reward: [(0, '4774.892')] [2023-03-08 21:23:04,650][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000191280_97935360.pth... [2023-03-08 21:23:04,650][547799] Updated weights for policy 0, policy_version 191280 (0.0005) [2023-03-08 21:23:04,652][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000190584_97579008.pth [2023-03-08 21:23:08,012][547799] Updated weights for policy 0, policy_version 191360 (0.0005) [2023-03-08 21:23:09,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11871.5). Total num frames: 97992704. Throughput: 0: 11727.8. Samples: 97971664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:23:09,600][547459] Avg episode reward: [(0, '4575.759')] [2023-03-08 21:23:11,359][547799] Updated weights for policy 0, policy_version 191440 (0.0005) [2023-03-08 21:23:14,600][547459] Fps is (10 sec: 12288.0, 60 sec: 11741.8, 300 sec: 11871.5). Total num frames: 98054144. Throughput: 0: 11836.5. Samples: 98043816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:23:14,600][547459] Avg episode reward: [(0, '4699.478')] [2023-03-08 21:23:14,803][547799] Updated weights for policy 0, policy_version 191520 (0.0005) [2023-03-08 21:23:18,242][547799] Updated weights for policy 0, policy_version 191600 (0.0005) [2023-03-08 21:23:19,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11885.3). Total num frames: 98115584. Throughput: 0: 11919.1. Samples: 98115440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:23:19,600][547459] Avg episode reward: [(0, '4638.852')] [2023-03-08 21:23:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000191632_98115584.pth... [2023-03-08 21:23:19,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000190928_97755136.pth [2023-03-08 21:23:21,591][547799] Updated weights for policy 0, policy_version 191680 (0.0005) [2023-03-08 21:23:24,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11871.5). Total num frames: 98172928. Throughput: 0: 11921.8. Samples: 98152112. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:23:24,600][547459] Avg episode reward: [(0, '4678.754')] [2023-03-08 21:23:25,226][547799] Updated weights for policy 0, policy_version 191760 (0.0005) [2023-03-08 21:23:28,822][547799] Updated weights for policy 0, policy_version 191840 (0.0005) [2023-03-08 21:23:29,600][547459] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11857.6). Total num frames: 98230272. Throughput: 0: 11815.0. Samples: 98218056. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:23:29,600][547459] Avg episode reward: [(0, '4725.771')] [2023-03-08 21:23:32,393][547799] Updated weights for policy 0, policy_version 191920 (0.0005) [2023-03-08 21:23:34,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 98287616. Throughput: 0: 11836.2. Samples: 98287680. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:23:34,600][547459] Avg episode reward: [(0, '4624.834')] [2023-03-08 21:23:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000191968_98287616.pth... [2023-03-08 21:23:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000191280_97935360.pth [2023-03-08 21:23:35,979][547799] Updated weights for policy 0, policy_version 192000 (0.0005) [2023-03-08 21:23:39,545][547799] Updated weights for policy 0, policy_version 192080 (0.0005) [2023-03-08 21:23:39,600][547459] Fps is (10 sec: 11468.7, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 98344960. Throughput: 0: 11849.8. Samples: 98321208. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:23:39,600][547459] Avg episode reward: [(0, '4608.960')] [2023-03-08 21:23:42,963][547799] Updated weights for policy 0, policy_version 192160 (0.0005) [2023-03-08 21:23:44,600][547459] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 98402304. Throughput: 0: 11829.7. Samples: 98393924. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:23:44,600][547459] Avg episode reward: [(0, '4657.774')] [2023-03-08 21:23:46,441][547799] Updated weights for policy 0, policy_version 192240 (0.0005) [2023-03-08 21:23:49,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 98463744. Throughput: 0: 11752.3. Samples: 98463732. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:23:49,600][547459] Avg episode reward: [(0, '4550.244')] [2023-03-08 21:23:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000192312_98463744.pth... [2023-03-08 21:23:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000191632_98115584.pth [2023-03-08 21:23:49,939][547799] Updated weights for policy 0, policy_version 192320 (0.0004) [2023-03-08 21:23:53,097][547799] Updated weights for policy 0, policy_version 192400 (0.0005) [2023-03-08 21:23:54,600][547459] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11857.6). Total num frames: 98525184. Throughput: 0: 11756.4. Samples: 98500704. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:23:54,600][547459] Avg episode reward: [(0, '4711.753')] [2023-03-08 21:23:56,712][547799] Updated weights for policy 0, policy_version 192480 (0.0005) [2023-03-08 21:23:59,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 98582528. Throughput: 0: 11732.9. Samples: 98571796. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:23:59,600][547459] Avg episode reward: [(0, '4768.479')] [2023-03-08 21:24:00,101][547799] Updated weights for policy 0, policy_version 192560 (0.0004) [2023-03-08 21:24:03,345][547799] Updated weights for policy 0, policy_version 192640 (0.0005) [2023-03-08 21:24:04,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11843.7). Total num frames: 98643968. Throughput: 0: 11779.2. Samples: 98645504. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:24:04,600][547459] Avg episode reward: [(0, '4704.876')] [2023-03-08 21:24:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000192664_98643968.pth... [2023-03-08 21:24:04,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000191968_98287616.pth [2023-03-08 21:24:06,715][547799] Updated weights for policy 0, policy_version 192720 (0.0005) [2023-03-08 21:24:09,600][547459] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 98705408. Throughput: 0: 11767.0. Samples: 98681628. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:24:09,600][547459] Avg episode reward: [(0, '4749.229')] [2023-03-08 21:24:10,137][547799] Updated weights for policy 0, policy_version 192800 (0.0005) [2023-03-08 21:24:13,585][547799] Updated weights for policy 0, policy_version 192880 (0.0005) [2023-03-08 21:24:14,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11857.6). Total num frames: 98762752. Throughput: 0: 11920.8. Samples: 98754492. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:24:14,600][547459] Avg episode reward: [(0, '4790.619')] [2023-03-08 21:24:17,284][547799] Updated weights for policy 0, policy_version 192960 (0.0005) [2023-03-08 21:24:19,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 98820096. Throughput: 0: 11888.1. Samples: 98822644. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:24:19,600][547459] Avg episode reward: [(0, '4589.978')] [2023-03-08 21:24:19,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000193008_98820096.pth... [2023-03-08 21:24:19,605][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000192312_98463744.pth [2023-03-08 21:24:20,751][547799] Updated weights for policy 0, policy_version 193040 (0.0005) [2023-03-08 21:24:24,002][547799] Updated weights for policy 0, policy_version 193120 (0.0005) [2023-03-08 21:24:24,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11871.5). Total num frames: 98881536. Throughput: 0: 11969.3. Samples: 98859828. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:24:24,600][547459] Avg episode reward: [(0, '4755.331')] [2023-03-08 21:24:27,511][547799] Updated weights for policy 0, policy_version 193200 (0.0005) [2023-03-08 21:24:29,600][547459] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 98942976. Throughput: 0: 11955.0. Samples: 98931900. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:24:29,600][547459] Avg episode reward: [(0, '4752.666')] [2023-03-08 21:24:30,800][547799] Updated weights for policy 0, policy_version 193280 (0.0005) [2023-03-08 21:24:34,363][547799] Updated weights for policy 0, policy_version 193360 (0.0004) [2023-03-08 21:24:34,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 99000320. Throughput: 0: 11967.1. Samples: 99002252. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:24:34,600][547459] Avg episode reward: [(0, '4620.788')] [2023-03-08 21:24:34,602][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000193360_99000320.pth... [2023-03-08 21:24:34,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000192664_98643968.pth [2023-03-08 21:24:38,021][547799] Updated weights for policy 0, policy_version 193440 (0.0004) [2023-03-08 21:24:39,600][547459] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 99057664. Throughput: 0: 11868.6. Samples: 99034792. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:24:39,600][547459] Avg episode reward: [(0, '4685.739')] [2023-03-08 21:24:41,508][547799] Updated weights for policy 0, policy_version 193520 (0.0005) [2023-03-08 21:24:44,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11843.7). Total num frames: 99115008. Throughput: 0: 11886.4. Samples: 99106684. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:24:44,600][547459] Avg episode reward: [(0, '4735.006')] [2023-03-08 21:24:45,036][547799] Updated weights for policy 0, policy_version 193600 (0.0005) [2023-03-08 21:24:48,541][547799] Updated weights for policy 0, policy_version 193680 (0.0004) [2023-03-08 21:24:49,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11857.6). Total num frames: 99176448. Throughput: 0: 11798.8. Samples: 99176448. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:24:49,600][547459] Avg episode reward: [(0, '4691.161')] [2023-03-08 21:24:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000193704_99176448.pth... [2023-03-08 21:24:49,604][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000193008_98820096.pth [2023-03-08 21:24:51,968][547799] Updated weights for policy 0, policy_version 193760 (0.0004) [2023-03-08 21:24:54,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11857.6). Total num frames: 99233792. Throughput: 0: 11790.3. Samples: 99212192. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:24:54,600][547459] Avg episode reward: [(0, '4830.509')] [2023-03-08 21:24:55,559][547799] Updated weights for policy 0, policy_version 193840 (0.0004) [2023-03-08 21:24:59,008][547799] Updated weights for policy 0, policy_version 193920 (0.0005) [2023-03-08 21:24:59,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 99291136. Throughput: 0: 11684.6. Samples: 99280296. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:24:59,600][547459] Avg episode reward: [(0, '4720.205')] [2023-03-08 21:25:02,322][547799] Updated weights for policy 0, policy_version 194000 (0.0005) [2023-03-08 21:25:04,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11857.6). Total num frames: 99352576. Throughput: 0: 11807.6. Samples: 99353988. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:25:04,600][547459] Avg episode reward: [(0, '4734.894')] [2023-03-08 21:25:04,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000194048_99352576.pth... [2023-03-08 21:25:04,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000193360_99000320.pth [2023-03-08 21:25:05,808][547799] Updated weights for policy 0, policy_version 194080 (0.0004) [2023-03-08 21:25:09,197][547799] Updated weights for policy 0, policy_version 194160 (0.0005) [2023-03-08 21:25:09,600][547459] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11857.6). Total num frames: 99414016. Throughput: 0: 11784.6. Samples: 99390136. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:25:09,600][547459] Avg episode reward: [(0, '4749.956')] [2023-03-08 21:25:12,666][547799] Updated weights for policy 0, policy_version 194240 (0.0005) [2023-03-08 21:25:14,599][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11857.6). Total num frames: 99471360. Throughput: 0: 11760.5. Samples: 99461124. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:25:14,600][547459] Avg episode reward: [(0, '4703.140')] [2023-03-08 21:25:16,413][547799] Updated weights for policy 0, policy_version 194320 (0.0005) [2023-03-08 21:25:19,600][547459] Fps is (10 sec: 11468.6, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 99528704. Throughput: 0: 11698.9. Samples: 99528704. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:25:19,601][547459] Avg episode reward: [(0, '4740.521')] [2023-03-08 21:25:19,605][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000194392_99528704.pth... [2023-03-08 21:25:19,607][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000193704_99176448.pth [2023-03-08 21:25:19,922][547799] Updated weights for policy 0, policy_version 194400 (0.0004) [2023-03-08 21:25:23,158][547799] Updated weights for policy 0, policy_version 194480 (0.0004) [2023-03-08 21:25:24,599][547459] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11829.8). Total num frames: 99586048. Throughput: 0: 11819.7. Samples: 99566680. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:25:24,600][547459] Avg episode reward: [(0, '4749.633')] [2023-03-08 21:25:26,618][547799] Updated weights for policy 0, policy_version 194560 (0.0005) [2023-03-08 21:25:29,600][547459] Fps is (10 sec: 11878.7, 60 sec: 11741.9, 300 sec: 11829.8). Total num frames: 99647488. Throughput: 0: 11774.6. Samples: 99636540. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:25:29,600][547459] Avg episode reward: [(0, '4791.600')] [2023-03-08 21:25:30,146][547799] Updated weights for policy 0, policy_version 194640 (0.0004) [2023-03-08 21:25:33,746][547799] Updated weights for policy 0, policy_version 194720 (0.0005) [2023-03-08 21:25:34,600][547459] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 99704832. Throughput: 0: 11764.4. Samples: 99705848. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:25:34,600][547459] Avg episode reward: [(0, '4743.126')] [2023-03-08 21:25:34,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000194736_99704832.pth... [2023-03-08 21:25:34,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000194048_99352576.pth [2023-03-08 21:25:37,220][547799] Updated weights for policy 0, policy_version 194800 (0.0005) [2023-03-08 21:25:39,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 99766272. Throughput: 0: 11766.9. Samples: 99741704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:25:39,600][547459] Avg episode reward: [(0, '4750.490')] [2023-03-08 21:25:40,703][547799] Updated weights for policy 0, policy_version 194880 (0.0005) [2023-03-08 21:25:44,134][547799] Updated weights for policy 0, policy_version 194960 (0.0004) [2023-03-08 21:25:44,599][547459] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 99823616. Throughput: 0: 11829.7. Samples: 99812632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:25:44,600][547459] Avg episode reward: [(0, '4575.340')] [2023-03-08 21:25:47,492][547799] Updated weights for policy 0, policy_version 195040 (0.0005) [2023-03-08 21:25:49,600][547459] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 99885056. Throughput: 0: 11801.0. Samples: 99885032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:25:49,600][547459] Avg episode reward: [(0, '4737.514')] [2023-03-08 21:25:49,603][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000195088_99885056.pth... [2023-03-08 21:25:49,606][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000194392_99528704.pth [2023-03-08 21:25:50,954][547799] Updated weights for policy 0, policy_version 195120 (0.0005) [2023-03-08 21:25:54,137][547799] Updated weights for policy 0, policy_version 195200 (0.0004) [2023-03-08 21:25:54,599][547459] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11829.8). Total num frames: 99946496. Throughput: 0: 11818.9. Samples: 99921984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:25:54,600][547459] Avg episode reward: [(0, '4835.496')] [2023-03-08 21:25:57,570][547799] Updated weights for policy 0, policy_version 195280 (0.0004) [2023-03-08 21:25:59,277][547754] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 [2023-03-08 21:25:59,599][547459] Fps is (10 sec: 11878.6, 60 sec: 11878.4, 300 sec: 11829.8). Total num frames: 100003840. Throughput: 0: 11873.9. Samples: 99995448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:25:59,600][547459] Avg episode reward: [(0, '4729.340')] [2023-03-08 21:25:59,627][547754] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 [2023-03-08 21:25:59,628][547754] Stopping Batcher_0... [2023-03-08 21:25:59,628][547834] Stopping RolloutWorker_w3... [2023-03-08 21:25:59,628][547801] Stopping RolloutWorker_w1... [2023-03-08 21:25:59,628][547899] Stopping RolloutWorker_w7... [2023-03-08 21:25:59,628][547931] Stopping RolloutWorker_w5... [2023-03-08 21:25:59,628][547835] Stopping RolloutWorker_w4... [2023-03-08 21:25:59,629][547834] Loop rollout_proc3_evt_loop terminating... [2023-03-08 21:25:59,629][547801] Loop rollout_proc1_evt_loop terminating... [2023-03-08 21:25:59,629][547899] Loop rollout_proc7_evt_loop terminating... [2023-03-08 21:25:59,629][547931] Loop rollout_proc5_evt_loop terminating... [2023-03-08 21:25:59,629][547835] Loop rollout_proc4_evt_loop terminating... [2023-03-08 21:25:59,629][547800] Stopping RolloutWorker_w0... [2023-03-08 21:25:59,629][547802] Stopping RolloutWorker_w2... [2023-03-08 21:25:59,629][547754] Loop batcher_evt_loop terminating... [2023-03-08 21:25:59,629][547800] Loop rollout_proc0_evt_loop terminating... [2023-03-08 21:25:59,629][547867] Stopping RolloutWorker_w6... [2023-03-08 21:25:59,628][547459] Component Batcher_0 stopped! [2023-03-08 21:25:59,629][547802] Loop rollout_proc2_evt_loop terminating... [2023-03-08 21:25:59,629][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000195328_100007936.pth... [2023-03-08 21:25:59,629][547867] Loop rollout_proc6_evt_loop terminating... [2023-03-08 21:25:59,629][547459] Component RolloutWorker_w3 stopped! [2023-03-08 21:25:59,630][547459] Component RolloutWorker_w1 stopped! [2023-03-08 21:25:59,630][547459] Component RolloutWorker_w7 stopped! [2023-03-08 21:25:59,630][547754] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000194736_99704832.pth [2023-03-08 21:25:59,630][547459] Component RolloutWorker_w4 stopped! [2023-03-08 21:25:59,631][547754] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-side-v2/checkpoint_p0/checkpoint_000195328_100007936.pth... [2023-03-08 21:25:59,631][547459] Component RolloutWorker_w5 stopped! [2023-03-08 21:25:59,631][547459] Component RolloutWorker_w0 stopped! [2023-03-08 21:25:59,632][547459] Component RolloutWorker_w2 stopped! [2023-03-08 21:25:59,632][547459] Component RolloutWorker_w6 stopped! [2023-03-08 21:25:59,632][547754] Stopping LearnerWorker_p0... [2023-03-08 21:25:59,633][547754] Loop learner_proc0_evt_loop terminating... [2023-03-08 21:25:59,633][547459] Component LearnerWorker_p0 stopped! [2023-03-08 21:25:59,651][547799] Weights refcount: 2 0 [2023-03-08 21:25:59,652][547799] Stopping InferenceWorker_p0-w0... [2023-03-08 21:25:59,653][547799] Loop inference_proc0-0_evt_loop terminating... [2023-03-08 21:25:59,653][547459] Component InferenceWorker_p0-w0 stopped! [2023-03-08 21:25:59,654][547459] Waiting for process learner_proc0 to stop... [2023-03-08 21:26:00,105][547459] Waiting for process inference_proc0-0 to join... [2023-03-08 21:26:00,121][547459] Waiting for process rollout_proc0 to join... [2023-03-08 21:26:00,121][547459] Waiting for process rollout_proc1 to join... [2023-03-08 21:26:00,121][547459] Waiting for process rollout_proc2 to join... [2023-03-08 21:26:00,158][547459] Waiting for process rollout_proc3 to join... [2023-03-08 21:26:00,159][547459] Waiting for process rollout_proc4 to join... [2023-03-08 21:26:00,166][547459] Waiting for process rollout_proc5 to join... [2023-03-08 21:26:00,167][547459] Waiting for process rollout_proc6 to join... [2023-03-08 21:26:00,167][547459] Waiting for process rollout_proc7 to join... [2023-03-08 21:26:00,167][547459] Batcher 0 profile tree view: batching: 19.3837, releasing_batches: 15.5674 [2023-03-08 21:26:00,167][547459] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0001 wait_policy_total: 2963.6865 update_model: 108.7637 weight_update: 0.0005 one_step: 0.0010 handle_policy_step: 5467.2052 deserialize: 229.9944, stack: 56.1650, obs_to_device_normalize: 966.2667, forward: 2722.2210, send_messages: 448.7057 prepare_outputs: 583.0406 to_cpu: 91.3786 [2023-03-08 21:26:00,167][547459] Learner 0 profile tree view: misc: 0.1333, prepare_batch: 81.5062 train: 1150.1176 epoch_init: 0.3763, minibatch_init: 12.5460, losses_postprocess: 14.6110, kl_divergence: 4.3955, after_optimizer: 6.6743 calculate_losses: 464.2102 losses_init: 0.3238, forward_head: 220.4556, bptt_initial: 1.4380, bptt: 1.5509, tail: 117.8736, advantages_returns: 9.6018, losses: 98.4397 update: 630.8422 clip: 59.7350 [2023-03-08 21:26:00,168][547459] RolloutWorker_w0 profile tree view: wait_for_trajectories: 4.6808, enqueue_policy_requests: 191.8114, env_step: 4967.9091, overhead: 382.8192, complete_rollouts: 4.5588 save_policy_outputs: 436.4269 split_output_tensors: 212.6095 [2023-03-08 21:26:00,168][547459] RolloutWorker_w7 profile tree view: wait_for_trajectories: 4.5678, enqueue_policy_requests: 193.2829, env_step: 4955.6420, overhead: 380.5268, complete_rollouts: 4.6447 save_policy_outputs: 435.9587 split_output_tensors: 213.6757 [2023-03-08 21:26:00,168][547459] Loop Runner_EvtLoop terminating... [2023-03-08 21:26:00,168][547459] Runner profile tree view: main_loop: 9202.8671 [2023-03-08 21:26:00,168][547459] Collected {0: 100007936}, FPS: 10867.0