[2023-03-08 21:26:50,094][569624] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/config.json... [2023-03-08 21:26:50,109][569624] Rollout worker 0 uses device cpu [2023-03-08 21:26:50,110][569624] Rollout worker 1 uses device cpu [2023-03-08 21:26:50,110][569624] Rollout worker 2 uses device cpu [2023-03-08 21:26:50,110][569624] Rollout worker 3 uses device cpu [2023-03-08 21:26:50,110][569624] Rollout worker 4 uses device cpu [2023-03-08 21:26:50,110][569624] Rollout worker 5 uses device cpu [2023-03-08 21:26:50,111][569624] Rollout worker 6 uses device cpu [2023-03-08 21:26:50,111][569624] Rollout worker 7 uses device cpu [2023-03-08 21:26:50,111][569624] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 [2023-03-08 21:26:50,123][569624] InferenceWorker_p0-w0: min num requests: 2 [2023-03-08 21:26:50,142][569624] Starting all processes... [2023-03-08 21:26:50,142][569624] Starting process learner_proc0 [2023-03-08 21:26:50,192][569624] Starting all processes... [2023-03-08 21:26:50,228][569624] Starting process inference_proc0-0 [2023-03-08 21:26:50,239][569624] Starting process rollout_proc0 [2023-03-08 21:26:50,239][569624] Starting process rollout_proc1 [2023-03-08 21:26:50,240][569624] Starting process rollout_proc2 [2023-03-08 21:26:50,240][569624] Starting process rollout_proc3 [2023-03-08 21:26:50,240][569624] Starting process rollout_proc4 [2023-03-08 21:26:50,240][569624] Starting process rollout_proc5 [2023-03-08 21:26:50,240][569624] Starting process rollout_proc6 [2023-03-08 21:26:50,240][569624] Starting process rollout_proc7 [2023-03-08 21:26:51,748][569867] Starting seed is not provided [2023-03-08 21:26:51,749][569867] Initializing actor-critic model on device cpu [2023-03-08 21:26:51,749][569867] RunningMeanStd input shape: (39,) [2023-03-08 21:26:51,749][569867] RunningMeanStd input shape: (1,) [2023-03-08 21:26:51,819][569867] Created Actor Critic model with architecture: [2023-03-08 21:26:51,819][569867] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): MlpEncoder( (mlp_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=Tanh) (2): RecursiveScriptModule(original_name=Linear) (3): RecursiveScriptModule(original_name=Tanh) ) ) ) ) (core): ModelCoreIdentity() (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=64, out_features=1, bias=True) (action_parameterization): ActionParameterizationContinuousNonAdaptiveStddev( (distribution_linear): Linear(in_features=64, out_features=4, bias=True) ) ) [2023-03-08 21:26:51,835][569912] Worker 1 uses CPU cores [4, 5, 6, 7] [2023-03-08 21:26:51,953][569914] Worker 2 uses CPU cores [8, 9, 10, 11] [2023-03-08 21:26:51,961][569913] Worker 0 uses CPU cores [0, 1, 2, 3] [2023-03-08 21:26:52,145][569867] Using optimizer [2023-03-08 21:26:52,145][569867] No checkpoints found [2023-03-08 21:26:52,146][569867] Did not load from checkpoint, starting from scratch! [2023-03-08 21:26:52,146][569867] Initialized policy 0 weights for model version 0 [2023-03-08 21:26:52,146][569867] LearnerWorker_p0 finished initialization! [2023-03-08 21:26:52,183][570012] Worker 7 uses CPU cores [28, 29, 30, 31] [2023-03-08 21:26:52,311][569916] Worker 4 uses CPU cores [16, 17, 18, 19] [2023-03-08 21:26:52,321][569949] Worker 5 uses CPU cores [20, 21, 22, 23] [2023-03-08 21:26:52,353][569915] Worker 3 uses CPU cores [12, 13, 14, 15] [2023-03-08 21:26:52,424][569911] RunningMeanStd input shape: (39,) [2023-03-08 21:26:52,425][569911] RunningMeanStd input shape: (1,) [2023-03-08 21:26:52,463][569928] Worker 6 uses CPU cores [24, 25, 26, 27] [2023-03-08 21:26:52,491][569624] Inference worker 0-0 is ready! [2023-03-08 21:26:52,492][569624] All inference workers are ready! Signal rollout workers to start! [2023-03-08 21:26:52,789][569624] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-08 21:26:56,511][569916] Decorrelating experience for 0 frames... [2023-03-08 21:26:56,516][570012] Decorrelating experience for 0 frames... [2023-03-08 21:26:56,525][569916] Decorrelating experience for 64 frames... [2023-03-08 21:26:56,529][570012] Decorrelating experience for 64 frames... [2023-03-08 21:26:56,531][569913] Decorrelating experience for 0 frames... [2023-03-08 21:26:56,542][569915] Decorrelating experience for 0 frames... [2023-03-08 21:26:56,544][569913] Decorrelating experience for 64 frames... [2023-03-08 21:26:56,550][569914] Decorrelating experience for 0 frames... [2023-03-08 21:26:56,556][569915] Decorrelating experience for 64 frames... [2023-03-08 21:26:56,559][569916] Decorrelating experience for 128 frames... [2023-03-08 21:26:56,564][570012] Decorrelating experience for 128 frames... [2023-03-08 21:26:56,564][569914] Decorrelating experience for 64 frames... [2023-03-08 21:26:56,576][569928] Decorrelating experience for 0 frames... [2023-03-08 21:26:56,578][569913] Decorrelating experience for 128 frames... [2023-03-08 21:26:56,590][569928] Decorrelating experience for 64 frames... [2023-03-08 21:26:56,591][569915] Decorrelating experience for 128 frames... [2023-03-08 21:26:56,592][569949] Decorrelating experience for 0 frames... [2023-03-08 21:26:56,598][569914] Decorrelating experience for 128 frames... [2023-03-08 21:26:56,605][569949] Decorrelating experience for 64 frames... [2023-03-08 21:26:56,614][569916] Decorrelating experience for 192 frames... [2023-03-08 21:26:56,617][570012] Decorrelating experience for 192 frames... [2023-03-08 21:26:56,624][569928] Decorrelating experience for 128 frames... [2023-03-08 21:26:56,633][569913] Decorrelating experience for 192 frames... [2023-03-08 21:26:56,640][569949] Decorrelating experience for 128 frames... [2023-03-08 21:26:56,645][569915] Decorrelating experience for 192 frames... [2023-03-08 21:26:56,654][569914] Decorrelating experience for 192 frames... [2023-03-08 21:26:56,679][569928] Decorrelating experience for 192 frames... [2023-03-08 21:26:56,696][569949] Decorrelating experience for 192 frames... [2023-03-08 21:26:56,869][569912] Decorrelating experience for 0 frames... [2023-03-08 21:26:56,882][569912] Decorrelating experience for 64 frames... [2023-03-08 21:26:56,917][569912] Decorrelating experience for 128 frames... [2023-03-08 21:26:56,973][569912] Decorrelating experience for 192 frames... [2023-03-08 21:26:57,789][569624] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-08 21:27:00,625][570012] Decorrelating experience for 256 frames... [2023-03-08 21:27:00,637][569915] Decorrelating experience for 256 frames... [2023-03-08 21:27:00,637][569916] Decorrelating experience for 256 frames... [2023-03-08 21:27:00,641][569913] Decorrelating experience for 256 frames... [2023-03-08 21:27:00,655][569914] Decorrelating experience for 256 frames... [2023-03-08 21:27:00,689][569928] Decorrelating experience for 256 frames... [2023-03-08 21:27:00,719][570012] Decorrelating experience for 320 frames... [2023-03-08 21:27:00,730][569915] Decorrelating experience for 320 frames... [2023-03-08 21:27:00,737][569913] Decorrelating experience for 320 frames... [2023-03-08 21:27:00,738][569916] Decorrelating experience for 320 frames... [2023-03-08 21:27:00,750][569914] Decorrelating experience for 320 frames... [2023-03-08 21:27:00,783][569928] Decorrelating experience for 320 frames... [2023-03-08 21:27:00,836][570012] Decorrelating experience for 384 frames... [2023-03-08 21:27:00,855][569916] Decorrelating experience for 384 frames... [2023-03-08 21:27:00,864][569913] Decorrelating experience for 384 frames... [2023-03-08 21:27:00,865][569914] Decorrelating experience for 384 frames... [2023-03-08 21:27:00,873][569915] Decorrelating experience for 384 frames... [2023-03-08 21:27:00,894][569928] Decorrelating experience for 384 frames... [2023-03-08 21:27:00,931][569949] Decorrelating experience for 256 frames... [2023-03-08 21:27:00,967][570012] Decorrelating experience for 448 frames... [2023-03-08 21:27:00,987][569916] Decorrelating experience for 448 frames... [2023-03-08 21:27:00,995][569914] Decorrelating experience for 448 frames... [2023-03-08 21:27:00,997][569913] Decorrelating experience for 448 frames... [2023-03-08 21:27:01,006][569915] Decorrelating experience for 448 frames... [2023-03-08 21:27:01,025][569912] Decorrelating experience for 256 frames... [2023-03-08 21:27:01,026][569928] Decorrelating experience for 448 frames... [2023-03-08 21:27:01,087][569949] Decorrelating experience for 320 frames... [2023-03-08 21:27:01,119][569912] Decorrelating experience for 320 frames... [2023-03-08 21:27:01,236][569912] Decorrelating experience for 384 frames... [2023-03-08 21:27:01,267][569949] Decorrelating experience for 384 frames... [2023-03-08 21:27:01,368][569912] Decorrelating experience for 448 frames... [2023-03-08 21:27:01,424][569949] Decorrelating experience for 448 frames... [2023-03-08 21:27:02,789][569624] Fps is (10 sec: 819.2, 60 sec: 819.2, 300 sec: 819.2). Total num frames: 8192. Throughput: 0: 53.2. Samples: 532. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:27:02,789][569624] Avg episode reward: [(0, '104.454')] [2023-03-08 21:27:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000000016_8192.pth... [2023-03-08 21:27:05,408][569911] Updated weights for policy 0, policy_version 80 (0.0005) [2023-03-08 21:27:07,789][569624] Fps is (10 sec: 6553.7, 60 sec: 4369.1, 300 sec: 4369.1). Total num frames: 65536. Throughput: 0: 4309.1. Samples: 64636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:27:07,789][569624] Avg episode reward: [(0, '910.722')] [2023-03-08 21:27:09,069][569911] Updated weights for policy 0, policy_version 160 (0.0005) [2023-03-08 21:27:10,118][569624] Heartbeat connected on Batcher_0 [2023-03-08 21:27:10,120][569624] Heartbeat connected on LearnerWorker_p0 [2023-03-08 21:27:10,126][569624] Heartbeat connected on RolloutWorker_w0 [2023-03-08 21:27:10,128][569624] Heartbeat connected on RolloutWorker_w1 [2023-03-08 21:27:10,131][569624] Heartbeat connected on RolloutWorker_w2 [2023-03-08 21:27:10,133][569624] Heartbeat connected on RolloutWorker_w3 [2023-03-08 21:27:10,135][569624] Heartbeat connected on RolloutWorker_w4 [2023-03-08 21:27:10,137][569624] Heartbeat connected on RolloutWorker_w5 [2023-03-08 21:27:10,139][569624] Heartbeat connected on RolloutWorker_w6 [2023-03-08 21:27:10,142][569624] Heartbeat connected on RolloutWorker_w7 [2023-03-08 21:27:10,174][569624] Heartbeat connected on InferenceWorker_p0-w0 [2023-03-08 21:27:12,705][569911] Updated weights for policy 0, policy_version 240 (0.0005) [2023-03-08 21:27:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 6144.0, 300 sec: 6144.0). Total num frames: 122880. Throughput: 0: 4918.4. Samples: 98368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:27:12,789][569624] Avg episode reward: [(0, '2817.541')] [2023-03-08 21:27:12,790][569867] Saving new best policy, reward=2772.887! [2023-03-08 21:27:16,284][569911] Updated weights for policy 0, policy_version 320 (0.0005) [2023-03-08 21:27:17,789][569624] Fps is (10 sec: 11059.2, 60 sec: 7045.1, 300 sec: 7045.1). Total num frames: 176128. Throughput: 0: 6692.9. Samples: 167324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:27:17,789][569624] Avg episode reward: [(0, '3228.183')] [2023-03-08 21:27:17,804][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000000352_180224.pth... [2023-03-08 21:27:17,806][569867] Saving new best policy, reward=3228.183! [2023-03-08 21:27:20,085][569911] Updated weights for policy 0, policy_version 400 (0.0004) [2023-03-08 21:27:22,789][569624] Fps is (10 sec: 11059.1, 60 sec: 7782.4, 300 sec: 7782.4). Total num frames: 233472. Throughput: 0: 7782.6. Samples: 233480. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 21:27:22,789][569624] Avg episode reward: [(0, '1839.200')] [2023-03-08 21:27:23,584][569911] Updated weights for policy 0, policy_version 480 (0.0004) [2023-03-08 21:27:27,015][569911] Updated weights for policy 0, policy_version 560 (0.0005) [2023-03-08 21:27:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 8426.1, 300 sec: 8426.1). Total num frames: 294912. Throughput: 0: 7723.9. Samples: 270336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:27:27,789][569624] Avg episode reward: [(0, '2316.394')] [2023-03-08 21:27:30,513][569911] Updated weights for policy 0, policy_version 640 (0.0005) [2023-03-08 21:27:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 8806.4, 300 sec: 8806.4). Total num frames: 352256. Throughput: 0: 8501.0. Samples: 340040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:27:32,789][569624] Avg episode reward: [(0, '2725.378')] [2023-03-08 21:27:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000000688_352256.pth... [2023-03-08 21:27:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000000016_8192.pth [2023-03-08 21:27:34,033][569911] Updated weights for policy 0, policy_version 720 (0.0006) [2023-03-08 21:27:37,737][569911] Updated weights for policy 0, policy_version 800 (0.0006) [2023-03-08 21:27:37,789][569624] Fps is (10 sec: 11468.6, 60 sec: 9102.2, 300 sec: 9102.2). Total num frames: 409600. Throughput: 0: 9059.4. Samples: 407676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:27:37,789][569624] Avg episode reward: [(0, '3065.903')] [2023-03-08 21:27:41,484][569911] Updated weights for policy 0, policy_version 880 (0.0006) [2023-03-08 21:27:42,789][569624] Fps is (10 sec: 11059.3, 60 sec: 9257.0, 300 sec: 9257.0). Total num frames: 462848. Throughput: 0: 9799.1. Samples: 440960. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:27:42,789][569624] Avg episode reward: [(0, '2168.589')] [2023-03-08 21:27:44,943][569911] Updated weights for policy 0, policy_version 960 (0.0005) [2023-03-08 21:27:47,789][569624] Fps is (10 sec: 11468.7, 60 sec: 9532.5, 300 sec: 9532.5). Total num frames: 524288. Throughput: 0: 11316.3. Samples: 509768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:27:47,789][569624] Avg episode reward: [(0, '2273.715')] [2023-03-08 21:27:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000001024_524288.pth... [2023-03-08 21:27:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000000352_180224.pth [2023-03-08 21:27:48,508][569911] Updated weights for policy 0, policy_version 1040 (0.0005) [2023-03-08 21:27:51,989][569911] Updated weights for policy 0, policy_version 1120 (0.0005) [2023-03-08 21:27:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 9693.8, 300 sec: 9693.8). Total num frames: 581632. Throughput: 0: 11481.5. Samples: 581304. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:27:52,789][569624] Avg episode reward: [(0, '2041.416')] [2023-03-08 21:27:55,493][569911] Updated weights for policy 0, policy_version 1200 (0.0005) [2023-03-08 21:27:57,789][569624] Fps is (10 sec: 11468.9, 60 sec: 10649.6, 300 sec: 9830.4). Total num frames: 638976. Throughput: 0: 11469.0. Samples: 614472. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:27:57,789][569624] Avg episode reward: [(0, '1634.106')] [2023-03-08 21:27:58,996][569911] Updated weights for policy 0, policy_version 1280 (0.0006) [2023-03-08 21:28:02,613][569911] Updated weights for policy 0, policy_version 1360 (0.0005) [2023-03-08 21:28:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 9947.4). Total num frames: 696320. Throughput: 0: 11483.8. Samples: 684096. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:28:02,789][569624] Avg episode reward: [(0, '2680.517')] [2023-03-08 21:28:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000001360_696320.pth... [2023-03-08 21:28:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000000688_352256.pth [2023-03-08 21:28:06,397][569911] Updated weights for policy 0, policy_version 1440 (0.0006) [2023-03-08 21:28:07,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 9994.2). Total num frames: 749568. Throughput: 0: 11468.6. Samples: 749568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:28:07,789][569624] Avg episode reward: [(0, '1643.529')] [2023-03-08 21:28:10,042][569911] Updated weights for policy 0, policy_version 1520 (0.0006) [2023-03-08 21:28:12,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 10086.4). Total num frames: 806912. Throughput: 0: 11391.9. Samples: 782972. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:28:12,789][569624] Avg episode reward: [(0, '1262.380')] [2023-03-08 21:28:13,608][569911] Updated weights for policy 0, policy_version 1600 (0.0005) [2023-03-08 21:28:17,256][569911] Updated weights for policy 0, policy_version 1680 (0.0005) [2023-03-08 21:28:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 10167.7). Total num frames: 864256. Throughput: 0: 11376.4. Samples: 851980. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:28:17,789][569624] Avg episode reward: [(0, '1392.525')] [2023-03-08 21:28:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000001688_864256.pth... [2023-03-08 21:28:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000001024_524288.pth [2023-03-08 21:28:20,827][569911] Updated weights for policy 0, policy_version 1760 (0.0005) [2023-03-08 21:28:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 10240.0). Total num frames: 921600. Throughput: 0: 11410.6. Samples: 921152. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:28:22,789][569624] Avg episode reward: [(0, '1840.188')] [2023-03-08 21:28:24,441][569911] Updated weights for policy 0, policy_version 1840 (0.0005) [2023-03-08 21:28:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 10304.7). Total num frames: 978944. Throughput: 0: 11409.3. Samples: 954380. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:28:27,789][569624] Avg episode reward: [(0, '1896.739')] [2023-03-08 21:28:27,977][569911] Updated weights for policy 0, policy_version 1920 (0.0005) [2023-03-08 21:28:31,501][569911] Updated weights for policy 0, policy_version 2000 (0.0005) [2023-03-08 21:28:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 10362.9). Total num frames: 1036288. Throughput: 0: 11429.0. Samples: 1024072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:28:32,789][569624] Avg episode reward: [(0, '1687.305')] [2023-03-08 21:28:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000002024_1036288.pth... [2023-03-08 21:28:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000001360_696320.pth [2023-03-08 21:28:35,024][569911] Updated weights for policy 0, policy_version 2080 (0.0005) [2023-03-08 21:28:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 10454.5). Total num frames: 1097728. Throughput: 0: 11452.8. Samples: 1096680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:28:37,789][569624] Avg episode reward: [(0, '2278.299')] [2023-03-08 21:28:38,348][569911] Updated weights for policy 0, policy_version 2160 (0.0005) [2023-03-08 21:28:41,847][569911] Updated weights for policy 0, policy_version 2240 (0.0005) [2023-03-08 21:28:42,789][569624] Fps is (10 sec: 11878.6, 60 sec: 11537.1, 300 sec: 10500.7). Total num frames: 1155072. Throughput: 0: 11500.0. Samples: 1131972. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:28:42,789][569624] Avg episode reward: [(0, '1988.625')] [2023-03-08 21:28:45,162][569911] Updated weights for policy 0, policy_version 2320 (0.0005) [2023-03-08 21:28:47,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11537.1, 300 sec: 10578.4). Total num frames: 1216512. Throughput: 0: 11559.9. Samples: 1204288. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:28:47,789][569624] Avg episode reward: [(0, '2205.152')] [2023-03-08 21:28:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000002376_1216512.pth... [2023-03-08 21:28:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000001688_864256.pth [2023-03-08 21:28:48,607][569911] Updated weights for policy 0, policy_version 2400 (0.0005) [2023-03-08 21:28:52,052][569911] Updated weights for policy 0, policy_version 2480 (0.0005) [2023-03-08 21:28:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11605.3, 300 sec: 10649.6). Total num frames: 1277952. Throughput: 0: 11682.5. Samples: 1275280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:28:52,789][569624] Avg episode reward: [(0, '2369.938')] [2023-03-08 21:28:55,256][569911] Updated weights for policy 0, policy_version 2560 (0.0005) [2023-03-08 21:28:57,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11673.6, 300 sec: 10715.1). Total num frames: 1339392. Throughput: 0: 11809.0. Samples: 1314376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:28:57,789][569624] Avg episode reward: [(0, '3042.053')] [2023-03-08 21:28:58,601][569911] Updated weights for policy 0, policy_version 2640 (0.0005) [2023-03-08 21:29:01,926][569911] Updated weights for policy 0, policy_version 2720 (0.0005) [2023-03-08 21:29:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11741.9, 300 sec: 10775.6). Total num frames: 1400832. Throughput: 0: 11923.9. Samples: 1388556. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:29:02,789][569624] Avg episode reward: [(0, '3331.665')] [2023-03-08 21:29:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000002736_1400832.pth... [2023-03-08 21:29:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000002024_1036288.pth [2023-03-08 21:29:02,795][569867] Saving new best policy, reward=3331.665! [2023-03-08 21:29:05,273][569911] Updated weights for policy 0, policy_version 2800 (0.0004) [2023-03-08 21:29:07,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 10831.6). Total num frames: 1462272. Throughput: 0: 12018.9. Samples: 1462000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:29:07,800][569624] Avg episode reward: [(0, '3635.050')] [2023-03-08 21:29:07,801][569867] Saving new best policy, reward=3635.050! [2023-03-08 21:29:08,634][569911] Updated weights for policy 0, policy_version 2880 (0.0005) [2023-03-08 21:29:11,861][569911] Updated weights for policy 0, policy_version 2960 (0.0005) [2023-03-08 21:29:12,789][569624] Fps is (10 sec: 12288.2, 60 sec: 11946.7, 300 sec: 10883.7). Total num frames: 1523712. Throughput: 0: 12090.4. Samples: 1498444. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:29:12,789][569624] Avg episode reward: [(0, '2886.365')] [2023-03-08 21:29:15,132][569911] Updated weights for policy 0, policy_version 3040 (0.0005) [2023-03-08 21:29:17,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12083.2, 300 sec: 10960.3). Total num frames: 1589248. Throughput: 0: 12235.5. Samples: 1574668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:29:17,800][569624] Avg episode reward: [(0, '3720.842')] [2023-03-08 21:29:17,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000003104_1589248.pth... [2023-03-08 21:29:17,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000002376_1216512.pth [2023-03-08 21:29:17,806][569867] Saving new best policy, reward=3720.842! [2023-03-08 21:29:18,489][569911] Updated weights for policy 0, policy_version 3120 (0.0004) [2023-03-08 21:29:21,992][569911] Updated weights for policy 0, policy_version 3200 (0.0005) [2023-03-08 21:29:22,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 10977.3). Total num frames: 1646592. Throughput: 0: 12192.6. Samples: 1645348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:29:22,800][569624] Avg episode reward: [(0, '3100.080')] [2023-03-08 21:29:25,348][569911] Updated weights for policy 0, policy_version 3280 (0.0005) [2023-03-08 21:29:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 11019.6). Total num frames: 1708032. Throughput: 0: 12216.8. Samples: 1681728. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:29:27,789][569624] Avg episode reward: [(0, '3807.421')] [2023-03-08 21:29:27,790][569867] Saving new best policy, reward=3807.421! [2023-03-08 21:29:28,640][569911] Updated weights for policy 0, policy_version 3360 (0.0005) [2023-03-08 21:29:32,039][569911] Updated weights for policy 0, policy_version 3440 (0.0005) [2023-03-08 21:29:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 11059.2). Total num frames: 1769472. Throughput: 0: 12260.0. Samples: 1755988. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:29:32,789][569624] Avg episode reward: [(0, '4064.464')] [2023-03-08 21:29:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000003456_1769472.pth... [2023-03-08 21:29:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000002736_1400832.pth [2023-03-08 21:29:32,796][569867] Saving new best policy, reward=4064.464! [2023-03-08 21:29:35,441][569911] Updated weights for policy 0, policy_version 3520 (0.0005) [2023-03-08 21:29:37,789][569624] Fps is (10 sec: 12288.2, 60 sec: 12219.8, 300 sec: 11096.4). Total num frames: 1830912. Throughput: 0: 12274.8. Samples: 1827644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:29:37,789][569624] Avg episode reward: [(0, '3402.166')] [2023-03-08 21:29:38,776][569911] Updated weights for policy 0, policy_version 3600 (0.0005) [2023-03-08 21:29:42,229][569911] Updated weights for policy 0, policy_version 3680 (0.0004) [2023-03-08 21:29:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 11107.4). Total num frames: 1888256. Throughput: 0: 12206.6. Samples: 1863672. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:29:42,789][569624] Avg episode reward: [(0, '2934.992')] [2023-03-08 21:29:45,989][569911] Updated weights for policy 0, policy_version 3760 (0.0004) [2023-03-08 21:29:47,789][569624] Fps is (10 sec: 11059.1, 60 sec: 12083.2, 300 sec: 11094.3). Total num frames: 1941504. Throughput: 0: 12061.7. Samples: 1931332. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:29:47,789][569624] Avg episode reward: [(0, '2723.006')] [2023-03-08 21:29:47,826][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000003800_1945600.pth... [2023-03-08 21:29:47,827][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000003104_1589248.pth [2023-03-08 21:29:49,474][569911] Updated weights for policy 0, policy_version 3840 (0.0005) [2023-03-08 21:29:52,764][569911] Updated weights for policy 0, policy_version 3920 (0.0005) [2023-03-08 21:29:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 11150.2). Total num frames: 2007040. Throughput: 0: 12038.5. Samples: 2003732. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:29:52,789][569624] Avg episode reward: [(0, '3274.487')] [2023-03-08 21:29:55,887][569911] Updated weights for policy 0, policy_version 4000 (0.0005) [2023-03-08 21:29:57,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12151.5, 300 sec: 11181.0). Total num frames: 2068480. Throughput: 0: 12119.4. Samples: 2043820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:29:57,789][569624] Avg episode reward: [(0, '3512.804')] [2023-03-08 21:29:59,581][569911] Updated weights for policy 0, policy_version 4080 (0.0004) [2023-03-08 21:30:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11188.5). Total num frames: 2125824. Throughput: 0: 11938.9. Samples: 2111920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:30:02,800][569624] Avg episode reward: [(0, '3367.128')] [2023-03-08 21:30:02,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000004152_2125824.pth... [2023-03-08 21:30:02,804][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000003456_1769472.pth [2023-03-08 21:30:03,025][569911] Updated weights for policy 0, policy_version 4160 (0.0005) [2023-03-08 21:30:06,405][569911] Updated weights for policy 0, policy_version 4240 (0.0005) [2023-03-08 21:30:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11216.7). Total num frames: 2187264. Throughput: 0: 11974.8. Samples: 2184216. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:30:07,800][569624] Avg episode reward: [(0, '3373.555')] [2023-03-08 21:30:09,981][569911] Updated weights for policy 0, policy_version 4320 (0.0005) [2023-03-08 21:30:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11223.0). Total num frames: 2244608. Throughput: 0: 11941.5. Samples: 2219096. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 21:30:12,800][569624] Avg episode reward: [(0, '3912.490')] [2023-03-08 21:30:13,441][569911] Updated weights for policy 0, policy_version 4400 (0.0004) [2023-03-08 21:30:17,094][569911] Updated weights for policy 0, policy_version 4480 (0.0005) [2023-03-08 21:30:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11229.0). Total num frames: 2301952. Throughput: 0: 11795.2. Samples: 2286772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:30:17,800][569624] Avg episode reward: [(0, '4054.403')] [2023-03-08 21:30:17,804][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000004496_2301952.pth... [2023-03-08 21:30:17,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000003800_1945600.pth [2023-03-08 21:30:20,526][569911] Updated weights for policy 0, policy_version 4560 (0.0004) [2023-03-08 21:30:22,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11234.7). Total num frames: 2359296. Throughput: 0: 11814.5. Samples: 2359296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:30:22,789][569624] Avg episode reward: [(0, '4142.654')] [2023-03-08 21:30:22,789][569867] Saving new best policy, reward=4142.654! [2023-03-08 21:30:24,037][569911] Updated weights for policy 0, policy_version 4640 (0.0005) [2023-03-08 21:30:27,415][569911] Updated weights for policy 0, policy_version 4720 (0.0005) [2023-03-08 21:30:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11259.2). Total num frames: 2420736. Throughput: 0: 11780.4. Samples: 2393792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:30:27,789][569624] Avg episode reward: [(0, '3810.320')] [2023-03-08 21:30:30,972][569911] Updated weights for policy 0, policy_version 4800 (0.0004) [2023-03-08 21:30:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11264.0). Total num frames: 2478080. Throughput: 0: 11853.6. Samples: 2464744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:30:32,789][569624] Avg episode reward: [(0, '3797.812')] [2023-03-08 21:30:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000004840_2478080.pth... [2023-03-08 21:30:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000004152_2125824.pth [2023-03-08 21:30:34,260][569911] Updated weights for policy 0, policy_version 4880 (0.0005) [2023-03-08 21:30:37,584][569911] Updated weights for policy 0, policy_version 4960 (0.0005) [2023-03-08 21:30:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11286.8). Total num frames: 2539520. Throughput: 0: 11905.6. Samples: 2539484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:30:37,789][569624] Avg episode reward: [(0, '4401.180')] [2023-03-08 21:30:37,790][569867] Saving new best policy, reward=4401.180! [2023-03-08 21:30:41,222][569911] Updated weights for policy 0, policy_version 5040 (0.0005) [2023-03-08 21:30:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11290.7). Total num frames: 2596864. Throughput: 0: 11750.9. Samples: 2572612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:30:42,789][569624] Avg episode reward: [(0, '4403.969')] [2023-03-08 21:30:42,790][569867] Saving new best policy, reward=4403.969! [2023-03-08 21:30:44,530][569911] Updated weights for policy 0, policy_version 5120 (0.0006) [2023-03-08 21:30:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11311.9). Total num frames: 2658304. Throughput: 0: 11869.0. Samples: 2646024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:30:47,789][569624] Avg episode reward: [(0, '4407.740')] [2023-03-08 21:30:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000005192_2658304.pth... [2023-03-08 21:30:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000004496_2301952.pth [2023-03-08 21:30:47,794][569867] Saving new best policy, reward=4407.740! [2023-03-08 21:30:47,919][569911] Updated weights for policy 0, policy_version 5200 (0.0005) [2023-03-08 21:30:51,322][569911] Updated weights for policy 0, policy_version 5280 (0.0005) [2023-03-08 21:30:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11332.3). Total num frames: 2719744. Throughput: 0: 11893.0. Samples: 2719400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:30:52,789][569624] Avg episode reward: [(0, '4099.216')] [2023-03-08 21:30:54,693][569911] Updated weights for policy 0, policy_version 5360 (0.0005) [2023-03-08 21:30:57,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11351.8). Total num frames: 2781184. Throughput: 0: 11920.2. Samples: 2755504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:30:57,789][569624] Avg episode reward: [(0, '4105.272')] [2023-03-08 21:30:58,130][569911] Updated weights for policy 0, policy_version 5440 (0.0005) [2023-03-08 21:31:01,520][569911] Updated weights for policy 0, policy_version 5520 (0.0005) [2023-03-08 21:31:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11354.1). Total num frames: 2838528. Throughput: 0: 11988.4. Samples: 2826248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:31:02,789][569624] Avg episode reward: [(0, '4232.837')] [2023-03-08 21:31:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000005544_2838528.pth... [2023-03-08 21:31:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000004840_2478080.pth [2023-03-08 21:31:04,929][569911] Updated weights for policy 0, policy_version 5600 (0.0005) [2023-03-08 21:31:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11356.4). Total num frames: 2895872. Throughput: 0: 11923.9. Samples: 2895872. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:31:07,800][569624] Avg episode reward: [(0, '4287.703')] [2023-03-08 21:31:08,682][569911] Updated weights for policy 0, policy_version 5680 (0.0006) [2023-03-08 21:31:12,016][569911] Updated weights for policy 0, policy_version 5760 (0.0005) [2023-03-08 21:31:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11374.3). Total num frames: 2957312. Throughput: 0: 11953.2. Samples: 2931688. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:31:12,789][569624] Avg episode reward: [(0, '4380.670')] [2023-03-08 21:31:15,359][569911] Updated weights for policy 0, policy_version 5840 (0.0005) [2023-03-08 21:31:17,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11391.5). Total num frames: 3018752. Throughput: 0: 12001.5. Samples: 3004812. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:31:17,789][569624] Avg episode reward: [(0, '4202.073')] [2023-03-08 21:31:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000005896_3018752.pth... [2023-03-08 21:31:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000005192_2658304.pth [2023-03-08 21:31:18,649][569911] Updated weights for policy 0, policy_version 5920 (0.0005) [2023-03-08 21:31:22,079][569911] Updated weights for policy 0, policy_version 6000 (0.0005) [2023-03-08 21:31:22,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11408.1). Total num frames: 3080192. Throughput: 0: 11980.4. Samples: 3078604. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 21:31:22,789][569624] Avg episode reward: [(0, '4509.446')] [2023-03-08 21:31:22,790][569867] Saving new best policy, reward=4509.446! [2023-03-08 21:31:25,324][569911] Updated weights for policy 0, policy_version 6080 (0.0005) [2023-03-08 21:31:27,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 11424.1). Total num frames: 3141632. Throughput: 0: 12072.4. Samples: 3115868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:31:27,789][569624] Avg episode reward: [(0, '4548.482')] [2023-03-08 21:31:27,790][569867] Saving new best policy, reward=4548.482! [2023-03-08 21:31:28,661][569911] Updated weights for policy 0, policy_version 6160 (0.0004) [2023-03-08 21:31:32,098][569911] Updated weights for policy 0, policy_version 6240 (0.0005) [2023-03-08 21:31:32,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11439.5). Total num frames: 3203072. Throughput: 0: 12057.8. Samples: 3188624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:31:32,789][569624] Avg episode reward: [(0, '4325.251')] [2023-03-08 21:31:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000006256_3203072.pth... [2023-03-08 21:31:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000005544_2838528.pth [2023-03-08 21:31:35,485][569911] Updated weights for policy 0, policy_version 6320 (0.0005) [2023-03-08 21:31:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11440.1). Total num frames: 3260416. Throughput: 0: 12022.8. Samples: 3260428. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 21:31:37,789][569624] Avg episode reward: [(0, '4194.831')] [2023-03-08 21:31:39,076][569911] Updated weights for policy 0, policy_version 6400 (0.0005) [2023-03-08 21:31:42,431][569911] Updated weights for policy 0, policy_version 6480 (0.0005) [2023-03-08 21:31:42,789][569624] Fps is (10 sec: 11878.6, 60 sec: 12083.2, 300 sec: 11454.7). Total num frames: 3321856. Throughput: 0: 11969.0. Samples: 3294108. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:31:42,789][569624] Avg episode reward: [(0, '4168.552')] [2023-03-08 21:31:45,594][569911] Updated weights for policy 0, policy_version 6560 (0.0004) [2023-03-08 21:31:47,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11468.8). Total num frames: 3383296. Throughput: 0: 12101.5. Samples: 3370816. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:31:47,789][569624] Avg episode reward: [(0, '4557.312')] [2023-03-08 21:31:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000006608_3383296.pth... [2023-03-08 21:31:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000005896_3018752.pth [2023-03-08 21:31:47,796][569867] Saving new best policy, reward=4557.312! [2023-03-08 21:31:49,109][569911] Updated weights for policy 0, policy_version 6640 (0.0005) [2023-03-08 21:31:52,287][569911] Updated weights for policy 0, policy_version 6720 (0.0005) [2023-03-08 21:31:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11677.1). Total num frames: 3444736. Throughput: 0: 12190.1. Samples: 3444424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:31:52,789][569624] Avg episode reward: [(0, '4474.312')] [2023-03-08 21:31:55,546][569911] Updated weights for policy 0, policy_version 6800 (0.0005) [2023-03-08 21:31:57,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 11857.6). Total num frames: 3506176. Throughput: 0: 12217.9. Samples: 3481492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:31:57,789][569624] Avg episode reward: [(0, '4139.133')] [2023-03-08 21:31:59,000][569911] Updated weights for policy 0, policy_version 6880 (0.0005) [2023-03-08 21:32:02,209][569911] Updated weights for policy 0, policy_version 6960 (0.0005) [2023-03-08 21:32:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11871.5). Total num frames: 3567616. Throughput: 0: 12235.3. Samples: 3555400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:32:02,789][569624] Avg episode reward: [(0, '4611.633')] [2023-03-08 21:32:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000006968_3567616.pth... [2023-03-08 21:32:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000006256_3203072.pth [2023-03-08 21:32:02,793][569867] Saving new best policy, reward=4611.633! [2023-03-08 21:32:05,613][569911] Updated weights for policy 0, policy_version 7040 (0.0005) [2023-03-08 21:32:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 11885.3). Total num frames: 3629056. Throughput: 0: 12172.7. Samples: 3626376. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:32:07,789][569624] Avg episode reward: [(0, '4422.198')] [2023-03-08 21:32:09,106][569911] Updated weights for policy 0, policy_version 7120 (0.0005) [2023-03-08 21:32:12,521][569911] Updated weights for policy 0, policy_version 7200 (0.0005) [2023-03-08 21:32:12,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 11899.2). Total num frames: 3686400. Throughput: 0: 12133.8. Samples: 3661888. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:32:12,789][569624] Avg episode reward: [(0, '4622.659')] [2023-03-08 21:32:12,789][569867] Saving new best policy, reward=4622.659! [2023-03-08 21:32:15,913][569911] Updated weights for policy 0, policy_version 7280 (0.0005) [2023-03-08 21:32:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 11913.1). Total num frames: 3747840. Throughput: 0: 12154.4. Samples: 3735572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:32:17,789][569624] Avg episode reward: [(0, '4413.409')] [2023-03-08 21:32:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000007320_3747840.pth... [2023-03-08 21:32:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000006608_3383296.pth [2023-03-08 21:32:19,247][569911] Updated weights for policy 0, policy_version 7360 (0.0005) [2023-03-08 21:32:22,529][569911] Updated weights for policy 0, policy_version 7440 (0.0005) [2023-03-08 21:32:22,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11913.1). Total num frames: 3809280. Throughput: 0: 12198.1. Samples: 3809344. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 21:32:22,789][569624] Avg episode reward: [(0, '4353.800')] [2023-03-08 21:32:25,765][569911] Updated weights for policy 0, policy_version 7520 (0.0006) [2023-03-08 21:32:27,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 11927.0). Total num frames: 3870720. Throughput: 0: 12277.3. Samples: 3846588. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:32:27,789][569624] Avg episode reward: [(0, '4308.780')] [2023-03-08 21:32:29,257][569911] Updated weights for policy 0, policy_version 7600 (0.0005) [2023-03-08 21:32:32,719][569911] Updated weights for policy 0, policy_version 7680 (0.0005) [2023-03-08 21:32:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11940.9). Total num frames: 3932160. Throughput: 0: 12181.5. Samples: 3918980. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:32:32,789][569624] Avg episode reward: [(0, '4401.594')] [2023-03-08 21:32:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000007680_3932160.pth... [2023-03-08 21:32:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000006968_3567616.pth [2023-03-08 21:32:36,051][569911] Updated weights for policy 0, policy_version 7760 (0.0005) [2023-03-08 21:32:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 11954.8). Total num frames: 3989504. Throughput: 0: 12114.3. Samples: 3989568. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:32:37,789][569624] Avg episode reward: [(0, '4259.857')] [2023-03-08 21:32:39,581][569911] Updated weights for policy 0, policy_version 7840 (0.0005) [2023-03-08 21:32:42,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12151.4, 300 sec: 11954.8). Total num frames: 4050944. Throughput: 0: 12108.4. Samples: 4026368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:32:42,789][569624] Avg episode reward: [(0, '4251.197')] [2023-03-08 21:32:43,001][569911] Updated weights for policy 0, policy_version 7920 (0.0005) [2023-03-08 21:32:46,463][569911] Updated weights for policy 0, policy_version 8000 (0.0006) [2023-03-08 21:32:47,789][569624] Fps is (10 sec: 12287.7, 60 sec: 12151.5, 300 sec: 11968.6). Total num frames: 4112384. Throughput: 0: 12023.0. Samples: 4096436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:32:47,789][569624] Avg episode reward: [(0, '4149.348')] [2023-03-08 21:32:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000008032_4112384.pth... [2023-03-08 21:32:47,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000007320_3747840.pth [2023-03-08 21:32:49,772][569911] Updated weights for policy 0, policy_version 8080 (0.0006) [2023-03-08 21:32:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 4169728. Throughput: 0: 12064.6. Samples: 4169284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:32:52,789][569624] Avg episode reward: [(0, '4458.649')] [2023-03-08 21:32:53,301][569911] Updated weights for policy 0, policy_version 8160 (0.0006) [2023-03-08 21:32:56,773][569911] Updated weights for policy 0, policy_version 8240 (0.0006) [2023-03-08 21:32:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 4231168. Throughput: 0: 12056.5. Samples: 4204432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:32:57,790][569624] Avg episode reward: [(0, '4439.869')] [2023-03-08 21:32:59,896][569911] Updated weights for policy 0, policy_version 8320 (0.0005) [2023-03-08 21:33:02,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 4292608. Throughput: 0: 12105.8. Samples: 4280332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:33:02,789][569624] Avg episode reward: [(0, '4487.330')] [2023-03-08 21:33:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000008384_4292608.pth... [2023-03-08 21:33:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000007680_3932160.pth [2023-03-08 21:33:03,092][569911] Updated weights for policy 0, policy_version 8400 (0.0004) [2023-03-08 21:33:06,548][569911] Updated weights for policy 0, policy_version 8480 (0.0005) [2023-03-08 21:33:07,789][569624] Fps is (10 sec: 12288.3, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 4354048. Throughput: 0: 12098.0. Samples: 4353752. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:33:07,789][569624] Avg episode reward: [(0, '4672.169')] [2023-03-08 21:33:07,789][569867] Saving new best policy, reward=4672.169! [2023-03-08 21:33:09,935][569911] Updated weights for policy 0, policy_version 8560 (0.0004) [2023-03-08 21:33:12,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 4415488. Throughput: 0: 12096.1. Samples: 4390912. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:33:12,789][569624] Avg episode reward: [(0, '4410.813')] [2023-03-08 21:33:13,416][569911] Updated weights for policy 0, policy_version 8640 (0.0005) [2023-03-08 21:33:16,803][569911] Updated weights for policy 0, policy_version 8720 (0.0005) [2023-03-08 21:33:17,789][569624] Fps is (10 sec: 11878.2, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 4472832. Throughput: 0: 12052.1. Samples: 4461328. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:33:17,789][569624] Avg episode reward: [(0, '4371.861')] [2023-03-08 21:33:17,843][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000008744_4476928.pth... [2023-03-08 21:33:17,845][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000008032_4112384.pth [2023-03-08 21:33:20,325][569911] Updated weights for policy 0, policy_version 8800 (0.0005) [2023-03-08 21:33:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 4534272. Throughput: 0: 12062.4. Samples: 4532376. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 21:33:22,789][569624] Avg episode reward: [(0, '4208.102')] [2023-03-08 21:33:23,711][569911] Updated weights for policy 0, policy_version 8880 (0.0005) [2023-03-08 21:33:26,952][569911] Updated weights for policy 0, policy_version 8960 (0.0005) [2023-03-08 21:33:27,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 4595712. Throughput: 0: 12035.4. Samples: 4567960. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 21:33:27,789][569624] Avg episode reward: [(0, '4491.639')] [2023-03-08 21:33:30,509][569911] Updated weights for policy 0, policy_version 9040 (0.0004) [2023-03-08 21:33:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 4648960. Throughput: 0: 12078.5. Samples: 4639968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:33:32,789][569624] Avg episode reward: [(0, '4443.205')] [2023-03-08 21:33:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000009080_4648960.pth... [2023-03-08 21:33:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000008384_4292608.pth [2023-03-08 21:33:34,105][569911] Updated weights for policy 0, policy_version 9120 (0.0005) [2023-03-08 21:33:37,464][569911] Updated weights for policy 0, policy_version 9200 (0.0005) [2023-03-08 21:33:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 4714496. Throughput: 0: 12036.9. Samples: 4710944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:33:37,789][569624] Avg episode reward: [(0, '4476.594')] [2023-03-08 21:33:40,779][569911] Updated weights for policy 0, policy_version 9280 (0.0005) [2023-03-08 21:33:42,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 4771840. Throughput: 0: 12064.6. Samples: 4747336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:33:42,789][569624] Avg episode reward: [(0, '4624.389')] [2023-03-08 21:33:44,156][569911] Updated weights for policy 0, policy_version 9360 (0.0005) [2023-03-08 21:33:47,456][569911] Updated weights for policy 0, policy_version 9440 (0.0004) [2023-03-08 21:33:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12015.0, 300 sec: 12052.0). Total num frames: 4833280. Throughput: 0: 12026.7. Samples: 4821536. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:33:47,789][569624] Avg episode reward: [(0, '4648.083')] [2023-03-08 21:33:47,833][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000009448_4837376.pth... [2023-03-08 21:33:47,835][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000008744_4476928.pth [2023-03-08 21:33:51,085][569911] Updated weights for policy 0, policy_version 9520 (0.0005) [2023-03-08 21:33:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 4894720. Throughput: 0: 11937.9. Samples: 4890960. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:33:52,789][569624] Avg episode reward: [(0, '4648.818')] [2023-03-08 21:33:54,401][569911] Updated weights for policy 0, policy_version 9600 (0.0004) [2023-03-08 21:33:57,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 4952064. Throughput: 0: 11950.0. Samples: 4928664. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:33:57,789][569624] Avg episode reward: [(0, '4655.736')] [2023-03-08 21:33:57,963][569911] Updated weights for policy 0, policy_version 9680 (0.0005) [2023-03-08 21:34:01,397][569911] Updated weights for policy 0, policy_version 9760 (0.0004) [2023-03-08 21:34:02,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 5013504. Throughput: 0: 11943.7. Samples: 4998796. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:34:02,789][569624] Avg episode reward: [(0, '4744.738')] [2023-03-08 21:34:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000009792_5013504.pth... [2023-03-08 21:34:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000009080_4648960.pth [2023-03-08 21:34:02,796][569867] Saving new best policy, reward=4744.738! [2023-03-08 21:34:04,929][569911] Updated weights for policy 0, policy_version 9840 (0.0005) [2023-03-08 21:34:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.6, 300 sec: 12024.2). Total num frames: 5070848. Throughput: 0: 11903.7. Samples: 5068044. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:34:07,789][569624] Avg episode reward: [(0, '4728.787')] [2023-03-08 21:34:08,381][569911] Updated weights for policy 0, policy_version 9920 (0.0005) [2023-03-08 21:34:11,765][569911] Updated weights for policy 0, policy_version 10000 (0.0005) [2023-03-08 21:34:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11996.4). Total num frames: 5128192. Throughput: 0: 11926.4. Samples: 5104648. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:34:12,789][569624] Avg episode reward: [(0, '4815.287')] [2023-03-08 21:34:12,795][569867] Saving new best policy, reward=4815.287! [2023-03-08 21:34:15,170][569911] Updated weights for policy 0, policy_version 10080 (0.0005) [2023-03-08 21:34:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 5189632. Throughput: 0: 11936.0. Samples: 5177088. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:34:17,789][569624] Avg episode reward: [(0, '4673.824')] [2023-03-08 21:34:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000010136_5189632.pth... [2023-03-08 21:34:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000009448_4837376.pth [2023-03-08 21:34:18,658][569911] Updated weights for policy 0, policy_version 10160 (0.0005) [2023-03-08 21:34:22,290][569911] Updated weights for policy 0, policy_version 10240 (0.0005) [2023-03-08 21:34:22,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11996.4). Total num frames: 5246976. Throughput: 0: 11896.4. Samples: 5246280. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:34:22,789][569624] Avg episode reward: [(0, '4510.963')] [2023-03-08 21:34:25,967][569911] Updated weights for policy 0, policy_version 10320 (0.0005) [2023-03-08 21:34:27,788][569624] Fps is (10 sec: 11469.0, 60 sec: 11810.2, 300 sec: 11982.5). Total num frames: 5304320. Throughput: 0: 11830.8. Samples: 5279720. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:34:27,789][569624] Avg episode reward: [(0, '3749.192')] [2023-03-08 21:34:29,630][569911] Updated weights for policy 0, policy_version 10400 (0.0004) [2023-03-08 21:34:32,789][569624] Fps is (10 sec: 11468.6, 60 sec: 11878.4, 300 sec: 11968.6). Total num frames: 5361664. Throughput: 0: 11713.2. Samples: 5348632. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:34:32,789][569624] Avg episode reward: [(0, '3823.958')] [2023-03-08 21:34:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000010472_5361664.pth... [2023-03-08 21:34:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000009792_5013504.pth [2023-03-08 21:34:33,036][569911] Updated weights for policy 0, policy_version 10480 (0.0005) [2023-03-08 21:34:36,432][569911] Updated weights for policy 0, policy_version 10560 (0.0005) [2023-03-08 21:34:37,789][569624] Fps is (10 sec: 11878.2, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 5423104. Throughput: 0: 11751.3. Samples: 5419768. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:34:37,789][569624] Avg episode reward: [(0, '4322.985')] [2023-03-08 21:34:39,804][569911] Updated weights for policy 0, policy_version 10640 (0.0005) [2023-03-08 21:34:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11996.4). Total num frames: 5480448. Throughput: 0: 11717.2. Samples: 5455936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:34:42,789][569624] Avg episode reward: [(0, '4612.416')] [2023-03-08 21:34:43,158][569911] Updated weights for policy 0, policy_version 10720 (0.0005) [2023-03-08 21:34:46,600][569911] Updated weights for policy 0, policy_version 10800 (0.0004) [2023-03-08 21:34:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 5541888. Throughput: 0: 11789.3. Samples: 5529312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:34:47,789][569624] Avg episode reward: [(0, '4567.179')] [2023-03-08 21:34:47,799][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000010832_5545984.pth... [2023-03-08 21:34:47,801][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000010136_5189632.pth [2023-03-08 21:34:49,743][569911] Updated weights for policy 0, policy_version 10880 (0.0006) [2023-03-08 21:34:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 5603328. Throughput: 0: 11902.1. Samples: 5603636. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:34:52,789][569624] Avg episode reward: [(0, '4478.425')] [2023-03-08 21:34:53,208][569911] Updated weights for policy 0, policy_version 10960 (0.0005) [2023-03-08 21:34:56,503][569911] Updated weights for policy 0, policy_version 11040 (0.0005) [2023-03-08 21:34:57,789][569624] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 11996.4). Total num frames: 5664768. Throughput: 0: 11888.7. Samples: 5639640. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:34:57,789][569624] Avg episode reward: [(0, '4696.645')] [2023-03-08 21:34:59,850][569911] Updated weights for policy 0, policy_version 11120 (0.0005) [2023-03-08 21:35:02,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11996.4). Total num frames: 5726208. Throughput: 0: 11927.5. Samples: 5713824. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:35:02,789][569624] Avg episode reward: [(0, '4531.277')] [2023-03-08 21:35:02,839][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000011192_5730304.pth... [2023-03-08 21:35:02,840][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000010472_5361664.pth [2023-03-08 21:35:03,198][569911] Updated weights for policy 0, policy_version 11200 (0.0005) [2023-03-08 21:35:06,437][569911] Updated weights for policy 0, policy_version 11280 (0.0005) [2023-03-08 21:35:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 5787648. Throughput: 0: 12051.5. Samples: 5788600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:35:07,789][569624] Avg episode reward: [(0, '4516.471')] [2023-03-08 21:35:09,664][569911] Updated weights for policy 0, policy_version 11360 (0.0005) [2023-03-08 21:35:12,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 5853184. Throughput: 0: 12174.5. Samples: 5827576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:35:12,789][569624] Avg episode reward: [(0, '4584.188')] [2023-03-08 21:35:12,944][569911] Updated weights for policy 0, policy_version 11440 (0.0004) [2023-03-08 21:35:16,164][569911] Updated weights for policy 0, policy_version 11520 (0.0005) [2023-03-08 21:35:17,789][569624] Fps is (10 sec: 13107.1, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 5918720. Throughput: 0: 12306.0. Samples: 5902400. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:35:17,789][569624] Avg episode reward: [(0, '4598.387')] [2023-03-08 21:35:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000011560_5918720.pth... [2023-03-08 21:35:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000010832_5545984.pth [2023-03-08 21:35:19,430][569911] Updated weights for policy 0, policy_version 11600 (0.0005) [2023-03-08 21:35:22,741][569911] Updated weights for policy 0, policy_version 11680 (0.0005) [2023-03-08 21:35:22,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12219.7, 300 sec: 12065.8). Total num frames: 5980160. Throughput: 0: 12392.0. Samples: 5977408. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:35:22,789][569624] Avg episode reward: [(0, '4452.173')] [2023-03-08 21:35:26,098][569911] Updated weights for policy 0, policy_version 11760 (0.0005) [2023-03-08 21:35:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 12065.8). Total num frames: 6037504. Throughput: 0: 12437.3. Samples: 6015616. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:35:27,789][569624] Avg episode reward: [(0, '4489.879')] [2023-03-08 21:35:29,563][569911] Updated weights for policy 0, policy_version 11840 (0.0005) [2023-03-08 21:35:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 6098944. Throughput: 0: 12379.6. Samples: 6086392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:35:32,789][569624] Avg episode reward: [(0, '4641.876')] [2023-03-08 21:35:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000011912_6098944.pth... [2023-03-08 21:35:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000011192_5730304.pth [2023-03-08 21:35:32,952][569911] Updated weights for policy 0, policy_version 11920 (0.0005) [2023-03-08 21:35:36,319][569911] Updated weights for policy 0, policy_version 12000 (0.0005) [2023-03-08 21:35:37,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12288.0, 300 sec: 12079.7). Total num frames: 6160384. Throughput: 0: 12330.0. Samples: 6158488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:35:37,789][569624] Avg episode reward: [(0, '4492.004')] [2023-03-08 21:35:39,720][569911] Updated weights for policy 0, policy_version 12080 (0.0005) [2023-03-08 21:35:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 6217728. Throughput: 0: 12341.4. Samples: 6195004. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:35:42,789][569624] Avg episode reward: [(0, '4686.736')] [2023-03-08 21:35:43,180][569911] Updated weights for policy 0, policy_version 12160 (0.0005) [2023-03-08 21:35:46,539][569911] Updated weights for policy 0, policy_version 12240 (0.0005) [2023-03-08 21:35:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 6279168. Throughput: 0: 12290.3. Samples: 6266888. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:35:47,789][569624] Avg episode reward: [(0, '4492.916')] [2023-03-08 21:35:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000012264_6279168.pth... [2023-03-08 21:35:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000011560_5918720.pth [2023-03-08 21:35:50,079][569911] Updated weights for policy 0, policy_version 12320 (0.0005) [2023-03-08 21:35:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12219.7, 300 sec: 12052.0). Total num frames: 6336512. Throughput: 0: 12172.8. Samples: 6336376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:35:52,790][569624] Avg episode reward: [(0, '4205.598')] [2023-03-08 21:35:53,656][569911] Updated weights for policy 0, policy_version 12400 (0.0005) [2023-03-08 21:35:57,157][569911] Updated weights for policy 0, policy_version 12480 (0.0005) [2023-03-08 21:35:57,789][569624] Fps is (10 sec: 11468.9, 60 sec: 12151.5, 300 sec: 12052.0). Total num frames: 6393856. Throughput: 0: 12121.4. Samples: 6373036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:35:57,789][569624] Avg episode reward: [(0, '4406.999')] [2023-03-08 21:36:00,689][569911] Updated weights for policy 0, policy_version 12560 (0.0005) [2023-03-08 21:36:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 6455296. Throughput: 0: 11949.7. Samples: 6440136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:36:02,789][569624] Avg episode reward: [(0, '4227.951')] [2023-03-08 21:36:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000012608_6455296.pth... [2023-03-08 21:36:02,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000011912_6098944.pth [2023-03-08 21:36:04,229][569911] Updated weights for policy 0, policy_version 12640 (0.0005) [2023-03-08 21:36:07,689][569911] Updated weights for policy 0, policy_version 12720 (0.0005) [2023-03-08 21:36:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 6512640. Throughput: 0: 11872.4. Samples: 6511664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:36:07,789][569624] Avg episode reward: [(0, '4543.993')] [2023-03-08 21:36:11,214][569911] Updated weights for policy 0, policy_version 12800 (0.0004) [2023-03-08 21:36:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 6569984. Throughput: 0: 11773.2. Samples: 6545408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:36:12,789][569624] Avg episode reward: [(0, '4575.826')] [2023-03-08 21:36:14,394][569911] Updated weights for policy 0, policy_version 12880 (0.0005) [2023-03-08 21:36:17,789][569624] Fps is (10 sec: 11878.2, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 6631424. Throughput: 0: 11830.0. Samples: 6618744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:36:17,789][569624] Avg episode reward: [(0, '4416.811')] [2023-03-08 21:36:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000012952_6631424.pth... [2023-03-08 21:36:17,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000012264_6279168.pth [2023-03-08 21:36:18,040][569911] Updated weights for policy 0, policy_version 12960 (0.0005) [2023-03-08 21:36:21,387][569911] Updated weights for policy 0, policy_version 13040 (0.0005) [2023-03-08 21:36:22,789][569624] Fps is (10 sec: 12287.8, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 6692864. Throughput: 0: 11836.3. Samples: 6691120. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:36:22,789][569624] Avg episode reward: [(0, '4614.844')] [2023-03-08 21:36:24,759][569911] Updated weights for policy 0, policy_version 13120 (0.0004) [2023-03-08 21:36:27,789][569624] Fps is (10 sec: 12288.2, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 6754304. Throughput: 0: 11848.0. Samples: 6728164. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:36:27,799][569624] Avg episode reward: [(0, '4757.294')] [2023-03-08 21:36:27,908][569911] Updated weights for policy 0, policy_version 13200 (0.0005) [2023-03-08 21:36:31,172][569911] Updated weights for policy 0, policy_version 13280 (0.0005) [2023-03-08 21:36:32,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 6819840. Throughput: 0: 11930.0. Samples: 6803736. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:36:32,800][569624] Avg episode reward: [(0, '4736.863')] [2023-03-08 21:36:32,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000013320_6819840.pth... [2023-03-08 21:36:32,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000012608_6455296.pth [2023-03-08 21:36:34,470][569911] Updated weights for policy 0, policy_version 13360 (0.0005) [2023-03-08 21:36:37,732][569911] Updated weights for policy 0, policy_version 13440 (0.0005) [2023-03-08 21:36:37,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 6881280. Throughput: 0: 12074.3. Samples: 6879720. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:36:37,800][569624] Avg episode reward: [(0, '4728.302')] [2023-03-08 21:36:40,882][569911] Updated weights for policy 0, policy_version 13520 (0.0005) [2023-03-08 21:36:42,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 6942720. Throughput: 0: 12111.2. Samples: 6918040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:36:42,789][569624] Avg episode reward: [(0, '4551.486')] [2023-03-08 21:36:44,273][569911] Updated weights for policy 0, policy_version 13600 (0.0005) [2023-03-08 21:36:47,599][569911] Updated weights for policy 0, policy_version 13680 (0.0005) [2023-03-08 21:36:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 7004160. Throughput: 0: 12262.2. Samples: 6991936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:36:47,789][569624] Avg episode reward: [(0, '4213.507')] [2023-03-08 21:36:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000013680_7004160.pth... [2023-03-08 21:36:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000012952_6631424.pth [2023-03-08 21:36:51,246][569911] Updated weights for policy 0, policy_version 13760 (0.0005) [2023-03-08 21:36:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 7061504. Throughput: 0: 12205.9. Samples: 7060932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:36:52,800][569624] Avg episode reward: [(0, '4255.466')] [2023-03-08 21:36:54,832][569911] Updated weights for policy 0, policy_version 13840 (0.0005) [2023-03-08 21:36:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 7118848. Throughput: 0: 12196.5. Samples: 7094252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:36:57,800][569624] Avg episode reward: [(0, '4390.916')] [2023-03-08 21:36:58,419][569911] Updated weights for policy 0, policy_version 13920 (0.0004) [2023-03-08 21:37:01,845][569911] Updated weights for policy 0, policy_version 14000 (0.0006) [2023-03-08 21:37:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 7176192. Throughput: 0: 12117.7. Samples: 7164040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:37:02,800][569624] Avg episode reward: [(0, '4760.887')] [2023-03-08 21:37:02,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000014016_7176192.pth... [2023-03-08 21:37:02,807][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000013320_6819840.pth [2023-03-08 21:37:05,380][569911] Updated weights for policy 0, policy_version 14080 (0.0005) [2023-03-08 21:37:07,789][569624] Fps is (10 sec: 11468.9, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 7233536. Throughput: 0: 12052.3. Samples: 7233472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:37:07,800][569624] Avg episode reward: [(0, '4669.274')] [2023-03-08 21:37:09,000][569911] Updated weights for policy 0, policy_version 14160 (0.0005) [2023-03-08 21:37:12,481][569911] Updated weights for policy 0, policy_version 14240 (0.0005) [2023-03-08 21:37:12,789][569624] Fps is (10 sec: 11468.7, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 7290880. Throughput: 0: 11982.1. Samples: 7267360. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:37:12,800][569624] Avg episode reward: [(0, '4617.526')] [2023-03-08 21:37:15,797][569911] Updated weights for policy 0, policy_version 14320 (0.0005) [2023-03-08 21:37:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12015.0, 300 sec: 12010.3). Total num frames: 7352320. Throughput: 0: 11919.3. Samples: 7340104. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:37:17,789][569624] Avg episode reward: [(0, '4602.996')] [2023-03-08 21:37:17,818][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000014368_7356416.pth... [2023-03-08 21:37:17,819][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000013680_7004160.pth [2023-03-08 21:37:19,159][569911] Updated weights for policy 0, policy_version 14400 (0.0005) [2023-03-08 21:37:22,630][569911] Updated weights for policy 0, policy_version 14480 (0.0005) [2023-03-08 21:37:22,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 7413760. Throughput: 0: 11846.9. Samples: 7412832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:37:22,789][569624] Avg episode reward: [(0, '4527.468')] [2023-03-08 21:37:25,813][569911] Updated weights for policy 0, policy_version 14560 (0.0005) [2023-03-08 21:37:27,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 7475200. Throughput: 0: 11860.3. Samples: 7451752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:37:27,789][569624] Avg episode reward: [(0, '4615.618')] [2023-03-08 21:37:29,246][569911] Updated weights for policy 0, policy_version 14640 (0.0005) [2023-03-08 21:37:32,674][569911] Updated weights for policy 0, policy_version 14720 (0.0004) [2023-03-08 21:37:32,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 7536640. Throughput: 0: 11807.9. Samples: 7523292. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:37:32,789][569624] Avg episode reward: [(0, '4557.081')] [2023-03-08 21:37:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000014720_7536640.pth... [2023-03-08 21:37:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000014016_7176192.pth [2023-03-08 21:37:36,012][569911] Updated weights for policy 0, policy_version 14800 (0.0004) [2023-03-08 21:37:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 7593984. Throughput: 0: 11860.2. Samples: 7594640. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:37:37,789][569624] Avg episode reward: [(0, '4468.766')] [2023-03-08 21:37:39,494][569911] Updated weights for policy 0, policy_version 14880 (0.0005) [2023-03-08 21:37:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 7655424. Throughput: 0: 11926.8. Samples: 7630960. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:37:42,789][569624] Avg episode reward: [(0, '3946.852')] [2023-03-08 21:37:43,006][569911] Updated weights for policy 0, policy_version 14960 (0.0004) [2023-03-08 21:37:46,391][569911] Updated weights for policy 0, policy_version 15040 (0.0005) [2023-03-08 21:37:47,789][569624] Fps is (10 sec: 11878.2, 60 sec: 11810.1, 300 sec: 12010.3). Total num frames: 7712768. Throughput: 0: 11960.2. Samples: 7702252. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:37:47,789][569624] Avg episode reward: [(0, '4267.625')] [2023-03-08 21:37:47,807][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000015072_7716864.pth... [2023-03-08 21:37:47,808][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000014368_7356416.pth [2023-03-08 21:37:49,838][569911] Updated weights for policy 0, policy_version 15120 (0.0005) [2023-03-08 21:37:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 7778304. Throughput: 0: 12086.2. Samples: 7777352. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:37:52,789][569624] Avg episode reward: [(0, '4696.588')] [2023-03-08 21:37:52,983][569911] Updated weights for policy 0, policy_version 15200 (0.0004) [2023-03-08 21:37:56,475][569911] Updated weights for policy 0, policy_version 15280 (0.0005) [2023-03-08 21:37:57,789][569624] Fps is (10 sec: 12288.2, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 7835648. Throughput: 0: 12089.4. Samples: 7811384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:37:57,789][569624] Avg episode reward: [(0, '4651.627')] [2023-03-08 21:37:59,884][569911] Updated weights for policy 0, policy_version 15360 (0.0005) [2023-03-08 21:38:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 7897088. Throughput: 0: 12101.3. Samples: 7884664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:38:02,789][569624] Avg episode reward: [(0, '4570.213')] [2023-03-08 21:38:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000015424_7897088.pth... [2023-03-08 21:38:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000014720_7536640.pth [2023-03-08 21:38:03,359][569911] Updated weights for policy 0, policy_version 15440 (0.0005) [2023-03-08 21:38:06,865][569911] Updated weights for policy 0, policy_version 15520 (0.0005) [2023-03-08 21:38:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 7954432. Throughput: 0: 12030.4. Samples: 7954200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:38:07,789][569624] Avg episode reward: [(0, '4594.619')] [2023-03-08 21:38:10,208][569911] Updated weights for policy 0, policy_version 15600 (0.0005) [2023-03-08 21:38:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 8011776. Throughput: 0: 11989.6. Samples: 7991284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:38:12,789][569624] Avg episode reward: [(0, '4680.761')] [2023-03-08 21:38:13,827][569911] Updated weights for policy 0, policy_version 15680 (0.0005) [2023-03-08 21:38:17,248][569911] Updated weights for policy 0, policy_version 15760 (0.0004) [2023-03-08 21:38:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 8073216. Throughput: 0: 11970.6. Samples: 8061968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:38:17,789][569624] Avg episode reward: [(0, '4789.298')] [2023-03-08 21:38:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000015768_8073216.pth... [2023-03-08 21:38:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000015072_7716864.pth [2023-03-08 21:38:20,778][569911] Updated weights for policy 0, policy_version 15840 (0.0004) [2023-03-08 21:38:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 8130560. Throughput: 0: 11910.9. Samples: 8130632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:38:22,789][569624] Avg episode reward: [(0, '4636.171')] [2023-03-08 21:38:24,320][569911] Updated weights for policy 0, policy_version 15920 (0.0004) [2023-03-08 21:38:27,789][569624] Fps is (10 sec: 11469.0, 60 sec: 11878.4, 300 sec: 11996.4). Total num frames: 8187904. Throughput: 0: 11899.8. Samples: 8166448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:38:27,789][569624] Avg episode reward: [(0, '4121.258')] [2023-03-08 21:38:27,881][569911] Updated weights for policy 0, policy_version 16000 (0.0004) [2023-03-08 21:38:31,292][569911] Updated weights for policy 0, policy_version 16080 (0.0005) [2023-03-08 21:38:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 8249344. Throughput: 0: 11871.9. Samples: 8236488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:38:32,789][569624] Avg episode reward: [(0, '3829.192')] [2023-03-08 21:38:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000016112_8249344.pth... [2023-03-08 21:38:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000015424_7897088.pth [2023-03-08 21:38:34,720][569911] Updated weights for policy 0, policy_version 16160 (0.0004) [2023-03-08 21:38:37,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 8306688. Throughput: 0: 11756.4. Samples: 8306392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:38:37,789][569624] Avg episode reward: [(0, '4419.922')] [2023-03-08 21:38:38,304][569911] Updated weights for policy 0, policy_version 16240 (0.0004) [2023-03-08 21:38:41,754][569911] Updated weights for policy 0, policy_version 16320 (0.0004) [2023-03-08 21:38:42,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11968.7). Total num frames: 8364032. Throughput: 0: 11757.3. Samples: 8340464. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:38:42,789][569624] Avg episode reward: [(0, '4406.153')] [2023-03-08 21:38:45,374][569911] Updated weights for policy 0, policy_version 16400 (0.0005) [2023-03-08 21:38:47,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11810.2, 300 sec: 11954.8). Total num frames: 8421376. Throughput: 0: 11673.4. Samples: 8409964. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:38:47,789][569624] Avg episode reward: [(0, '4566.058')] [2023-03-08 21:38:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000016448_8421376.pth... [2023-03-08 21:38:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000015768_8073216.pth [2023-03-08 21:38:48,982][569911] Updated weights for policy 0, policy_version 16480 (0.0005) [2023-03-08 21:38:52,529][569911] Updated weights for policy 0, policy_version 16560 (0.0005) [2023-03-08 21:38:52,789][569624] Fps is (10 sec: 11468.6, 60 sec: 11673.6, 300 sec: 11954.8). Total num frames: 8478720. Throughput: 0: 11657.4. Samples: 8478784. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:38:52,790][569624] Avg episode reward: [(0, '4482.568')] [2023-03-08 21:38:55,910][569911] Updated weights for policy 0, policy_version 16640 (0.0005) [2023-03-08 21:38:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11954.8). Total num frames: 8540160. Throughput: 0: 11651.0. Samples: 8515580. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:38:57,789][569624] Avg episode reward: [(0, '4431.320')] [2023-03-08 21:38:59,389][569911] Updated weights for policy 0, policy_version 16720 (0.0004) [2023-03-08 21:39:02,717][569911] Updated weights for policy 0, policy_version 16800 (0.0005) [2023-03-08 21:39:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11741.9, 300 sec: 11968.6). Total num frames: 8601600. Throughput: 0: 11659.5. Samples: 8586648. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:39:02,789][569624] Avg episode reward: [(0, '4467.223')] [2023-03-08 21:39:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000016800_8601600.pth... [2023-03-08 21:39:02,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000016112_8249344.pth [2023-03-08 21:39:06,092][569911] Updated weights for policy 0, policy_version 16880 (0.0004) [2023-03-08 21:39:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11968.7). Total num frames: 8658944. Throughput: 0: 11741.9. Samples: 8659016. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:39:07,789][569624] Avg episode reward: [(0, '4073.764')] [2023-03-08 21:39:09,493][569911] Updated weights for policy 0, policy_version 16960 (0.0005) [2023-03-08 21:39:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11968.7). Total num frames: 8720384. Throughput: 0: 11765.4. Samples: 8695892. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:39:12,789][569624] Avg episode reward: [(0, '4475.325')] [2023-03-08 21:39:12,884][569911] Updated weights for policy 0, policy_version 17040 (0.0005) [2023-03-08 21:39:16,286][569911] Updated weights for policy 0, policy_version 17120 (0.0005) [2023-03-08 21:39:17,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 8781824. Throughput: 0: 11831.9. Samples: 8768924. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:39:17,789][569624] Avg episode reward: [(0, '4598.624')] [2023-03-08 21:39:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000017152_8781824.pth... [2023-03-08 21:39:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000016448_8421376.pth [2023-03-08 21:39:19,900][569911] Updated weights for policy 0, policy_version 17200 (0.0005) [2023-03-08 21:39:22,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11968.6). Total num frames: 8835072. Throughput: 0: 11750.1. Samples: 8835144. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:39:22,789][569624] Avg episode reward: [(0, '4212.377')] [2023-03-08 21:39:23,514][569911] Updated weights for policy 0, policy_version 17280 (0.0004) [2023-03-08 21:39:26,874][569911] Updated weights for policy 0, policy_version 17360 (0.0004) [2023-03-08 21:39:27,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 8896512. Throughput: 0: 11817.1. Samples: 8872232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:39:27,789][569624] Avg episode reward: [(0, '4498.097')] [2023-03-08 21:39:30,050][569911] Updated weights for policy 0, policy_version 17440 (0.0005) [2023-03-08 21:39:32,789][569624] Fps is (10 sec: 12697.3, 60 sec: 11878.4, 300 sec: 11996.4). Total num frames: 8962048. Throughput: 0: 11970.1. Samples: 8948620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:39:32,790][569624] Avg episode reward: [(0, '4726.544')] [2023-03-08 21:39:32,794][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000017504_8962048.pth... [2023-03-08 21:39:32,797][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000016800_8601600.pth [2023-03-08 21:39:33,435][569911] Updated weights for policy 0, policy_version 17520 (0.0005) [2023-03-08 21:39:36,889][569911] Updated weights for policy 0, policy_version 17600 (0.0004) [2023-03-08 21:39:37,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11996.4). Total num frames: 9019392. Throughput: 0: 12012.5. Samples: 9019344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:39:37,789][569624] Avg episode reward: [(0, '4693.124')] [2023-03-08 21:39:40,422][569911] Updated weights for policy 0, policy_version 17680 (0.0004) [2023-03-08 21:39:42,789][569624] Fps is (10 sec: 11469.0, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 9076736. Throughput: 0: 11950.9. Samples: 9053372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:39:42,789][569624] Avg episode reward: [(0, '4573.575')] [2023-03-08 21:39:43,943][569911] Updated weights for policy 0, policy_version 17760 (0.0005) [2023-03-08 21:39:47,313][569911] Updated weights for policy 0, policy_version 17840 (0.0005) [2023-03-08 21:39:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.6, 300 sec: 11982.5). Total num frames: 9138176. Throughput: 0: 11982.0. Samples: 9125836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:39:47,789][569624] Avg episode reward: [(0, '3923.015')] [2023-03-08 21:39:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000017848_9138176.pth... [2023-03-08 21:39:47,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000017152_8781824.pth [2023-03-08 21:39:50,799][569911] Updated weights for policy 0, policy_version 17920 (0.0005) [2023-03-08 21:39:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12015.0, 300 sec: 11982.5). Total num frames: 9199616. Throughput: 0: 11954.7. Samples: 9196976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:39:52,789][569624] Avg episode reward: [(0, '4351.997')] [2023-03-08 21:39:54,076][569911] Updated weights for policy 0, policy_version 18000 (0.0005) [2023-03-08 21:39:57,461][569911] Updated weights for policy 0, policy_version 18080 (0.0005) [2023-03-08 21:39:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 9256960. Throughput: 0: 11939.4. Samples: 9233164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:39:57,789][569624] Avg episode reward: [(0, '4551.839')] [2023-03-08 21:40:01,051][569911] Updated weights for policy 0, policy_version 18160 (0.0005) [2023-03-08 21:40:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 9314304. Throughput: 0: 11860.1. Samples: 9302628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:40:02,789][569624] Avg episode reward: [(0, '4375.070')] [2023-03-08 21:40:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000018192_9314304.pth... [2023-03-08 21:40:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000017504_8962048.pth [2023-03-08 21:40:04,736][569911] Updated weights for policy 0, policy_version 18240 (0.0005) [2023-03-08 21:40:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11940.9). Total num frames: 9375744. Throughput: 0: 11961.8. Samples: 9373424. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:40:07,789][569624] Avg episode reward: [(0, '4525.329')] [2023-03-08 21:40:08,131][569911] Updated weights for policy 0, policy_version 18320 (0.0005) [2023-03-08 21:40:11,641][569911] Updated weights for policy 0, policy_version 18400 (0.0004) [2023-03-08 21:40:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 9433088. Throughput: 0: 11905.5. Samples: 9407980. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:40:12,789][569624] Avg episode reward: [(0, '4734.856')] [2023-03-08 21:40:15,093][569911] Updated weights for policy 0, policy_version 18480 (0.0004) [2023-03-08 21:40:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 9490432. Throughput: 0: 11768.7. Samples: 9478208. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:40:17,789][569624] Avg episode reward: [(0, '4379.756')] [2023-03-08 21:40:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000018536_9490432.pth... [2023-03-08 21:40:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000017848_9138176.pth [2023-03-08 21:40:18,697][569911] Updated weights for policy 0, policy_version 18560 (0.0005) [2023-03-08 21:40:22,138][569911] Updated weights for policy 0, policy_version 18640 (0.0004) [2023-03-08 21:40:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 9547776. Throughput: 0: 11746.5. Samples: 9547936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:40:22,789][569624] Avg episode reward: [(0, '4548.271')] [2023-03-08 21:40:25,661][569911] Updated weights for policy 0, policy_version 18720 (0.0004) [2023-03-08 21:40:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 9609216. Throughput: 0: 11762.7. Samples: 9582696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:40:27,789][569624] Avg episode reward: [(0, '4605.705')] [2023-03-08 21:40:29,111][569911] Updated weights for policy 0, policy_version 18800 (0.0004) [2023-03-08 21:40:32,510][569911] Updated weights for policy 0, policy_version 18880 (0.0005) [2023-03-08 21:40:32,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11885.3). Total num frames: 9666560. Throughput: 0: 11744.4. Samples: 9654336. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:40:32,789][569624] Avg episode reward: [(0, '4359.350')] [2023-03-08 21:40:32,813][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000018888_9670656.pth... [2023-03-08 21:40:32,815][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000018192_9314304.pth [2023-03-08 21:40:36,117][569911] Updated weights for policy 0, policy_version 18960 (0.0005) [2023-03-08 21:40:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 9728000. Throughput: 0: 11754.7. Samples: 9725936. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:40:37,789][569624] Avg episode reward: [(0, '4460.838')] [2023-03-08 21:40:39,457][569911] Updated weights for policy 0, policy_version 19040 (0.0005) [2023-03-08 21:40:42,563][569911] Updated weights for policy 0, policy_version 19120 (0.0004) [2023-03-08 21:40:42,789][569624] Fps is (10 sec: 12288.2, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 9789440. Throughput: 0: 11804.0. Samples: 9764344. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:40:42,789][569624] Avg episode reward: [(0, '4566.222')] [2023-03-08 21:40:45,976][569911] Updated weights for policy 0, policy_version 19200 (0.0005) [2023-03-08 21:40:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 9850880. Throughput: 0: 11908.4. Samples: 9838504. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:40:47,789][569624] Avg episode reward: [(0, '4241.288')] [2023-03-08 21:40:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000019240_9850880.pth... [2023-03-08 21:40:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000018536_9490432.pth [2023-03-08 21:40:49,135][569911] Updated weights for policy 0, policy_version 19280 (0.0005) [2023-03-08 21:40:52,631][569911] Updated weights for policy 0, policy_version 19360 (0.0005) [2023-03-08 21:40:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11927.0). Total num frames: 9912320. Throughput: 0: 11963.0. Samples: 9911760. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:40:52,800][569624] Avg episode reward: [(0, '4358.822')] [2023-03-08 21:40:55,951][569911] Updated weights for policy 0, policy_version 19440 (0.0005) [2023-03-08 21:40:57,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 9973760. Throughput: 0: 12006.6. Samples: 9948276. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:40:57,789][569624] Avg episode reward: [(0, '4265.543')] [2023-03-08 21:40:59,316][569911] Updated weights for policy 0, policy_version 19520 (0.0005) [2023-03-08 21:41:02,713][569911] Updated weights for policy 0, policy_version 19600 (0.0005) [2023-03-08 21:41:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11940.9). Total num frames: 10035200. Throughput: 0: 12073.0. Samples: 10021492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:41:02,789][569624] Avg episode reward: [(0, '4497.026')] [2023-03-08 21:41:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000019600_10035200.pth... [2023-03-08 21:41:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000018888_9670656.pth [2023-03-08 21:41:06,143][569911] Updated weights for policy 0, policy_version 19680 (0.0005) [2023-03-08 21:41:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11940.9). Total num frames: 10092544. Throughput: 0: 12105.0. Samples: 10092664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:41:07,789][569624] Avg episode reward: [(0, '4653.389')] [2023-03-08 21:41:09,422][569911] Updated weights for policy 0, policy_version 19760 (0.0005) [2023-03-08 21:41:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11940.9). Total num frames: 10153984. Throughput: 0: 12185.4. Samples: 10131040. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 21:41:12,789][569624] Avg episode reward: [(0, '4151.132')] [2023-03-08 21:41:12,806][569911] Updated weights for policy 0, policy_version 19840 (0.0005) [2023-03-08 21:41:15,899][569911] Updated weights for policy 0, policy_version 19920 (0.0005) [2023-03-08 21:41:17,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12151.5, 300 sec: 11954.8). Total num frames: 10219520. Throughput: 0: 12278.8. Samples: 10206884. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 21:41:17,789][569624] Avg episode reward: [(0, '4667.505')] [2023-03-08 21:41:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000019960_10219520.pth... [2023-03-08 21:41:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000019240_9850880.pth [2023-03-08 21:41:19,291][569911] Updated weights for policy 0, policy_version 20000 (0.0004) [2023-03-08 21:41:22,609][569911] Updated weights for policy 0, policy_version 20080 (0.0005) [2023-03-08 21:41:22,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12219.7, 300 sec: 11954.8). Total num frames: 10280960. Throughput: 0: 12317.8. Samples: 10280240. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 21:41:22,789][569624] Avg episode reward: [(0, '4657.901')] [2023-03-08 21:41:25,955][569911] Updated weights for policy 0, policy_version 20160 (0.0005) [2023-03-08 21:41:27,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 11940.9). Total num frames: 10342400. Throughput: 0: 12283.0. Samples: 10317080. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:41:27,789][569624] Avg episode reward: [(0, '4550.758')] [2023-03-08 21:41:29,281][569911] Updated weights for policy 0, policy_version 20240 (0.0005) [2023-03-08 21:41:32,675][569911] Updated weights for policy 0, policy_version 20320 (0.0005) [2023-03-08 21:41:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 11940.9). Total num frames: 10403840. Throughput: 0: 12237.0. Samples: 10389168. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:41:32,789][569624] Avg episode reward: [(0, '4563.049')] [2023-03-08 21:41:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000020320_10403840.pth... [2023-03-08 21:41:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000019600_10035200.pth [2023-03-08 21:41:36,020][569911] Updated weights for policy 0, policy_version 20400 (0.0004) [2023-03-08 21:41:37,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12288.0, 300 sec: 11940.9). Total num frames: 10465280. Throughput: 0: 12219.0. Samples: 10461616. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:41:37,789][569624] Avg episode reward: [(0, '4585.025')] [2023-03-08 21:41:39,391][569911] Updated weights for policy 0, policy_version 20480 (0.0005) [2023-03-08 21:41:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 11927.0). Total num frames: 10522624. Throughput: 0: 12267.2. Samples: 10500300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:41:42,789][569624] Avg episode reward: [(0, '4616.182')] [2023-03-08 21:41:42,909][569911] Updated weights for policy 0, policy_version 20560 (0.0005) [2023-03-08 21:41:46,547][569911] Updated weights for policy 0, policy_version 20640 (0.0006) [2023-03-08 21:41:47,789][569624] Fps is (10 sec: 11468.9, 60 sec: 12151.5, 300 sec: 11927.0). Total num frames: 10579968. Throughput: 0: 12137.5. Samples: 10567680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:41:47,789][569624] Avg episode reward: [(0, '4556.755')] [2023-03-08 21:41:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000020664_10579968.pth... [2023-03-08 21:41:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000019960_10219520.pth [2023-03-08 21:41:50,061][569911] Updated weights for policy 0, policy_version 20720 (0.0005) [2023-03-08 21:41:52,789][569624] Fps is (10 sec: 11468.9, 60 sec: 12083.2, 300 sec: 11927.0). Total num frames: 10637312. Throughput: 0: 12104.7. Samples: 10637376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:41:52,789][569624] Avg episode reward: [(0, '4696.077')] [2023-03-08 21:41:53,609][569911] Updated weights for policy 0, policy_version 20800 (0.0005) [2023-03-08 21:41:56,956][569911] Updated weights for policy 0, policy_version 20880 (0.0005) [2023-03-08 21:41:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11940.9). Total num frames: 10698752. Throughput: 0: 12069.2. Samples: 10674156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:41:57,789][569624] Avg episode reward: [(0, '4719.096')] [2023-03-08 21:42:00,378][569911] Updated weights for policy 0, policy_version 20960 (0.0004) [2023-03-08 21:42:02,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 11940.9). Total num frames: 10756096. Throughput: 0: 11971.5. Samples: 10745600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:42:02,789][569624] Avg episode reward: [(0, '4458.087')] [2023-03-08 21:42:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000021016_10760192.pth... [2023-03-08 21:42:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000020320_10403840.pth [2023-03-08 21:42:03,862][569911] Updated weights for policy 0, policy_version 21040 (0.0005) [2023-03-08 21:42:07,406][569911] Updated weights for policy 0, policy_version 21120 (0.0005) [2023-03-08 21:42:07,789][569624] Fps is (10 sec: 11468.9, 60 sec: 12015.0, 300 sec: 11940.9). Total num frames: 10813440. Throughput: 0: 11880.7. Samples: 10814872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:42:07,789][569624] Avg episode reward: [(0, '4530.388')] [2023-03-08 21:42:10,914][569911] Updated weights for policy 0, policy_version 21200 (0.0005) [2023-03-08 21:42:12,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11940.9). Total num frames: 10874880. Throughput: 0: 11849.1. Samples: 10850288. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:42:12,789][569624] Avg episode reward: [(0, '4625.538')] [2023-03-08 21:42:14,272][569911] Updated weights for policy 0, policy_version 21280 (0.0004) [2023-03-08 21:42:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11927.0). Total num frames: 10932224. Throughput: 0: 11804.8. Samples: 10920384. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:42:17,789][569624] Avg episode reward: [(0, '4732.553')] [2023-03-08 21:42:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000021352_10932224.pth... [2023-03-08 21:42:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000020664_10579968.pth [2023-03-08 21:42:17,902][569911] Updated weights for policy 0, policy_version 21360 (0.0005) [2023-03-08 21:42:21,172][569911] Updated weights for policy 0, policy_version 21440 (0.0005) [2023-03-08 21:42:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11927.0). Total num frames: 10993664. Throughput: 0: 11830.8. Samples: 10994000. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:42:22,789][569624] Avg episode reward: [(0, '4498.266')] [2023-03-08 21:42:24,505][569911] Updated weights for policy 0, policy_version 21520 (0.0005) [2023-03-08 21:42:27,789][569624] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 11927.0). Total num frames: 11055104. Throughput: 0: 11783.1. Samples: 11030536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:42:27,789][569624] Avg episode reward: [(0, '4739.208')] [2023-03-08 21:42:28,130][569911] Updated weights for policy 0, policy_version 21600 (0.0005) [2023-03-08 21:42:31,706][569911] Updated weights for policy 0, policy_version 21680 (0.0005) [2023-03-08 21:42:32,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.2, 300 sec: 11927.0). Total num frames: 11112448. Throughput: 0: 11809.6. Samples: 11099112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:42:32,789][569624] Avg episode reward: [(0, '4498.094')] [2023-03-08 21:42:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000021704_11112448.pth... [2023-03-08 21:42:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000021016_10760192.pth [2023-03-08 21:42:35,170][569911] Updated weights for policy 0, policy_version 21760 (0.0004) [2023-03-08 21:42:37,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11741.9, 300 sec: 11913.1). Total num frames: 11169792. Throughput: 0: 11826.2. Samples: 11169556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:42:37,789][569624] Avg episode reward: [(0, '4395.404')] [2023-03-08 21:42:38,615][569911] Updated weights for policy 0, policy_version 21840 (0.0005) [2023-03-08 21:42:41,807][569911] Updated weights for policy 0, policy_version 21920 (0.0005) [2023-03-08 21:42:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.2, 300 sec: 11927.0). Total num frames: 11231232. Throughput: 0: 11831.0. Samples: 11206548. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:42:42,789][569624] Avg episode reward: [(0, '4349.235')] [2023-03-08 21:42:45,075][569911] Updated weights for policy 0, policy_version 22000 (0.0005) [2023-03-08 21:42:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 11292672. Throughput: 0: 11904.5. Samples: 11281304. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:42:47,789][569624] Avg episode reward: [(0, '4267.329')] [2023-03-08 21:42:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000022056_11292672.pth... [2023-03-08 21:42:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000021352_10932224.pth [2023-03-08 21:42:48,740][569911] Updated weights for policy 0, policy_version 22080 (0.0005) [2023-03-08 21:42:52,169][569911] Updated weights for policy 0, policy_version 22160 (0.0005) [2023-03-08 21:42:52,789][569624] Fps is (10 sec: 11878.2, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 11350016. Throughput: 0: 11893.9. Samples: 11350100. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:42:52,789][569624] Avg episode reward: [(0, '4045.429')] [2023-03-08 21:42:55,591][569911] Updated weights for policy 0, policy_version 22240 (0.0005) [2023-03-08 21:42:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 11411456. Throughput: 0: 11920.8. Samples: 11386724. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 21:42:57,789][569624] Avg episode reward: [(0, '4548.036')] [2023-03-08 21:42:59,151][569911] Updated weights for policy 0, policy_version 22320 (0.0004) [2023-03-08 21:43:02,566][569911] Updated weights for policy 0, policy_version 22400 (0.0005) [2023-03-08 21:43:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 11468800. Throughput: 0: 11915.5. Samples: 11456584. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 21:43:02,789][569624] Avg episode reward: [(0, '4235.602')] [2023-03-08 21:43:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000022400_11468800.pth... [2023-03-08 21:43:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000021704_11112448.pth [2023-03-08 21:43:05,938][569911] Updated weights for policy 0, policy_version 22480 (0.0005) [2023-03-08 21:43:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.6, 300 sec: 11927.0). Total num frames: 11530240. Throughput: 0: 11883.9. Samples: 11528776. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 21:43:07,789][569624] Avg episode reward: [(0, '4472.406')] [2023-03-08 21:43:09,449][569911] Updated weights for policy 0, policy_version 22560 (0.0005) [2023-03-08 21:43:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 11587584. Throughput: 0: 11845.1. Samples: 11563568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:43:12,789][569624] Avg episode reward: [(0, '4419.008')] [2023-03-08 21:43:12,978][569911] Updated weights for policy 0, policy_version 22640 (0.0005) [2023-03-08 21:43:16,512][569911] Updated weights for policy 0, policy_version 22720 (0.0005) [2023-03-08 21:43:17,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 11644928. Throughput: 0: 11857.6. Samples: 11632704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:43:17,789][569624] Avg episode reward: [(0, '4449.138')] [2023-03-08 21:43:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000022744_11644928.pth... [2023-03-08 21:43:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000022056_11292672.pth [2023-03-08 21:43:19,981][569911] Updated weights for policy 0, policy_version 22800 (0.0005) [2023-03-08 21:43:22,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11810.1, 300 sec: 11913.1). Total num frames: 11702272. Throughput: 0: 11834.8. Samples: 11702120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:43:22,789][569624] Avg episode reward: [(0, '4606.284')] [2023-03-08 21:43:23,693][569911] Updated weights for policy 0, policy_version 22880 (0.0005) [2023-03-08 21:43:27,048][569911] Updated weights for policy 0, policy_version 22960 (0.0005) [2023-03-08 21:43:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11913.1). Total num frames: 11763712. Throughput: 0: 11799.6. Samples: 11737532. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:43:27,789][569624] Avg episode reward: [(0, '4517.601')] [2023-03-08 21:43:30,626][569911] Updated weights for policy 0, policy_version 23040 (0.0004) [2023-03-08 21:43:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11741.8, 300 sec: 11899.2). Total num frames: 11816960. Throughput: 0: 11669.5. Samples: 11806432. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:43:32,789][569624] Avg episode reward: [(0, '4397.985')] [2023-03-08 21:43:32,820][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000023088_11821056.pth... [2023-03-08 21:43:32,822][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000022400_11468800.pth [2023-03-08 21:43:34,297][569911] Updated weights for policy 0, policy_version 23120 (0.0005) [2023-03-08 21:43:37,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11741.9, 300 sec: 11899.2). Total num frames: 11874304. Throughput: 0: 11646.7. Samples: 11874204. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:43:37,790][569624] Avg episode reward: [(0, '4300.330')] [2023-03-08 21:43:37,995][569911] Updated weights for policy 0, policy_version 23200 (0.0005) [2023-03-08 21:43:41,652][569911] Updated weights for policy 0, policy_version 23280 (0.0005) [2023-03-08 21:43:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11899.2). Total num frames: 11931648. Throughput: 0: 11563.3. Samples: 11907072. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:43:42,789][569624] Avg episode reward: [(0, '4221.558')] [2023-03-08 21:43:45,283][569911] Updated weights for policy 0, policy_version 23360 (0.0005) [2023-03-08 21:43:47,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11537.1, 300 sec: 11885.3). Total num frames: 11984896. Throughput: 0: 11479.2. Samples: 11973148. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:43:47,789][569624] Avg episode reward: [(0, '3590.810')] [2023-03-08 21:43:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000023408_11984896.pth... [2023-03-08 21:43:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000022744_11644928.pth [2023-03-08 21:43:48,996][569911] Updated weights for policy 0, policy_version 23440 (0.0005) [2023-03-08 21:43:52,478][569911] Updated weights for policy 0, policy_version 23520 (0.0005) [2023-03-08 21:43:52,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11537.1, 300 sec: 11871.5). Total num frames: 12042240. Throughput: 0: 11416.7. Samples: 12042528. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:43:52,789][569624] Avg episode reward: [(0, '3917.791')] [2023-03-08 21:43:55,903][569911] Updated weights for policy 0, policy_version 23600 (0.0004) [2023-03-08 21:43:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11537.1, 300 sec: 11871.5). Total num frames: 12103680. Throughput: 0: 11455.6. Samples: 12079072. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:43:57,789][569624] Avg episode reward: [(0, '4168.185')] [2023-03-08 21:43:59,282][569911] Updated weights for policy 0, policy_version 23680 (0.0005) [2023-03-08 21:44:02,624][569911] Updated weights for policy 0, policy_version 23760 (0.0004) [2023-03-08 21:44:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11605.3, 300 sec: 11885.3). Total num frames: 12165120. Throughput: 0: 11543.7. Samples: 12152168. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:44:02,789][569624] Avg episode reward: [(0, '4509.500')] [2023-03-08 21:44:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000023760_12165120.pth... [2023-03-08 21:44:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000023088_11821056.pth [2023-03-08 21:44:06,005][569911] Updated weights for policy 0, policy_version 23840 (0.0004) [2023-03-08 21:44:07,789][569624] Fps is (10 sec: 12287.8, 60 sec: 11605.3, 300 sec: 11885.3). Total num frames: 12226560. Throughput: 0: 11590.7. Samples: 12223704. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:44:07,790][569624] Avg episode reward: [(0, '3996.177')] [2023-03-08 21:44:09,450][569911] Updated weights for policy 0, policy_version 23920 (0.0005) [2023-03-08 21:44:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11871.5). Total num frames: 12283904. Throughput: 0: 11608.7. Samples: 12259924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:44:12,789][569624] Avg episode reward: [(0, '4181.906')] [2023-03-08 21:44:12,901][569911] Updated weights for policy 0, policy_version 24000 (0.0004) [2023-03-08 21:44:16,549][569911] Updated weights for policy 0, policy_version 24080 (0.0005) [2023-03-08 21:44:17,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11605.3, 300 sec: 11885.3). Total num frames: 12341248. Throughput: 0: 11612.0. Samples: 12328972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:44:17,789][569624] Avg episode reward: [(0, '4123.046')] [2023-03-08 21:44:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000024104_12341248.pth... [2023-03-08 21:44:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000023408_11984896.pth [2023-03-08 21:44:20,112][569911] Updated weights for policy 0, policy_version 24160 (0.0005) [2023-03-08 21:44:22,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11605.3, 300 sec: 11871.5). Total num frames: 12398592. Throughput: 0: 11654.7. Samples: 12398664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:44:22,789][569624] Avg episode reward: [(0, '4130.692')] [2023-03-08 21:44:23,581][569911] Updated weights for policy 0, policy_version 24240 (0.0005) [2023-03-08 21:44:27,228][569911] Updated weights for policy 0, policy_version 24320 (0.0005) [2023-03-08 21:44:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11843.7). Total num frames: 12455936. Throughput: 0: 11658.5. Samples: 12431704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:44:27,789][569624] Avg episode reward: [(0, '3781.695')] [2023-03-08 21:44:30,686][569911] Updated weights for policy 0, policy_version 24400 (0.0005) [2023-03-08 21:44:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11857.6). Total num frames: 12517376. Throughput: 0: 11768.9. Samples: 12502748. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:44:32,789][569624] Avg episode reward: [(0, '4289.668')] [2023-03-08 21:44:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000024448_12517376.pth... [2023-03-08 21:44:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000023760_12165120.pth [2023-03-08 21:44:34,265][569911] Updated weights for policy 0, policy_version 24480 (0.0005) [2023-03-08 21:44:37,584][569911] Updated weights for policy 0, policy_version 24560 (0.0005) [2023-03-08 21:44:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11857.6). Total num frames: 12574720. Throughput: 0: 11807.0. Samples: 12573844. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:44:37,789][569624] Avg episode reward: [(0, '4017.755')] [2023-03-08 21:44:41,038][569911] Updated weights for policy 0, policy_version 24640 (0.0005) [2023-03-08 21:44:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11857.6). Total num frames: 12636160. Throughput: 0: 11825.9. Samples: 12611240. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:44:42,789][569624] Avg episode reward: [(0, '4235.820')] [2023-03-08 21:44:44,638][569911] Updated weights for policy 0, policy_version 24720 (0.0005) [2023-03-08 21:44:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 12693504. Throughput: 0: 11700.7. Samples: 12678700. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:44:47,789][569624] Avg episode reward: [(0, '4167.497')] [2023-03-08 21:44:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000024792_12693504.pth... [2023-03-08 21:44:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000024104_12341248.pth [2023-03-08 21:44:48,157][569911] Updated weights for policy 0, policy_version 24800 (0.0005) [2023-03-08 21:44:51,550][569911] Updated weights for policy 0, policy_version 24880 (0.0005) [2023-03-08 21:44:52,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 12750848. Throughput: 0: 11692.2. Samples: 12749852. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:44:52,789][569624] Avg episode reward: [(0, '4026.241')] [2023-03-08 21:44:55,361][569911] Updated weights for policy 0, policy_version 24960 (0.0005) [2023-03-08 21:44:57,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11673.6, 300 sec: 11829.8). Total num frames: 12804096. Throughput: 0: 11603.0. Samples: 12782060. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:44:57,789][569624] Avg episode reward: [(0, '4030.050')] [2023-03-08 21:44:58,959][569911] Updated weights for policy 0, policy_version 25040 (0.0005) [2023-03-08 21:45:02,537][569911] Updated weights for policy 0, policy_version 25120 (0.0004) [2023-03-08 21:45:02,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11605.4, 300 sec: 11815.9). Total num frames: 12861440. Throughput: 0: 11559.8. Samples: 12849160. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:45:02,789][569624] Avg episode reward: [(0, '3842.988')] [2023-03-08 21:45:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000025120_12861440.pth... [2023-03-08 21:45:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000024448_12517376.pth [2023-03-08 21:45:06,006][569911] Updated weights for policy 0, policy_version 25200 (0.0005) [2023-03-08 21:45:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11605.4, 300 sec: 11829.8). Total num frames: 12922880. Throughput: 0: 11584.0. Samples: 12919944. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:45:07,789][569624] Avg episode reward: [(0, '4017.806')] [2023-03-08 21:45:09,472][569911] Updated weights for policy 0, policy_version 25280 (0.0005) [2023-03-08 21:45:12,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 11829.8). Total num frames: 12980224. Throughput: 0: 11662.1. Samples: 12956500. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:45:12,789][569624] Avg episode reward: [(0, '4468.067')] [2023-03-08 21:45:12,893][569911] Updated weights for policy 0, policy_version 25360 (0.0005) [2023-03-08 21:45:16,275][569911] Updated weights for policy 0, policy_version 25440 (0.0005) [2023-03-08 21:45:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11843.7). Total num frames: 13041664. Throughput: 0: 11696.5. Samples: 13029092. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:45:17,789][569624] Avg episode reward: [(0, '4716.080')] [2023-03-08 21:45:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000025472_13041664.pth... [2023-03-08 21:45:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000024792_12693504.pth [2023-03-08 21:45:19,859][569911] Updated weights for policy 0, policy_version 25520 (0.0004) [2023-03-08 21:45:22,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11605.3, 300 sec: 11815.9). Total num frames: 13094912. Throughput: 0: 11599.8. Samples: 13095836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:45:22,789][569624] Avg episode reward: [(0, '4530.089')] [2023-03-08 21:45:23,499][569911] Updated weights for policy 0, policy_version 25600 (0.0005) [2023-03-08 21:45:26,963][569911] Updated weights for policy 0, policy_version 25680 (0.0004) [2023-03-08 21:45:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11829.8). Total num frames: 13156352. Throughput: 0: 11568.9. Samples: 13131840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:45:27,789][569624] Avg episode reward: [(0, '4461.542')] [2023-03-08 21:45:30,384][569911] Updated weights for policy 0, policy_version 25760 (0.0005) [2023-03-08 21:45:32,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11673.6, 300 sec: 11829.8). Total num frames: 13217792. Throughput: 0: 11629.3. Samples: 13202020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:45:32,789][569624] Avg episode reward: [(0, '4655.256')] [2023-03-08 21:45:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000025816_13217792.pth... [2023-03-08 21:45:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000025120_12861440.pth [2023-03-08 21:45:33,838][569911] Updated weights for policy 0, policy_version 25840 (0.0005) [2023-03-08 21:45:37,355][569911] Updated weights for policy 0, policy_version 25920 (0.0004) [2023-03-08 21:45:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11815.9). Total num frames: 13275136. Throughput: 0: 11631.4. Samples: 13273264. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:45:37,789][569624] Avg episode reward: [(0, '4503.750')] [2023-03-08 21:45:40,772][569911] Updated weights for policy 0, policy_version 26000 (0.0005) [2023-03-08 21:45:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11815.9). Total num frames: 13336576. Throughput: 0: 11705.1. Samples: 13308788. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:45:42,789][569624] Avg episode reward: [(0, '4619.892')] [2023-03-08 21:45:44,096][569911] Updated weights for policy 0, policy_version 26080 (0.0005) [2023-03-08 21:45:47,652][569911] Updated weights for policy 0, policy_version 26160 (0.0005) [2023-03-08 21:45:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11802.0). Total num frames: 13393920. Throughput: 0: 11831.7. Samples: 13381588. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:45:47,789][569624] Avg episode reward: [(0, '4427.991')] [2023-03-08 21:45:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000026160_13393920.pth... [2023-03-08 21:45:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000025472_13041664.pth [2023-03-08 21:45:51,036][569911] Updated weights for policy 0, policy_version 26240 (0.0004) [2023-03-08 21:45:52,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11802.0). Total num frames: 13455360. Throughput: 0: 11818.6. Samples: 13451780. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:45:52,800][569624] Avg episode reward: [(0, '4371.791')] [2023-03-08 21:45:54,464][569911] Updated weights for policy 0, policy_version 26320 (0.0005) [2023-03-08 21:45:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 13512704. Throughput: 0: 11815.6. Samples: 13488200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:45:57,789][569624] Avg episode reward: [(0, '3991.565')] [2023-03-08 21:45:57,954][569911] Updated weights for policy 0, policy_version 26400 (0.0005) [2023-03-08 21:46:01,340][569911] Updated weights for policy 0, policy_version 26480 (0.0004) [2023-03-08 21:46:02,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 13570048. Throughput: 0: 11809.2. Samples: 13560504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:46:02,800][569624] Avg episode reward: [(0, '4271.000')] [2023-03-08 21:46:02,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000026512_13574144.pth... [2023-03-08 21:46:02,804][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000025816_13217792.pth [2023-03-08 21:46:04,811][569911] Updated weights for policy 0, policy_version 26560 (0.0004) [2023-03-08 21:46:07,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.2, 300 sec: 11788.2). Total num frames: 13631488. Throughput: 0: 11904.8. Samples: 13631552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:46:07,789][569624] Avg episode reward: [(0, '4457.992')] [2023-03-08 21:46:08,308][569911] Updated weights for policy 0, policy_version 26640 (0.0004) [2023-03-08 21:46:11,957][569911] Updated weights for policy 0, policy_version 26720 (0.0005) [2023-03-08 21:46:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11760.4). Total num frames: 13688832. Throughput: 0: 11832.9. Samples: 13664320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:46:12,800][569624] Avg episode reward: [(0, '3719.059')] [2023-03-08 21:46:15,791][569911] Updated weights for policy 0, policy_version 26800 (0.0005) [2023-03-08 21:46:17,789][569624] Fps is (10 sec: 11059.0, 60 sec: 11673.6, 300 sec: 11732.6). Total num frames: 13742080. Throughput: 0: 11740.3. Samples: 13730336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:46:17,789][569624] Avg episode reward: [(0, '3695.049')] [2023-03-08 21:46:17,814][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000026848_13746176.pth... [2023-03-08 21:46:17,816][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000026160_13393920.pth [2023-03-08 21:46:19,159][569911] Updated weights for policy 0, policy_version 26880 (0.0005) [2023-03-08 21:46:22,610][569911] Updated weights for policy 0, policy_version 26960 (0.0005) [2023-03-08 21:46:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11732.6). Total num frames: 13803520. Throughput: 0: 11781.2. Samples: 13803420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:46:22,800][569624] Avg episode reward: [(0, '4311.031')] [2023-03-08 21:46:25,935][569911] Updated weights for policy 0, policy_version 27040 (0.0005) [2023-03-08 21:46:27,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11810.1, 300 sec: 11732.6). Total num frames: 13864960. Throughput: 0: 11813.4. Samples: 13840392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:46:27,800][569624] Avg episode reward: [(0, '4403.039')] [2023-03-08 21:46:29,424][569911] Updated weights for policy 0, policy_version 27120 (0.0005) [2023-03-08 21:46:32,728][569911] Updated weights for policy 0, policy_version 27200 (0.0005) [2023-03-08 21:46:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11810.1, 300 sec: 11732.6). Total num frames: 13926400. Throughput: 0: 11780.2. Samples: 13911700. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:46:32,800][569624] Avg episode reward: [(0, '4469.916')] [2023-03-08 21:46:32,804][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000027200_13926400.pth... [2023-03-08 21:46:32,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000026512_13574144.pth [2023-03-08 21:46:36,068][569911] Updated weights for policy 0, policy_version 27280 (0.0005) [2023-03-08 21:46:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11732.6). Total num frames: 13983744. Throughput: 0: 11837.9. Samples: 13984488. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:46:37,800][569624] Avg episode reward: [(0, '4390.801')] [2023-03-08 21:46:39,467][569911] Updated weights for policy 0, policy_version 27360 (0.0005) [2023-03-08 21:46:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11746.5). Total num frames: 14045184. Throughput: 0: 11844.8. Samples: 14021216. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:46:42,789][569624] Avg episode reward: [(0, '4382.386')] [2023-03-08 21:46:42,847][569911] Updated weights for policy 0, policy_version 27440 (0.0004) [2023-03-08 21:46:46,182][569911] Updated weights for policy 0, policy_version 27520 (0.0004) [2023-03-08 21:46:47,789][569624] Fps is (10 sec: 12697.6, 60 sec: 11946.7, 300 sec: 11774.3). Total num frames: 14110720. Throughput: 0: 11878.3. Samples: 14095028. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:46:47,789][569624] Avg episode reward: [(0, '4412.105')] [2023-03-08 21:46:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000027560_14110720.pth... [2023-03-08 21:46:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000026848_13746176.pth [2023-03-08 21:46:49,427][569911] Updated weights for policy 0, policy_version 27600 (0.0005) [2023-03-08 21:46:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11760.4). Total num frames: 14168064. Throughput: 0: 11924.1. Samples: 14168136. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:46:52,789][569624] Avg episode reward: [(0, '4497.006')] [2023-03-08 21:46:52,810][569911] Updated weights for policy 0, policy_version 27680 (0.0005) [2023-03-08 21:46:56,143][569911] Updated weights for policy 0, policy_version 27760 (0.0005) [2023-03-08 21:46:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11774.3). Total num frames: 14229504. Throughput: 0: 12032.8. Samples: 14205796. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:46:57,789][569624] Avg episode reward: [(0, '4385.972')] [2023-03-08 21:46:59,645][569911] Updated weights for policy 0, policy_version 27840 (0.0005) [2023-03-08 21:47:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11788.1). Total num frames: 14290944. Throughput: 0: 12178.5. Samples: 14278368. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:47:02,789][569624] Avg episode reward: [(0, '4462.572')] [2023-03-08 21:47:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000027912_14290944.pth... [2023-03-08 21:47:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000027200_13926400.pth [2023-03-08 21:47:02,988][569911] Updated weights for policy 0, policy_version 27920 (0.0005) [2023-03-08 21:47:06,229][569911] Updated weights for policy 0, policy_version 28000 (0.0005) [2023-03-08 21:47:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11788.2). Total num frames: 14352384. Throughput: 0: 12200.8. Samples: 14352456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:47:07,789][569624] Avg episode reward: [(0, '4434.023')] [2023-03-08 21:47:09,442][569911] Updated weights for policy 0, policy_version 28080 (0.0004) [2023-03-08 21:47:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11802.0). Total num frames: 14413824. Throughput: 0: 12198.2. Samples: 14389312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:47:12,789][569624] Avg episode reward: [(0, '4172.868')] [2023-03-08 21:47:12,867][569911] Updated weights for policy 0, policy_version 28160 (0.0005) [2023-03-08 21:47:16,061][569911] Updated weights for policy 0, policy_version 28240 (0.0004) [2023-03-08 21:47:17,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12288.0, 300 sec: 11815.9). Total num frames: 14479360. Throughput: 0: 12279.7. Samples: 14464284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:47:17,789][569624] Avg episode reward: [(0, '4224.515')] [2023-03-08 21:47:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000028280_14479360.pth... [2023-03-08 21:47:17,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000027560_14110720.pth [2023-03-08 21:47:19,498][569911] Updated weights for policy 0, policy_version 28320 (0.0005) [2023-03-08 21:47:22,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 11802.0). Total num frames: 14536704. Throughput: 0: 12246.9. Samples: 14535600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:47:22,789][569624] Avg episode reward: [(0, '4484.152')] [2023-03-08 21:47:23,025][569911] Updated weights for policy 0, policy_version 28400 (0.0005) [2023-03-08 21:47:26,531][569911] Updated weights for policy 0, policy_version 28480 (0.0005) [2023-03-08 21:47:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12151.5, 300 sec: 11802.0). Total num frames: 14594048. Throughput: 0: 12186.4. Samples: 14569604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:47:27,789][569624] Avg episode reward: [(0, '4435.849')] [2023-03-08 21:47:29,833][569911] Updated weights for policy 0, policy_version 28560 (0.0005) [2023-03-08 21:47:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 12083.2, 300 sec: 11802.0). Total num frames: 14651392. Throughput: 0: 12103.4. Samples: 14639680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:47:32,789][569624] Avg episode reward: [(0, '4286.137')] [2023-03-08 21:47:32,836][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000028624_14655488.pth... [2023-03-08 21:47:32,838][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000027912_14290944.pth [2023-03-08 21:47:33,457][569911] Updated weights for policy 0, policy_version 28640 (0.0005) [2023-03-08 21:47:36,812][569911] Updated weights for policy 0, policy_version 28720 (0.0005) [2023-03-08 21:47:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 11802.0). Total num frames: 14712832. Throughput: 0: 12106.0. Samples: 14712908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:47:37,789][569624] Avg episode reward: [(0, '3884.046')] [2023-03-08 21:47:40,164][569911] Updated weights for policy 0, policy_version 28800 (0.0005) [2023-03-08 21:47:42,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12219.7, 300 sec: 11815.9). Total num frames: 14778368. Throughput: 0: 12097.9. Samples: 14750200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:47:42,789][569624] Avg episode reward: [(0, '4419.001')] [2023-03-08 21:47:43,407][569911] Updated weights for policy 0, policy_version 28880 (0.0005) [2023-03-08 21:47:46,749][569911] Updated weights for policy 0, policy_version 28960 (0.0005) [2023-03-08 21:47:47,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 11815.9). Total num frames: 14835712. Throughput: 0: 12152.0. Samples: 14825208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:47:47,789][569624] Avg episode reward: [(0, '4587.590')] [2023-03-08 21:47:47,812][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000028984_14839808.pth... [2023-03-08 21:47:47,813][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000028280_14479360.pth [2023-03-08 21:47:50,172][569911] Updated weights for policy 0, policy_version 29040 (0.0005) [2023-03-08 21:47:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 11815.9). Total num frames: 14897152. Throughput: 0: 12078.0. Samples: 14895968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:47:52,789][569624] Avg episode reward: [(0, '4665.600')] [2023-03-08 21:47:53,818][569911] Updated weights for policy 0, policy_version 29120 (0.0005) [2023-03-08 21:47:57,156][569911] Updated weights for policy 0, policy_version 29200 (0.0005) [2023-03-08 21:47:57,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11815.9). Total num frames: 14954496. Throughput: 0: 12017.2. Samples: 14930088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:47:57,789][569624] Avg episode reward: [(0, '4394.091')] [2023-03-08 21:48:00,641][569911] Updated weights for policy 0, policy_version 29280 (0.0005) [2023-03-08 21:48:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12014.9, 300 sec: 11802.0). Total num frames: 15011840. Throughput: 0: 11908.0. Samples: 15000144. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:48:02,789][569624] Avg episode reward: [(0, '4644.011')] [2023-03-08 21:48:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000029320_15011840.pth... [2023-03-08 21:48:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000028624_14655488.pth [2023-03-08 21:48:04,191][569911] Updated weights for policy 0, policy_version 29360 (0.0004) [2023-03-08 21:48:07,587][569911] Updated weights for policy 0, policy_version 29440 (0.0004) [2023-03-08 21:48:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11815.9). Total num frames: 15073280. Throughput: 0: 11945.9. Samples: 15073164. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:48:07,789][569624] Avg episode reward: [(0, '4490.211')] [2023-03-08 21:48:11,010][569911] Updated weights for policy 0, policy_version 29520 (0.0005) [2023-03-08 21:48:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11829.8). Total num frames: 15134720. Throughput: 0: 12006.6. Samples: 15109900. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:48:12,789][569624] Avg episode reward: [(0, '4777.146')] [2023-03-08 21:48:14,498][569911] Updated weights for policy 0, policy_version 29600 (0.0004) [2023-03-08 21:48:17,745][569911] Updated weights for policy 0, policy_version 29680 (0.0004) [2023-03-08 21:48:17,789][569624] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11843.7). Total num frames: 15196160. Throughput: 0: 12002.2. Samples: 15179776. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:48:17,789][569624] Avg episode reward: [(0, '4627.493')] [2023-03-08 21:48:17,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000029680_15196160.pth... [2023-03-08 21:48:17,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000028984_14839808.pth [2023-03-08 21:48:21,034][569911] Updated weights for policy 0, policy_version 29760 (0.0005) [2023-03-08 21:48:22,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11843.7). Total num frames: 15257600. Throughput: 0: 12104.4. Samples: 15257608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:48:22,789][569624] Avg episode reward: [(0, '4510.332')] [2023-03-08 21:48:24,225][569911] Updated weights for policy 0, policy_version 29840 (0.0005) [2023-03-08 21:48:27,546][569911] Updated weights for policy 0, policy_version 29920 (0.0005) [2023-03-08 21:48:27,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11871.5). Total num frames: 15319040. Throughput: 0: 12096.2. Samples: 15294528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:48:27,789][569624] Avg episode reward: [(0, '4468.300')] [2023-03-08 21:48:30,992][569911] Updated weights for policy 0, policy_version 30000 (0.0005) [2023-03-08 21:48:32,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 11885.3). Total num frames: 15380480. Throughput: 0: 12063.8. Samples: 15368080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:48:32,789][569624] Avg episode reward: [(0, '4438.296')] [2023-03-08 21:48:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000030040_15380480.pth... [2023-03-08 21:48:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000029320_15011840.pth [2023-03-08 21:48:34,286][569911] Updated weights for policy 0, policy_version 30080 (0.0005) [2023-03-08 21:48:37,560][569911] Updated weights for policy 0, policy_version 30160 (0.0004) [2023-03-08 21:48:37,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 11899.2). Total num frames: 15441920. Throughput: 0: 12132.3. Samples: 15441920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:48:37,789][569624] Avg episode reward: [(0, '4547.102')] [2023-03-08 21:48:40,983][569911] Updated weights for policy 0, policy_version 30240 (0.0005) [2023-03-08 21:48:42,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 11927.0). Total num frames: 15503360. Throughput: 0: 12172.9. Samples: 15477868. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:48:42,789][569624] Avg episode reward: [(0, '4326.567')] [2023-03-08 21:48:44,370][569911] Updated weights for policy 0, policy_version 30320 (0.0005) [2023-03-08 21:48:47,695][569911] Updated weights for policy 0, policy_version 30400 (0.0004) [2023-03-08 21:48:47,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 11940.9). Total num frames: 15564800. Throughput: 0: 12262.7. Samples: 15551968. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:48:47,789][569624] Avg episode reward: [(0, '4376.338')] [2023-03-08 21:48:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000030400_15564800.pth... [2023-03-08 21:48:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000029680_15196160.pth [2023-03-08 21:48:50,963][569911] Updated weights for policy 0, policy_version 30480 (0.0005) [2023-03-08 21:48:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 11940.9). Total num frames: 15626240. Throughput: 0: 12237.1. Samples: 15623832. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:48:52,789][569624] Avg episode reward: [(0, '4358.174')] [2023-03-08 21:48:54,353][569911] Updated weights for policy 0, policy_version 30560 (0.0004) [2023-03-08 21:48:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 11927.0). Total num frames: 15683584. Throughput: 0: 12220.9. Samples: 15659840. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:48:57,789][569624] Avg episode reward: [(0, '4069.663')] [2023-03-08 21:48:57,936][569911] Updated weights for policy 0, policy_version 30640 (0.0005) [2023-03-08 21:49:01,207][569911] Updated weights for policy 0, policy_version 30720 (0.0004) [2023-03-08 21:49:02,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12288.0, 300 sec: 11940.9). Total num frames: 15749120. Throughput: 0: 12288.0. Samples: 15732736. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:49:02,789][569624] Avg episode reward: [(0, '3679.983')] [2023-03-08 21:49:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000030760_15749120.pth... [2023-03-08 21:49:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000030040_15380480.pth [2023-03-08 21:49:04,289][569911] Updated weights for policy 0, policy_version 30800 (0.0005) [2023-03-08 21:49:07,566][569911] Updated weights for policy 0, policy_version 30880 (0.0005) [2023-03-08 21:49:07,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12288.0, 300 sec: 11954.8). Total num frames: 15810560. Throughput: 0: 12286.9. Samples: 15810520. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:49:07,789][569624] Avg episode reward: [(0, '3907.050')] [2023-03-08 21:49:11,188][569911] Updated weights for policy 0, policy_version 30960 (0.0004) [2023-03-08 21:49:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 11954.8). Total num frames: 15867904. Throughput: 0: 12198.9. Samples: 15843476. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:49:12,789][569624] Avg episode reward: [(0, '3552.539')] [2023-03-08 21:49:14,611][569911] Updated weights for policy 0, policy_version 31040 (0.0005) [2023-03-08 21:49:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12219.7, 300 sec: 11968.6). Total num frames: 15929344. Throughput: 0: 12188.7. Samples: 15916572. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 21:49:17,789][569624] Avg episode reward: [(0, '3633.471')] [2023-03-08 21:49:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000031112_15929344.pth... [2023-03-08 21:49:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000030400_15564800.pth [2023-03-08 21:49:18,072][569911] Updated weights for policy 0, policy_version 31120 (0.0005) [2023-03-08 21:49:21,470][569911] Updated weights for policy 0, policy_version 31200 (0.0005) [2023-03-08 21:49:22,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 11968.6). Total num frames: 15986688. Throughput: 0: 12104.7. Samples: 15986632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:49:22,789][569624] Avg episode reward: [(0, '3644.290')] [2023-03-08 21:49:25,017][569911] Updated weights for policy 0, policy_version 31280 (0.0005) [2023-03-08 21:49:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 11968.6). Total num frames: 16048128. Throughput: 0: 12080.7. Samples: 16021500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:49:27,789][569624] Avg episode reward: [(0, '3971.369')] [2023-03-08 21:49:28,545][569911] Updated weights for policy 0, policy_version 31360 (0.0005) [2023-03-08 21:49:32,280][569911] Updated weights for policy 0, policy_version 31440 (0.0005) [2023-03-08 21:49:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 16101376. Throughput: 0: 11931.0. Samples: 16088864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:49:32,789][569624] Avg episode reward: [(0, '4165.348')] [2023-03-08 21:49:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000031448_16101376.pth... [2023-03-08 21:49:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000030760_15749120.pth [2023-03-08 21:49:35,783][569911] Updated weights for policy 0, policy_version 31520 (0.0005) [2023-03-08 21:49:37,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11946.7, 300 sec: 11940.9). Total num frames: 16158720. Throughput: 0: 11822.5. Samples: 16155844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:49:37,789][569624] Avg episode reward: [(0, '4017.152')] [2023-03-08 21:49:39,402][569911] Updated weights for policy 0, policy_version 31600 (0.0005) [2023-03-08 21:49:42,618][569911] Updated weights for policy 0, policy_version 31680 (0.0004) [2023-03-08 21:49:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 16220160. Throughput: 0: 11857.1. Samples: 16193412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:49:42,789][569624] Avg episode reward: [(0, '4481.106')] [2023-03-08 21:49:46,025][569911] Updated weights for policy 0, policy_version 31760 (0.0005) [2023-03-08 21:49:47,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 16281600. Throughput: 0: 11908.3. Samples: 16268612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:49:47,789][569624] Avg episode reward: [(0, '4440.627')] [2023-03-08 21:49:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000031800_16281600.pth... [2023-03-08 21:49:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000031112_15929344.pth [2023-03-08 21:49:49,473][569911] Updated weights for policy 0, policy_version 31840 (0.0005) [2023-03-08 21:49:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 16338944. Throughput: 0: 11732.9. Samples: 16338500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:49:52,789][569624] Avg episode reward: [(0, '4432.282')] [2023-03-08 21:49:53,026][569911] Updated weights for policy 0, policy_version 31920 (0.0005) [2023-03-08 21:49:56,435][569911] Updated weights for policy 0, policy_version 32000 (0.0005) [2023-03-08 21:49:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 16396288. Throughput: 0: 11783.6. Samples: 16373740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:49:57,789][569624] Avg episode reward: [(0, '4012.735')] [2023-03-08 21:49:59,775][569911] Updated weights for policy 0, policy_version 32080 (0.0005) [2023-03-08 21:50:02,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 16457728. Throughput: 0: 11793.0. Samples: 16447256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:50:02,789][569624] Avg episode reward: [(0, '4402.372')] [2023-03-08 21:50:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000032152_16461824.pth... [2023-03-08 21:50:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000031448_16101376.pth [2023-03-08 21:50:03,183][569911] Updated weights for policy 0, policy_version 32160 (0.0004) [2023-03-08 21:50:06,556][569911] Updated weights for policy 0, policy_version 32240 (0.0004) [2023-03-08 21:50:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11810.1, 300 sec: 11996.4). Total num frames: 16519168. Throughput: 0: 11833.5. Samples: 16519140. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:50:07,789][569624] Avg episode reward: [(0, '4132.529')] [2023-03-08 21:50:10,027][569911] Updated weights for policy 0, policy_version 32320 (0.0005) [2023-03-08 21:50:12,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11996.4). Total num frames: 16580608. Throughput: 0: 11854.5. Samples: 16554952. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:50:12,789][569624] Avg episode reward: [(0, '4256.938')] [2023-03-08 21:50:13,462][569911] Updated weights for policy 0, policy_version 32400 (0.0005) [2023-03-08 21:50:16,902][569911] Updated weights for policy 0, policy_version 32480 (0.0005) [2023-03-08 21:50:17,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 12010.3). Total num frames: 16637952. Throughput: 0: 11926.2. Samples: 16625544. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:50:17,789][569624] Avg episode reward: [(0, '4061.029')] [2023-03-08 21:50:17,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000032496_16637952.pth... [2023-03-08 21:50:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000031800_16281600.pth [2023-03-08 21:50:20,276][569911] Updated weights for policy 0, policy_version 32560 (0.0005) [2023-03-08 21:50:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 16699392. Throughput: 0: 12073.9. Samples: 16699168. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:50:22,789][569624] Avg episode reward: [(0, '4504.645')] [2023-03-08 21:50:23,614][569911] Updated weights for policy 0, policy_version 32640 (0.0004) [2023-03-08 21:50:26,915][569911] Updated weights for policy 0, policy_version 32720 (0.0005) [2023-03-08 21:50:27,789][569624] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 16760832. Throughput: 0: 12054.1. Samples: 16735844. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:50:27,789][569624] Avg episode reward: [(0, '4394.534')] [2023-03-08 21:50:30,361][569911] Updated weights for policy 0, policy_version 32800 (0.0005) [2023-03-08 21:50:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 16822272. Throughput: 0: 11965.6. Samples: 16807064. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:50:32,789][569624] Avg episode reward: [(0, '4549.080')] [2023-03-08 21:50:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000032856_16822272.pth... [2023-03-08 21:50:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000032152_16461824.pth [2023-03-08 21:50:33,702][569911] Updated weights for policy 0, policy_version 32880 (0.0005) [2023-03-08 21:50:37,176][569911] Updated weights for policy 0, policy_version 32960 (0.0005) [2023-03-08 21:50:37,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 16883712. Throughput: 0: 12049.4. Samples: 16880724. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:50:37,789][569624] Avg episode reward: [(0, '4321.674')] [2023-03-08 21:50:40,488][569911] Updated weights for policy 0, policy_version 33040 (0.0005) [2023-03-08 21:50:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 16941056. Throughput: 0: 12063.1. Samples: 16916580. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:50:42,789][569624] Avg episode reward: [(0, '4563.583')] [2023-03-08 21:50:43,888][569911] Updated weights for policy 0, policy_version 33120 (0.0005) [2023-03-08 21:50:47,128][569911] Updated weights for policy 0, policy_version 33200 (0.0005) [2023-03-08 21:50:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 17006592. Throughput: 0: 12097.9. Samples: 16991664. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:50:47,789][569624] Avg episode reward: [(0, '4362.107')] [2023-03-08 21:50:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000033216_17006592.pth... [2023-03-08 21:50:47,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000032496_16637952.pth [2023-03-08 21:50:50,469][569911] Updated weights for policy 0, policy_version 33280 (0.0005) [2023-03-08 21:50:52,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 17063936. Throughput: 0: 12108.2. Samples: 17064008. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:50:52,799][569624] Avg episode reward: [(0, '4431.839')] [2023-03-08 21:50:53,846][569911] Updated weights for policy 0, policy_version 33360 (0.0004) [2023-03-08 21:50:57,334][569911] Updated weights for policy 0, policy_version 33440 (0.0005) [2023-03-08 21:50:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12052.0). Total num frames: 17125376. Throughput: 0: 12113.0. Samples: 17100036. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:50:57,802][569624] Avg episode reward: [(0, '4610.895')] [2023-03-08 21:51:00,670][569911] Updated weights for policy 0, policy_version 33520 (0.0005) [2023-03-08 21:51:02,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12151.4, 300 sec: 12052.0). Total num frames: 17186816. Throughput: 0: 12174.0. Samples: 17173376. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:51:02,789][569624] Avg episode reward: [(0, '4425.550')] [2023-03-08 21:51:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000033568_17186816.pth... [2023-03-08 21:51:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000032856_16822272.pth [2023-03-08 21:51:03,902][569911] Updated weights for policy 0, policy_version 33600 (0.0004) [2023-03-08 21:51:07,209][569911] Updated weights for policy 0, policy_version 33680 (0.0005) [2023-03-08 21:51:07,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 17248256. Throughput: 0: 12202.0. Samples: 17248256. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:51:07,789][569624] Avg episode reward: [(0, '4446.694')] [2023-03-08 21:51:10,644][569911] Updated weights for policy 0, policy_version 33760 (0.0005) [2023-03-08 21:51:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 17309696. Throughput: 0: 12198.6. Samples: 17284780. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:51:12,789][569624] Avg episode reward: [(0, '4184.361')] [2023-03-08 21:51:14,044][569911] Updated weights for policy 0, policy_version 33840 (0.0005) [2023-03-08 21:51:17,399][569911] Updated weights for policy 0, policy_version 33920 (0.0005) [2023-03-08 21:51:17,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 17371136. Throughput: 0: 12220.6. Samples: 17356992. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:51:17,789][569624] Avg episode reward: [(0, '4570.925')] [2023-03-08 21:51:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000033928_17371136.pth... [2023-03-08 21:51:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000033216_17006592.pth [2023-03-08 21:51:20,761][569911] Updated weights for policy 0, policy_version 34000 (0.0005) [2023-03-08 21:51:22,789][569624] Fps is (10 sec: 11878.6, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 17428480. Throughput: 0: 12173.8. Samples: 17428544. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:51:22,789][569624] Avg episode reward: [(0, '4510.722')] [2023-03-08 21:51:24,286][569911] Updated weights for policy 0, policy_version 34080 (0.0005) [2023-03-08 21:51:27,614][569911] Updated weights for policy 0, policy_version 34160 (0.0005) [2023-03-08 21:51:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.4, 300 sec: 12079.7). Total num frames: 17489920. Throughput: 0: 12194.7. Samples: 17465344. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:51:27,789][569624] Avg episode reward: [(0, '4402.602')] [2023-03-08 21:51:30,904][569911] Updated weights for policy 0, policy_version 34240 (0.0005) [2023-03-08 21:51:32,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 17551360. Throughput: 0: 12166.3. Samples: 17539148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:51:32,789][569624] Avg episode reward: [(0, '4637.819')] [2023-03-08 21:51:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000034288_17555456.pth... [2023-03-08 21:51:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000033568_17186816.pth [2023-03-08 21:51:34,138][569911] Updated weights for policy 0, policy_version 34320 (0.0004) [2023-03-08 21:51:37,299][569911] Updated weights for policy 0, policy_version 34400 (0.0005) [2023-03-08 21:51:37,789][569624] Fps is (10 sec: 12697.8, 60 sec: 12219.8, 300 sec: 12107.5). Total num frames: 17616896. Throughput: 0: 12280.8. Samples: 17616644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:51:37,789][569624] Avg episode reward: [(0, '4503.675')] [2023-03-08 21:51:40,573][569911] Updated weights for policy 0, policy_version 34480 (0.0005) [2023-03-08 21:51:42,789][569624] Fps is (10 sec: 12697.7, 60 sec: 12288.0, 300 sec: 12093.6). Total num frames: 17678336. Throughput: 0: 12303.2. Samples: 17653680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:51:42,789][569624] Avg episode reward: [(0, '4719.794')] [2023-03-08 21:51:43,859][569911] Updated weights for policy 0, policy_version 34560 (0.0004) [2023-03-08 21:51:47,155][569911] Updated weights for policy 0, policy_version 34640 (0.0005) [2023-03-08 21:51:47,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 17739776. Throughput: 0: 12349.7. Samples: 17729112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:51:47,789][569624] Avg episode reward: [(0, '4636.485')] [2023-03-08 21:51:47,808][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000034656_17743872.pth... [2023-03-08 21:51:47,809][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000033928_17371136.pth [2023-03-08 21:51:50,844][569911] Updated weights for policy 0, policy_version 34720 (0.0005) [2023-03-08 21:51:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 17797120. Throughput: 0: 12214.2. Samples: 17797896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:51:52,789][569624] Avg episode reward: [(0, '4412.019')] [2023-03-08 21:51:54,108][569911] Updated weights for policy 0, policy_version 34800 (0.0004) [2023-03-08 21:51:57,368][569911] Updated weights for policy 0, policy_version 34880 (0.0005) [2023-03-08 21:51:57,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12288.0, 300 sec: 12107.5). Total num frames: 17862656. Throughput: 0: 12248.9. Samples: 17835980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:51:57,789][569624] Avg episode reward: [(0, '4563.935')] [2023-03-08 21:52:00,807][569911] Updated weights for policy 0, policy_version 34960 (0.0005) [2023-03-08 21:52:02,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12288.0, 300 sec: 12107.5). Total num frames: 17924096. Throughput: 0: 12245.8. Samples: 17908052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:52:02,789][569624] Avg episode reward: [(0, '4334.140')] [2023-03-08 21:52:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000035008_17924096.pth... [2023-03-08 21:52:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000034288_17555456.pth [2023-03-08 21:52:04,217][569911] Updated weights for policy 0, policy_version 35040 (0.0005) [2023-03-08 21:52:07,775][569911] Updated weights for policy 0, policy_version 35120 (0.0005) [2023-03-08 21:52:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 17981440. Throughput: 0: 12242.4. Samples: 17979452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:52:07,789][569624] Avg episode reward: [(0, '4263.172')] [2023-03-08 21:52:11,240][569911] Updated weights for policy 0, policy_version 35200 (0.0005) [2023-03-08 21:52:12,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 18038784. Throughput: 0: 12198.6. Samples: 18014280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:52:12,789][569624] Avg episode reward: [(0, '4277.969')] [2023-03-08 21:52:14,785][569911] Updated weights for policy 0, policy_version 35280 (0.0004) [2023-03-08 21:52:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 18100224. Throughput: 0: 12106.0. Samples: 18083920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:52:17,789][569624] Avg episode reward: [(0, '4184.458')] [2023-03-08 21:52:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000035352_18100224.pth... [2023-03-08 21:52:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000034656_17743872.pth [2023-03-08 21:52:18,107][569911] Updated weights for policy 0, policy_version 35360 (0.0005) [2023-03-08 21:52:21,341][569911] Updated weights for policy 0, policy_version 35440 (0.0004) [2023-03-08 21:52:22,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 18161664. Throughput: 0: 12112.9. Samples: 18161728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:52:22,789][569624] Avg episode reward: [(0, '4467.173')] [2023-03-08 21:52:24,644][569911] Updated weights for policy 0, policy_version 35520 (0.0005) [2023-03-08 21:52:27,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 18223104. Throughput: 0: 12052.3. Samples: 18196036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:52:27,789][569624] Avg episode reward: [(0, '4729.688')] [2023-03-08 21:52:28,092][569911] Updated weights for policy 0, policy_version 35600 (0.0005) [2023-03-08 21:52:31,366][569911] Updated weights for policy 0, policy_version 35680 (0.0005) [2023-03-08 21:52:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 18280448. Throughput: 0: 12029.2. Samples: 18270428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:52:32,789][569624] Avg episode reward: [(0, '4583.353')] [2023-03-08 21:52:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000035704_18280448.pth... [2023-03-08 21:52:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000035008_17924096.pth [2023-03-08 21:52:34,893][569911] Updated weights for policy 0, policy_version 35760 (0.0004) [2023-03-08 21:52:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 18341888. Throughput: 0: 12064.9. Samples: 18340816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:52:37,789][569624] Avg episode reward: [(0, '4632.454')] [2023-03-08 21:52:38,460][569911] Updated weights for policy 0, policy_version 35840 (0.0005) [2023-03-08 21:52:41,903][569911] Updated weights for policy 0, policy_version 35920 (0.0004) [2023-03-08 21:52:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 18399232. Throughput: 0: 11972.0. Samples: 18374720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:52:42,789][569624] Avg episode reward: [(0, '4694.495')] [2023-03-08 21:52:45,078][569911] Updated weights for policy 0, policy_version 36000 (0.0005) [2023-03-08 21:52:47,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 18464768. Throughput: 0: 12033.7. Samples: 18449568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:52:47,789][569624] Avg episode reward: [(0, '4728.091')] [2023-03-08 21:52:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000036064_18464768.pth... [2023-03-08 21:52:47,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000035352_18100224.pth [2023-03-08 21:52:48,423][569911] Updated weights for policy 0, policy_version 36080 (0.0005) [2023-03-08 21:52:51,855][569911] Updated weights for policy 0, policy_version 36160 (0.0005) [2023-03-08 21:52:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 18522112. Throughput: 0: 12058.6. Samples: 18522092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:52:52,789][569624] Avg episode reward: [(0, '4625.285')] [2023-03-08 21:52:55,198][569911] Updated weights for policy 0, policy_version 36240 (0.0004) [2023-03-08 21:52:57,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 18587648. Throughput: 0: 12104.4. Samples: 18558976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:52:57,789][569624] Avg episode reward: [(0, '4405.963')] [2023-03-08 21:52:58,342][569911] Updated weights for policy 0, policy_version 36320 (0.0005) [2023-03-08 21:53:01,742][569911] Updated weights for policy 0, policy_version 36400 (0.0005) [2023-03-08 21:53:02,789][569624] Fps is (10 sec: 12697.7, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 18649088. Throughput: 0: 12243.8. Samples: 18634888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:53:02,789][569624] Avg episode reward: [(0, '4454.741')] [2023-03-08 21:53:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000036424_18649088.pth... [2023-03-08 21:53:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000035704_18280448.pth [2023-03-08 21:53:05,092][569911] Updated weights for policy 0, policy_version 36480 (0.0004) [2023-03-08 21:53:07,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 18706432. Throughput: 0: 12120.5. Samples: 18707148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:53:07,789][569624] Avg episode reward: [(0, '4336.847')] [2023-03-08 21:53:08,443][569911] Updated weights for policy 0, policy_version 36560 (0.0005) [2023-03-08 21:53:11,685][569911] Updated weights for policy 0, policy_version 36640 (0.0005) [2023-03-08 21:53:12,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12121.4). Total num frames: 18771968. Throughput: 0: 12238.7. Samples: 18746776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:53:12,789][569624] Avg episode reward: [(0, '4412.886')] [2023-03-08 21:53:15,150][569911] Updated weights for policy 0, policy_version 36720 (0.0005) [2023-03-08 21:53:17,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 18829312. Throughput: 0: 12177.7. Samples: 18818424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:53:17,789][569624] Avg episode reward: [(0, '4438.383')] [2023-03-08 21:53:17,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000036776_18829312.pth... [2023-03-08 21:53:17,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000036064_18464768.pth [2023-03-08 21:53:18,660][569911] Updated weights for policy 0, policy_version 36800 (0.0005) [2023-03-08 21:53:21,986][569911] Updated weights for policy 0, policy_version 36880 (0.0005) [2023-03-08 21:53:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 18890752. Throughput: 0: 12207.0. Samples: 18890132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:53:22,789][569624] Avg episode reward: [(0, '4325.271')] [2023-03-08 21:53:25,351][569911] Updated weights for policy 0, policy_version 36960 (0.0005) [2023-03-08 21:53:27,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 18952192. Throughput: 0: 12260.4. Samples: 18926436. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:53:27,789][569624] Avg episode reward: [(0, '4492.777')] [2023-03-08 21:53:28,537][569911] Updated weights for policy 0, policy_version 37040 (0.0005) [2023-03-08 21:53:32,005][569911] Updated weights for policy 0, policy_version 37120 (0.0005) [2023-03-08 21:53:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 19013632. Throughput: 0: 12237.6. Samples: 19000260. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:53:32,789][569624] Avg episode reward: [(0, '4716.756')] [2023-03-08 21:53:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000037136_19013632.pth... [2023-03-08 21:53:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000036424_18649088.pth [2023-03-08 21:53:35,283][569911] Updated weights for policy 0, policy_version 37200 (0.0005) [2023-03-08 21:53:37,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12219.8, 300 sec: 12107.5). Total num frames: 19075072. Throughput: 0: 12294.0. Samples: 19075320. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:53:37,789][569624] Avg episode reward: [(0, '4475.545')] [2023-03-08 21:53:38,487][569911] Updated weights for policy 0, policy_version 37280 (0.0004) [2023-03-08 21:53:41,671][569911] Updated weights for policy 0, policy_version 37360 (0.0005) [2023-03-08 21:53:42,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12356.3, 300 sec: 12121.4). Total num frames: 19140608. Throughput: 0: 12294.9. Samples: 19112248. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:53:42,789][569624] Avg episode reward: [(0, '3633.530')] [2023-03-08 21:53:45,162][569911] Updated weights for policy 0, policy_version 37440 (0.0004) [2023-03-08 21:53:47,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 19197952. Throughput: 0: 12237.0. Samples: 19185556. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 21:53:47,789][569624] Avg episode reward: [(0, '3448.885')] [2023-03-08 21:53:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000037496_19197952.pth... [2023-03-08 21:53:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000036776_18829312.pth [2023-03-08 21:53:48,653][569911] Updated weights for policy 0, policy_version 37520 (0.0004) [2023-03-08 21:53:52,118][569911] Updated weights for policy 0, policy_version 37600 (0.0005) [2023-03-08 21:53:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 19255296. Throughput: 0: 12187.4. Samples: 19255584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:53:52,789][569624] Avg episode reward: [(0, '3823.038')] [2023-03-08 21:53:55,555][569911] Updated weights for policy 0, policy_version 37680 (0.0005) [2023-03-08 21:53:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 19316736. Throughput: 0: 12119.8. Samples: 19292168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:53:57,789][569624] Avg episode reward: [(0, '4063.518')] [2023-03-08 21:53:59,120][569911] Updated weights for policy 0, policy_version 37760 (0.0005) [2023-03-08 21:54:02,619][569911] Updated weights for policy 0, policy_version 37840 (0.0005) [2023-03-08 21:54:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 19374080. Throughput: 0: 12076.3. Samples: 19361856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:54:02,789][569624] Avg episode reward: [(0, '4229.412')] [2023-03-08 21:54:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000037840_19374080.pth... [2023-03-08 21:54:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000037136_19013632.pth [2023-03-08 21:54:06,039][569911] Updated weights for policy 0, policy_version 37920 (0.0005) [2023-03-08 21:54:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.4, 300 sec: 12093.6). Total num frames: 19435520. Throughput: 0: 12068.0. Samples: 19433192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:54:07,789][569624] Avg episode reward: [(0, '4433.710')] [2023-03-08 21:54:09,414][569911] Updated weights for policy 0, policy_version 38000 (0.0005) [2023-03-08 21:54:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 19492864. Throughput: 0: 12053.5. Samples: 19468844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:54:12,789][569624] Avg episode reward: [(0, '4440.777')] [2023-03-08 21:54:12,799][569911] Updated weights for policy 0, policy_version 38080 (0.0005) [2023-03-08 21:54:16,200][569911] Updated weights for policy 0, policy_version 38160 (0.0005) [2023-03-08 21:54:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 19554304. Throughput: 0: 12039.2. Samples: 19542024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:54:17,789][569624] Avg episode reward: [(0, '4563.496')] [2023-03-08 21:54:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000038192_19554304.pth... [2023-03-08 21:54:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000037496_19197952.pth [2023-03-08 21:54:19,470][569911] Updated weights for policy 0, policy_version 38240 (0.0004) [2023-03-08 21:54:22,760][569911] Updated weights for policy 0, policy_version 38320 (0.0005) [2023-03-08 21:54:22,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 19619840. Throughput: 0: 12040.9. Samples: 19617160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:54:22,789][569624] Avg episode reward: [(0, '4484.297')] [2023-03-08 21:54:26,185][569911] Updated weights for policy 0, policy_version 38400 (0.0005) [2023-03-08 21:54:27,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 19677184. Throughput: 0: 12009.4. Samples: 19652672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:54:27,789][569624] Avg episode reward: [(0, '4199.664')] [2023-03-08 21:54:29,799][569911] Updated weights for policy 0, policy_version 38480 (0.0005) [2023-03-08 21:54:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12015.0, 300 sec: 12121.4). Total num frames: 19734528. Throughput: 0: 11925.4. Samples: 19722200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:54:32,789][569624] Avg episode reward: [(0, '4495.026')] [2023-03-08 21:54:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000038544_19734528.pth... [2023-03-08 21:54:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000037840_19374080.pth [2023-03-08 21:54:33,403][569911] Updated weights for policy 0, policy_version 38560 (0.0005) [2023-03-08 21:54:36,967][569911] Updated weights for policy 0, policy_version 38640 (0.0006) [2023-03-08 21:54:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11946.6, 300 sec: 12107.5). Total num frames: 19791872. Throughput: 0: 11911.6. Samples: 19791604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:54:37,789][569624] Avg episode reward: [(0, '4024.769')] [2023-03-08 21:54:40,558][569911] Updated weights for policy 0, policy_version 38720 (0.0005) [2023-03-08 21:54:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 12093.6). Total num frames: 19849216. Throughput: 0: 11832.5. Samples: 19824632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:54:42,789][569624] Avg episode reward: [(0, '4264.195')] [2023-03-08 21:54:43,936][569911] Updated weights for policy 0, policy_version 38800 (0.0005) [2023-03-08 21:54:47,494][569911] Updated weights for policy 0, policy_version 38880 (0.0005) [2023-03-08 21:54:47,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11810.2, 300 sec: 12093.6). Total num frames: 19906560. Throughput: 0: 11852.9. Samples: 19895236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:54:47,789][569624] Avg episode reward: [(0, '4099.369')] [2023-03-08 21:54:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000038880_19906560.pth... [2023-03-08 21:54:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000038192_19554304.pth [2023-03-08 21:54:50,890][569911] Updated weights for policy 0, policy_version 38960 (0.0005) [2023-03-08 21:54:52,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12107.5). Total num frames: 19968000. Throughput: 0: 11886.1. Samples: 19968064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:54:52,789][569624] Avg episode reward: [(0, '4428.165')] [2023-03-08 21:54:54,166][569911] Updated weights for policy 0, policy_version 39040 (0.0005) [2023-03-08 21:54:57,566][569911] Updated weights for policy 0, policy_version 39120 (0.0005) [2023-03-08 21:54:57,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 12107.5). Total num frames: 20029440. Throughput: 0: 11911.7. Samples: 20004872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:54:57,789][569624] Avg episode reward: [(0, '4090.849')] [2023-03-08 21:55:00,856][569911] Updated weights for policy 0, policy_version 39200 (0.0004) [2023-03-08 21:55:02,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12014.9, 300 sec: 12121.4). Total num frames: 20094976. Throughput: 0: 11925.4. Samples: 20078668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:55:02,789][569624] Avg episode reward: [(0, '4260.203')] [2023-03-08 21:55:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000039248_20094976.pth... [2023-03-08 21:55:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000038544_19734528.pth [2023-03-08 21:55:04,200][569911] Updated weights for policy 0, policy_version 39280 (0.0004) [2023-03-08 21:55:07,698][569911] Updated weights for policy 0, policy_version 39360 (0.0004) [2023-03-08 21:55:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12107.5). Total num frames: 20152320. Throughput: 0: 11858.7. Samples: 20150800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:55:07,789][569624] Avg episode reward: [(0, '4373.824')] [2023-03-08 21:55:10,941][569911] Updated weights for policy 0, policy_version 39440 (0.0005) [2023-03-08 21:55:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12121.4). Total num frames: 20213760. Throughput: 0: 11916.4. Samples: 20188908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:55:12,789][569624] Avg episode reward: [(0, '4170.285')] [2023-03-08 21:55:14,268][569911] Updated weights for policy 0, policy_version 39520 (0.0005) [2023-03-08 21:55:17,589][569911] Updated weights for policy 0, policy_version 39600 (0.0005) [2023-03-08 21:55:17,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12121.4). Total num frames: 20275200. Throughput: 0: 12013.0. Samples: 20262784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:55:17,789][569624] Avg episode reward: [(0, '4435.253')] [2023-03-08 21:55:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000039600_20275200.pth... [2023-03-08 21:55:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000038880_19906560.pth [2023-03-08 21:55:21,026][569911] Updated weights for policy 0, policy_version 39680 (0.0004) [2023-03-08 21:55:22,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 12121.4). Total num frames: 20336640. Throughput: 0: 12056.5. Samples: 20334148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:55:22,789][569624] Avg episode reward: [(0, '4183.768')] [2023-03-08 21:55:24,411][569911] Updated weights for policy 0, policy_version 39760 (0.0006) [2023-03-08 21:55:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12107.5). Total num frames: 20393984. Throughput: 0: 12122.8. Samples: 20370160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:55:27,789][569624] Avg episode reward: [(0, '4138.549')] [2023-03-08 21:55:27,924][569911] Updated weights for policy 0, policy_version 39840 (0.0005) [2023-03-08 21:55:31,358][569911] Updated weights for policy 0, policy_version 39920 (0.0005) [2023-03-08 21:55:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12107.5). Total num frames: 20455424. Throughput: 0: 12129.9. Samples: 20441080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:55:32,789][569624] Avg episode reward: [(0, '4048.394')] [2023-03-08 21:55:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000039952_20455424.pth... [2023-03-08 21:55:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000039248_20094976.pth [2023-03-08 21:55:34,573][569911] Updated weights for policy 0, policy_version 40000 (0.0005) [2023-03-08 21:55:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12107.5). Total num frames: 20512768. Throughput: 0: 12104.5. Samples: 20512768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:55:37,789][569624] Avg episode reward: [(0, '4253.636')] [2023-03-08 21:55:38,256][569911] Updated weights for policy 0, policy_version 40080 (0.0005) [2023-03-08 21:55:41,685][569911] Updated weights for policy 0, policy_version 40160 (0.0005) [2023-03-08 21:55:42,789][569624] Fps is (10 sec: 11468.9, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 20570112. Throughput: 0: 12058.8. Samples: 20547516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:55:42,800][569624] Avg episode reward: [(0, '4103.994')] [2023-03-08 21:55:45,247][569911] Updated weights for policy 0, policy_version 40240 (0.0005) [2023-03-08 21:55:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 20631552. Throughput: 0: 12008.8. Samples: 20619064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:55:47,789][569624] Avg episode reward: [(0, '3859.249')] [2023-03-08 21:55:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000040296_20631552.pth... [2023-03-08 21:55:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000039600_20275200.pth [2023-03-08 21:55:48,653][569911] Updated weights for policy 0, policy_version 40320 (0.0005) [2023-03-08 21:55:51,916][569911] Updated weights for policy 0, policy_version 40400 (0.0005) [2023-03-08 21:55:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 20692992. Throughput: 0: 12044.4. Samples: 20692800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:55:52,800][569624] Avg episode reward: [(0, '3725.584')] [2023-03-08 21:55:55,438][569911] Updated weights for policy 0, policy_version 40480 (0.0005) [2023-03-08 21:55:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 20750336. Throughput: 0: 11949.9. Samples: 20726652. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:55:57,800][569624] Avg episode reward: [(0, '4034.799')] [2023-03-08 21:55:58,856][569911] Updated weights for policy 0, policy_version 40560 (0.0004) [2023-03-08 21:56:02,458][569911] Updated weights for policy 0, policy_version 40640 (0.0005) [2023-03-08 21:56:02,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 20807680. Throughput: 0: 11904.5. Samples: 20798484. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:56:02,800][569624] Avg episode reward: [(0, '4389.993')] [2023-03-08 21:56:02,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000040648_20811776.pth... [2023-03-08 21:56:02,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000039952_20455424.pth [2023-03-08 21:56:05,827][569911] Updated weights for policy 0, policy_version 40720 (0.0005) [2023-03-08 21:56:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 20873216. Throughput: 0: 11977.2. Samples: 20873124. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:56:07,789][569624] Avg episode reward: [(0, '4501.419')] [2023-03-08 21:56:09,015][569911] Updated weights for policy 0, policy_version 40800 (0.0004) [2023-03-08 21:56:12,351][569911] Updated weights for policy 0, policy_version 40880 (0.0005) [2023-03-08 21:56:12,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 20934656. Throughput: 0: 11991.0. Samples: 20909752. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:56:12,789][569624] Avg episode reward: [(0, '4416.747')] [2023-03-08 21:56:15,690][569911] Updated weights for policy 0, policy_version 40960 (0.0005) [2023-03-08 21:56:17,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 21000192. Throughput: 0: 12057.1. Samples: 20983648. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 21:56:17,789][569624] Avg episode reward: [(0, '4655.728')] [2023-03-08 21:56:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000041016_21000192.pth... [2023-03-08 21:56:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000040296_20631552.pth [2023-03-08 21:56:18,838][569911] Updated weights for policy 0, policy_version 41040 (0.0005) [2023-03-08 21:56:22,196][569911] Updated weights for policy 0, policy_version 41120 (0.0005) [2023-03-08 21:56:22,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 21057536. Throughput: 0: 12107.4. Samples: 21057600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:56:22,789][569624] Avg episode reward: [(0, '4722.643')] [2023-03-08 21:56:25,614][569911] Updated weights for policy 0, policy_version 41200 (0.0005) [2023-03-08 21:56:27,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 21123072. Throughput: 0: 12146.8. Samples: 21094124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:56:27,789][569624] Avg episode reward: [(0, '4658.296')] [2023-03-08 21:56:28,770][569911] Updated weights for policy 0, policy_version 41280 (0.0005) [2023-03-08 21:56:32,287][569911] Updated weights for policy 0, policy_version 41360 (0.0005) [2023-03-08 21:56:32,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 21180416. Throughput: 0: 12202.9. Samples: 21168192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:56:32,789][569624] Avg episode reward: [(0, '4614.359')] [2023-03-08 21:56:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000041368_21180416.pth... [2023-03-08 21:56:32,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000040648_20811776.pth [2023-03-08 21:56:35,618][569911] Updated weights for policy 0, policy_version 41440 (0.0004) [2023-03-08 21:56:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 21241856. Throughput: 0: 12150.3. Samples: 21239564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:56:37,789][569624] Avg episode reward: [(0, '4781.165')] [2023-03-08 21:56:39,175][569911] Updated weights for policy 0, policy_version 41520 (0.0005) [2023-03-08 21:56:42,483][569911] Updated weights for policy 0, policy_version 41600 (0.0004) [2023-03-08 21:56:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.4, 300 sec: 12065.8). Total num frames: 21299200. Throughput: 0: 12177.3. Samples: 21274632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:56:42,789][569624] Avg episode reward: [(0, '4648.585')] [2023-03-08 21:56:45,866][569911] Updated weights for policy 0, policy_version 41680 (0.0005) [2023-03-08 21:56:47,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12219.8, 300 sec: 12093.6). Total num frames: 21364736. Throughput: 0: 12234.8. Samples: 21349052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:56:47,789][569624] Avg episode reward: [(0, '4690.176')] [2023-03-08 21:56:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000041728_21364736.pth... [2023-03-08 21:56:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000041016_21000192.pth [2023-03-08 21:56:49,137][569911] Updated weights for policy 0, policy_version 41760 (0.0005) [2023-03-08 21:56:52,536][569911] Updated weights for policy 0, policy_version 41840 (0.0005) [2023-03-08 21:56:52,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 21422080. Throughput: 0: 12199.2. Samples: 21422088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:56:52,789][569624] Avg episode reward: [(0, '4298.024')] [2023-03-08 21:56:55,929][569911] Updated weights for policy 0, policy_version 41920 (0.0005) [2023-03-08 21:56:57,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12219.7, 300 sec: 12065.8). Total num frames: 21483520. Throughput: 0: 12203.5. Samples: 21458912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:56:57,789][569624] Avg episode reward: [(0, '4757.262')] [2023-03-08 21:56:59,282][569911] Updated weights for policy 0, policy_version 42000 (0.0005) [2023-03-08 21:57:02,547][569911] Updated weights for policy 0, policy_version 42080 (0.0004) [2023-03-08 21:57:02,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12288.0, 300 sec: 12079.7). Total num frames: 21544960. Throughput: 0: 12200.5. Samples: 21532672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:57:02,789][569624] Avg episode reward: [(0, '4423.453')] [2023-03-08 21:57:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000042080_21544960.pth... [2023-03-08 21:57:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000041368_21180416.pth [2023-03-08 21:57:05,954][569911] Updated weights for policy 0, policy_version 42160 (0.0004) [2023-03-08 21:57:07,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12219.8, 300 sec: 12093.6). Total num frames: 21606400. Throughput: 0: 12187.3. Samples: 21606028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:57:07,789][569624] Avg episode reward: [(0, '4644.645')] [2023-03-08 21:57:09,362][569911] Updated weights for policy 0, policy_version 42240 (0.0005) [2023-03-08 21:57:12,738][569911] Updated weights for policy 0, policy_version 42320 (0.0005) [2023-03-08 21:57:12,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 21667840. Throughput: 0: 12161.9. Samples: 21641408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:57:12,789][569624] Avg episode reward: [(0, '4720.010')] [2023-03-08 21:57:16,151][569911] Updated weights for policy 0, policy_version 42400 (0.0004) [2023-03-08 21:57:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 21725184. Throughput: 0: 12107.9. Samples: 21713048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:57:17,798][569624] Avg episode reward: [(0, '4596.286')] [2023-03-08 21:57:17,801][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000042432_21725184.pth... [2023-03-08 21:57:17,802][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000041728_21364736.pth [2023-03-08 21:57:19,578][569911] Updated weights for policy 0, policy_version 42480 (0.0004) [2023-03-08 21:57:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 21786624. Throughput: 0: 12150.3. Samples: 21786328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:57:22,790][569624] Avg episode reward: [(0, '4746.681')] [2023-03-08 21:57:23,048][569911] Updated weights for policy 0, policy_version 42560 (0.0005) [2023-03-08 21:57:26,688][569911] Updated weights for policy 0, policy_version 42640 (0.0004) [2023-03-08 21:57:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 21843968. Throughput: 0: 12105.8. Samples: 21819392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:57:27,800][569624] Avg episode reward: [(0, '4661.064')] [2023-03-08 21:57:30,047][569911] Updated weights for policy 0, policy_version 42720 (0.0005) [2023-03-08 21:57:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 21905408. Throughput: 0: 12076.9. Samples: 21892512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:57:32,800][569624] Avg episode reward: [(0, '4635.805')] [2023-03-08 21:57:32,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000042784_21905408.pth... [2023-03-08 21:57:32,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000042080_21544960.pth [2023-03-08 21:57:33,258][569911] Updated weights for policy 0, policy_version 42800 (0.0004) [2023-03-08 21:57:36,677][569911] Updated weights for policy 0, policy_version 42880 (0.0005) [2023-03-08 21:57:37,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 21966848. Throughput: 0: 12077.8. Samples: 21965592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:57:37,789][569624] Avg episode reward: [(0, '4599.719')] [2023-03-08 21:57:40,097][569911] Updated weights for policy 0, policy_version 42960 (0.0005) [2023-03-08 21:57:42,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 22028288. Throughput: 0: 12043.5. Samples: 22000868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:57:42,789][569624] Avg episode reward: [(0, '4681.688')] [2023-03-08 21:57:43,342][569911] Updated weights for policy 0, policy_version 43040 (0.0005) [2023-03-08 21:57:46,687][569911] Updated weights for policy 0, policy_version 43120 (0.0005) [2023-03-08 21:57:47,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 22089728. Throughput: 0: 12058.8. Samples: 22075320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:57:47,789][569624] Avg episode reward: [(0, '4649.846')] [2023-03-08 21:57:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000043144_22089728.pth... [2023-03-08 21:57:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000042432_21725184.pth [2023-03-08 21:57:49,855][569911] Updated weights for policy 0, policy_version 43200 (0.0005) [2023-03-08 21:57:52,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 22155264. Throughput: 0: 12154.2. Samples: 22152968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:57:52,789][569624] Avg episode reward: [(0, '4132.754')] [2023-03-08 21:57:53,063][569911] Updated weights for policy 0, policy_version 43280 (0.0005) [2023-03-08 21:57:56,453][569911] Updated weights for policy 0, policy_version 43360 (0.0005) [2023-03-08 21:57:57,789][569624] Fps is (10 sec: 12288.2, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 22212608. Throughput: 0: 12195.7. Samples: 22190212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:57:57,789][569624] Avg episode reward: [(0, '4572.107')] [2023-03-08 21:57:59,824][569911] Updated weights for policy 0, policy_version 43440 (0.0005) [2023-03-08 21:58:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 22278144. Throughput: 0: 12247.5. Samples: 22264184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:58:02,789][569624] Avg episode reward: [(0, '4636.139')] [2023-03-08 21:58:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000043512_22278144.pth... [2023-03-08 21:58:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000042784_21905408.pth [2023-03-08 21:58:03,010][569911] Updated weights for policy 0, policy_version 43520 (0.0005) [2023-03-08 21:58:06,232][569911] Updated weights for policy 0, policy_version 43600 (0.0005) [2023-03-08 21:58:07,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 22339584. Throughput: 0: 12288.7. Samples: 22339320. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:58:07,789][569624] Avg episode reward: [(0, '4547.103')] [2023-03-08 21:58:09,575][569911] Updated weights for policy 0, policy_version 43680 (0.0005) [2023-03-08 21:58:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 22401024. Throughput: 0: 12379.2. Samples: 22376456. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:58:12,789][569624] Avg episode reward: [(0, '4300.851')] [2023-03-08 21:58:12,951][569911] Updated weights for policy 0, policy_version 43760 (0.0005) [2023-03-08 21:58:16,542][569911] Updated weights for policy 0, policy_version 43840 (0.0005) [2023-03-08 21:58:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 22458368. Throughput: 0: 12301.9. Samples: 22446100. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:58:17,789][569624] Avg episode reward: [(0, '4471.665')] [2023-03-08 21:58:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000043864_22458368.pth... [2023-03-08 21:58:17,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000043144_22089728.pth [2023-03-08 21:58:19,963][569911] Updated weights for policy 0, policy_version 43920 (0.0005) [2023-03-08 21:58:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 22519808. Throughput: 0: 12305.5. Samples: 22519340. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:58:22,789][569624] Avg episode reward: [(0, '4530.020')] [2023-03-08 21:58:23,389][569911] Updated weights for policy 0, policy_version 44000 (0.0004) [2023-03-08 21:58:26,841][569911] Updated weights for policy 0, policy_version 44080 (0.0005) [2023-03-08 21:58:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 22577152. Throughput: 0: 12280.4. Samples: 22553484. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:58:27,789][569624] Avg episode reward: [(0, '4185.470')] [2023-03-08 21:58:30,356][569911] Updated weights for policy 0, policy_version 44160 (0.0005) [2023-03-08 21:58:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 22634496. Throughput: 0: 12185.4. Samples: 22623664. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 21:58:32,789][569624] Avg episode reward: [(0, '3927.008')] [2023-03-08 21:58:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000044208_22634496.pth... [2023-03-08 21:58:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000043512_22278144.pth [2023-03-08 21:58:33,952][569911] Updated weights for policy 0, policy_version 44240 (0.0004) [2023-03-08 21:58:37,410][569911] Updated weights for policy 0, policy_version 44320 (0.0005) [2023-03-08 21:58:37,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 12052.0). Total num frames: 22695936. Throughput: 0: 12008.5. Samples: 22693352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:58:37,789][569624] Avg episode reward: [(0, '4408.090')] [2023-03-08 21:58:41,091][569911] Updated weights for policy 0, policy_version 44400 (0.0005) [2023-03-08 21:58:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 22749184. Throughput: 0: 11944.6. Samples: 22727720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:58:42,789][569624] Avg episode reward: [(0, '4295.472')] [2023-03-08 21:58:44,527][569911] Updated weights for policy 0, policy_version 44480 (0.0005) [2023-03-08 21:58:47,789][569624] Fps is (10 sec: 11468.9, 60 sec: 12015.0, 300 sec: 12052.0). Total num frames: 22810624. Throughput: 0: 11860.8. Samples: 22797920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:58:47,789][569624] Avg episode reward: [(0, '4550.970')] [2023-03-08 21:58:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000044552_22810624.pth... [2023-03-08 21:58:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000043864_22458368.pth [2023-03-08 21:58:48,058][569911] Updated weights for policy 0, policy_version 44560 (0.0005) [2023-03-08 21:58:51,609][569911] Updated weights for policy 0, policy_version 44640 (0.0005) [2023-03-08 21:58:52,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 12024.2). Total num frames: 22863872. Throughput: 0: 11679.9. Samples: 22864916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:58:52,789][569624] Avg episode reward: [(0, '4400.299')] [2023-03-08 21:58:55,150][569911] Updated weights for policy 0, policy_version 44720 (0.0005) [2023-03-08 21:58:57,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 22925312. Throughput: 0: 11652.1. Samples: 22900800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:58:57,789][569624] Avg episode reward: [(0, '4389.158')] [2023-03-08 21:58:58,623][569911] Updated weights for policy 0, policy_version 44800 (0.0005) [2023-03-08 21:59:01,912][569911] Updated weights for policy 0, policy_version 44880 (0.0005) [2023-03-08 21:59:02,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 12038.1). Total num frames: 22986752. Throughput: 0: 11742.9. Samples: 22974528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 21:59:02,789][569624] Avg episode reward: [(0, '4507.515')] [2023-03-08 21:59:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000044896_22986752.pth... [2023-03-08 21:59:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000044208_22634496.pth [2023-03-08 21:59:05,397][569911] Updated weights for policy 0, policy_version 44960 (0.0004) [2023-03-08 21:59:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11810.1, 300 sec: 12052.0). Total num frames: 23048192. Throughput: 0: 11689.1. Samples: 23045348. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:59:07,789][569624] Avg episode reward: [(0, '4490.061')] [2023-03-08 21:59:08,767][569911] Updated weights for policy 0, policy_version 45040 (0.0005) [2023-03-08 21:59:12,157][569911] Updated weights for policy 0, policy_version 45120 (0.0005) [2023-03-08 21:59:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 12038.1). Total num frames: 23105536. Throughput: 0: 11737.0. Samples: 23081652. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:59:12,789][569624] Avg episode reward: [(0, '4177.156')] [2023-03-08 21:59:15,385][569911] Updated weights for policy 0, policy_version 45200 (0.0004) [2023-03-08 21:59:17,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 23171072. Throughput: 0: 11817.9. Samples: 23155472. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:59:17,789][569624] Avg episode reward: [(0, '4148.639')] [2023-03-08 21:59:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000045256_23171072.pth... [2023-03-08 21:59:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000044552_22810624.pth [2023-03-08 21:59:18,701][569911] Updated weights for policy 0, policy_version 45280 (0.0005) [2023-03-08 21:59:22,067][569911] Updated weights for policy 0, policy_version 45360 (0.0005) [2023-03-08 21:59:22,789][569624] Fps is (10 sec: 12697.6, 60 sec: 11878.4, 300 sec: 12052.0). Total num frames: 23232512. Throughput: 0: 11912.4. Samples: 23229412. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:59:22,800][569624] Avg episode reward: [(0, '4064.979')] [2023-03-08 21:59:25,544][569911] Updated weights for policy 0, policy_version 45440 (0.0006) [2023-03-08 21:59:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12052.0). Total num frames: 23289856. Throughput: 0: 11946.1. Samples: 23265292. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 21:59:27,789][569624] Avg episode reward: [(0, '4198.526')] [2023-03-08 21:59:29,053][569911] Updated weights for policy 0, policy_version 45520 (0.0006) [2023-03-08 21:59:32,384][569911] Updated weights for policy 0, policy_version 45600 (0.0005) [2023-03-08 21:59:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 23351296. Throughput: 0: 11972.0. Samples: 23336660. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:59:32,800][569624] Avg episode reward: [(0, '4406.804')] [2023-03-08 21:59:32,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000045608_23351296.pth... [2023-03-08 21:59:32,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000044896_22986752.pth [2023-03-08 21:59:35,844][569911] Updated weights for policy 0, policy_version 45680 (0.0004) [2023-03-08 21:59:37,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 23412736. Throughput: 0: 12095.1. Samples: 23409196. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:59:37,800][569624] Avg episode reward: [(0, '4474.989')] [2023-03-08 21:59:39,111][569911] Updated weights for policy 0, policy_version 45760 (0.0005) [2023-03-08 21:59:42,404][569911] Updated weights for policy 0, policy_version 45840 (0.0005) [2023-03-08 21:59:42,789][569624] Fps is (10 sec: 12288.2, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 23474176. Throughput: 0: 12106.2. Samples: 23445576. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:59:42,789][569624] Avg episode reward: [(0, '4627.156')] [2023-03-08 21:59:45,867][569911] Updated weights for policy 0, policy_version 45920 (0.0004) [2023-03-08 21:59:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 23531520. Throughput: 0: 12109.6. Samples: 23519460. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:59:47,789][569624] Avg episode reward: [(0, '4393.149')] [2023-03-08 21:59:47,821][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000045968_23535616.pth... [2023-03-08 21:59:47,823][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000045256_23171072.pth [2023-03-08 21:59:49,120][569911] Updated weights for policy 0, policy_version 46000 (0.0005) [2023-03-08 21:59:52,421][569911] Updated weights for policy 0, policy_version 46080 (0.0005) [2023-03-08 21:59:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 23597056. Throughput: 0: 12198.3. Samples: 23594272. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:59:52,789][569624] Avg episode reward: [(0, '4474.748')] [2023-03-08 21:59:55,959][569911] Updated weights for policy 0, policy_version 46160 (0.0005) [2023-03-08 21:59:57,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 23654400. Throughput: 0: 12181.6. Samples: 23629824. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 21:59:57,789][569624] Avg episode reward: [(0, '4710.886')] [2023-03-08 21:59:59,303][569911] Updated weights for policy 0, policy_version 46240 (0.0005) [2023-03-08 22:00:02,710][569911] Updated weights for policy 0, policy_version 46320 (0.0006) [2023-03-08 22:00:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 23715840. Throughput: 0: 12104.7. Samples: 23700184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:00:02,789][569624] Avg episode reward: [(0, '4286.660')] [2023-03-08 22:00:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000046320_23715840.pth... [2023-03-08 22:00:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000045608_23351296.pth [2023-03-08 22:00:06,305][569911] Updated weights for policy 0, policy_version 46400 (0.0005) [2023-03-08 22:00:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 23773184. Throughput: 0: 12058.0. Samples: 23772024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:00:07,789][569624] Avg episode reward: [(0, '4425.945')] [2023-03-08 22:00:09,632][569911] Updated weights for policy 0, policy_version 46480 (0.0004) [2023-03-08 22:00:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 23834624. Throughput: 0: 12104.7. Samples: 23810004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:00:12,789][569624] Avg episode reward: [(0, '4410.244')] [2023-03-08 22:00:12,903][569911] Updated weights for policy 0, policy_version 46560 (0.0005) [2023-03-08 22:00:16,366][569911] Updated weights for policy 0, policy_version 46640 (0.0005) [2023-03-08 22:00:17,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 23896064. Throughput: 0: 12119.2. Samples: 23882024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:00:17,789][569624] Avg episode reward: [(0, '4563.541')] [2023-03-08 22:00:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000046672_23896064.pth... [2023-03-08 22:00:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000045968_23535616.pth [2023-03-08 22:00:19,649][569911] Updated weights for policy 0, policy_version 46720 (0.0005) [2023-03-08 22:00:22,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 12065.8). Total num frames: 23953408. Throughput: 0: 12102.9. Samples: 23953824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:00:22,789][569624] Avg episode reward: [(0, '4387.201')] [2023-03-08 22:00:23,168][569911] Updated weights for policy 0, policy_version 46800 (0.0006) [2023-03-08 22:00:26,568][569911] Updated weights for policy 0, policy_version 46880 (0.0005) [2023-03-08 22:00:27,789][569624] Fps is (10 sec: 11878.6, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 24014848. Throughput: 0: 12104.5. Samples: 23990280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:00:27,789][569624] Avg episode reward: [(0, '4542.647')] [2023-03-08 22:00:30,029][569911] Updated weights for policy 0, policy_version 46960 (0.0005) [2023-03-08 22:00:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12015.0, 300 sec: 12065.8). Total num frames: 24072192. Throughput: 0: 12034.5. Samples: 24061012. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:00:32,789][569624] Avg episode reward: [(0, '4545.124')] [2023-03-08 22:00:32,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000047024_24076288.pth... [2023-03-08 22:00:32,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000046320_23715840.pth [2023-03-08 22:00:33,404][569911] Updated weights for policy 0, policy_version 47040 (0.0005) [2023-03-08 22:00:36,867][569911] Updated weights for policy 0, policy_version 47120 (0.0005) [2023-03-08 22:00:37,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 24133632. Throughput: 0: 11986.0. Samples: 24133640. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:00:37,800][569624] Avg episode reward: [(0, '4410.581')] [2023-03-08 22:00:40,409][569911] Updated weights for policy 0, policy_version 47200 (0.0005) [2023-03-08 22:00:42,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 24195072. Throughput: 0: 11956.7. Samples: 24167876. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:00:42,789][569624] Avg episode reward: [(0, '4209.846')] [2023-03-08 22:00:43,695][569911] Updated weights for policy 0, policy_version 47280 (0.0005) [2023-03-08 22:00:47,201][569911] Updated weights for policy 0, policy_version 47360 (0.0005) [2023-03-08 22:00:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 24252416. Throughput: 0: 12004.2. Samples: 24240372. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:00:47,789][569624] Avg episode reward: [(0, '4538.710')] [2023-03-08 22:00:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000047368_24252416.pth... [2023-03-08 22:00:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000046672_23896064.pth [2023-03-08 22:00:50,743][569911] Updated weights for policy 0, policy_version 47440 (0.0005) [2023-03-08 22:00:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 24313856. Throughput: 0: 12013.3. Samples: 24312624. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:00:52,800][569624] Avg episode reward: [(0, '4077.661')] [2023-03-08 22:00:54,044][569911] Updated weights for policy 0, policy_version 47520 (0.0005) [2023-03-08 22:00:57,475][569911] Updated weights for policy 0, policy_version 47600 (0.0005) [2023-03-08 22:00:57,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 24371200. Throughput: 0: 11949.0. Samples: 24347708. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:00:57,789][569624] Avg episode reward: [(0, '4513.667')] [2023-03-08 22:01:00,818][569911] Updated weights for policy 0, policy_version 47680 (0.0005) [2023-03-08 22:01:02,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 24436736. Throughput: 0: 11970.4. Samples: 24420692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:01:02,789][569624] Avg episode reward: [(0, '4371.886')] [2023-03-08 22:01:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000047728_24436736.pth... [2023-03-08 22:01:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000047024_24076288.pth [2023-03-08 22:01:04,105][569911] Updated weights for policy 0, policy_version 47760 (0.0005) [2023-03-08 22:01:07,733][569911] Updated weights for policy 0, policy_version 47840 (0.0005) [2023-03-08 22:01:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 24494080. Throughput: 0: 11959.4. Samples: 24491996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:01:07,789][569624] Avg episode reward: [(0, '4474.176')] [2023-03-08 22:01:11,123][569911] Updated weights for policy 0, policy_version 47920 (0.0005) [2023-03-08 22:01:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 24551424. Throughput: 0: 11946.1. Samples: 24527856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:01:12,800][569624] Avg episode reward: [(0, '4206.251')] [2023-03-08 22:01:14,545][569911] Updated weights for policy 0, policy_version 48000 (0.0004) [2023-03-08 22:01:17,785][569911] Updated weights for policy 0, policy_version 48080 (0.0005) [2023-03-08 22:01:17,789][569624] Fps is (10 sec: 12288.2, 60 sec: 12015.0, 300 sec: 12065.8). Total num frames: 24616960. Throughput: 0: 11991.9. Samples: 24600648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:01:17,799][569624] Avg episode reward: [(0, '4255.218')] [2023-03-08 22:01:17,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000048080_24616960.pth... [2023-03-08 22:01:17,805][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000047368_24252416.pth [2023-03-08 22:01:21,208][569911] Updated weights for policy 0, policy_version 48160 (0.0005) [2023-03-08 22:01:22,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 24674304. Throughput: 0: 12016.2. Samples: 24674368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:01:22,800][569624] Avg episode reward: [(0, '4637.804')] [2023-03-08 22:01:24,555][569911] Updated weights for policy 0, policy_version 48240 (0.0005) [2023-03-08 22:01:27,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 24735744. Throughput: 0: 12073.3. Samples: 24711176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:01:27,789][569624] Avg episode reward: [(0, '4602.056')] [2023-03-08 22:01:27,802][569911] Updated weights for policy 0, policy_version 48320 (0.0005) [2023-03-08 22:01:30,999][569911] Updated weights for policy 0, policy_version 48400 (0.0004) [2023-03-08 22:01:32,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12151.4, 300 sec: 12065.8). Total num frames: 24801280. Throughput: 0: 12169.1. Samples: 24787980. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:01:32,800][569624] Avg episode reward: [(0, '4694.411')] [2023-03-08 22:01:32,804][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000048440_24801280.pth... [2023-03-08 22:01:32,807][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000047728_24436736.pth [2023-03-08 22:01:34,282][569911] Updated weights for policy 0, policy_version 48480 (0.0005) [2023-03-08 22:01:37,515][569911] Updated weights for policy 0, policy_version 48560 (0.0004) [2023-03-08 22:01:37,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 24862720. Throughput: 0: 12225.8. Samples: 24862784. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:01:37,800][569624] Avg episode reward: [(0, '4741.613')] [2023-03-08 22:01:40,819][569911] Updated weights for policy 0, policy_version 48640 (0.0005) [2023-03-08 22:01:42,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.4, 300 sec: 12065.8). Total num frames: 24924160. Throughput: 0: 12272.5. Samples: 24899972. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:01:42,800][569624] Avg episode reward: [(0, '4601.169')] [2023-03-08 22:01:44,331][569911] Updated weights for policy 0, policy_version 48720 (0.0005) [2023-03-08 22:01:47,763][569911] Updated weights for policy 0, policy_version 48800 (0.0005) [2023-03-08 22:01:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 24985600. Throughput: 0: 12191.0. Samples: 24969288. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:01:47,789][569624] Avg episode reward: [(0, '4512.397')] [2023-03-08 22:01:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000048800_24985600.pth... [2023-03-08 22:01:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000048080_24616960.pth [2023-03-08 22:01:51,158][569911] Updated weights for policy 0, policy_version 48880 (0.0005) [2023-03-08 22:01:52,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 25042944. Throughput: 0: 12243.5. Samples: 25042952. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:01:52,789][569624] Avg episode reward: [(0, '4591.184')] [2023-03-08 22:01:54,646][569911] Updated weights for policy 0, policy_version 48960 (0.0005) [2023-03-08 22:01:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12219.8, 300 sec: 12065.8). Total num frames: 25104384. Throughput: 0: 12230.2. Samples: 25078216. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:01:57,789][569624] Avg episode reward: [(0, '4552.629')] [2023-03-08 22:01:57,984][569911] Updated weights for policy 0, policy_version 49040 (0.0005) [2023-03-08 22:02:01,329][569911] Updated weights for policy 0, policy_version 49120 (0.0004) [2023-03-08 22:02:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 25165824. Throughput: 0: 12261.0. Samples: 25152392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:02:02,789][569624] Avg episode reward: [(0, '4638.380')] [2023-03-08 22:02:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000049152_25165824.pth... [2023-03-08 22:02:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000048440_24801280.pth [2023-03-08 22:02:04,710][569911] Updated weights for policy 0, policy_version 49200 (0.0004) [2023-03-08 22:02:07,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12219.8, 300 sec: 12065.8). Total num frames: 25227264. Throughput: 0: 12219.0. Samples: 25224224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:02:07,789][569624] Avg episode reward: [(0, '4534.414')] [2023-03-08 22:02:08,058][569911] Updated weights for policy 0, policy_version 49280 (0.0005) [2023-03-08 22:02:11,691][569911] Updated weights for policy 0, policy_version 49360 (0.0005) [2023-03-08 22:02:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12065.8). Total num frames: 25284608. Throughput: 0: 12190.4. Samples: 25259744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:02:12,789][569624] Avg episode reward: [(0, '4596.893')] [2023-03-08 22:02:15,172][569911] Updated weights for policy 0, policy_version 49440 (0.0005) [2023-03-08 22:02:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 25341952. Throughput: 0: 12035.6. Samples: 25329580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:02:17,789][569624] Avg episode reward: [(0, '4621.383')] [2023-03-08 22:02:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000049496_25341952.pth... [2023-03-08 22:02:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000048800_24985600.pth [2023-03-08 22:02:18,799][569911] Updated weights for policy 0, policy_version 49520 (0.0005) [2023-03-08 22:02:22,256][569911] Updated weights for policy 0, policy_version 49600 (0.0005) [2023-03-08 22:02:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 25399296. Throughput: 0: 11921.0. Samples: 25399228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:02:22,789][569624] Avg episode reward: [(0, '4768.632')] [2023-03-08 22:02:25,634][569911] Updated weights for policy 0, policy_version 49680 (0.0005) [2023-03-08 22:02:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 25460736. Throughput: 0: 11913.6. Samples: 25436084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:02:27,789][569624] Avg episode reward: [(0, '4108.013')] [2023-03-08 22:02:28,992][569911] Updated weights for policy 0, policy_version 49760 (0.0005) [2023-03-08 22:02:32,401][569911] Updated weights for policy 0, policy_version 49840 (0.0005) [2023-03-08 22:02:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 25522176. Throughput: 0: 11929.7. Samples: 25506124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:02:32,789][569624] Avg episode reward: [(0, '4626.669')] [2023-03-08 22:02:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000049848_25522176.pth... [2023-03-08 22:02:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000049152_25165824.pth [2023-03-08 22:02:35,692][569911] Updated weights for policy 0, policy_version 49920 (0.0004) [2023-03-08 22:02:37,789][569624] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 25579520. Throughput: 0: 11923.8. Samples: 25579520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:02:37,789][569624] Avg episode reward: [(0, '4618.069')] [2023-03-08 22:02:39,200][569911] Updated weights for policy 0, policy_version 50000 (0.0005) [2023-03-08 22:02:42,689][569911] Updated weights for policy 0, policy_version 50080 (0.0005) [2023-03-08 22:02:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 25640960. Throughput: 0: 11955.1. Samples: 25616196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:02:42,789][569624] Avg episode reward: [(0, '4674.458')] [2023-03-08 22:02:46,176][569911] Updated weights for policy 0, policy_version 50160 (0.0005) [2023-03-08 22:02:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 25698304. Throughput: 0: 11865.2. Samples: 25686328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:02:47,789][569624] Avg episode reward: [(0, '4665.684')] [2023-03-08 22:02:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000050192_25698304.pth... [2023-03-08 22:02:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000049496_25341952.pth [2023-03-08 22:02:49,483][569911] Updated weights for policy 0, policy_version 50240 (0.0004) [2023-03-08 22:02:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 25759744. Throughput: 0: 11863.5. Samples: 25758084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:02:52,789][569624] Avg episode reward: [(0, '4689.674')] [2023-03-08 22:02:53,087][569911] Updated weights for policy 0, policy_version 50320 (0.0004) [2023-03-08 22:02:56,510][569911] Updated weights for policy 0, policy_version 50400 (0.0005) [2023-03-08 22:02:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11996.4). Total num frames: 25817088. Throughput: 0: 11859.8. Samples: 25793436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:02:57,789][569624] Avg episode reward: [(0, '4705.236')] [2023-03-08 22:02:59,594][569911] Updated weights for policy 0, policy_version 50480 (0.0005) [2023-03-08 22:03:02,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 25882624. Throughput: 0: 12012.7. Samples: 25870152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:03:02,789][569624] Avg episode reward: [(0, '4615.789')] [2023-03-08 22:03:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000050552_25882624.pth... [2023-03-08 22:03:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000049848_25522176.pth [2023-03-08 22:03:02,975][569911] Updated weights for policy 0, policy_version 50560 (0.0004) [2023-03-08 22:03:06,416][569911] Updated weights for policy 0, policy_version 50640 (0.0005) [2023-03-08 22:03:07,789][569624] Fps is (10 sec: 12697.5, 60 sec: 11946.6, 300 sec: 12010.3). Total num frames: 25944064. Throughput: 0: 12071.4. Samples: 25942440. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:03:07,789][569624] Avg episode reward: [(0, '4702.152')] [2023-03-08 22:03:09,676][569911] Updated weights for policy 0, policy_version 50720 (0.0005) [2023-03-08 22:03:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 26005504. Throughput: 0: 12084.9. Samples: 25979904. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:03:12,789][569624] Avg episode reward: [(0, '4647.687')] [2023-03-08 22:03:13,037][569911] Updated weights for policy 0, policy_version 50800 (0.0005) [2023-03-08 22:03:16,513][569911] Updated weights for policy 0, policy_version 50880 (0.0004) [2023-03-08 22:03:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 26062848. Throughput: 0: 12100.0. Samples: 26050624. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:03:17,789][569624] Avg episode reward: [(0, '4648.900')] [2023-03-08 22:03:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000050904_26062848.pth... [2023-03-08 22:03:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000050192_25698304.pth [2023-03-08 22:03:20,046][569911] Updated weights for policy 0, policy_version 50960 (0.0005) [2023-03-08 22:03:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 26124288. Throughput: 0: 12040.8. Samples: 26121360. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:03:22,789][569624] Avg episode reward: [(0, '4708.968')] [2023-03-08 22:03:23,425][569911] Updated weights for policy 0, policy_version 51040 (0.0004) [2023-03-08 22:03:26,891][569911] Updated weights for policy 0, policy_version 51120 (0.0005) [2023-03-08 22:03:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 26181632. Throughput: 0: 12020.7. Samples: 26157128. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:03:27,789][569624] Avg episode reward: [(0, '4577.486')] [2023-03-08 22:03:30,363][569911] Updated weights for policy 0, policy_version 51200 (0.0005) [2023-03-08 22:03:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 26247168. Throughput: 0: 12100.6. Samples: 26230856. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:03:32,789][569624] Avg episode reward: [(0, '4658.524')] [2023-03-08 22:03:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000051264_26247168.pth... [2023-03-08 22:03:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000050552_25882624.pth [2023-03-08 22:03:33,474][569911] Updated weights for policy 0, policy_version 51280 (0.0005) [2023-03-08 22:03:36,838][569911] Updated weights for policy 0, policy_version 51360 (0.0005) [2023-03-08 22:03:37,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 26304512. Throughput: 0: 12141.9. Samples: 26304472. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:03:37,789][569624] Avg episode reward: [(0, '4691.303')] [2023-03-08 22:03:40,235][569911] Updated weights for policy 0, policy_version 51440 (0.0005) [2023-03-08 22:03:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 26365952. Throughput: 0: 12176.6. Samples: 26341384. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:03:42,789][569624] Avg episode reward: [(0, '4464.810')] [2023-03-08 22:03:43,594][569911] Updated weights for policy 0, policy_version 51520 (0.0004) [2023-03-08 22:03:47,079][569911] Updated weights for policy 0, policy_version 51600 (0.0005) [2023-03-08 22:03:47,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12151.4, 300 sec: 12079.7). Total num frames: 26427392. Throughput: 0: 12055.0. Samples: 26412628. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:03:47,790][569624] Avg episode reward: [(0, '4579.471')] [2023-03-08 22:03:47,795][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000051616_26427392.pth... [2023-03-08 22:03:47,797][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000050904_26062848.pth [2023-03-08 22:03:50,282][569911] Updated weights for policy 0, policy_version 51680 (0.0005) [2023-03-08 22:03:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 26488832. Throughput: 0: 12113.7. Samples: 26487556. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:03:52,789][569624] Avg episode reward: [(0, '4496.501')] [2023-03-08 22:03:53,714][569911] Updated weights for policy 0, policy_version 51760 (0.0005) [2023-03-08 22:03:57,105][569911] Updated weights for policy 0, policy_version 51840 (0.0004) [2023-03-08 22:03:57,789][569624] Fps is (10 sec: 12288.3, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 26550272. Throughput: 0: 12051.0. Samples: 26522200. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:03:57,789][569624] Avg episode reward: [(0, '4591.119')] [2023-03-08 22:04:00,449][569911] Updated weights for policy 0, policy_version 51920 (0.0004) [2023-03-08 22:04:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 26611712. Throughput: 0: 12123.6. Samples: 26596184. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:04:02,789][569624] Avg episode reward: [(0, '4569.107')] [2023-03-08 22:04:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000051976_26611712.pth... [2023-03-08 22:04:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000051264_26247168.pth [2023-03-08 22:04:03,703][569911] Updated weights for policy 0, policy_version 52000 (0.0005) [2023-03-08 22:04:07,056][569911] Updated weights for policy 0, policy_version 52080 (0.0004) [2023-03-08 22:04:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 26673152. Throughput: 0: 12235.0. Samples: 26671936. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:04:07,789][569624] Avg episode reward: [(0, '4420.771')] [2023-03-08 22:04:10,431][569911] Updated weights for policy 0, policy_version 52160 (0.0005) [2023-03-08 22:04:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 26734592. Throughput: 0: 12219.7. Samples: 26707016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:04:12,789][569624] Avg episode reward: [(0, '4124.619')] [2023-03-08 22:04:13,692][569911] Updated weights for policy 0, policy_version 52240 (0.0005) [2023-03-08 22:04:17,109][569911] Updated weights for policy 0, policy_version 52320 (0.0005) [2023-03-08 22:04:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 26791936. Throughput: 0: 12239.8. Samples: 26781648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:04:17,789][569624] Avg episode reward: [(0, '4089.383')] [2023-03-08 22:04:17,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000052328_26791936.pth... [2023-03-08 22:04:17,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000051616_26427392.pth [2023-03-08 22:04:20,647][569911] Updated weights for policy 0, policy_version 52400 (0.0005) [2023-03-08 22:04:22,789][569624] Fps is (10 sec: 11468.9, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 26849280. Throughput: 0: 12108.5. Samples: 26849352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:04:22,789][569624] Avg episode reward: [(0, '4276.719')] [2023-03-08 22:04:24,422][569911] Updated weights for policy 0, policy_version 52480 (0.0006) [2023-03-08 22:04:27,789][569624] Fps is (10 sec: 11059.2, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 26902528. Throughput: 0: 12014.8. Samples: 26882048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:04:27,789][569624] Avg episode reward: [(0, '3803.598')] [2023-03-08 22:04:28,295][569911] Updated weights for policy 0, policy_version 52560 (0.0005) [2023-03-08 22:04:31,762][569911] Updated weights for policy 0, policy_version 52640 (0.0004) [2023-03-08 22:04:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 26963968. Throughput: 0: 11938.1. Samples: 26949840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:04:32,789][569624] Avg episode reward: [(0, '3873.591')] [2023-03-08 22:04:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000052664_26963968.pth... [2023-03-08 22:04:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000051976_26611712.pth [2023-03-08 22:04:35,104][569911] Updated weights for policy 0, policy_version 52720 (0.0005) [2023-03-08 22:04:37,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 27021312. Throughput: 0: 11878.4. Samples: 27022084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:04:37,789][569624] Avg episode reward: [(0, '4207.995')] [2023-03-08 22:04:38,529][569911] Updated weights for policy 0, policy_version 52800 (0.0004) [2023-03-08 22:04:41,974][569911] Updated weights for policy 0, policy_version 52880 (0.0005) [2023-03-08 22:04:42,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 27082752. Throughput: 0: 11910.6. Samples: 27058176. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:04:42,789][569624] Avg episode reward: [(0, '4579.146')] [2023-03-08 22:04:45,366][569911] Updated weights for policy 0, policy_version 52960 (0.0005) [2023-03-08 22:04:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 27144192. Throughput: 0: 11866.9. Samples: 27130196. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:04:47,789][569624] Avg episode reward: [(0, '4598.371')] [2023-03-08 22:04:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000053016_27144192.pth... [2023-03-08 22:04:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000052328_26791936.pth [2023-03-08 22:04:48,694][569911] Updated weights for policy 0, policy_version 53040 (0.0005) [2023-03-08 22:04:51,953][569911] Updated weights for policy 0, policy_version 53120 (0.0004) [2023-03-08 22:04:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 27205632. Throughput: 0: 11854.9. Samples: 27205408. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:04:52,789][569624] Avg episode reward: [(0, '4667.037')] [2023-03-08 22:04:55,396][569911] Updated weights for policy 0, policy_version 53200 (0.0005) [2023-03-08 22:04:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 27262976. Throughput: 0: 11840.3. Samples: 27239828. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:04:57,789][569624] Avg episode reward: [(0, '4206.635')] [2023-03-08 22:04:58,772][569911] Updated weights for policy 0, policy_version 53280 (0.0005) [2023-03-08 22:05:02,471][569911] Updated weights for policy 0, policy_version 53360 (0.0005) [2023-03-08 22:05:02,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 27324416. Throughput: 0: 11760.5. Samples: 27310872. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:05:02,789][569624] Avg episode reward: [(0, '4473.096')] [2023-03-08 22:05:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000053368_27324416.pth... [2023-03-08 22:05:02,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000052664_26963968.pth [2023-03-08 22:05:05,823][569911] Updated weights for policy 0, policy_version 53440 (0.0005) [2023-03-08 22:05:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 12024.2). Total num frames: 27381760. Throughput: 0: 11832.7. Samples: 27381824. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:05:07,789][569624] Avg episode reward: [(0, '4053.695')] [2023-03-08 22:05:09,257][569911] Updated weights for policy 0, policy_version 53520 (0.0005) [2023-03-08 22:05:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 12010.3). Total num frames: 27439104. Throughput: 0: 11923.9. Samples: 27418624. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:05:12,789][569624] Avg episode reward: [(0, '4568.693')] [2023-03-08 22:05:12,805][569911] Updated weights for policy 0, policy_version 53600 (0.0004) [2023-03-08 22:05:16,103][569911] Updated weights for policy 0, policy_version 53680 (0.0005) [2023-03-08 22:05:17,789][569624] Fps is (10 sec: 12287.8, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 27504640. Throughput: 0: 11982.8. Samples: 27489068. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:05:17,789][569624] Avg episode reward: [(0, '4736.874')] [2023-03-08 22:05:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000053720_27504640.pth... [2023-03-08 22:05:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000053016_27144192.pth [2023-03-08 22:05:19,255][569911] Updated weights for policy 0, policy_version 53760 (0.0006) [2023-03-08 22:05:22,466][569911] Updated weights for policy 0, policy_version 53840 (0.0004) [2023-03-08 22:05:22,789][569624] Fps is (10 sec: 13107.3, 60 sec: 12015.0, 300 sec: 12052.0). Total num frames: 27570176. Throughput: 0: 12099.1. Samples: 27566540. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:05:22,789][569624] Avg episode reward: [(0, '4789.412')] [2023-03-08 22:05:25,969][569911] Updated weights for policy 0, policy_version 53920 (0.0005) [2023-03-08 22:05:27,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 27627520. Throughput: 0: 12105.9. Samples: 27602944. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:05:27,789][569624] Avg episode reward: [(0, '4623.013')] [2023-03-08 22:05:29,222][569911] Updated weights for policy 0, policy_version 54000 (0.0005) [2023-03-08 22:05:32,754][569911] Updated weights for policy 0, policy_version 54080 (0.0005) [2023-03-08 22:05:32,789][569624] Fps is (10 sec: 11878.1, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 27688960. Throughput: 0: 12121.4. Samples: 27675660. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:05:32,789][569624] Avg episode reward: [(0, '4689.888')] [2023-03-08 22:05:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000054080_27688960.pth... [2023-03-08 22:05:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000053368_27324416.pth [2023-03-08 22:05:36,261][569911] Updated weights for policy 0, policy_version 54160 (0.0005) [2023-03-08 22:05:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 27746304. Throughput: 0: 11967.5. Samples: 27743944. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:05:37,789][569624] Avg episode reward: [(0, '4647.022')] [2023-03-08 22:05:39,848][569911] Updated weights for policy 0, policy_version 54240 (0.0005) [2023-03-08 22:05:42,789][569624] Fps is (10 sec: 11468.9, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 27803648. Throughput: 0: 11979.4. Samples: 27778900. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:05:42,789][569624] Avg episode reward: [(0, '4782.513')] [2023-03-08 22:05:43,391][569911] Updated weights for policy 0, policy_version 54320 (0.0004) [2023-03-08 22:05:46,778][569911] Updated weights for policy 0, policy_version 54400 (0.0005) [2023-03-08 22:05:47,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 27865088. Throughput: 0: 11975.9. Samples: 27849784. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:05:47,789][569624] Avg episode reward: [(0, '4775.979')] [2023-03-08 22:05:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000054424_27865088.pth... [2023-03-08 22:05:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000053720_27504640.pth [2023-03-08 22:05:50,090][569911] Updated weights for policy 0, policy_version 54480 (0.0005) [2023-03-08 22:05:52,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 27926528. Throughput: 0: 12079.3. Samples: 27925392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:05:52,800][569624] Avg episode reward: [(0, '4670.662')] [2023-03-08 22:05:53,362][569911] Updated weights for policy 0, policy_version 54560 (0.0004) [2023-03-08 22:05:56,480][569911] Updated weights for policy 0, policy_version 54640 (0.0004) [2023-03-08 22:05:57,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 27987968. Throughput: 0: 12107.4. Samples: 27963456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:05:57,800][569624] Avg episode reward: [(0, '4739.473')] [2023-03-08 22:05:59,706][569911] Updated weights for policy 0, policy_version 54720 (0.0005) [2023-03-08 22:06:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 28049408. Throughput: 0: 12219.7. Samples: 28038952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:06:02,800][569624] Avg episode reward: [(0, '4643.844')] [2023-03-08 22:06:02,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000054792_28053504.pth... [2023-03-08 22:06:02,805][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000054080_27688960.pth [2023-03-08 22:06:03,102][569911] Updated weights for policy 0, policy_version 54800 (0.0005) [2023-03-08 22:06:06,480][569911] Updated weights for policy 0, policy_version 54880 (0.0005) [2023-03-08 22:06:07,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 28110848. Throughput: 0: 12097.1. Samples: 28110912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:06:07,800][569624] Avg episode reward: [(0, '4641.920')] [2023-03-08 22:06:09,851][569911] Updated weights for policy 0, policy_version 54960 (0.0006) [2023-03-08 22:06:12,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 28176384. Throughput: 0: 12105.8. Samples: 28147704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:06:12,789][569624] Avg episode reward: [(0, '4587.939')] [2023-03-08 22:06:13,014][569911] Updated weights for policy 0, policy_version 55040 (0.0005) [2023-03-08 22:06:16,237][569911] Updated weights for policy 0, policy_version 55120 (0.0005) [2023-03-08 22:06:17,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 28237824. Throughput: 0: 12219.0. Samples: 28225512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:06:17,789][569624] Avg episode reward: [(0, '4455.298')] [2023-03-08 22:06:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000055152_28237824.pth... [2023-03-08 22:06:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000054424_27865088.pth [2023-03-08 22:06:19,683][569911] Updated weights for policy 0, policy_version 55200 (0.0005) [2023-03-08 22:06:22,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.4, 300 sec: 12079.7). Total num frames: 28299264. Throughput: 0: 12279.3. Samples: 28296512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:06:22,789][569624] Avg episode reward: [(0, '4571.259')] [2023-03-08 22:06:23,085][569911] Updated weights for policy 0, policy_version 55280 (0.0005) [2023-03-08 22:06:26,454][569911] Updated weights for policy 0, policy_version 55360 (0.0005) [2023-03-08 22:06:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12052.0). Total num frames: 28356608. Throughput: 0: 12292.0. Samples: 28332040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:06:27,789][569624] Avg episode reward: [(0, '4655.504')] [2023-03-08 22:06:29,724][569911] Updated weights for policy 0, policy_version 55440 (0.0005) [2023-03-08 22:06:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12065.8). Total num frames: 28422144. Throughput: 0: 12396.6. Samples: 28407632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:06:32,789][569624] Avg episode reward: [(0, '4718.917')] [2023-03-08 22:06:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000055512_28422144.pth... [2023-03-08 22:06:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000054792_28053504.pth [2023-03-08 22:06:33,089][569911] Updated weights for policy 0, policy_version 55520 (0.0005) [2023-03-08 22:06:36,326][569911] Updated weights for policy 0, policy_version 55600 (0.0005) [2023-03-08 22:06:37,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 28483584. Throughput: 0: 12402.3. Samples: 28483496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:06:37,789][569624] Avg episode reward: [(0, '4603.054')] [2023-03-08 22:06:39,515][569911] Updated weights for policy 0, policy_version 55680 (0.0005) [2023-03-08 22:06:42,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12356.3, 300 sec: 12065.8). Total num frames: 28545024. Throughput: 0: 12379.0. Samples: 28520512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:06:42,789][569624] Avg episode reward: [(0, '4164.999')] [2023-03-08 22:06:42,844][569911] Updated weights for policy 0, policy_version 55760 (0.0005) [2023-03-08 22:06:45,983][569911] Updated weights for policy 0, policy_version 55840 (0.0005) [2023-03-08 22:06:47,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12424.5, 300 sec: 12093.6). Total num frames: 28610560. Throughput: 0: 12370.8. Samples: 28595640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:06:47,789][569624] Avg episode reward: [(0, '4358.656')] [2023-03-08 22:06:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000055880_28610560.pth... [2023-03-08 22:06:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000055152_28237824.pth [2023-03-08 22:06:49,331][569911] Updated weights for policy 0, policy_version 55920 (0.0005) [2023-03-08 22:06:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12356.3, 300 sec: 12079.7). Total num frames: 28667904. Throughput: 0: 12376.0. Samples: 28667832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:06:52,789][569624] Avg episode reward: [(0, '3768.786')] [2023-03-08 22:06:52,989][569911] Updated weights for policy 0, policy_version 56000 (0.0004) [2023-03-08 22:06:56,567][569911] Updated weights for policy 0, policy_version 56080 (0.0005) [2023-03-08 22:06:57,789][569624] Fps is (10 sec: 11468.9, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 28725248. Throughput: 0: 12320.9. Samples: 28702144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:06:57,789][569624] Avg episode reward: [(0, '3381.800')] [2023-03-08 22:07:00,108][569911] Updated weights for policy 0, policy_version 56160 (0.0005) [2023-03-08 22:07:02,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 28786688. Throughput: 0: 12152.5. Samples: 28772376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:07:02,789][569624] Avg episode reward: [(0, '3979.169')] [2023-03-08 22:07:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000056224_28786688.pth... [2023-03-08 22:07:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000055512_28422144.pth [2023-03-08 22:07:03,523][569911] Updated weights for policy 0, policy_version 56240 (0.0005) [2023-03-08 22:07:06,686][569911] Updated weights for policy 0, policy_version 56320 (0.0005) [2023-03-08 22:07:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12065.8). Total num frames: 28844032. Throughput: 0: 12177.8. Samples: 28844512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:07:07,789][569624] Avg episode reward: [(0, '4643.793')] [2023-03-08 22:07:10,093][569911] Updated weights for policy 0, policy_version 56400 (0.0005) [2023-03-08 22:07:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 28909568. Throughput: 0: 12219.8. Samples: 28881932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:07:12,789][569624] Avg episode reward: [(0, '4537.746')] [2023-03-08 22:07:13,318][569911] Updated weights for policy 0, policy_version 56480 (0.0005) [2023-03-08 22:07:16,734][569911] Updated weights for policy 0, policy_version 56560 (0.0005) [2023-03-08 22:07:17,789][569624] Fps is (10 sec: 12697.8, 60 sec: 12219.8, 300 sec: 12107.5). Total num frames: 28971008. Throughput: 0: 12205.1. Samples: 28956860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:07:17,789][569624] Avg episode reward: [(0, '4665.811')] [2023-03-08 22:07:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000056584_28971008.pth... [2023-03-08 22:07:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000055880_28610560.pth [2023-03-08 22:07:20,113][569911] Updated weights for policy 0, policy_version 56640 (0.0005) [2023-03-08 22:07:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 29028352. Throughput: 0: 12109.7. Samples: 29028432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:07:22,789][569624] Avg episode reward: [(0, '4523.857')] [2023-03-08 22:07:23,527][569911] Updated weights for policy 0, policy_version 56720 (0.0005) [2023-03-08 22:07:27,047][569911] Updated weights for policy 0, policy_version 56800 (0.0005) [2023-03-08 22:07:27,789][569624] Fps is (10 sec: 11878.2, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 29089792. Throughput: 0: 12073.1. Samples: 29063804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:07:27,789][569624] Avg episode reward: [(0, '4168.510')] [2023-03-08 22:07:30,447][569911] Updated weights for policy 0, policy_version 56880 (0.0005) [2023-03-08 22:07:32,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 29147136. Throughput: 0: 11999.3. Samples: 29135608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:07:32,789][569624] Avg episode reward: [(0, '4524.141')] [2023-03-08 22:07:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000056928_29147136.pth... [2023-03-08 22:07:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000056224_28786688.pth [2023-03-08 22:07:33,739][569911] Updated weights for policy 0, policy_version 56960 (0.0004) [2023-03-08 22:07:36,835][569911] Updated weights for policy 0, policy_version 57040 (0.0005) [2023-03-08 22:07:37,789][569624] Fps is (10 sec: 12697.7, 60 sec: 12219.7, 300 sec: 12121.4). Total num frames: 29216768. Throughput: 0: 12111.1. Samples: 29212832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:07:37,789][569624] Avg episode reward: [(0, '4360.041')] [2023-03-08 22:07:40,083][569911] Updated weights for policy 0, policy_version 57120 (0.0004) [2023-03-08 22:07:42,789][569624] Fps is (10 sec: 12697.7, 60 sec: 12151.5, 300 sec: 12121.4). Total num frames: 29274112. Throughput: 0: 12171.7. Samples: 29249872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:07:42,789][569624] Avg episode reward: [(0, '4347.554')] [2023-03-08 22:07:43,605][569911] Updated weights for policy 0, policy_version 57200 (0.0005) [2023-03-08 22:07:47,071][569911] Updated weights for policy 0, policy_version 57280 (0.0005) [2023-03-08 22:07:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 29335552. Throughput: 0: 12159.7. Samples: 29319564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:07:47,789][569624] Avg episode reward: [(0, '4609.831')] [2023-03-08 22:07:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000057296_29335552.pth... [2023-03-08 22:07:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000056584_28971008.pth [2023-03-08 22:07:50,423][569911] Updated weights for policy 0, policy_version 57360 (0.0004) [2023-03-08 22:07:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12135.3). Total num frames: 29396992. Throughput: 0: 12267.1. Samples: 29396532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:07:52,789][569624] Avg episode reward: [(0, '4724.016')] [2023-03-08 22:07:53,637][569911] Updated weights for policy 0, policy_version 57440 (0.0005) [2023-03-08 22:07:57,062][569911] Updated weights for policy 0, policy_version 57520 (0.0005) [2023-03-08 22:07:57,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12121.4). Total num frames: 29458432. Throughput: 0: 12235.2. Samples: 29432516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:07:57,789][569624] Avg episode reward: [(0, '4591.947')] [2023-03-08 22:08:00,325][569911] Updated weights for policy 0, policy_version 57600 (0.0005) [2023-03-08 22:08:02,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12219.7, 300 sec: 12121.4). Total num frames: 29519872. Throughput: 0: 12227.2. Samples: 29507088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:08:02,790][569624] Avg episode reward: [(0, '4388.241')] [2023-03-08 22:08:02,794][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000057656_29519872.pth... [2023-03-08 22:08:02,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000056928_29147136.pth [2023-03-08 22:08:03,788][569911] Updated weights for policy 0, policy_version 57680 (0.0005) [2023-03-08 22:08:07,144][569911] Updated weights for policy 0, policy_version 57760 (0.0005) [2023-03-08 22:08:07,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 29577216. Throughput: 0: 12196.8. Samples: 29577288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:08:07,789][569624] Avg episode reward: [(0, '4662.694')] [2023-03-08 22:08:10,682][569911] Updated weights for policy 0, policy_version 57840 (0.0005) [2023-03-08 22:08:12,789][569624] Fps is (10 sec: 11878.6, 60 sec: 12151.5, 300 sec: 12121.4). Total num frames: 29638656. Throughput: 0: 12222.2. Samples: 29613804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:08:12,789][569624] Avg episode reward: [(0, '4606.589')] [2023-03-08 22:08:13,992][569911] Updated weights for policy 0, policy_version 57920 (0.0005) [2023-03-08 22:08:17,262][569911] Updated weights for policy 0, policy_version 58000 (0.0005) [2023-03-08 22:08:17,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12151.4, 300 sec: 12121.4). Total num frames: 29700096. Throughput: 0: 12252.8. Samples: 29686984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:08:17,789][569624] Avg episode reward: [(0, '4558.064')] [2023-03-08 22:08:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000058008_29700096.pth... [2023-03-08 22:08:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000057296_29335552.pth [2023-03-08 22:08:20,636][569911] Updated weights for policy 0, policy_version 58080 (0.0005) [2023-03-08 22:08:22,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12135.3). Total num frames: 29761536. Throughput: 0: 12165.2. Samples: 29760264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:08:22,789][569624] Avg episode reward: [(0, '4364.879')] [2023-03-08 22:08:24,151][569911] Updated weights for policy 0, policy_version 58160 (0.0005) [2023-03-08 22:08:27,649][569911] Updated weights for policy 0, policy_version 58240 (0.0005) [2023-03-08 22:08:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 29818880. Throughput: 0: 12119.8. Samples: 29795264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:08:27,789][569624] Avg episode reward: [(0, '4512.972')] [2023-03-08 22:08:30,995][569911] Updated weights for policy 0, policy_version 58320 (0.0005) [2023-03-08 22:08:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 29876224. Throughput: 0: 12166.2. Samples: 29867044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:08:32,789][569624] Avg episode reward: [(0, '4438.390')] [2023-03-08 22:08:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000058352_29876224.pth... [2023-03-08 22:08:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000057656_29519872.pth [2023-03-08 22:08:34,603][569911] Updated weights for policy 0, policy_version 58400 (0.0005) [2023-03-08 22:08:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12107.5). Total num frames: 29937664. Throughput: 0: 12017.6. Samples: 29937324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:08:37,789][569624] Avg episode reward: [(0, '4533.775')] [2023-03-08 22:08:38,027][569911] Updated weights for policy 0, policy_version 58480 (0.0005) [2023-03-08 22:08:41,503][569911] Updated weights for policy 0, policy_version 58560 (0.0006) [2023-03-08 22:08:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 29995008. Throughput: 0: 11959.2. Samples: 29970680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:08:42,789][569624] Avg episode reward: [(0, '4497.373')] [2023-03-08 22:08:45,057][569911] Updated weights for policy 0, policy_version 58640 (0.0006) [2023-03-08 22:08:47,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 30052352. Throughput: 0: 11849.1. Samples: 30040296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:08:47,789][569624] Avg episode reward: [(0, '4625.086')] [2023-03-08 22:08:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000058696_30052352.pth... [2023-03-08 22:08:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000058008_29700096.pth [2023-03-08 22:08:48,686][569911] Updated weights for policy 0, policy_version 58720 (0.0005) [2023-03-08 22:08:52,041][569911] Updated weights for policy 0, policy_version 58800 (0.0005) [2023-03-08 22:08:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 30113792. Throughput: 0: 11874.3. Samples: 30111632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:08:52,789][569624] Avg episode reward: [(0, '4534.019')] [2023-03-08 22:08:55,568][569911] Updated weights for policy 0, policy_version 58880 (0.0005) [2023-03-08 22:08:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 30171136. Throughput: 0: 11839.0. Samples: 30146560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:08:57,789][569624] Avg episode reward: [(0, '4541.873')] [2023-03-08 22:08:59,077][569911] Updated weights for policy 0, policy_version 58960 (0.0005) [2023-03-08 22:09:02,152][569911] Updated weights for policy 0, policy_version 59040 (0.0005) [2023-03-08 22:09:02,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.5, 300 sec: 12065.8). Total num frames: 30232576. Throughput: 0: 11857.2. Samples: 30220556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:09:02,789][569624] Avg episode reward: [(0, '4609.365')] [2023-03-08 22:09:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000059048_30232576.pth... [2023-03-08 22:09:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000058352_29876224.pth [2023-03-08 22:09:05,619][569911] Updated weights for policy 0, policy_version 59120 (0.0005) [2023-03-08 22:09:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.6, 300 sec: 12065.8). Total num frames: 30294016. Throughput: 0: 11824.5. Samples: 30292368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:09:07,789][569624] Avg episode reward: [(0, '4549.020')] [2023-03-08 22:09:09,051][569911] Updated weights for policy 0, policy_version 59200 (0.0005) [2023-03-08 22:09:12,463][569911] Updated weights for policy 0, policy_version 59280 (0.0004) [2023-03-08 22:09:12,789][569624] Fps is (10 sec: 12287.8, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 30355456. Throughput: 0: 11828.2. Samples: 30327532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:09:12,789][569624] Avg episode reward: [(0, '4661.030')] [2023-03-08 22:09:15,817][569911] Updated weights for policy 0, policy_version 59360 (0.0004) [2023-03-08 22:09:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12079.7). Total num frames: 30412800. Throughput: 0: 11855.0. Samples: 30400520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:09:17,789][569624] Avg episode reward: [(0, '4785.453')] [2023-03-08 22:09:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000059400_30412800.pth... [2023-03-08 22:09:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000058696_30052352.pth [2023-03-08 22:09:19,402][569911] Updated weights for policy 0, policy_version 59440 (0.0005) [2023-03-08 22:09:22,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 12093.6). Total num frames: 30470144. Throughput: 0: 11835.9. Samples: 30469940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:09:22,789][569624] Avg episode reward: [(0, '4742.453')] [2023-03-08 22:09:22,942][569911] Updated weights for policy 0, policy_version 59520 (0.0005) [2023-03-08 22:09:26,340][569911] Updated weights for policy 0, policy_version 59600 (0.0004) [2023-03-08 22:09:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12093.6). Total num frames: 30531584. Throughput: 0: 11887.5. Samples: 30505616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:09:27,789][569624] Avg episode reward: [(0, '4632.873')] [2023-03-08 22:09:29,688][569911] Updated weights for policy 0, policy_version 59680 (0.0005) [2023-03-08 22:09:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12107.5). Total num frames: 30593024. Throughput: 0: 11994.2. Samples: 30580036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:09:32,789][569624] Avg episode reward: [(0, '4691.820')] [2023-03-08 22:09:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000059752_30593024.pth... [2023-03-08 22:09:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000059048_30232576.pth [2023-03-08 22:09:33,070][569911] Updated weights for policy 0, policy_version 59760 (0.0006) [2023-03-08 22:09:36,412][569911] Updated weights for policy 0, policy_version 59840 (0.0005) [2023-03-08 22:09:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12093.6). Total num frames: 30650368. Throughput: 0: 11995.1. Samples: 30651412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:09:37,789][569624] Avg episode reward: [(0, '4668.569')] [2023-03-08 22:09:39,839][569911] Updated weights for policy 0, policy_version 59920 (0.0005) [2023-03-08 22:09:42,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12107.5). Total num frames: 30715904. Throughput: 0: 12016.5. Samples: 30687304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:09:42,789][569624] Avg episode reward: [(0, '4723.076')] [2023-03-08 22:09:42,947][569911] Updated weights for policy 0, policy_version 60000 (0.0005) [2023-03-08 22:09:46,329][569911] Updated weights for policy 0, policy_version 60080 (0.0005) [2023-03-08 22:09:47,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 30773248. Throughput: 0: 12074.3. Samples: 30763900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:09:47,789][569624] Avg episode reward: [(0, '4589.492')] [2023-03-08 22:09:47,845][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000060112_30777344.pth... [2023-03-08 22:09:47,847][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000059400_30412800.pth [2023-03-08 22:09:49,990][569911] Updated weights for policy 0, policy_version 60160 (0.0005) [2023-03-08 22:09:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12107.5). Total num frames: 30834688. Throughput: 0: 11976.4. Samples: 30831304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:09:52,789][569624] Avg episode reward: [(0, '4365.136')] [2023-03-08 22:09:53,423][569911] Updated weights for policy 0, policy_version 60240 (0.0005) [2023-03-08 22:09:56,688][569911] Updated weights for policy 0, policy_version 60320 (0.0004) [2023-03-08 22:09:57,789][569624] Fps is (10 sec: 12288.2, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 30896128. Throughput: 0: 12004.0. Samples: 30867712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:09:57,789][569624] Avg episode reward: [(0, '4546.907')] [2023-03-08 22:09:59,880][569911] Updated weights for policy 0, policy_version 60400 (0.0005) [2023-03-08 22:10:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 30957568. Throughput: 0: 12092.9. Samples: 30944700. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:10:02,789][569624] Avg episode reward: [(0, '4643.394')] [2023-03-08 22:10:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000060464_30957568.pth... [2023-03-08 22:10:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000059752_30593024.pth [2023-03-08 22:10:03,382][569911] Updated weights for policy 0, policy_version 60480 (0.0005) [2023-03-08 22:10:06,858][569911] Updated weights for policy 0, policy_version 60560 (0.0004) [2023-03-08 22:10:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12121.4). Total num frames: 31014912. Throughput: 0: 12102.8. Samples: 31014568. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:10:07,789][569624] Avg episode reward: [(0, '4643.536')] [2023-03-08 22:10:10,500][569911] Updated weights for policy 0, policy_version 60640 (0.0005) [2023-03-08 22:10:12,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 12093.6). Total num frames: 31072256. Throughput: 0: 12047.5. Samples: 31047752. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:10:12,789][569624] Avg episode reward: [(0, '4530.525')] [2023-03-08 22:10:14,084][569911] Updated weights for policy 0, policy_version 60720 (0.0004) [2023-03-08 22:10:17,351][569911] Updated weights for policy 0, policy_version 60800 (0.0005) [2023-03-08 22:10:17,789][569624] Fps is (10 sec: 11878.2, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 31133696. Throughput: 0: 11987.3. Samples: 31119468. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:10:17,790][569624] Avg episode reward: [(0, '4665.784')] [2023-03-08 22:10:17,794][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000060808_31133696.pth... [2023-03-08 22:10:17,798][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000060112_30777344.pth [2023-03-08 22:10:20,679][569911] Updated weights for policy 0, policy_version 60880 (0.0005) [2023-03-08 22:10:22,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 31195136. Throughput: 0: 12050.3. Samples: 31193676. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:10:22,789][569624] Avg episode reward: [(0, '4690.082')] [2023-03-08 22:10:24,112][569911] Updated weights for policy 0, policy_version 60960 (0.0005) [2023-03-08 22:10:27,450][569911] Updated weights for policy 0, policy_version 61040 (0.0005) [2023-03-08 22:10:27,789][569624] Fps is (10 sec: 12288.2, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 31256576. Throughput: 0: 12019.7. Samples: 31228192. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:10:27,789][569624] Avg episode reward: [(0, '4554.160')] [2023-03-08 22:10:30,963][569911] Updated weights for policy 0, policy_version 61120 (0.0005) [2023-03-08 22:10:32,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 31313920. Throughput: 0: 11946.6. Samples: 31301496. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:10:32,789][569624] Avg episode reward: [(0, '4372.262')] [2023-03-08 22:10:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000061160_31313920.pth... [2023-03-08 22:10:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000060464_30957568.pth [2023-03-08 22:10:34,012][569911] Updated weights for policy 0, policy_version 61200 (0.0005) [2023-03-08 22:10:37,551][569911] Updated weights for policy 0, policy_version 61280 (0.0004) [2023-03-08 22:10:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 31375360. Throughput: 0: 12089.8. Samples: 31375344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:10:37,789][569624] Avg episode reward: [(0, '4245.142')] [2023-03-08 22:10:40,898][569911] Updated weights for policy 0, policy_version 61360 (0.0005) [2023-03-08 22:10:42,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12107.5). Total num frames: 31436800. Throughput: 0: 12101.6. Samples: 31412288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:10:42,789][569624] Avg episode reward: [(0, '4488.990')] [2023-03-08 22:10:44,249][569911] Updated weights for policy 0, policy_version 61440 (0.0005) [2023-03-08 22:10:47,550][569911] Updated weights for policy 0, policy_version 61520 (0.0005) [2023-03-08 22:10:47,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 31498240. Throughput: 0: 12027.6. Samples: 31485940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:10:47,789][569624] Avg episode reward: [(0, '4231.038')] [2023-03-08 22:10:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000061520_31498240.pth... [2023-03-08 22:10:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000060808_31133696.pth [2023-03-08 22:10:50,910][569911] Updated weights for policy 0, policy_version 61600 (0.0005) [2023-03-08 22:10:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 31559680. Throughput: 0: 12112.0. Samples: 31559608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:10:52,800][569624] Avg episode reward: [(0, '3932.958')] [2023-03-08 22:10:54,239][569911] Updated weights for policy 0, policy_version 61680 (0.0004) [2023-03-08 22:10:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 31617024. Throughput: 0: 12175.1. Samples: 31595632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:10:57,800][569624] Avg episode reward: [(0, '4484.790')] [2023-03-08 22:10:57,876][569911] Updated weights for policy 0, policy_version 61760 (0.0005) [2023-03-08 22:11:01,198][569911] Updated weights for policy 0, policy_version 61840 (0.0005) [2023-03-08 22:11:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 31678464. Throughput: 0: 12148.8. Samples: 31666164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:11:02,800][569624] Avg episode reward: [(0, '4532.075')] [2023-03-08 22:11:02,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000061872_31678464.pth... [2023-03-08 22:11:02,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000061160_31313920.pth [2023-03-08 22:11:04,544][569911] Updated weights for policy 0, policy_version 61920 (0.0005) [2023-03-08 22:11:07,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 31739904. Throughput: 0: 12138.3. Samples: 31739896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:11:07,800][569624] Avg episode reward: [(0, '4671.065')] [2023-03-08 22:11:07,822][569911] Updated weights for policy 0, policy_version 62000 (0.0005) [2023-03-08 22:11:11,339][569911] Updated weights for policy 0, policy_version 62080 (0.0005) [2023-03-08 22:11:12,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 31801344. Throughput: 0: 12187.3. Samples: 31776620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:11:12,799][569624] Avg episode reward: [(0, '4699.751')] [2023-03-08 22:11:14,711][569911] Updated weights for policy 0, policy_version 62160 (0.0005) [2023-03-08 22:11:17,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 31862784. Throughput: 0: 12198.6. Samples: 31850436. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:11:17,800][569624] Avg episode reward: [(0, '4751.272')] [2023-03-08 22:11:17,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000062232_31862784.pth... [2023-03-08 22:11:17,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000061520_31498240.pth [2023-03-08 22:11:17,934][569911] Updated weights for policy 0, policy_version 62240 (0.0004) [2023-03-08 22:11:21,255][569911] Updated weights for policy 0, policy_version 62320 (0.0005) [2023-03-08 22:11:22,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 31924224. Throughput: 0: 12189.7. Samples: 31923880. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:11:22,789][569624] Avg episode reward: [(0, '4475.620')] [2023-03-08 22:11:24,735][569911] Updated weights for policy 0, policy_version 62400 (0.0005) [2023-03-08 22:11:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 31981568. Throughput: 0: 12150.5. Samples: 31959060. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:11:27,789][569624] Avg episode reward: [(0, '4354.966')] [2023-03-08 22:11:28,191][569911] Updated weights for policy 0, policy_version 62480 (0.0005) [2023-03-08 22:11:31,519][569911] Updated weights for policy 0, policy_version 62560 (0.0005) [2023-03-08 22:11:32,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 32047104. Throughput: 0: 12107.6. Samples: 32030784. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:11:32,789][569624] Avg episode reward: [(0, '4692.008')] [2023-03-08 22:11:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000062592_32047104.pth... [2023-03-08 22:11:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000061872_31678464.pth [2023-03-08 22:11:34,874][569911] Updated weights for policy 0, policy_version 62640 (0.0005) [2023-03-08 22:11:37,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 32104448. Throughput: 0: 12112.4. Samples: 32104664. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:11:37,789][569624] Avg episode reward: [(0, '4603.375')] [2023-03-08 22:11:38,136][569911] Updated weights for policy 0, policy_version 62720 (0.0005) [2023-03-08 22:11:41,416][569911] Updated weights for policy 0, policy_version 62800 (0.0005) [2023-03-08 22:11:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12052.0). Total num frames: 32165888. Throughput: 0: 12181.1. Samples: 32143780. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:11:42,789][569624] Avg episode reward: [(0, '4705.104')] [2023-03-08 22:11:44,785][569911] Updated weights for policy 0, policy_version 62880 (0.0005) [2023-03-08 22:11:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 32227328. Throughput: 0: 12197.5. Samples: 32215048. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:11:47,789][569624] Avg episode reward: [(0, '4638.210')] [2023-03-08 22:11:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000062944_32227328.pth... [2023-03-08 22:11:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000062232_31862784.pth [2023-03-08 22:11:48,313][569911] Updated weights for policy 0, policy_version 62960 (0.0005) [2023-03-08 22:11:51,780][569911] Updated weights for policy 0, policy_version 63040 (0.0005) [2023-03-08 22:11:52,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 32284672. Throughput: 0: 12125.9. Samples: 32285560. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:11:52,789][569624] Avg episode reward: [(0, '4628.321')] [2023-03-08 22:11:55,231][569911] Updated weights for policy 0, policy_version 63120 (0.0005) [2023-03-08 22:11:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 32346112. Throughput: 0: 12109.2. Samples: 32321532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:11:57,789][569624] Avg episode reward: [(0, '4646.189')] [2023-03-08 22:11:58,623][569911] Updated weights for policy 0, policy_version 63200 (0.0005) [2023-03-08 22:12:02,048][569911] Updated weights for policy 0, policy_version 63280 (0.0005) [2023-03-08 22:12:02,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 32407552. Throughput: 0: 12057.8. Samples: 32393036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:12:02,789][569624] Avg episode reward: [(0, '4671.995')] [2023-03-08 22:12:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000063296_32407552.pth... [2023-03-08 22:12:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000062592_32047104.pth [2023-03-08 22:12:05,541][569911] Updated weights for policy 0, policy_version 63360 (0.0005) [2023-03-08 22:12:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 32464896. Throughput: 0: 12020.8. Samples: 32464816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:12:07,789][569624] Avg episode reward: [(0, '4357.388')] [2023-03-08 22:12:08,797][569911] Updated weights for policy 0, policy_version 63440 (0.0005) [2023-03-08 22:12:12,112][569911] Updated weights for policy 0, policy_version 63520 (0.0005) [2023-03-08 22:12:12,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 32530432. Throughput: 0: 12067.9. Samples: 32502116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:12:12,789][569624] Avg episode reward: [(0, '4572.206')] [2023-03-08 22:12:15,565][569911] Updated weights for policy 0, policy_version 63600 (0.0004) [2023-03-08 22:12:17,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 32587776. Throughput: 0: 12096.2. Samples: 32575112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:12:17,789][569624] Avg episode reward: [(0, '4460.794')] [2023-03-08 22:12:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000063648_32587776.pth... [2023-03-08 22:12:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000062944_32227328.pth [2023-03-08 22:12:19,077][569911] Updated weights for policy 0, policy_version 63680 (0.0005) [2023-03-08 22:12:22,408][569911] Updated weights for policy 0, policy_version 63760 (0.0005) [2023-03-08 22:12:22,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 32649216. Throughput: 0: 12046.5. Samples: 32646756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:12:22,789][569624] Avg episode reward: [(0, '4115.122')] [2023-03-08 22:12:25,661][569911] Updated weights for policy 0, policy_version 63840 (0.0005) [2023-03-08 22:12:27,789][569624] Fps is (10 sec: 12288.2, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 32710656. Throughput: 0: 12006.3. Samples: 32684064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:12:27,789][569624] Avg episode reward: [(0, '4462.954')] [2023-03-08 22:12:28,833][569911] Updated weights for policy 0, policy_version 63920 (0.0004) [2023-03-08 22:12:32,167][569911] Updated weights for policy 0, policy_version 64000 (0.0005) [2023-03-08 22:12:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 32772096. Throughput: 0: 12103.8. Samples: 32759720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:12:32,789][569624] Avg episode reward: [(0, '4564.190')] [2023-03-08 22:12:32,832][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000064016_32776192.pth... [2023-03-08 22:12:32,834][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000063296_32407552.pth [2023-03-08 22:12:35,451][569911] Updated weights for policy 0, policy_version 64080 (0.0005) [2023-03-08 22:12:37,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 32833536. Throughput: 0: 12177.4. Samples: 32833544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:12:37,789][569624] Avg episode reward: [(0, '4472.946')] [2023-03-08 22:12:38,873][569911] Updated weights for policy 0, policy_version 64160 (0.0005) [2023-03-08 22:12:42,160][569911] Updated weights for policy 0, policy_version 64240 (0.0005) [2023-03-08 22:12:42,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 32894976. Throughput: 0: 12206.7. Samples: 32870836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:12:42,789][569624] Avg episode reward: [(0, '4527.913')] [2023-03-08 22:12:45,465][569911] Updated weights for policy 0, policy_version 64320 (0.0005) [2023-03-08 22:12:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 32956416. Throughput: 0: 12260.4. Samples: 32944752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:12:47,789][569624] Avg episode reward: [(0, '4698.435')] [2023-03-08 22:12:47,822][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000064376_32960512.pth... [2023-03-08 22:12:47,823][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000063648_32587776.pth [2023-03-08 22:12:48,681][569911] Updated weights for policy 0, policy_version 64400 (0.0005) [2023-03-08 22:12:52,173][569911] Updated weights for policy 0, policy_version 64480 (0.0005) [2023-03-08 22:12:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12065.8). Total num frames: 33017856. Throughput: 0: 12289.9. Samples: 33017864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:12:52,789][569624] Avg episode reward: [(0, '4563.408')] [2023-03-08 22:12:55,447][569911] Updated weights for policy 0, policy_version 64560 (0.0005) [2023-03-08 22:12:57,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12065.9). Total num frames: 33079296. Throughput: 0: 12295.2. Samples: 33055400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:12:57,789][569624] Avg episode reward: [(0, '4682.985')] [2023-03-08 22:12:58,908][569911] Updated weights for policy 0, policy_version 64640 (0.0006) [2023-03-08 22:13:02,293][569911] Updated weights for policy 0, policy_version 64720 (0.0006) [2023-03-08 22:13:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 33140736. Throughput: 0: 12290.3. Samples: 33128176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:13:02,789][569624] Avg episode reward: [(0, '4748.531')] [2023-03-08 22:13:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000064728_33140736.pth... [2023-03-08 22:13:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000064016_32776192.pth [2023-03-08 22:13:05,609][569911] Updated weights for policy 0, policy_version 64800 (0.0005) [2023-03-08 22:13:07,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12288.0, 300 sec: 12079.7). Total num frames: 33202176. Throughput: 0: 12342.1. Samples: 33202148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:13:07,789][569624] Avg episode reward: [(0, '4676.826')] [2023-03-08 22:13:08,965][569911] Updated weights for policy 0, policy_version 64880 (0.0005) [2023-03-08 22:13:12,535][569911] Updated weights for policy 0, policy_version 64960 (0.0005) [2023-03-08 22:13:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 33259520. Throughput: 0: 12268.2. Samples: 33236132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:13:12,789][569624] Avg episode reward: [(0, '4698.148')] [2023-03-08 22:13:15,934][569911] Updated weights for policy 0, policy_version 65040 (0.0005) [2023-03-08 22:13:17,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12288.0, 300 sec: 12079.7). Total num frames: 33325056. Throughput: 0: 12199.1. Samples: 33308680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:13:17,790][569624] Avg episode reward: [(0, '4693.612')] [2023-03-08 22:13:17,794][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000065088_33325056.pth... [2023-03-08 22:13:17,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000064376_32960512.pth [2023-03-08 22:13:18,939][569911] Updated weights for policy 0, policy_version 65120 (0.0005) [2023-03-08 22:13:22,333][569911] Updated weights for policy 0, policy_version 65200 (0.0005) [2023-03-08 22:13:22,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12288.0, 300 sec: 12093.6). Total num frames: 33386496. Throughput: 0: 12250.1. Samples: 33384800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:13:22,789][569624] Avg episode reward: [(0, '4673.149')] [2023-03-08 22:13:25,843][569911] Updated weights for policy 0, policy_version 65280 (0.0005) [2023-03-08 22:13:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 33443840. Throughput: 0: 12188.9. Samples: 33419336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:13:27,789][569624] Avg episode reward: [(0, '4736.181')] [2023-03-08 22:13:29,164][569911] Updated weights for policy 0, policy_version 65360 (0.0005) [2023-03-08 22:13:32,645][569911] Updated weights for policy 0, policy_version 65440 (0.0004) [2023-03-08 22:13:32,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 33505280. Throughput: 0: 12173.6. Samples: 33492564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:13:32,789][569624] Avg episode reward: [(0, '4707.284')] [2023-03-08 22:13:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000065440_33505280.pth... [2023-03-08 22:13:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000064728_33140736.pth [2023-03-08 22:13:36,193][569911] Updated weights for policy 0, policy_version 65520 (0.0005) [2023-03-08 22:13:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 33562624. Throughput: 0: 12104.8. Samples: 33562580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:13:37,789][569624] Avg episode reward: [(0, '4718.045')] [2023-03-08 22:13:39,632][569911] Updated weights for policy 0, policy_version 65600 (0.0005) [2023-03-08 22:13:42,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 33624064. Throughput: 0: 12085.0. Samples: 33599224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:13:42,789][569624] Avg episode reward: [(0, '4601.373')] [2023-03-08 22:13:42,823][569911] Updated weights for policy 0, policy_version 65680 (0.0005) [2023-03-08 22:13:46,198][569911] Updated weights for policy 0, policy_version 65760 (0.0005) [2023-03-08 22:13:47,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12219.7, 300 sec: 12121.4). Total num frames: 33689600. Throughput: 0: 12132.9. Samples: 33674156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:13:47,789][569624] Avg episode reward: [(0, '4661.112')] [2023-03-08 22:13:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000065800_33689600.pth... [2023-03-08 22:13:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000065088_33325056.pth [2023-03-08 22:13:49,180][569911] Updated weights for policy 0, policy_version 65840 (0.0004) [2023-03-08 22:13:52,440][569911] Updated weights for policy 0, policy_version 65920 (0.0005) [2023-03-08 22:13:52,789][569624] Fps is (10 sec: 12697.8, 60 sec: 12219.8, 300 sec: 12135.3). Total num frames: 33751040. Throughput: 0: 12215.5. Samples: 33751844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:13:52,789][569624] Avg episode reward: [(0, '4711.766')] [2023-03-08 22:13:55,946][569911] Updated weights for policy 0, policy_version 66000 (0.0005) [2023-03-08 22:13:57,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12135.3). Total num frames: 33812480. Throughput: 0: 12261.5. Samples: 33787900. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:13:57,789][569624] Avg episode reward: [(0, '4461.062')] [2023-03-08 22:13:59,414][569911] Updated weights for policy 0, policy_version 66080 (0.0005) [2023-03-08 22:14:02,465][569911] Updated weights for policy 0, policy_version 66160 (0.0004) [2023-03-08 22:14:02,789][569624] Fps is (10 sec: 12697.4, 60 sec: 12288.0, 300 sec: 12149.2). Total num frames: 33878016. Throughput: 0: 12296.5. Samples: 33862024. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:14:02,789][569624] Avg episode reward: [(0, '4576.492')] [2023-03-08 22:14:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000066168_33878016.pth... [2023-03-08 22:14:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000065440_33505280.pth [2023-03-08 22:14:05,825][569911] Updated weights for policy 0, policy_version 66240 (0.0005) [2023-03-08 22:14:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12135.3). Total num frames: 33935360. Throughput: 0: 12233.5. Samples: 33935308. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:14:07,789][569624] Avg episode reward: [(0, '4484.415')] [2023-03-08 22:14:09,391][569911] Updated weights for policy 0, policy_version 66320 (0.0005) [2023-03-08 22:14:12,789][569624] Fps is (10 sec: 11469.0, 60 sec: 12219.7, 300 sec: 12135.3). Total num frames: 33992704. Throughput: 0: 12196.8. Samples: 33968192. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:14:12,789][569624] Avg episode reward: [(0, '4437.926')] [2023-03-08 22:14:12,978][569911] Updated weights for policy 0, policy_version 66400 (0.0005) [2023-03-08 22:14:16,407][569911] Updated weights for policy 0, policy_version 66480 (0.0004) [2023-03-08 22:14:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12149.2). Total num frames: 34054144. Throughput: 0: 12149.3. Samples: 34039284. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:14:17,789][569624] Avg episode reward: [(0, '4617.557')] [2023-03-08 22:14:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000066512_34054144.pth... [2023-03-08 22:14:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000065800_33689600.pth [2023-03-08 22:14:19,761][569911] Updated weights for policy 0, policy_version 66560 (0.0005) [2023-03-08 22:14:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12135.3). Total num frames: 34111488. Throughput: 0: 12198.3. Samples: 34111504. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:14:22,789][569624] Avg episode reward: [(0, '4474.338')] [2023-03-08 22:14:23,180][569911] Updated weights for policy 0, policy_version 66640 (0.0004) [2023-03-08 22:14:26,415][569911] Updated weights for policy 0, policy_version 66720 (0.0005) [2023-03-08 22:14:27,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12149.2). Total num frames: 34177024. Throughput: 0: 12237.3. Samples: 34149900. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:14:27,789][569624] Avg episode reward: [(0, '4579.413')] [2023-03-08 22:14:29,714][569911] Updated weights for policy 0, policy_version 66800 (0.0005) [2023-03-08 22:14:32,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12219.7, 300 sec: 12163.0). Total num frames: 34238464. Throughput: 0: 12268.6. Samples: 34226240. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:14:32,789][569624] Avg episode reward: [(0, '4443.938')] [2023-03-08 22:14:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000066872_34238464.pth... [2023-03-08 22:14:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000066168_33878016.pth [2023-03-08 22:14:32,890][569911] Updated weights for policy 0, policy_version 66880 (0.0004) [2023-03-08 22:14:36,345][569911] Updated weights for policy 0, policy_version 66960 (0.0005) [2023-03-08 22:14:37,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12149.2). Total num frames: 34299904. Throughput: 0: 12142.8. Samples: 34298272. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:14:37,789][569624] Avg episode reward: [(0, '4672.153')] [2023-03-08 22:14:39,608][569911] Updated weights for policy 0, policy_version 67040 (0.0005) [2023-03-08 22:14:42,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12163.0). Total num frames: 34361344. Throughput: 0: 12189.9. Samples: 34336444. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:14:42,789][569624] Avg episode reward: [(0, '4634.299')] [2023-03-08 22:14:42,996][569911] Updated weights for policy 0, policy_version 67120 (0.0005) [2023-03-08 22:14:46,371][569911] Updated weights for policy 0, policy_version 67200 (0.0004) [2023-03-08 22:14:47,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12163.0). Total num frames: 34422784. Throughput: 0: 12146.8. Samples: 34408628. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:14:47,789][569624] Avg episode reward: [(0, '4754.703')] [2023-03-08 22:14:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000067232_34422784.pth... [2023-03-08 22:14:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000066512_34054144.pth [2023-03-08 22:14:49,556][569911] Updated weights for policy 0, policy_version 67280 (0.0005) [2023-03-08 22:14:52,789][569624] Fps is (10 sec: 12288.2, 60 sec: 12219.7, 300 sec: 12163.0). Total num frames: 34484224. Throughput: 0: 12203.4. Samples: 34484460. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:14:52,789][569624] Avg episode reward: [(0, '4707.407')] [2023-03-08 22:14:52,854][569911] Updated weights for policy 0, policy_version 67360 (0.0005) [2023-03-08 22:14:56,185][569911] Updated weights for policy 0, policy_version 67440 (0.0005) [2023-03-08 22:14:57,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12288.0, 300 sec: 12176.9). Total num frames: 34549760. Throughput: 0: 12286.7. Samples: 34521096. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:14:57,789][569624] Avg episode reward: [(0, '4399.731')] [2023-03-08 22:14:59,426][569911] Updated weights for policy 0, policy_version 67520 (0.0004) [2023-03-08 22:15:02,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12151.5, 300 sec: 12176.9). Total num frames: 34607104. Throughput: 0: 12357.8. Samples: 34595384. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:15:02,789][569624] Avg episode reward: [(0, '4544.289')] [2023-03-08 22:15:02,812][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000067600_34611200.pth... [2023-03-08 22:15:02,812][569911] Updated weights for policy 0, policy_version 67600 (0.0004) [2023-03-08 22:15:02,814][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000066872_34238464.pth [2023-03-08 22:15:06,295][569911] Updated weights for policy 0, policy_version 67680 (0.0004) [2023-03-08 22:15:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12190.8). Total num frames: 34668544. Throughput: 0: 12373.4. Samples: 34668308. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:15:07,789][569624] Avg episode reward: [(0, '4564.490')] [2023-03-08 22:15:09,616][569911] Updated weights for policy 0, policy_version 67760 (0.0005) [2023-03-08 22:15:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 12176.9). Total num frames: 34725888. Throughput: 0: 12339.0. Samples: 34705156. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:15:12,789][569624] Avg episode reward: [(0, '4613.518')] [2023-03-08 22:15:13,267][569911] Updated weights for policy 0, policy_version 67840 (0.0005) [2023-03-08 22:15:16,597][569911] Updated weights for policy 0, policy_version 67920 (0.0006) [2023-03-08 22:15:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12176.9). Total num frames: 34787328. Throughput: 0: 12189.4. Samples: 34774764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:15:17,789][569624] Avg episode reward: [(0, '3867.296')] [2023-03-08 22:15:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000067944_34787328.pth... [2023-03-08 22:15:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000067232_34422784.pth [2023-03-08 22:15:20,012][569911] Updated weights for policy 0, policy_version 68000 (0.0005) [2023-03-08 22:15:22,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12176.9). Total num frames: 34848768. Throughput: 0: 12164.8. Samples: 34845688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:15:22,789][569624] Avg episode reward: [(0, '4193.729')] [2023-03-08 22:15:23,412][569911] Updated weights for policy 0, policy_version 68080 (0.0005) [2023-03-08 22:15:26,668][569911] Updated weights for policy 0, policy_version 68160 (0.0005) [2023-03-08 22:15:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12176.9). Total num frames: 34906112. Throughput: 0: 12151.9. Samples: 34883280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:15:27,789][569624] Avg episode reward: [(0, '4067.235')] [2023-03-08 22:15:30,877][569911] Updated weights for policy 0, policy_version 68240 (0.0005) [2023-03-08 22:15:32,789][569624] Fps is (10 sec: 11059.1, 60 sec: 12014.9, 300 sec: 12149.1). Total num frames: 34959360. Throughput: 0: 11957.9. Samples: 34946736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:15:32,789][569624] Avg episode reward: [(0, '3579.478')] [2023-03-08 22:15:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000068280_34959360.pth... [2023-03-08 22:15:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000067600_34611200.pth [2023-03-08 22:15:34,464][569911] Updated weights for policy 0, policy_version 68320 (0.0005) [2023-03-08 22:15:37,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11946.7, 300 sec: 12135.3). Total num frames: 35016704. Throughput: 0: 11827.2. Samples: 35016684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:15:37,789][569624] Avg episode reward: [(0, '4297.405')] [2023-03-08 22:15:37,874][569911] Updated weights for policy 0, policy_version 68400 (0.0005) [2023-03-08 22:15:41,342][569911] Updated weights for policy 0, policy_version 68480 (0.0004) [2023-03-08 22:15:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 12135.3). Total num frames: 35078144. Throughput: 0: 11808.2. Samples: 35052464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:15:42,790][569624] Avg episode reward: [(0, '4324.680')] [2023-03-08 22:15:44,598][569911] Updated weights for policy 0, policy_version 68560 (0.0005) [2023-03-08 22:15:47,789][569624] Fps is (10 sec: 12287.8, 60 sec: 11946.7, 300 sec: 12135.3). Total num frames: 35139584. Throughput: 0: 11814.9. Samples: 35127052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:15:47,789][569624] Avg episode reward: [(0, '4410.758')] [2023-03-08 22:15:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000068632_35139584.pth... [2023-03-08 22:15:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000067944_34787328.pth [2023-03-08 22:15:47,949][569911] Updated weights for policy 0, policy_version 68640 (0.0005) [2023-03-08 22:15:51,496][569911] Updated weights for policy 0, policy_version 68720 (0.0005) [2023-03-08 22:15:52,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12135.3). Total num frames: 35196928. Throughput: 0: 11747.4. Samples: 35196940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:15:52,800][569624] Avg episode reward: [(0, '4343.643')] [2023-03-08 22:15:54,957][569911] Updated weights for policy 0, policy_version 68800 (0.0005) [2023-03-08 22:15:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 12135.3). Total num frames: 35258368. Throughput: 0: 11746.8. Samples: 35233760. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:15:57,800][569624] Avg episode reward: [(0, '4526.958')] [2023-03-08 22:15:58,331][569911] Updated weights for policy 0, policy_version 68880 (0.0005) [2023-03-08 22:16:01,770][569911] Updated weights for policy 0, policy_version 68960 (0.0005) [2023-03-08 22:16:02,789][569624] Fps is (10 sec: 11878.2, 60 sec: 11810.1, 300 sec: 12121.4). Total num frames: 35315712. Throughput: 0: 11751.3. Samples: 35303572. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:16:02,789][569624] Avg episode reward: [(0, '4657.717')] [2023-03-08 22:16:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000068976_35315712.pth... [2023-03-08 22:16:02,797][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000068280_34959360.pth [2023-03-08 22:16:05,196][569911] Updated weights for policy 0, policy_version 69040 (0.0005) [2023-03-08 22:16:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 12121.4). Total num frames: 35377152. Throughput: 0: 11810.3. Samples: 35377152. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:16:07,789][569624] Avg episode reward: [(0, '4615.606')] [2023-03-08 22:16:08,579][569911] Updated weights for policy 0, policy_version 69120 (0.0005) [2023-03-08 22:16:11,983][569911] Updated weights for policy 0, policy_version 69200 (0.0004) [2023-03-08 22:16:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 12121.4). Total num frames: 35438592. Throughput: 0: 11769.3. Samples: 35412900. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:16:12,789][569624] Avg episode reward: [(0, '4564.646')] [2023-03-08 22:16:15,289][569911] Updated weights for policy 0, policy_version 69280 (0.0004) [2023-03-08 22:16:17,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 12121.4). Total num frames: 35500032. Throughput: 0: 12000.5. Samples: 35486760. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:16:17,789][569624] Avg episode reward: [(0, '4393.134')] [2023-03-08 22:16:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000069336_35500032.pth... [2023-03-08 22:16:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000068632_35139584.pth [2023-03-08 22:16:18,687][569911] Updated weights for policy 0, policy_version 69360 (0.0005) [2023-03-08 22:16:22,193][569911] Updated weights for policy 0, policy_version 69440 (0.0005) [2023-03-08 22:16:22,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 12121.4). Total num frames: 35557376. Throughput: 0: 12015.5. Samples: 35557384. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:16:22,789][569624] Avg episode reward: [(0, '4551.826')] [2023-03-08 22:16:25,494][569911] Updated weights for policy 0, policy_version 69520 (0.0005) [2023-03-08 22:16:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12107.5). Total num frames: 35618816. Throughput: 0: 12041.1. Samples: 35594312. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:16:27,789][569624] Avg episode reward: [(0, '4476.958')] [2023-03-08 22:16:28,873][569911] Updated weights for policy 0, policy_version 69600 (0.0004) [2023-03-08 22:16:32,211][569911] Updated weights for policy 0, policy_version 69680 (0.0004) [2023-03-08 22:16:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12015.0, 300 sec: 12121.4). Total num frames: 35680256. Throughput: 0: 12020.2. Samples: 35667960. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:16:32,789][569624] Avg episode reward: [(0, '4471.985')] [2023-03-08 22:16:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000069688_35680256.pth... [2023-03-08 22:16:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000068976_35315712.pth [2023-03-08 22:16:35,592][569911] Updated weights for policy 0, policy_version 69760 (0.0005) [2023-03-08 22:16:37,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 35741696. Throughput: 0: 12083.5. Samples: 35740700. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:16:37,789][569624] Avg episode reward: [(0, '4588.839')] [2023-03-08 22:16:39,115][569911] Updated weights for policy 0, policy_version 69840 (0.0005) [2023-03-08 22:16:42,468][569911] Updated weights for policy 0, policy_version 69920 (0.0005) [2023-03-08 22:16:42,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 35803136. Throughput: 0: 12056.3. Samples: 35776292. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:16:42,789][569624] Avg episode reward: [(0, '4589.108')] [2023-03-08 22:16:45,759][569911] Updated weights for policy 0, policy_version 70000 (0.0005) [2023-03-08 22:16:47,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12121.4). Total num frames: 35860480. Throughput: 0: 12107.6. Samples: 35848412. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:16:47,789][569624] Avg episode reward: [(0, '4478.947')] [2023-03-08 22:16:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000070040_35860480.pth... [2023-03-08 22:16:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000069336_35500032.pth [2023-03-08 22:16:49,215][569911] Updated weights for policy 0, policy_version 70080 (0.0005) [2023-03-08 22:16:52,505][569911] Updated weights for policy 0, policy_version 70160 (0.0005) [2023-03-08 22:16:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 35921920. Throughput: 0: 12107.6. Samples: 35921992. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:16:52,789][569624] Avg episode reward: [(0, '4582.205')] [2023-03-08 22:16:55,977][569911] Updated weights for policy 0, policy_version 70240 (0.0005) [2023-03-08 22:16:57,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 35983360. Throughput: 0: 12115.3. Samples: 35958088. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:16:57,789][569624] Avg episode reward: [(0, '4657.529')] [2023-03-08 22:16:59,305][569911] Updated weights for policy 0, policy_version 70320 (0.0005) [2023-03-08 22:17:02,593][569911] Updated weights for policy 0, policy_version 70400 (0.0005) [2023-03-08 22:17:02,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12135.3). Total num frames: 36044800. Throughput: 0: 12129.4. Samples: 36032584. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:17:02,789][569624] Avg episode reward: [(0, '4675.848')] [2023-03-08 22:17:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000070400_36044800.pth... [2023-03-08 22:17:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000069688_35680256.pth [2023-03-08 22:17:05,771][569911] Updated weights for policy 0, policy_version 70480 (0.0005) [2023-03-08 22:17:07,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12219.7, 300 sec: 12135.3). Total num frames: 36110336. Throughput: 0: 12232.5. Samples: 36107848. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:17:07,789][569624] Avg episode reward: [(0, '4708.183')] [2023-03-08 22:17:09,115][569911] Updated weights for policy 0, policy_version 70560 (0.0005) [2023-03-08 22:17:12,484][569911] Updated weights for policy 0, policy_version 70640 (0.0005) [2023-03-08 22:17:12,789][569624] Fps is (10 sec: 12288.2, 60 sec: 12151.5, 300 sec: 12135.3). Total num frames: 36167680. Throughput: 0: 12195.4. Samples: 36143104. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:17:12,789][569624] Avg episode reward: [(0, '4580.219')] [2023-03-08 22:17:15,904][569911] Updated weights for policy 0, policy_version 70720 (0.0005) [2023-03-08 22:17:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12135.3). Total num frames: 36229120. Throughput: 0: 12192.6. Samples: 36216628. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:17:17,789][569624] Avg episode reward: [(0, '4666.657')] [2023-03-08 22:17:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000070760_36229120.pth... [2023-03-08 22:17:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000070040_35860480.pth [2023-03-08 22:17:19,394][569911] Updated weights for policy 0, policy_version 70800 (0.0004) [2023-03-08 22:17:22,740][569911] Updated weights for policy 0, policy_version 70880 (0.0005) [2023-03-08 22:17:22,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12219.7, 300 sec: 12135.3). Total num frames: 36290560. Throughput: 0: 12147.6. Samples: 36287340. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:17:22,789][569624] Avg episode reward: [(0, '4656.067')] [2023-03-08 22:17:26,164][569911] Updated weights for policy 0, policy_version 70960 (0.0005) [2023-03-08 22:17:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12121.4). Total num frames: 36347904. Throughput: 0: 12156.0. Samples: 36323312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:17:27,789][569624] Avg episode reward: [(0, '4660.837')] [2023-03-08 22:17:29,582][569911] Updated weights for policy 0, policy_version 71040 (0.0004) [2023-03-08 22:17:32,722][569911] Updated weights for policy 0, policy_version 71120 (0.0005) [2023-03-08 22:17:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12135.3). Total num frames: 36413440. Throughput: 0: 12244.7. Samples: 36399424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:17:32,800][569624] Avg episode reward: [(0, '4701.366')] [2023-03-08 22:17:32,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000071120_36413440.pth... [2023-03-08 22:17:32,805][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000070400_36044800.pth [2023-03-08 22:17:36,238][569911] Updated weights for policy 0, policy_version 71200 (0.0005) [2023-03-08 22:17:37,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12121.4). Total num frames: 36470784. Throughput: 0: 12195.2. Samples: 36470776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:17:37,800][569624] Avg episode reward: [(0, '4723.483')] [2023-03-08 22:17:39,595][569911] Updated weights for policy 0, policy_version 71280 (0.0005) [2023-03-08 22:17:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12121.4). Total num frames: 36532224. Throughput: 0: 12205.7. Samples: 36507344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:17:42,799][569624] Avg episode reward: [(0, '4436.014')] [2023-03-08 22:17:42,846][569911] Updated weights for policy 0, policy_version 71360 (0.0005) [2023-03-08 22:17:46,170][569911] Updated weights for policy 0, policy_version 71440 (0.0004) [2023-03-08 22:17:47,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12121.4). Total num frames: 36593664. Throughput: 0: 12196.8. Samples: 36581440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:17:47,800][569624] Avg episode reward: [(0, '4464.249')] [2023-03-08 22:17:47,825][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000071480_36597760.pth... [2023-03-08 22:17:47,827][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000070760_36229120.pth [2023-03-08 22:17:49,420][569911] Updated weights for policy 0, policy_version 71520 (0.0005) [2023-03-08 22:17:52,526][569911] Updated weights for policy 0, policy_version 71600 (0.0004) [2023-03-08 22:17:52,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12288.0, 300 sec: 12135.3). Total num frames: 36659200. Throughput: 0: 12253.7. Samples: 36659264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:17:52,800][569624] Avg episode reward: [(0, '4510.267')] [2023-03-08 22:17:55,892][569911] Updated weights for policy 0, policy_version 71680 (0.0005) [2023-03-08 22:17:57,789][569624] Fps is (10 sec: 12697.7, 60 sec: 12288.0, 300 sec: 12135.3). Total num frames: 36720640. Throughput: 0: 12286.9. Samples: 36696016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:17:57,800][569624] Avg episode reward: [(0, '4510.037')] [2023-03-08 22:17:59,353][569911] Updated weights for policy 0, policy_version 71760 (0.0005) [2023-03-08 22:18:02,696][569911] Updated weights for policy 0, policy_version 71840 (0.0005) [2023-03-08 22:18:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12135.3). Total num frames: 36782080. Throughput: 0: 12270.9. Samples: 36768816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:18:02,800][569624] Avg episode reward: [(0, '4219.085')] [2023-03-08 22:18:02,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000071840_36782080.pth... [2023-03-08 22:18:02,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000071120_36413440.pth [2023-03-08 22:18:06,008][569911] Updated weights for policy 0, policy_version 71920 (0.0005) [2023-03-08 22:18:07,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12149.1). Total num frames: 36843520. Throughput: 0: 12358.2. Samples: 36843460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:18:07,790][569624] Avg episode reward: [(0, '4265.939')] [2023-03-08 22:18:09,221][569911] Updated weights for policy 0, policy_version 72000 (0.0005) [2023-03-08 22:18:12,428][569911] Updated weights for policy 0, policy_version 72080 (0.0005) [2023-03-08 22:18:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12135.3). Total num frames: 36904960. Throughput: 0: 12380.8. Samples: 36880448. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:18:12,800][569624] Avg episode reward: [(0, '4488.001')] [2023-03-08 22:18:15,896][569911] Updated weights for policy 0, policy_version 72160 (0.0005) [2023-03-08 22:18:17,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12135.3). Total num frames: 36966400. Throughput: 0: 12320.7. Samples: 36953856. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:18:17,800][569624] Avg episode reward: [(0, '4584.659')] [2023-03-08 22:18:17,804][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000072200_36966400.pth... [2023-03-08 22:18:17,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000071480_36597760.pth [2023-03-08 22:18:19,431][569911] Updated weights for policy 0, policy_version 72240 (0.0005) [2023-03-08 22:18:22,464][569911] Updated weights for policy 0, policy_version 72320 (0.0005) [2023-03-08 22:18:22,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12288.0, 300 sec: 12149.2). Total num frames: 37027840. Throughput: 0: 12391.0. Samples: 37028372. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:18:22,789][569624] Avg episode reward: [(0, '4644.798')] [2023-03-08 22:18:25,752][569911] Updated weights for policy 0, policy_version 72400 (0.0005) [2023-03-08 22:18:27,789][569624] Fps is (10 sec: 12697.7, 60 sec: 12424.5, 300 sec: 12163.0). Total num frames: 37093376. Throughput: 0: 12387.4. Samples: 37064776. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:18:27,789][569624] Avg episode reward: [(0, '4474.352')] [2023-03-08 22:18:29,187][569911] Updated weights for policy 0, policy_version 72480 (0.0005) [2023-03-08 22:18:32,584][569911] Updated weights for policy 0, policy_version 72560 (0.0005) [2023-03-08 22:18:32,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12288.0, 300 sec: 12163.0). Total num frames: 37150720. Throughput: 0: 12377.8. Samples: 37138440. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:18:32,789][569624] Avg episode reward: [(0, '4494.487')] [2023-03-08 22:18:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000072560_37150720.pth... [2023-03-08 22:18:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000071840_36782080.pth [2023-03-08 22:18:36,102][569911] Updated weights for policy 0, policy_version 72640 (0.0005) [2023-03-08 22:18:37,789][569624] Fps is (10 sec: 11468.7, 60 sec: 12288.0, 300 sec: 12149.2). Total num frames: 37208064. Throughput: 0: 12197.0. Samples: 37208128. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:18:37,789][569624] Avg episode reward: [(0, '4724.935')] [2023-03-08 22:18:39,619][569911] Updated weights for policy 0, policy_version 72720 (0.0005) [2023-03-08 22:18:42,789][569624] Fps is (10 sec: 11469.0, 60 sec: 12219.7, 300 sec: 12121.4). Total num frames: 37265408. Throughput: 0: 12180.4. Samples: 37244132. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:18:42,789][569624] Avg episode reward: [(0, '4548.185')] [2023-03-08 22:18:43,144][569911] Updated weights for policy 0, policy_version 72800 (0.0004) [2023-03-08 22:18:46,275][569911] Updated weights for policy 0, policy_version 72880 (0.0005) [2023-03-08 22:18:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12135.3). Total num frames: 37330944. Throughput: 0: 12215.3. Samples: 37318504. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:18:47,789][569624] Avg episode reward: [(0, '4612.114')] [2023-03-08 22:18:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000072912_37330944.pth... [2023-03-08 22:18:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000072200_36966400.pth [2023-03-08 22:18:49,704][569911] Updated weights for policy 0, policy_version 72960 (0.0005) [2023-03-08 22:18:52,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12219.7, 300 sec: 12135.3). Total num frames: 37392384. Throughput: 0: 12176.7. Samples: 37391412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:18:52,789][569624] Avg episode reward: [(0, '4630.244')] [2023-03-08 22:18:52,966][569911] Updated weights for policy 0, policy_version 73040 (0.0005) [2023-03-08 22:18:56,360][569911] Updated weights for policy 0, policy_version 73120 (0.0004) [2023-03-08 22:18:57,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12121.4). Total num frames: 37453824. Throughput: 0: 12150.6. Samples: 37427224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:18:57,789][569624] Avg episode reward: [(0, '4470.871')] [2023-03-08 22:18:59,883][569911] Updated weights for policy 0, policy_version 73200 (0.0005) [2023-03-08 22:19:02,789][569624] Fps is (10 sec: 11468.7, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 37507072. Throughput: 0: 12068.9. Samples: 37496956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:19:02,789][569624] Avg episode reward: [(0, '4351.976')] [2023-03-08 22:19:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000073264_37511168.pth... [2023-03-08 22:19:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000072560_37150720.pth [2023-03-08 22:19:03,487][569911] Updated weights for policy 0, policy_version 73280 (0.0005) [2023-03-08 22:19:06,840][569911] Updated weights for policy 0, policy_version 73360 (0.0004) [2023-03-08 22:19:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 37568512. Throughput: 0: 12003.1. Samples: 37568512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:19:07,789][569624] Avg episode reward: [(0, '4582.825')] [2023-03-08 22:19:10,569][569911] Updated weights for policy 0, policy_version 73440 (0.0005) [2023-03-08 22:19:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12107.5). Total num frames: 37625856. Throughput: 0: 11922.3. Samples: 37601280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:19:12,789][569624] Avg episode reward: [(0, '4218.244')] [2023-03-08 22:19:13,938][569911] Updated weights for policy 0, policy_version 73520 (0.0004) [2023-03-08 22:19:17,123][569911] Updated weights for policy 0, policy_version 73600 (0.0005) [2023-03-08 22:19:17,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12135.3). Total num frames: 37691392. Throughput: 0: 11925.2. Samples: 37675072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:19:17,789][569624] Avg episode reward: [(0, '4514.760')] [2023-03-08 22:19:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000073616_37691392.pth... [2023-03-08 22:19:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000072912_37330944.pth [2023-03-08 22:19:20,457][569911] Updated weights for policy 0, policy_version 73680 (0.0004) [2023-03-08 22:19:22,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12107.5). Total num frames: 37748736. Throughput: 0: 12016.4. Samples: 37748864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:19:22,789][569624] Avg episode reward: [(0, '4264.366')] [2023-03-08 22:19:23,875][569911] Updated weights for policy 0, policy_version 73760 (0.0005) [2023-03-08 22:19:26,991][569911] Updated weights for policy 0, policy_version 73840 (0.0005) [2023-03-08 22:19:27,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 12121.4). Total num frames: 37814272. Throughput: 0: 12083.8. Samples: 37787904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:19:27,789][569624] Avg episode reward: [(0, '4247.434')] [2023-03-08 22:19:30,457][569911] Updated weights for policy 0, policy_version 73920 (0.0004) [2023-03-08 22:19:32,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12107.5). Total num frames: 37871616. Throughput: 0: 12018.1. Samples: 37859320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:19:32,789][569624] Avg episode reward: [(0, '4324.800')] [2023-03-08 22:19:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000073968_37871616.pth... [2023-03-08 22:19:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000073264_37511168.pth [2023-03-08 22:19:33,993][569911] Updated weights for policy 0, policy_version 74000 (0.0005) [2023-03-08 22:19:37,195][569911] Updated weights for policy 0, policy_version 74080 (0.0005) [2023-03-08 22:19:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 37933056. Throughput: 0: 12035.5. Samples: 37933012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:19:37,789][569624] Avg episode reward: [(0, '4491.359')] [2023-03-08 22:19:40,662][569911] Updated weights for policy 0, policy_version 74160 (0.0005) [2023-03-08 22:19:42,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 37994496. Throughput: 0: 12046.7. Samples: 37969324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:19:42,789][569624] Avg episode reward: [(0, '4668.895')] [2023-03-08 22:19:44,016][569911] Updated weights for policy 0, policy_version 74240 (0.0005) [2023-03-08 22:19:47,512][569911] Updated weights for policy 0, policy_version 74320 (0.0005) [2023-03-08 22:19:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 38051840. Throughput: 0: 12077.8. Samples: 38040456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:19:47,789][569624] Avg episode reward: [(0, '4651.490')] [2023-03-08 22:19:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000074320_38051840.pth... [2023-03-08 22:19:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000073616_37691392.pth [2023-03-08 22:19:50,870][569911] Updated weights for policy 0, policy_version 74400 (0.0005) [2023-03-08 22:19:52,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 38113280. Throughput: 0: 12113.2. Samples: 38113604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:19:52,789][569624] Avg episode reward: [(0, '4630.740')] [2023-03-08 22:19:54,180][569911] Updated weights for policy 0, policy_version 74480 (0.0005) [2023-03-08 22:19:57,541][569911] Updated weights for policy 0, policy_version 74560 (0.0005) [2023-03-08 22:19:57,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 38174720. Throughput: 0: 12196.8. Samples: 38150136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:19:57,789][569624] Avg episode reward: [(0, '4603.333')] [2023-03-08 22:20:00,940][569911] Updated weights for policy 0, policy_version 74640 (0.0005) [2023-03-08 22:20:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 38236160. Throughput: 0: 12186.9. Samples: 38223480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:20:02,789][569624] Avg episode reward: [(0, '4648.813')] [2023-03-08 22:20:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000074680_38236160.pth... [2023-03-08 22:20:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000073968_37871616.pth [2023-03-08 22:20:04,200][569911] Updated weights for policy 0, policy_version 74720 (0.0005) [2023-03-08 22:20:07,579][569911] Updated weights for policy 0, policy_version 74800 (0.0004) [2023-03-08 22:20:07,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 38297600. Throughput: 0: 12193.1. Samples: 38297552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:20:07,789][569624] Avg episode reward: [(0, '4633.488')] [2023-03-08 22:20:11,050][569911] Updated weights for policy 0, policy_version 74880 (0.0005) [2023-03-08 22:20:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 38359040. Throughput: 0: 12094.3. Samples: 38332148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:20:12,789][569624] Avg episode reward: [(0, '4715.627')] [2023-03-08 22:20:14,423][569911] Updated weights for policy 0, policy_version 74960 (0.0005) [2023-03-08 22:20:17,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 38416384. Throughput: 0: 12107.8. Samples: 38404168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:20:17,789][569624] Avg episode reward: [(0, '4650.985')] [2023-03-08 22:20:17,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000075032_38416384.pth... [2023-03-08 22:20:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000074320_38051840.pth [2023-03-08 22:20:17,958][569911] Updated weights for policy 0, policy_version 75040 (0.0005) [2023-03-08 22:20:21,020][569911] Updated weights for policy 0, policy_version 75120 (0.0005) [2023-03-08 22:20:22,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12121.4). Total num frames: 38481920. Throughput: 0: 12147.5. Samples: 38479648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:20:22,789][569624] Avg episode reward: [(0, '4619.768')] [2023-03-08 22:20:24,255][569911] Updated weights for policy 0, policy_version 75200 (0.0005) [2023-03-08 22:20:27,432][569911] Updated weights for policy 0, policy_version 75280 (0.0005) [2023-03-08 22:20:27,789][569624] Fps is (10 sec: 13107.2, 60 sec: 12219.8, 300 sec: 12163.0). Total num frames: 38547456. Throughput: 0: 12211.7. Samples: 38518848. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:20:27,789][569624] Avg episode reward: [(0, '4610.848')] [2023-03-08 22:20:30,842][569911] Updated weights for policy 0, policy_version 75360 (0.0005) [2023-03-08 22:20:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12163.0). Total num frames: 38604800. Throughput: 0: 12268.1. Samples: 38592520. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:20:32,789][569624] Avg episode reward: [(0, '4646.166')] [2023-03-08 22:20:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000075400_38604800.pth... [2023-03-08 22:20:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000074680_38236160.pth [2023-03-08 22:20:34,336][569911] Updated weights for policy 0, policy_version 75440 (0.0005) [2023-03-08 22:20:37,789][569624] Fps is (10 sec: 11468.7, 60 sec: 12151.5, 300 sec: 12149.2). Total num frames: 38662144. Throughput: 0: 12201.1. Samples: 38662656. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:20:37,789][569624] Avg episode reward: [(0, '4370.989')] [2023-03-08 22:20:37,821][569911] Updated weights for policy 0, policy_version 75520 (0.0005) [2023-03-08 22:20:41,078][569911] Updated weights for policy 0, policy_version 75600 (0.0005) [2023-03-08 22:20:42,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12219.7, 300 sec: 12163.0). Total num frames: 38727680. Throughput: 0: 12216.6. Samples: 38699884. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:20:42,790][569624] Avg episode reward: [(0, '4620.848')] [2023-03-08 22:20:44,460][569911] Updated weights for policy 0, policy_version 75680 (0.0005) [2023-03-08 22:20:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12163.0). Total num frames: 38785024. Throughput: 0: 12206.0. Samples: 38772748. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:20:47,789][569624] Avg episode reward: [(0, '4192.355')] [2023-03-08 22:20:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000075752_38785024.pth... [2023-03-08 22:20:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000075032_38416384.pth [2023-03-08 22:20:47,906][569911] Updated weights for policy 0, policy_version 75760 (0.0005) [2023-03-08 22:20:51,266][569911] Updated weights for policy 0, policy_version 75840 (0.0005) [2023-03-08 22:20:52,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 12163.0). Total num frames: 38846464. Throughput: 0: 12153.0. Samples: 38844436. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:20:52,789][569624] Avg episode reward: [(0, '4471.285')] [2023-03-08 22:20:54,619][569911] Updated weights for policy 0, policy_version 75920 (0.0005) [2023-03-08 22:20:57,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12176.9). Total num frames: 38907904. Throughput: 0: 12219.9. Samples: 38882044. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:20:57,789][569624] Avg episode reward: [(0, '4425.145')] [2023-03-08 22:20:58,089][569911] Updated weights for policy 0, policy_version 76000 (0.0005) [2023-03-08 22:21:01,499][569911] Updated weights for policy 0, policy_version 76080 (0.0005) [2023-03-08 22:21:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12163.0). Total num frames: 38965248. Throughput: 0: 12197.0. Samples: 38953032. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:21:02,789][569624] Avg episode reward: [(0, '4540.096')] [2023-03-08 22:21:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000076104_38965248.pth... [2023-03-08 22:21:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000075400_38604800.pth [2023-03-08 22:21:04,846][569911] Updated weights for policy 0, policy_version 76160 (0.0004) [2023-03-08 22:21:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 12163.0). Total num frames: 39026688. Throughput: 0: 12156.4. Samples: 39026688. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:21:07,789][569624] Avg episode reward: [(0, '4721.769')] [2023-03-08 22:21:08,338][569911] Updated weights for policy 0, policy_version 76240 (0.0005) [2023-03-08 22:21:11,651][569911] Updated weights for policy 0, policy_version 76320 (0.0004) [2023-03-08 22:21:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12163.0). Total num frames: 39088128. Throughput: 0: 12095.0. Samples: 39063124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:21:12,789][569624] Avg episode reward: [(0, '4411.760')] [2023-03-08 22:21:15,096][569911] Updated weights for policy 0, policy_version 76400 (0.0005) [2023-03-08 22:21:17,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12176.9). Total num frames: 39149568. Throughput: 0: 12032.0. Samples: 39133960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:21:17,799][569624] Avg episode reward: [(0, '4586.129')] [2023-03-08 22:21:17,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000076464_39149568.pth... [2023-03-08 22:21:17,805][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000075752_38785024.pth [2023-03-08 22:21:18,437][569911] Updated weights for policy 0, policy_version 76480 (0.0004) [2023-03-08 22:21:22,066][569911] Updated weights for policy 0, policy_version 76560 (0.0004) [2023-03-08 22:21:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12014.9, 300 sec: 12149.2). Total num frames: 39202816. Throughput: 0: 12015.7. Samples: 39203364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:21:22,800][569624] Avg episode reward: [(0, '4408.975')] [2023-03-08 22:21:25,775][569911] Updated weights for policy 0, policy_version 76640 (0.0005) [2023-03-08 22:21:27,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11878.4, 300 sec: 12135.3). Total num frames: 39260160. Throughput: 0: 11949.2. Samples: 39237596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:21:27,800][569624] Avg episode reward: [(0, '4228.029')] [2023-03-08 22:21:29,202][569911] Updated weights for policy 0, policy_version 76720 (0.0005) [2023-03-08 22:21:32,477][569911] Updated weights for policy 0, policy_version 76800 (0.0005) [2023-03-08 22:21:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12135.3). Total num frames: 39321600. Throughput: 0: 11925.1. Samples: 39309376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:21:32,800][569624] Avg episode reward: [(0, '4268.466')] [2023-03-08 22:21:32,842][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000076808_39325696.pth... [2023-03-08 22:21:32,844][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000076104_38965248.pth [2023-03-08 22:21:36,031][569911] Updated weights for policy 0, policy_version 76880 (0.0005) [2023-03-08 22:21:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 12121.4). Total num frames: 39378944. Throughput: 0: 11884.2. Samples: 39379224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:21:37,800][569624] Avg episode reward: [(0, '4130.374')] [2023-03-08 22:21:39,583][569911] Updated weights for policy 0, policy_version 76960 (0.0005) [2023-03-08 22:21:42,668][569911] Updated weights for policy 0, policy_version 77040 (0.0005) [2023-03-08 22:21:42,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12149.2). Total num frames: 39444480. Throughput: 0: 11879.2. Samples: 39416608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:21:42,789][569624] Avg episode reward: [(0, '4287.114')] [2023-03-08 22:21:46,083][569911] Updated weights for policy 0, policy_version 77120 (0.0006) [2023-03-08 22:21:47,789][569624] Fps is (10 sec: 12697.7, 60 sec: 12015.0, 300 sec: 12149.2). Total num frames: 39505920. Throughput: 0: 11961.7. Samples: 39491308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:21:47,799][569624] Avg episode reward: [(0, '4669.492')] [2023-03-08 22:21:47,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000077160_39505920.pth... [2023-03-08 22:21:47,804][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000076464_39149568.pth [2023-03-08 22:21:49,346][569911] Updated weights for policy 0, policy_version 77200 (0.0005) [2023-03-08 22:21:52,659][569911] Updated weights for policy 0, policy_version 77280 (0.0005) [2023-03-08 22:21:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12149.2). Total num frames: 39567360. Throughput: 0: 11985.2. Samples: 39566024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:21:52,789][569624] Avg episode reward: [(0, '4719.760')] [2023-03-08 22:21:56,118][569911] Updated weights for policy 0, policy_version 77360 (0.0005) [2023-03-08 22:21:57,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12014.9, 300 sec: 12149.2). Total num frames: 39628800. Throughput: 0: 11932.5. Samples: 39600088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:21:57,789][569624] Avg episode reward: [(0, '4555.995')] [2023-03-08 22:21:59,275][569911] Updated weights for policy 0, policy_version 77440 (0.0006) [2023-03-08 22:22:02,687][569911] Updated weights for policy 0, policy_version 77520 (0.0005) [2023-03-08 22:22:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12135.3). Total num frames: 39690240. Throughput: 0: 12087.6. Samples: 39677900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:22:02,789][569624] Avg episode reward: [(0, '4653.231')] [2023-03-08 22:22:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000077520_39690240.pth... [2023-03-08 22:22:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000076808_39325696.pth [2023-03-08 22:22:05,812][569911] Updated weights for policy 0, policy_version 77600 (0.0005) [2023-03-08 22:22:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12149.1). Total num frames: 39751680. Throughput: 0: 12186.2. Samples: 39751744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:22:07,789][569624] Avg episode reward: [(0, '4657.387')] [2023-03-08 22:22:09,166][569911] Updated weights for policy 0, policy_version 77680 (0.0005) [2023-03-08 22:22:12,587][569911] Updated weights for policy 0, policy_version 77760 (0.0005) [2023-03-08 22:22:12,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12083.2, 300 sec: 12149.1). Total num frames: 39813120. Throughput: 0: 12243.1. Samples: 39788536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:22:12,790][569624] Avg episode reward: [(0, '4698.338')] [2023-03-08 22:22:16,006][569911] Updated weights for policy 0, policy_version 77840 (0.0005) [2023-03-08 22:22:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12135.3). Total num frames: 39870464. Throughput: 0: 12226.4. Samples: 39859564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:22:17,789][569624] Avg episode reward: [(0, '4613.285')] [2023-03-08 22:22:17,837][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000077880_39874560.pth... [2023-03-08 22:22:17,839][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000077160_39505920.pth [2023-03-08 22:22:19,707][569911] Updated weights for policy 0, policy_version 77920 (0.0005) [2023-03-08 22:22:22,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12149.2). Total num frames: 39931904. Throughput: 0: 12261.3. Samples: 39930984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:22:22,789][569624] Avg episode reward: [(0, '4406.296')] [2023-03-08 22:22:22,944][569911] Updated weights for policy 0, policy_version 78000 (0.0006) [2023-03-08 22:22:26,336][569911] Updated weights for policy 0, policy_version 78080 (0.0005) [2023-03-08 22:22:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12121.4). Total num frames: 39989248. Throughput: 0: 12244.1. Samples: 39967592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:22:27,789][569624] Avg episode reward: [(0, '4482.367')] [2023-03-08 22:22:29,858][569911] Updated weights for policy 0, policy_version 78160 (0.0005) [2023-03-08 22:22:32,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12219.7, 300 sec: 12149.1). Total num frames: 40054784. Throughput: 0: 12180.4. Samples: 40039428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:22:32,790][569624] Avg episode reward: [(0, '4607.837')] [2023-03-08 22:22:32,794][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000078232_40054784.pth... [2023-03-08 22:22:32,797][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000077520_39690240.pth [2023-03-08 22:22:33,156][569911] Updated weights for policy 0, policy_version 78240 (0.0005) [2023-03-08 22:22:36,443][569911] Updated weights for policy 0, policy_version 78320 (0.0005) [2023-03-08 22:22:37,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12135.3). Total num frames: 40112128. Throughput: 0: 12144.0. Samples: 40112504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:22:37,789][569624] Avg episode reward: [(0, '4677.646')] [2023-03-08 22:22:39,821][569911] Updated weights for policy 0, policy_version 78400 (0.0006) [2023-03-08 22:22:42,789][569624] Fps is (10 sec: 11878.7, 60 sec: 12151.5, 300 sec: 12135.3). Total num frames: 40173568. Throughput: 0: 12204.0. Samples: 40149268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:22:42,789][569624] Avg episode reward: [(0, '4568.628')] [2023-03-08 22:22:43,087][569911] Updated weights for policy 0, policy_version 78480 (0.0005) [2023-03-08 22:22:46,362][569911] Updated weights for policy 0, policy_version 78560 (0.0005) [2023-03-08 22:22:47,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12219.7, 300 sec: 12135.3). Total num frames: 40239104. Throughput: 0: 12150.8. Samples: 40224688. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:22:47,789][569624] Avg episode reward: [(0, '4510.167')] [2023-03-08 22:22:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000078592_40239104.pth... [2023-03-08 22:22:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000077880_39874560.pth [2023-03-08 22:22:49,596][569911] Updated weights for policy 0, policy_version 78640 (0.0005) [2023-03-08 22:22:52,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12219.7, 300 sec: 12135.3). Total num frames: 40300544. Throughput: 0: 12170.3. Samples: 40299408. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:22:52,789][569624] Avg episode reward: [(0, '4639.233')] [2023-03-08 22:22:52,934][569911] Updated weights for policy 0, policy_version 78720 (0.0005) [2023-03-08 22:22:56,509][569911] Updated weights for policy 0, policy_version 78800 (0.0005) [2023-03-08 22:22:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12121.4). Total num frames: 40357888. Throughput: 0: 12138.4. Samples: 40334764. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:22:57,789][569624] Avg episode reward: [(0, '4386.339')] [2023-03-08 22:23:00,057][569911] Updated weights for policy 0, policy_version 78880 (0.0005) [2023-03-08 22:23:02,789][569624] Fps is (10 sec: 11468.7, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 40415232. Throughput: 0: 12068.7. Samples: 40402656. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:23:02,789][569624] Avg episode reward: [(0, '4444.800')] [2023-03-08 22:23:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000078936_40415232.pth... [2023-03-08 22:23:02,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000078232_40054784.pth [2023-03-08 22:23:03,783][569911] Updated weights for policy 0, policy_version 78960 (0.0005) [2023-03-08 22:23:07,472][569911] Updated weights for policy 0, policy_version 79040 (0.0005) [2023-03-08 22:23:07,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 40468480. Throughput: 0: 11952.1. Samples: 40468828. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:23:07,789][569624] Avg episode reward: [(0, '4252.498')] [2023-03-08 22:23:11,074][569911] Updated weights for policy 0, policy_version 79120 (0.0005) [2023-03-08 22:23:12,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 40525824. Throughput: 0: 11919.1. Samples: 40503952. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:23:12,789][569624] Avg episode reward: [(0, '4163.653')] [2023-03-08 22:23:14,606][569911] Updated weights for policy 0, policy_version 79200 (0.0005) [2023-03-08 22:23:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 40587264. Throughput: 0: 11899.1. Samples: 40574888. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:23:17,789][569624] Avg episode reward: [(0, '4518.349')] [2023-03-08 22:23:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000079272_40587264.pth... [2023-03-08 22:23:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000078592_40239104.pth [2023-03-08 22:23:17,930][569911] Updated weights for policy 0, policy_version 79280 (0.0005) [2023-03-08 22:23:21,574][569911] Updated weights for policy 0, policy_version 79360 (0.0005) [2023-03-08 22:23:22,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 40644608. Throughput: 0: 11826.0. Samples: 40644672. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:23:22,800][569624] Avg episode reward: [(0, '4739.609')] [2023-03-08 22:23:24,793][569911] Updated weights for policy 0, policy_version 79440 (0.0004) [2023-03-08 22:23:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 40706048. Throughput: 0: 11826.7. Samples: 40681468. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:23:27,800][569624] Avg episode reward: [(0, '4333.410')] [2023-03-08 22:23:28,196][569911] Updated weights for policy 0, policy_version 79520 (0.0005) [2023-03-08 22:23:31,641][569911] Updated weights for policy 0, policy_version 79600 (0.0005) [2023-03-08 22:23:32,789][569624] Fps is (10 sec: 12287.8, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 40767488. Throughput: 0: 11776.7. Samples: 40754640. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:23:32,800][569624] Avg episode reward: [(0, '4529.059')] [2023-03-08 22:23:32,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000079624_40767488.pth... [2023-03-08 22:23:32,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000078936_40415232.pth [2023-03-08 22:23:35,012][569911] Updated weights for policy 0, policy_version 79680 (0.0005) [2023-03-08 22:23:37,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 40828928. Throughput: 0: 11692.8. Samples: 40825584. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:23:37,800][569624] Avg episode reward: [(0, '4381.430')] [2023-03-08 22:23:38,357][569911] Updated weights for policy 0, policy_version 79760 (0.0005) [2023-03-08 22:23:41,730][569911] Updated weights for policy 0, policy_version 79840 (0.0004) [2023-03-08 22:23:42,789][569624] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 40890368. Throughput: 0: 11775.4. Samples: 40864656. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:23:42,800][569624] Avg episode reward: [(0, '4471.395')] [2023-03-08 22:23:45,079][569911] Updated weights for policy 0, policy_version 79920 (0.0005) [2023-03-08 22:23:47,789][569624] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 40951808. Throughput: 0: 11847.7. Samples: 40935800. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:23:47,789][569624] Avg episode reward: [(0, '4731.383')] [2023-03-08 22:23:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000079984_40951808.pth... [2023-03-08 22:23:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000079272_40587264.pth [2023-03-08 22:23:48,406][569911] Updated weights for policy 0, policy_version 80000 (0.0005) [2023-03-08 22:23:51,672][569911] Updated weights for policy 0, policy_version 80080 (0.0005) [2023-03-08 22:23:52,789][569624] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 41013248. Throughput: 0: 12074.7. Samples: 41012188. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:23:52,789][569624] Avg episode reward: [(0, '4482.130')] [2023-03-08 22:23:54,949][569911] Updated weights for policy 0, policy_version 80160 (0.0004) [2023-03-08 22:23:57,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 12093.6). Total num frames: 41074688. Throughput: 0: 12125.0. Samples: 41049576. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:23:57,789][569624] Avg episode reward: [(0, '4452.366')] [2023-03-08 22:23:58,424][569911] Updated weights for policy 0, policy_version 80240 (0.0006) [2023-03-08 22:24:01,791][569911] Updated weights for policy 0, policy_version 80320 (0.0005) [2023-03-08 22:24:02,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 41132032. Throughput: 0: 12117.9. Samples: 41120192. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:24:02,789][569624] Avg episode reward: [(0, '4037.097')] [2023-03-08 22:24:02,797][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000080344_41136128.pth... [2023-03-08 22:24:02,799][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000079624_40767488.pth [2023-03-08 22:24:05,162][569911] Updated weights for policy 0, policy_version 80400 (0.0004) [2023-03-08 22:24:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 41193472. Throughput: 0: 12176.2. Samples: 41192600. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:24:07,789][569624] Avg episode reward: [(0, '4489.626')] [2023-03-08 22:24:08,711][569911] Updated weights for policy 0, policy_version 80480 (0.0006) [2023-03-08 22:24:12,116][569911] Updated weights for policy 0, policy_version 80560 (0.0006) [2023-03-08 22:24:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 41250816. Throughput: 0: 12134.2. Samples: 41227508. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:24:12,789][569624] Avg episode reward: [(0, '4655.229')] [2023-03-08 22:24:15,323][569911] Updated weights for policy 0, policy_version 80640 (0.0005) [2023-03-08 22:24:17,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 41316352. Throughput: 0: 12194.6. Samples: 41303396. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:24:17,789][569624] Avg episode reward: [(0, '4622.139')] [2023-03-08 22:24:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000080696_41316352.pth... [2023-03-08 22:24:17,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000079984_40951808.pth [2023-03-08 22:24:18,670][569911] Updated weights for policy 0, policy_version 80720 (0.0006) [2023-03-08 22:24:21,947][569911] Updated weights for policy 0, policy_version 80800 (0.0005) [2023-03-08 22:24:22,789][569624] Fps is (10 sec: 12697.7, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 41377792. Throughput: 0: 12256.2. Samples: 41377112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:24:22,789][569624] Avg episode reward: [(0, '4584.572')] [2023-03-08 22:24:25,337][569911] Updated weights for policy 0, policy_version 80880 (0.0005) [2023-03-08 22:24:27,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 41439232. Throughput: 0: 12193.4. Samples: 41413360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:24:27,789][569624] Avg episode reward: [(0, '4577.960')] [2023-03-08 22:24:28,787][569911] Updated weights for policy 0, policy_version 80960 (0.0005) [2023-03-08 22:24:32,086][569911] Updated weights for policy 0, policy_version 81040 (0.0005) [2023-03-08 22:24:32,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 41500672. Throughput: 0: 12231.5. Samples: 41486220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:24:32,789][569624] Avg episode reward: [(0, '4780.189')] [2023-03-08 22:24:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000081056_41500672.pth... [2023-03-08 22:24:32,797][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000080344_41136128.pth [2023-03-08 22:24:35,505][569911] Updated weights for policy 0, policy_version 81120 (0.0005) [2023-03-08 22:24:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 41558016. Throughput: 0: 12125.8. Samples: 41557848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:24:37,789][569624] Avg episode reward: [(0, '4674.310')] [2023-03-08 22:24:38,779][569911] Updated weights for policy 0, policy_version 81200 (0.0004) [2023-03-08 22:24:42,111][569911] Updated weights for policy 0, policy_version 81280 (0.0005) [2023-03-08 22:24:42,789][569624] Fps is (10 sec: 11878.6, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 41619456. Throughput: 0: 12143.8. Samples: 41596048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:24:42,789][569624] Avg episode reward: [(0, '4443.512')] [2023-03-08 22:24:45,443][569911] Updated weights for policy 0, policy_version 81360 (0.0005) [2023-03-08 22:24:47,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 41684992. Throughput: 0: 12203.6. Samples: 41669356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:24:47,789][569624] Avg episode reward: [(0, '4745.288')] [2023-03-08 22:24:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000081416_41684992.pth... [2023-03-08 22:24:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000080696_41316352.pth [2023-03-08 22:24:48,736][569911] Updated weights for policy 0, policy_version 81440 (0.0005) [2023-03-08 22:24:52,188][569911] Updated weights for policy 0, policy_version 81520 (0.0005) [2023-03-08 22:24:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 41742336. Throughput: 0: 12217.8. Samples: 41742400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:24:52,789][569624] Avg episode reward: [(0, '4681.897')] [2023-03-08 22:24:55,442][569911] Updated weights for policy 0, policy_version 81600 (0.0005) [2023-03-08 22:24:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 41803776. Throughput: 0: 12273.9. Samples: 41779832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:24:57,789][569624] Avg episode reward: [(0, '4699.206')] [2023-03-08 22:24:58,929][569911] Updated weights for policy 0, policy_version 81680 (0.0005) [2023-03-08 22:25:02,300][569911] Updated weights for policy 0, policy_version 81760 (0.0005) [2023-03-08 22:25:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 41865216. Throughput: 0: 12205.7. Samples: 41852652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:25:02,789][569624] Avg episode reward: [(0, '4508.138')] [2023-03-08 22:25:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000081768_41865216.pth... [2023-03-08 22:25:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000081056_41500672.pth [2023-03-08 22:25:05,868][569911] Updated weights for policy 0, policy_version 81840 (0.0005) [2023-03-08 22:25:07,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 41922560. Throughput: 0: 12122.6. Samples: 41922632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:25:07,789][569624] Avg episode reward: [(0, '4642.127')] [2023-03-08 22:25:09,150][569911] Updated weights for policy 0, policy_version 81920 (0.0005) [2023-03-08 22:25:12,517][569911] Updated weights for policy 0, policy_version 82000 (0.0005) [2023-03-08 22:25:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 41984000. Throughput: 0: 12142.1. Samples: 41959756. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:25:12,789][569624] Avg episode reward: [(0, '4697.960')] [2023-03-08 22:25:15,748][569911] Updated weights for policy 0, policy_version 82080 (0.0004) [2023-03-08 22:25:17,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 42049536. Throughput: 0: 12192.2. Samples: 42034868. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:25:17,789][569624] Avg episode reward: [(0, '4643.347')] [2023-03-08 22:25:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000082128_42049536.pth... [2023-03-08 22:25:17,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000081416_41684992.pth [2023-03-08 22:25:19,191][569911] Updated weights for policy 0, policy_version 82160 (0.0005) [2023-03-08 22:25:22,698][569911] Updated weights for policy 0, policy_version 82240 (0.0004) [2023-03-08 22:25:22,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.4, 300 sec: 12065.8). Total num frames: 42106880. Throughput: 0: 12142.9. Samples: 42104280. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:25:22,789][569624] Avg episode reward: [(0, '4595.474')] [2023-03-08 22:25:26,184][569911] Updated weights for policy 0, policy_version 82320 (0.0005) [2023-03-08 22:25:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 42164224. Throughput: 0: 12086.4. Samples: 42139936. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:25:27,789][569624] Avg episode reward: [(0, '4489.504')] [2023-03-08 22:25:29,764][569911] Updated weights for policy 0, policy_version 82400 (0.0005) [2023-03-08 22:25:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 42221568. Throughput: 0: 12010.9. Samples: 42209844. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:25:32,789][569624] Avg episode reward: [(0, '4561.566')] [2023-03-08 22:25:32,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000082464_42221568.pth... [2023-03-08 22:25:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000081768_41865216.pth [2023-03-08 22:25:33,200][569911] Updated weights for policy 0, policy_version 82480 (0.0004) [2023-03-08 22:25:36,595][569911] Updated weights for policy 0, policy_version 82560 (0.0005) [2023-03-08 22:25:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 42283008. Throughput: 0: 12014.9. Samples: 42283072. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:25:37,789][569624] Avg episode reward: [(0, '4687.576')] [2023-03-08 22:25:39,901][569911] Updated weights for policy 0, policy_version 82640 (0.0005) [2023-03-08 22:25:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 42340352. Throughput: 0: 11998.1. Samples: 42319744. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:25:42,789][569624] Avg episode reward: [(0, '4424.672')] [2023-03-08 22:25:43,551][569911] Updated weights for policy 0, policy_version 82720 (0.0006) [2023-03-08 22:25:46,887][569911] Updated weights for policy 0, policy_version 82800 (0.0005) [2023-03-08 22:25:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 42401792. Throughput: 0: 11931.4. Samples: 42389568. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:25:47,789][569624] Avg episode reward: [(0, '4683.734')] [2023-03-08 22:25:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000082816_42401792.pth... [2023-03-08 22:25:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000082128_42049536.pth [2023-03-08 22:25:50,129][569911] Updated weights for policy 0, policy_version 82880 (0.0005) [2023-03-08 22:25:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 42463232. Throughput: 0: 12024.7. Samples: 42463744. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:25:52,789][569624] Avg episode reward: [(0, '4706.489')] [2023-03-08 22:25:53,471][569911] Updated weights for policy 0, policy_version 82960 (0.0005) [2023-03-08 22:25:56,874][569911] Updated weights for policy 0, policy_version 83040 (0.0005) [2023-03-08 22:25:57,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 42524672. Throughput: 0: 12015.1. Samples: 42500436. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:25:57,793][569624] Avg episode reward: [(0, '4675.489')] [2023-03-08 22:26:00,372][569911] Updated weights for policy 0, policy_version 83120 (0.0005) [2023-03-08 22:26:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 42582016. Throughput: 0: 11885.4. Samples: 42569712. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:26:02,800][569624] Avg episode reward: [(0, '4683.640')] [2023-03-08 22:26:02,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000083168_42582016.pth... [2023-03-08 22:26:02,804][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000082464_42221568.pth [2023-03-08 22:26:03,843][569911] Updated weights for policy 0, policy_version 83200 (0.0006) [2023-03-08 22:26:07,290][569911] Updated weights for policy 0, policy_version 83280 (0.0005) [2023-03-08 22:26:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 42643456. Throughput: 0: 11966.7. Samples: 42642780. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:26:07,789][569624] Avg episode reward: [(0, '4688.844')] [2023-03-08 22:26:10,382][569911] Updated weights for policy 0, policy_version 83360 (0.0005) [2023-03-08 22:26:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 42704896. Throughput: 0: 12056.5. Samples: 42682476. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:26:12,800][569624] Avg episode reward: [(0, '4523.036')] [2023-03-08 22:26:14,043][569911] Updated weights for policy 0, policy_version 83440 (0.0005) [2023-03-08 22:26:17,716][569911] Updated weights for policy 0, policy_version 83520 (0.0005) [2023-03-08 22:26:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 42762240. Throughput: 0: 12007.8. Samples: 42750196. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:26:17,800][569624] Avg episode reward: [(0, '4191.101')] [2023-03-08 22:26:17,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000083520_42762240.pth... [2023-03-08 22:26:17,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000082816_42401792.pth [2023-03-08 22:26:21,383][569911] Updated weights for policy 0, policy_version 83600 (0.0005) [2023-03-08 22:26:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 42819584. Throughput: 0: 11846.2. Samples: 42816152. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:26:22,800][569624] Avg episode reward: [(0, '4082.621')] [2023-03-08 22:26:24,944][569911] Updated weights for policy 0, policy_version 83680 (0.0005) [2023-03-08 22:26:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 12052.0). Total num frames: 42876928. Throughput: 0: 11833.4. Samples: 42852248. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:26:27,800][569624] Avg episode reward: [(0, '4383.556')] [2023-03-08 22:26:28,299][569911] Updated weights for policy 0, policy_version 83760 (0.0005) [2023-03-08 22:26:31,755][569911] Updated weights for policy 0, policy_version 83840 (0.0005) [2023-03-08 22:26:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 42938368. Throughput: 0: 11864.2. Samples: 42923456. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:26:32,800][569624] Avg episode reward: [(0, '4526.843')] [2023-03-08 22:26:32,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000083864_42938368.pth... [2023-03-08 22:26:32,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000083168_42582016.pth [2023-03-08 22:26:34,806][569911] Updated weights for policy 0, policy_version 83920 (0.0005) [2023-03-08 22:26:37,789][569624] Fps is (10 sec: 12697.7, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 43003904. Throughput: 0: 12005.9. Samples: 43004008. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:26:37,796][569911] Updated weights for policy 0, policy_version 84000 (0.0005) [2023-03-08 22:26:37,800][569624] Avg episode reward: [(0, '4212.473')] [2023-03-08 22:26:41,168][569911] Updated weights for policy 0, policy_version 84080 (0.0005) [2023-03-08 22:26:42,789][569624] Fps is (10 sec: 13107.2, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 43069440. Throughput: 0: 12005.6. Samples: 43040688. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:26:42,800][569624] Avg episode reward: [(0, '4447.054')] [2023-03-08 22:26:44,414][569911] Updated weights for policy 0, policy_version 84160 (0.0005) [2023-03-08 22:26:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 43126784. Throughput: 0: 12125.6. Samples: 43115364. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:26:47,789][569624] Avg episode reward: [(0, '4362.832')] [2023-03-08 22:26:47,820][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000084240_43130880.pth... [2023-03-08 22:26:47,821][569911] Updated weights for policy 0, policy_version 84240 (0.0005) [2023-03-08 22:26:47,822][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000083520_42762240.pth [2023-03-08 22:26:51,221][569911] Updated weights for policy 0, policy_version 84320 (0.0004) [2023-03-08 22:26:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 43188224. Throughput: 0: 12117.8. Samples: 43188080. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:26:52,789][569624] Avg episode reward: [(0, '4018.285')] [2023-03-08 22:26:54,618][569911] Updated weights for policy 0, policy_version 84400 (0.0004) [2023-03-08 22:26:57,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 43249664. Throughput: 0: 12055.6. Samples: 43224976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:26:57,789][569624] Avg episode reward: [(0, '4284.017')] [2023-03-08 22:26:58,024][569911] Updated weights for policy 0, policy_version 84480 (0.0005) [2023-03-08 22:27:01,636][569911] Updated weights for policy 0, policy_version 84560 (0.0006) [2023-03-08 22:27:02,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 43307008. Throughput: 0: 12087.9. Samples: 43294152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:27:02,789][569624] Avg episode reward: [(0, '4614.861')] [2023-03-08 22:27:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000084584_43307008.pth... [2023-03-08 22:27:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000083864_42938368.pth [2023-03-08 22:27:04,922][569911] Updated weights for policy 0, policy_version 84640 (0.0005) [2023-03-08 22:27:07,789][569624] Fps is (10 sec: 11468.9, 60 sec: 12015.0, 300 sec: 12038.1). Total num frames: 43364352. Throughput: 0: 12181.9. Samples: 43364336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:27:07,789][569624] Avg episode reward: [(0, '4221.795')] [2023-03-08 22:27:08,606][569911] Updated weights for policy 0, policy_version 84720 (0.0005) [2023-03-08 22:27:12,041][569911] Updated weights for policy 0, policy_version 84800 (0.0005) [2023-03-08 22:27:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 43425792. Throughput: 0: 12171.6. Samples: 43399972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:27:12,789][569624] Avg episode reward: [(0, '4115.495')] [2023-03-08 22:27:15,600][569911] Updated weights for policy 0, policy_version 84880 (0.0005) [2023-03-08 22:27:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 43479040. Throughput: 0: 12088.2. Samples: 43467424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:27:17,789][569624] Avg episode reward: [(0, '4147.519')] [2023-03-08 22:27:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000084920_43479040.pth... [2023-03-08 22:27:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000084240_43130880.pth [2023-03-08 22:27:19,401][569911] Updated weights for policy 0, policy_version 84960 (0.0005) [2023-03-08 22:27:22,723][569911] Updated weights for policy 0, policy_version 85040 (0.0005) [2023-03-08 22:27:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 43540480. Throughput: 0: 11849.1. Samples: 43537220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:27:22,789][569624] Avg episode reward: [(0, '4608.622')] [2023-03-08 22:27:26,379][569911] Updated weights for policy 0, policy_version 85120 (0.0005) [2023-03-08 22:27:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 43597824. Throughput: 0: 11832.6. Samples: 43573156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:27:27,789][569624] Avg episode reward: [(0, '4725.269')] [2023-03-08 22:27:29,638][569911] Updated weights for policy 0, policy_version 85200 (0.0006) [2023-03-08 22:27:32,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 43659264. Throughput: 0: 11806.1. Samples: 43646640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:27:32,789][569624] Avg episode reward: [(0, '4533.241')] [2023-03-08 22:27:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000085272_43659264.pth... [2023-03-08 22:27:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000084584_43307008.pth [2023-03-08 22:27:32,941][569911] Updated weights for policy 0, policy_version 85280 (0.0005) [2023-03-08 22:27:36,267][569911] Updated weights for policy 0, policy_version 85360 (0.0005) [2023-03-08 22:27:37,789][569624] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 43720704. Throughput: 0: 11825.9. Samples: 43720244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:27:37,789][569624] Avg episode reward: [(0, '4707.899')] [2023-03-08 22:27:39,682][569911] Updated weights for policy 0, policy_version 85440 (0.0005) [2023-03-08 22:27:42,307][569867] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000010 [2023-03-08 22:27:42,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 43782144. Throughput: 0: 11791.9. Samples: 43755612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:27:42,789][569624] Avg episode reward: [(0, '4716.282')] [2023-03-08 22:27:42,977][569911] Updated weights for policy 0, policy_version 85520 (0.0005) [2023-03-08 22:27:46,128][569911] Updated weights for policy 0, policy_version 85600 (0.0005) [2023-03-08 22:27:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 43843584. Throughput: 0: 11958.7. Samples: 43832292. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:27:47,789][569624] Avg episode reward: [(0, '4251.449')] [2023-03-08 22:27:47,794][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000085640_43847680.pth... [2023-03-08 22:27:47,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000084920_43479040.pth [2023-03-08 22:27:49,531][569911] Updated weights for policy 0, policy_version 85680 (0.0005) [2023-03-08 22:27:52,789][569624] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 43905024. Throughput: 0: 12016.7. Samples: 43905088. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:27:52,789][569624] Avg episode reward: [(0, '4691.838')] [2023-03-08 22:27:52,882][569911] Updated weights for policy 0, policy_version 85760 (0.0005) [2023-03-08 22:27:56,264][569911] Updated weights for policy 0, policy_version 85840 (0.0005) [2023-03-08 22:27:57,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 43966464. Throughput: 0: 12040.3. Samples: 43941784. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:27:57,789][569624] Avg episode reward: [(0, '4452.777')] [2023-03-08 22:27:59,661][569911] Updated weights for policy 0, policy_version 85920 (0.0005) [2023-03-08 22:28:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12015.0, 300 sec: 12065.8). Total num frames: 44027904. Throughput: 0: 12130.1. Samples: 44013280. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:28:02,789][569624] Avg episode reward: [(0, '4582.573')] [2023-03-08 22:28:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000085992_44027904.pth... [2023-03-08 22:28:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000085272_43659264.pth [2023-03-08 22:28:02,903][569911] Updated weights for policy 0, policy_version 86000 (0.0006) [2023-03-08 22:28:06,215][569911] Updated weights for policy 0, policy_version 86080 (0.0005) [2023-03-08 22:28:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 44089344. Throughput: 0: 12267.9. Samples: 44089276. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:28:07,789][569624] Avg episode reward: [(0, '4394.737')] [2023-03-08 22:28:09,699][569911] Updated weights for policy 0, policy_version 86160 (0.0005) [2023-03-08 22:28:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 44146688. Throughput: 0: 12253.3. Samples: 44124552. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:28:12,789][569624] Avg episode reward: [(0, '4487.149')] [2023-03-08 22:28:13,196][569911] Updated weights for policy 0, policy_version 86240 (0.0005) [2023-03-08 22:28:16,569][569911] Updated weights for policy 0, policy_version 86320 (0.0005) [2023-03-08 22:28:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 44208128. Throughput: 0: 12203.3. Samples: 44195788. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:28:17,789][569624] Avg episode reward: [(0, '4279.213')] [2023-03-08 22:28:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000086344_44208128.pth... [2023-03-08 22:28:17,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000085640_43847680.pth [2023-03-08 22:28:19,995][569911] Updated weights for policy 0, policy_version 86400 (0.0005) [2023-03-08 22:28:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 44265472. Throughput: 0: 12122.2. Samples: 44265744. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:28:22,789][569624] Avg episode reward: [(0, '4182.684')] [2023-03-08 22:28:23,506][569911] Updated weights for policy 0, policy_version 86480 (0.0004) [2023-03-08 22:28:26,829][569911] Updated weights for policy 0, policy_version 86560 (0.0005) [2023-03-08 22:28:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 44326912. Throughput: 0: 12150.8. Samples: 44302400. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:28:27,789][569624] Avg episode reward: [(0, '4314.966')] [2023-03-08 22:28:30,283][569911] Updated weights for policy 0, policy_version 86640 (0.0004) [2023-03-08 22:28:32,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.4, 300 sec: 12065.8). Total num frames: 44388352. Throughput: 0: 12064.0. Samples: 44375172. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 22:28:32,789][569624] Avg episode reward: [(0, '4600.713')] [2023-03-08 22:28:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000086696_44388352.pth... [2023-03-08 22:28:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000085992_44027904.pth [2023-03-08 22:28:33,741][569911] Updated weights for policy 0, policy_version 86720 (0.0005) [2023-03-08 22:28:37,060][569911] Updated weights for policy 0, policy_version 86800 (0.0005) [2023-03-08 22:28:37,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 44449792. Throughput: 0: 12072.7. Samples: 44448360. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:28:37,789][569624] Avg episode reward: [(0, '4475.685')] [2023-03-08 22:28:40,239][569911] Updated weights for policy 0, policy_version 86880 (0.0005) [2023-03-08 22:28:42,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 44511232. Throughput: 0: 12104.5. Samples: 44486484. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:28:42,789][569624] Avg episode reward: [(0, '4599.714')] [2023-03-08 22:28:43,819][569911] Updated weights for policy 0, policy_version 86960 (0.0005) [2023-03-08 22:28:46,910][569911] Updated weights for policy 0, policy_version 87040 (0.0005) [2023-03-08 22:28:47,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.4, 300 sec: 12065.8). Total num frames: 44572672. Throughput: 0: 12138.7. Samples: 44559524. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:28:47,789][569624] Avg episode reward: [(0, '4674.664')] [2023-03-08 22:28:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000087056_44572672.pth... [2023-03-08 22:28:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000086344_44208128.pth [2023-03-08 22:28:50,253][569911] Updated weights for policy 0, policy_version 87120 (0.0005) [2023-03-08 22:28:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.4, 300 sec: 12065.8). Total num frames: 44634112. Throughput: 0: 12107.3. Samples: 44634104. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:28:52,789][569624] Avg episode reward: [(0, '4586.131')] [2023-03-08 22:28:53,565][569911] Updated weights for policy 0, policy_version 87200 (0.0004) [2023-03-08 22:28:57,016][569911] Updated weights for policy 0, policy_version 87280 (0.0005) [2023-03-08 22:28:57,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 44695552. Throughput: 0: 12133.4. Samples: 44670556. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:28:57,789][569624] Avg episode reward: [(0, '4656.645')] [2023-03-08 22:29:00,281][569911] Updated weights for policy 0, policy_version 87360 (0.0004) [2023-03-08 22:29:02,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.4, 300 sec: 12079.7). Total num frames: 44756992. Throughput: 0: 12178.8. Samples: 44743836. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:29:02,790][569624] Avg episode reward: [(0, '4688.550')] [2023-03-08 22:29:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000087416_44756992.pth... [2023-03-08 22:29:02,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000086696_44388352.pth [2023-03-08 22:29:03,556][569911] Updated weights for policy 0, policy_version 87440 (0.0004) [2023-03-08 22:29:06,736][569911] Updated weights for policy 0, policy_version 87520 (0.0004) [2023-03-08 22:29:07,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 44822528. Throughput: 0: 12329.6. Samples: 44820576. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:29:07,789][569624] Avg episode reward: [(0, '4257.498')] [2023-03-08 22:29:10,079][569911] Updated weights for policy 0, policy_version 87600 (0.0005) [2023-03-08 22:29:12,789][569624] Fps is (10 sec: 12288.3, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 44879872. Throughput: 0: 12298.8. Samples: 44855844. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:29:12,789][569624] Avg episode reward: [(0, '4731.554')] [2023-03-08 22:29:13,536][569911] Updated weights for policy 0, policy_version 87680 (0.0004) [2023-03-08 22:29:16,828][569911] Updated weights for policy 0, policy_version 87760 (0.0005) [2023-03-08 22:29:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 44941312. Throughput: 0: 12308.0. Samples: 44929032. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:29:17,789][569624] Avg episode reward: [(0, '4473.403')] [2023-03-08 22:29:17,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000087776_44941312.pth... [2023-03-08 22:29:17,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000087056_44572672.pth [2023-03-08 22:29:20,131][569911] Updated weights for policy 0, policy_version 87840 (0.0005) [2023-03-08 22:29:22,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12288.0, 300 sec: 12079.7). Total num frames: 45002752. Throughput: 0: 12313.6. Samples: 45002472. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:29:22,789][569624] Avg episode reward: [(0, '4315.881')] [2023-03-08 22:29:23,692][569911] Updated weights for policy 0, policy_version 87920 (0.0005) [2023-03-08 22:29:27,015][569911] Updated weights for policy 0, policy_version 88000 (0.0005) [2023-03-08 22:29:27,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12288.0, 300 sec: 12079.7). Total num frames: 45064192. Throughput: 0: 12243.6. Samples: 45037448. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:29:27,789][569624] Avg episode reward: [(0, '4470.253')] [2023-03-08 22:29:30,369][569911] Updated weights for policy 0, policy_version 88080 (0.0005) [2023-03-08 22:29:32,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12093.6). Total num frames: 45125632. Throughput: 0: 12279.7. Samples: 45112112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:29:32,789][569624] Avg episode reward: [(0, '4606.874')] [2023-03-08 22:29:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000088136_45125632.pth... [2023-03-08 22:29:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000087416_44756992.pth [2023-03-08 22:29:33,746][569911] Updated weights for policy 0, policy_version 88160 (0.0004) [2023-03-08 22:29:37,170][569911] Updated weights for policy 0, policy_version 88240 (0.0005) [2023-03-08 22:29:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 45182976. Throughput: 0: 12198.6. Samples: 45183040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:29:37,790][569624] Avg episode reward: [(0, '4595.295')] [2023-03-08 22:29:40,523][569911] Updated weights for policy 0, policy_version 88320 (0.0004) [2023-03-08 22:29:42,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12288.0, 300 sec: 12079.7). Total num frames: 45248512. Throughput: 0: 12207.7. Samples: 45219904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:29:42,789][569624] Avg episode reward: [(0, '4550.870')] [2023-03-08 22:29:43,637][569911] Updated weights for policy 0, policy_version 88400 (0.0005) [2023-03-08 22:29:47,256][569911] Updated weights for policy 0, policy_version 88480 (0.0005) [2023-03-08 22:29:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.8, 300 sec: 12079.7). Total num frames: 45305856. Throughput: 0: 12216.5. Samples: 45293576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:29:47,800][569624] Avg episode reward: [(0, '4336.241')] [2023-03-08 22:29:47,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000088488_45305856.pth... [2023-03-08 22:29:47,804][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000087776_44941312.pth [2023-03-08 22:29:50,946][569911] Updated weights for policy 0, policy_version 88560 (0.0006) [2023-03-08 22:29:52,789][569624] Fps is (10 sec: 11059.2, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 45359104. Throughput: 0: 11971.8. Samples: 45359308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:29:52,800][569624] Avg episode reward: [(0, '4574.555')] [2023-03-08 22:29:54,620][569911] Updated weights for policy 0, policy_version 88640 (0.0005) [2023-03-08 22:29:57,789][569624] Fps is (10 sec: 11059.3, 60 sec: 12015.0, 300 sec: 12038.1). Total num frames: 45416448. Throughput: 0: 11995.1. Samples: 45395624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:29:57,799][569624] Avg episode reward: [(0, '4594.384')] [2023-03-08 22:29:58,213][569911] Updated weights for policy 0, policy_version 88720 (0.0005) [2023-03-08 22:30:01,618][569911] Updated weights for policy 0, policy_version 88800 (0.0005) [2023-03-08 22:30:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12015.0, 300 sec: 12052.0). Total num frames: 45477888. Throughput: 0: 11902.2. Samples: 45464632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:30:02,789][569624] Avg episode reward: [(0, '4595.772')] [2023-03-08 22:30:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000088824_45477888.pth... [2023-03-08 22:30:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000088136_45125632.pth [2023-03-08 22:30:05,124][569911] Updated weights for policy 0, policy_version 88880 (0.0004) [2023-03-08 22:30:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 45535232. Throughput: 0: 11830.1. Samples: 45534824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:30:07,789][569624] Avg episode reward: [(0, '4471.127')] [2023-03-08 22:30:08,711][569911] Updated weights for policy 0, policy_version 88960 (0.0005) [2023-03-08 22:30:11,946][569911] Updated weights for policy 0, policy_version 89040 (0.0004) [2023-03-08 22:30:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 45596672. Throughput: 0: 11836.9. Samples: 45570108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:30:12,789][569624] Avg episode reward: [(0, '4547.645')] [2023-03-08 22:30:15,153][569911] Updated weights for policy 0, policy_version 89120 (0.0004) [2023-03-08 22:30:17,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 45658112. Throughput: 0: 11858.3. Samples: 45645736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:30:17,789][569624] Avg episode reward: [(0, '4513.212')] [2023-03-08 22:30:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000089176_45658112.pth... [2023-03-08 22:30:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000088488_45305856.pth [2023-03-08 22:30:18,763][569911] Updated weights for policy 0, policy_version 89200 (0.0005) [2023-03-08 22:30:22,270][569911] Updated weights for policy 0, policy_version 89280 (0.0005) [2023-03-08 22:30:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 45715456. Throughput: 0: 11826.0. Samples: 45715212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:30:22,789][569624] Avg episode reward: [(0, '4583.510')] [2023-03-08 22:30:25,437][569911] Updated weights for policy 0, policy_version 89360 (0.0005) [2023-03-08 22:30:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12052.0). Total num frames: 45776896. Throughput: 0: 11855.2. Samples: 45753388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:30:27,789][569624] Avg episode reward: [(0, '4536.334')] [2023-03-08 22:30:28,835][569911] Updated weights for policy 0, policy_version 89440 (0.0005) [2023-03-08 22:30:32,120][569911] Updated weights for policy 0, policy_version 89520 (0.0004) [2023-03-08 22:30:32,789][569624] Fps is (10 sec: 12697.6, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 45842432. Throughput: 0: 11840.1. Samples: 45826380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:30:32,789][569624] Avg episode reward: [(0, '4803.167')] [2023-03-08 22:30:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000089536_45842432.pth... [2023-03-08 22:30:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000088824_45477888.pth [2023-03-08 22:30:35,519][569911] Updated weights for policy 0, policy_version 89600 (0.0005) [2023-03-08 22:30:37,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 45903872. Throughput: 0: 12049.0. Samples: 45901512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:30:37,789][569624] Avg episode reward: [(0, '4730.609')] [2023-03-08 22:30:38,800][569911] Updated weights for policy 0, policy_version 89680 (0.0005) [2023-03-08 22:30:42,219][569911] Updated weights for policy 0, policy_version 89760 (0.0004) [2023-03-08 22:30:42,789][569624] Fps is (10 sec: 11878.6, 60 sec: 11878.4, 300 sec: 12065.9). Total num frames: 45961216. Throughput: 0: 12043.4. Samples: 45937576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:30:42,799][569624] Avg episode reward: [(0, '4632.966')] [2023-03-08 22:30:45,503][569911] Updated weights for policy 0, policy_version 89840 (0.0005) [2023-03-08 22:30:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 46022656. Throughput: 0: 12129.0. Samples: 46010440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:30:47,789][569624] Avg episode reward: [(0, '4527.550')] [2023-03-08 22:30:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000089888_46022656.pth... [2023-03-08 22:30:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000089176_45658112.pth [2023-03-08 22:30:48,942][569911] Updated weights for policy 0, policy_version 89920 (0.0005) [2023-03-08 22:30:52,390][569911] Updated weights for policy 0, policy_version 90000 (0.0005) [2023-03-08 22:30:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 46084096. Throughput: 0: 12162.7. Samples: 46082144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:30:52,800][569624] Avg episode reward: [(0, '4459.778')] [2023-03-08 22:30:55,529][569911] Updated weights for policy 0, policy_version 90080 (0.0005) [2023-03-08 22:30:57,789][569624] Fps is (10 sec: 12288.2, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 46145536. Throughput: 0: 12242.6. Samples: 46121024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:30:57,800][569624] Avg episode reward: [(0, '4617.353')] [2023-03-08 22:30:58,933][569911] Updated weights for policy 0, policy_version 90160 (0.0004) [2023-03-08 22:31:02,424][569911] Updated weights for policy 0, policy_version 90240 (0.0005) [2023-03-08 22:31:02,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 46206976. Throughput: 0: 12176.0. Samples: 46193656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:31:02,800][569624] Avg episode reward: [(0, '4588.259')] [2023-03-08 22:31:02,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000090248_46206976.pth... [2023-03-08 22:31:02,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000089536_45842432.pth [2023-03-08 22:31:05,872][569911] Updated weights for policy 0, policy_version 90320 (0.0004) [2023-03-08 22:31:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 46264320. Throughput: 0: 12192.6. Samples: 46263880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:31:07,800][569624] Avg episode reward: [(0, '4468.093')] [2023-03-08 22:31:09,173][569911] Updated weights for policy 0, policy_version 90400 (0.0004) [2023-03-08 22:31:12,572][569911] Updated weights for policy 0, policy_version 90480 (0.0005) [2023-03-08 22:31:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 46325760. Throughput: 0: 12171.8. Samples: 46301120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:31:12,800][569624] Avg episode reward: [(0, '4389.892')] [2023-03-08 22:31:16,018][569911] Updated weights for policy 0, policy_version 90560 (0.0005) [2023-03-08 22:31:17,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 46387200. Throughput: 0: 12157.3. Samples: 46373460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:31:17,800][569624] Avg episode reward: [(0, '4198.538')] [2023-03-08 22:31:17,804][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000090600_46387200.pth... [2023-03-08 22:31:17,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000089888_46022656.pth [2023-03-08 22:31:19,420][569911] Updated weights for policy 0, policy_version 90640 (0.0005) [2023-03-08 22:31:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 46444544. Throughput: 0: 12068.8. Samples: 46444608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:31:22,800][569624] Avg episode reward: [(0, '4593.950')] [2023-03-08 22:31:22,859][569911] Updated weights for policy 0, policy_version 90720 (0.0005) [2023-03-08 22:31:26,276][569911] Updated weights for policy 0, policy_version 90800 (0.0005) [2023-03-08 22:31:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 46505984. Throughput: 0: 12082.0. Samples: 46481268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:31:27,789][569624] Avg episode reward: [(0, '4455.519')] [2023-03-08 22:31:29,780][569911] Updated weights for policy 0, policy_version 90880 (0.0005) [2023-03-08 22:31:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 46563328. Throughput: 0: 12014.8. Samples: 46551104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:31:32,789][569624] Avg episode reward: [(0, '4630.838')] [2023-03-08 22:31:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000090944_46563328.pth... [2023-03-08 22:31:32,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000090248_46206976.pth [2023-03-08 22:31:33,172][569911] Updated weights for policy 0, policy_version 90960 (0.0005) [2023-03-08 22:31:36,611][569911] Updated weights for policy 0, policy_version 91040 (0.0005) [2023-03-08 22:31:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 46624768. Throughput: 0: 12059.8. Samples: 46624832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:31:37,789][569624] Avg episode reward: [(0, '4490.708')] [2023-03-08 22:31:39,874][569911] Updated weights for policy 0, policy_version 91120 (0.0005) [2023-03-08 22:31:42,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 46686208. Throughput: 0: 12010.6. Samples: 46661500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:31:42,789][569624] Avg episode reward: [(0, '4669.273')] [2023-03-08 22:31:43,391][569911] Updated weights for policy 0, policy_version 91200 (0.0005) [2023-03-08 22:31:46,808][569911] Updated weights for policy 0, policy_version 91280 (0.0005) [2023-03-08 22:31:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12015.0, 300 sec: 12052.0). Total num frames: 46743552. Throughput: 0: 11948.3. Samples: 46731328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:31:47,800][569624] Avg episode reward: [(0, '4607.947')] [2023-03-08 22:31:47,844][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000091304_46747648.pth... [2023-03-08 22:31:47,845][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000090600_46387200.pth [2023-03-08 22:31:50,401][569911] Updated weights for policy 0, policy_version 91360 (0.0005) [2023-03-08 22:31:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 46804992. Throughput: 0: 11983.0. Samples: 46803116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:31:52,789][569624] Avg episode reward: [(0, '4069.457')] [2023-03-08 22:31:53,697][569911] Updated weights for policy 0, policy_version 91440 (0.0005) [2023-03-08 22:31:57,160][569911] Updated weights for policy 0, policy_version 91520 (0.0004) [2023-03-08 22:31:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 46862336. Throughput: 0: 11939.3. Samples: 46838388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:31:57,789][569624] Avg episode reward: [(0, '4303.938')] [2023-03-08 22:32:00,672][569911] Updated weights for policy 0, policy_version 91600 (0.0006) [2023-03-08 22:32:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 46923776. Throughput: 0: 11919.6. Samples: 46909840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:32:02,789][569624] Avg episode reward: [(0, '3971.490')] [2023-03-08 22:32:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000091648_46923776.pth... [2023-03-08 22:32:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000090944_46563328.pth [2023-03-08 22:32:04,050][569911] Updated weights for policy 0, policy_version 91680 (0.0005) [2023-03-08 22:32:07,651][569911] Updated weights for policy 0, policy_version 91760 (0.0005) [2023-03-08 22:32:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 46981120. Throughput: 0: 11913.8. Samples: 46980728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:32:07,789][569624] Avg episode reward: [(0, '3808.068')] [2023-03-08 22:32:11,143][569911] Updated weights for policy 0, policy_version 91840 (0.0005) [2023-03-08 22:32:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 47038464. Throughput: 0: 11880.0. Samples: 47015868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:32:12,789][569624] Avg episode reward: [(0, '3738.116')] [2023-03-08 22:32:14,852][569911] Updated weights for policy 0, policy_version 91920 (0.0005) [2023-03-08 22:32:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11810.1, 300 sec: 12052.0). Total num frames: 47095808. Throughput: 0: 11830.6. Samples: 47083480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:32:17,789][569624] Avg episode reward: [(0, '4502.728')] [2023-03-08 22:32:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000091984_47095808.pth... [2023-03-08 22:32:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000091304_46747648.pth [2023-03-08 22:32:18,164][569911] Updated weights for policy 0, policy_version 92000 (0.0005) [2023-03-08 22:32:21,684][569911] Updated weights for policy 0, policy_version 92080 (0.0004) [2023-03-08 22:32:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 47157248. Throughput: 0: 11786.5. Samples: 47155224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:32:22,789][569624] Avg episode reward: [(0, '4131.726')] [2023-03-08 22:32:25,012][569911] Updated weights for policy 0, policy_version 92160 (0.0005) [2023-03-08 22:32:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 12052.0). Total num frames: 47214592. Throughput: 0: 11770.7. Samples: 47191180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:32:27,789][569624] Avg episode reward: [(0, '4207.785')] [2023-03-08 22:32:28,549][569911] Updated weights for policy 0, policy_version 92240 (0.0006) [2023-03-08 22:32:32,122][569911] Updated weights for policy 0, policy_version 92320 (0.0005) [2023-03-08 22:32:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12052.0). Total num frames: 47276032. Throughput: 0: 11775.5. Samples: 47261224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:32:32,789][569624] Avg episode reward: [(0, '4241.339')] [2023-03-08 22:32:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000092336_47276032.pth... [2023-03-08 22:32:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000091648_46923776.pth [2023-03-08 22:32:35,732][569911] Updated weights for policy 0, policy_version 92400 (0.0005) [2023-03-08 22:32:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 12038.1). Total num frames: 47333376. Throughput: 0: 11741.6. Samples: 47331488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:32:37,789][569624] Avg episode reward: [(0, '4116.737')] [2023-03-08 22:32:39,090][569911] Updated weights for policy 0, policy_version 92480 (0.0004) [2023-03-08 22:32:42,596][569911] Updated weights for policy 0, policy_version 92560 (0.0005) [2023-03-08 22:32:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 12024.2). Total num frames: 47390720. Throughput: 0: 11731.2. Samples: 47366292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:32:42,789][569624] Avg episode reward: [(0, '4228.336')] [2023-03-08 22:32:45,984][569911] Updated weights for policy 0, policy_version 92640 (0.0005) [2023-03-08 22:32:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 12024.2). Total num frames: 47452160. Throughput: 0: 11758.7. Samples: 47438984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:32:47,789][569624] Avg episode reward: [(0, '4047.499')] [2023-03-08 22:32:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000092680_47452160.pth... [2023-03-08 22:32:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000091984_47095808.pth [2023-03-08 22:32:49,501][569911] Updated weights for policy 0, policy_version 92720 (0.0005) [2023-03-08 22:32:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 12010.3). Total num frames: 47509504. Throughput: 0: 11706.6. Samples: 47507524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:32:52,789][569624] Avg episode reward: [(0, '4416.421')] [2023-03-08 22:32:53,090][569911] Updated weights for policy 0, policy_version 92800 (0.0006) [2023-03-08 22:32:56,284][569911] Updated weights for policy 0, policy_version 92880 (0.0005) [2023-03-08 22:32:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 12010.3). Total num frames: 47570944. Throughput: 0: 11749.9. Samples: 47544616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:32:57,789][569624] Avg episode reward: [(0, '4510.294')] [2023-03-08 22:32:59,889][569911] Updated weights for policy 0, policy_version 92960 (0.0005) [2023-03-08 22:33:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11996.4). Total num frames: 47628288. Throughput: 0: 11825.4. Samples: 47615624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:33:02,789][569624] Avg episode reward: [(0, '3897.443')] [2023-03-08 22:33:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000093024_47628288.pth... [2023-03-08 22:33:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000092336_47276032.pth [2023-03-08 22:33:03,468][569911] Updated weights for policy 0, policy_version 93040 (0.0004) [2023-03-08 22:33:06,932][569911] Updated weights for policy 0, policy_version 93120 (0.0005) [2023-03-08 22:33:07,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11741.9, 300 sec: 11996.4). Total num frames: 47685632. Throughput: 0: 11770.3. Samples: 47684888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:33:07,789][569624] Avg episode reward: [(0, '3974.595')] [2023-03-08 22:33:10,264][569911] Updated weights for policy 0, policy_version 93200 (0.0005) [2023-03-08 22:33:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11996.4). Total num frames: 47747072. Throughput: 0: 11796.5. Samples: 47722024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:33:12,789][569624] Avg episode reward: [(0, '4240.244')] [2023-03-08 22:33:13,656][569911] Updated weights for policy 0, policy_version 93280 (0.0005) [2023-03-08 22:33:16,965][569911] Updated weights for policy 0, policy_version 93360 (0.0005) [2023-03-08 22:33:17,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 47808512. Throughput: 0: 11878.5. Samples: 47795756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:33:17,789][569624] Avg episode reward: [(0, '4634.982')] [2023-03-08 22:33:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000093376_47808512.pth... [2023-03-08 22:33:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000092680_47452160.pth [2023-03-08 22:33:20,225][569911] Updated weights for policy 0, policy_version 93440 (0.0005) [2023-03-08 22:33:22,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 47869952. Throughput: 0: 11961.4. Samples: 47869752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:33:22,789][569624] Avg episode reward: [(0, '4665.881')] [2023-03-08 22:33:23,701][569911] Updated weights for policy 0, policy_version 93520 (0.0005) [2023-03-08 22:33:27,185][569911] Updated weights for policy 0, policy_version 93600 (0.0005) [2023-03-08 22:33:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11996.4). Total num frames: 47927296. Throughput: 0: 11939.7. Samples: 47903580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:33:27,789][569624] Avg episode reward: [(0, '4506.392')] [2023-03-08 22:33:30,717][569911] Updated weights for policy 0, policy_version 93680 (0.0005) [2023-03-08 22:33:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 47984640. Throughput: 0: 11876.2. Samples: 47973412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:33:32,789][569624] Avg episode reward: [(0, '4654.551')] [2023-03-08 22:33:32,848][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000093728_47988736.pth... [2023-03-08 22:33:32,850][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000093024_47628288.pth [2023-03-08 22:33:34,176][569911] Updated weights for policy 0, policy_version 93760 (0.0005) [2023-03-08 22:33:37,505][569911] Updated weights for policy 0, policy_version 93840 (0.0005) [2023-03-08 22:33:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 48046080. Throughput: 0: 11969.5. Samples: 48046152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:33:37,789][569624] Avg episode reward: [(0, '4698.875')] [2023-03-08 22:33:40,822][569911] Updated weights for policy 0, policy_version 93920 (0.0005) [2023-03-08 22:33:42,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 48107520. Throughput: 0: 11968.2. Samples: 48083184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:33:42,789][569624] Avg episode reward: [(0, '4501.031')] [2023-03-08 22:33:44,241][569911] Updated weights for policy 0, policy_version 94000 (0.0005) [2023-03-08 22:33:47,623][569911] Updated weights for policy 0, policy_version 94080 (0.0005) [2023-03-08 22:33:47,789][569624] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 48168960. Throughput: 0: 12024.9. Samples: 48156744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:33:47,789][569624] Avg episode reward: [(0, '4730.931')] [2023-03-08 22:33:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000094080_48168960.pth... [2023-03-08 22:33:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000093376_47808512.pth [2023-03-08 22:33:50,933][569911] Updated weights for policy 0, policy_version 94160 (0.0005) [2023-03-08 22:33:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 48230400. Throughput: 0: 12116.0. Samples: 48230108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:33:52,789][569624] Avg episode reward: [(0, '4650.990')] [2023-03-08 22:33:54,481][569911] Updated weights for policy 0, policy_version 94240 (0.0005) [2023-03-08 22:33:57,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 48287744. Throughput: 0: 12031.9. Samples: 48263460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:33:57,789][569624] Avg episode reward: [(0, '4676.232')] [2023-03-08 22:33:57,990][569911] Updated weights for policy 0, policy_version 94320 (0.0005) [2023-03-08 22:34:01,159][569911] Updated weights for policy 0, policy_version 94400 (0.0005) [2023-03-08 22:34:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 48349184. Throughput: 0: 12027.7. Samples: 48337000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:34:02,789][569624] Avg episode reward: [(0, '4683.507')] [2023-03-08 22:34:02,835][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000094440_48353280.pth... [2023-03-08 22:34:02,837][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000093728_47988736.pth [2023-03-08 22:34:04,509][569911] Updated weights for policy 0, policy_version 94480 (0.0005) [2023-03-08 22:34:07,683][569911] Updated weights for policy 0, policy_version 94560 (0.0005) [2023-03-08 22:34:07,789][569624] Fps is (10 sec: 12697.7, 60 sec: 12151.5, 300 sec: 11982.5). Total num frames: 48414720. Throughput: 0: 12067.9. Samples: 48412808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:34:07,789][569624] Avg episode reward: [(0, '4648.799')] [2023-03-08 22:34:11,058][569911] Updated weights for policy 0, policy_version 94640 (0.0005) [2023-03-08 22:34:12,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12151.5, 300 sec: 11982.5). Total num frames: 48476160. Throughput: 0: 12112.8. Samples: 48448656. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:34:12,789][569624] Avg episode reward: [(0, '4383.518')] [2023-03-08 22:34:14,491][569911] Updated weights for policy 0, policy_version 94720 (0.0005) [2023-03-08 22:34:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 48533504. Throughput: 0: 12165.3. Samples: 48520852. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:34:17,789][569624] Avg episode reward: [(0, '4743.198')] [2023-03-08 22:34:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000094792_48533504.pth... [2023-03-08 22:34:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000094080_48168960.pth [2023-03-08 22:34:17,989][569911] Updated weights for policy 0, policy_version 94800 (0.0005) [2023-03-08 22:34:21,409][569911] Updated weights for policy 0, policy_version 94880 (0.0005) [2023-03-08 22:34:22,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 48594944. Throughput: 0: 12138.1. Samples: 48592368. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:34:22,789][569624] Avg episode reward: [(0, '4585.774')] [2023-03-08 22:34:24,756][569911] Updated weights for policy 0, policy_version 94960 (0.0005) [2023-03-08 22:34:27,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11968.7). Total num frames: 48656384. Throughput: 0: 12146.0. Samples: 48629752. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:34:27,789][569624] Avg episode reward: [(0, '4675.921')] [2023-03-08 22:34:28,059][569911] Updated weights for policy 0, policy_version 95040 (0.0005) [2023-03-08 22:34:31,499][569911] Updated weights for policy 0, policy_version 95120 (0.0005) [2023-03-08 22:34:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 11968.7). Total num frames: 48713728. Throughput: 0: 12106.0. Samples: 48701512. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:34:32,789][569624] Avg episode reward: [(0, '4651.287')] [2023-03-08 22:34:32,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000095144_48713728.pth... [2023-03-08 22:34:32,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000094440_48353280.pth [2023-03-08 22:34:35,071][569911] Updated weights for policy 0, policy_version 95200 (0.0005) [2023-03-08 22:34:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12083.2, 300 sec: 11940.9). Total num frames: 48771072. Throughput: 0: 12027.6. Samples: 48771348. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:34:37,789][569624] Avg episode reward: [(0, '4541.811')] [2023-03-08 22:34:38,579][569911] Updated weights for policy 0, policy_version 95280 (0.0005) [2023-03-08 22:34:42,002][569911] Updated weights for policy 0, policy_version 95360 (0.0004) [2023-03-08 22:34:42,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11954.8). Total num frames: 48832512. Throughput: 0: 12087.4. Samples: 48807392. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:34:42,789][569624] Avg episode reward: [(0, '4500.723')] [2023-03-08 22:34:45,245][569911] Updated weights for policy 0, policy_version 95440 (0.0005) [2023-03-08 22:34:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 48893952. Throughput: 0: 12097.9. Samples: 48881404. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:34:47,789][569624] Avg episode reward: [(0, '4657.322')] [2023-03-08 22:34:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000095496_48893952.pth... [2023-03-08 22:34:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000094792_48533504.pth [2023-03-08 22:34:48,555][569911] Updated weights for policy 0, policy_version 95520 (0.0004) [2023-03-08 22:34:51,734][569911] Updated weights for policy 0, policy_version 95600 (0.0004) [2023-03-08 22:34:52,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 48959488. Throughput: 0: 12106.1. Samples: 48957584. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:34:52,789][569624] Avg episode reward: [(0, '4565.547')] [2023-03-08 22:34:55,044][569911] Updated weights for policy 0, policy_version 95680 (0.0005) [2023-03-08 22:34:57,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 11996.4). Total num frames: 49016832. Throughput: 0: 12121.2. Samples: 48994108. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:34:57,789][569624] Avg episode reward: [(0, '4468.375')] [2023-03-08 22:34:58,571][569911] Updated weights for policy 0, policy_version 95760 (0.0004) [2023-03-08 22:35:02,009][569911] Updated weights for policy 0, policy_version 95840 (0.0005) [2023-03-08 22:35:02,789][569624] Fps is (10 sec: 11878.2, 60 sec: 12151.4, 300 sec: 12010.3). Total num frames: 49078272. Throughput: 0: 12105.9. Samples: 49065620. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:35:02,790][569624] Avg episode reward: [(0, '4607.098')] [2023-03-08 22:35:02,795][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000095856_49078272.pth... [2023-03-08 22:35:02,798][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000095144_48713728.pth [2023-03-08 22:35:05,598][569911] Updated weights for policy 0, policy_version 95920 (0.0005) [2023-03-08 22:35:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 49135616. Throughput: 0: 12015.2. Samples: 49133052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:35:07,789][569624] Avg episode reward: [(0, '4617.538')] [2023-03-08 22:35:09,041][569911] Updated weights for policy 0, policy_version 96000 (0.0005) [2023-03-08 22:35:12,387][569911] Updated weights for policy 0, policy_version 96080 (0.0005) [2023-03-08 22:35:12,789][569624] Fps is (10 sec: 11878.6, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 49197056. Throughput: 0: 12003.8. Samples: 49169924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:35:12,789][569624] Avg episode reward: [(0, '4565.786')] [2023-03-08 22:35:15,835][569911] Updated weights for policy 0, policy_version 96160 (0.0005) [2023-03-08 22:35:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 49254400. Throughput: 0: 12014.9. Samples: 49242184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:35:17,789][569624] Avg episode reward: [(0, '4585.936')] [2023-03-08 22:35:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000096200_49254400.pth... [2023-03-08 22:35:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000095496_48893952.pth [2023-03-08 22:35:19,309][569911] Updated weights for policy 0, policy_version 96240 (0.0005) [2023-03-08 22:35:22,711][569911] Updated weights for policy 0, policy_version 96320 (0.0005) [2023-03-08 22:35:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 49315840. Throughput: 0: 12072.6. Samples: 49314616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:35:22,789][569624] Avg episode reward: [(0, '4336.992')] [2023-03-08 22:35:26,067][569911] Updated weights for policy 0, policy_version 96400 (0.0005) [2023-03-08 22:35:27,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 49377280. Throughput: 0: 12080.5. Samples: 49351016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:35:27,789][569624] Avg episode reward: [(0, '4514.784')] [2023-03-08 22:35:29,516][569911] Updated weights for policy 0, policy_version 96480 (0.0005) [2023-03-08 22:35:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11968.6). Total num frames: 49434624. Throughput: 0: 11992.5. Samples: 49421068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:35:32,789][569624] Avg episode reward: [(0, '4435.773')] [2023-03-08 22:35:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000096552_49434624.pth... [2023-03-08 22:35:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000095856_49078272.pth [2023-03-08 22:35:33,066][569911] Updated weights for policy 0, policy_version 96560 (0.0005) [2023-03-08 22:35:36,574][569911] Updated weights for policy 0, policy_version 96640 (0.0005) [2023-03-08 22:35:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12014.9, 300 sec: 11968.6). Total num frames: 49491968. Throughput: 0: 11874.4. Samples: 49491932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:35:37,789][569624] Avg episode reward: [(0, '4516.296')] [2023-03-08 22:35:40,084][569911] Updated weights for policy 0, policy_version 96720 (0.0005) [2023-03-08 22:35:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 49549312. Throughput: 0: 11845.9. Samples: 49527176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:35:42,789][569624] Avg episode reward: [(0, '4348.542')] [2023-03-08 22:35:43,466][569911] Updated weights for policy 0, policy_version 96800 (0.0005) [2023-03-08 22:35:46,991][569911] Updated weights for policy 0, policy_version 96880 (0.0006) [2023-03-08 22:35:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 49610752. Throughput: 0: 11835.1. Samples: 49598196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:35:47,789][569624] Avg episode reward: [(0, '4579.688')] [2023-03-08 22:35:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000096896_49610752.pth... [2023-03-08 22:35:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000096200_49254400.pth [2023-03-08 22:35:50,646][569911] Updated weights for policy 0, policy_version 96960 (0.0006) [2023-03-08 22:35:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11940.9). Total num frames: 49668096. Throughput: 0: 11889.1. Samples: 49668060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:35:52,789][569624] Avg episode reward: [(0, '4538.121')] [2023-03-08 22:35:54,049][569911] Updated weights for policy 0, policy_version 97040 (0.0005) [2023-03-08 22:35:57,408][569911] Updated weights for policy 0, policy_version 97120 (0.0005) [2023-03-08 22:35:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11940.9). Total num frames: 49729536. Throughput: 0: 11856.7. Samples: 49703476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:35:57,789][569624] Avg episode reward: [(0, '4661.981')] [2023-03-08 22:36:00,757][569911] Updated weights for policy 0, policy_version 97200 (0.0005) [2023-03-08 22:36:02,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 49790976. Throughput: 0: 11832.7. Samples: 49774656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:36:02,789][569624] Avg episode reward: [(0, '4795.566')] [2023-03-08 22:36:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000097248_49790976.pth... [2023-03-08 22:36:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000096552_49434624.pth [2023-03-08 22:36:04,226][569911] Updated weights for policy 0, policy_version 97280 (0.0005) [2023-03-08 22:36:07,527][569911] Updated weights for policy 0, policy_version 97360 (0.0005) [2023-03-08 22:36:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11940.9). Total num frames: 49848320. Throughput: 0: 11860.3. Samples: 49848328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:36:07,789][569624] Avg episode reward: [(0, '4559.157')] [2023-03-08 22:36:10,821][569911] Updated weights for policy 0, policy_version 97440 (0.0004) [2023-03-08 22:36:12,789][569624] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 49913856. Throughput: 0: 11873.9. Samples: 49885340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:36:12,789][569624] Avg episode reward: [(0, '4611.553')] [2023-03-08 22:36:14,143][569911] Updated weights for policy 0, policy_version 97520 (0.0005) [2023-03-08 22:36:17,694][569911] Updated weights for policy 0, policy_version 97600 (0.0005) [2023-03-08 22:36:17,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 49971200. Throughput: 0: 11934.3. Samples: 49958112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:36:17,789][569624] Avg episode reward: [(0, '4371.098')] [2023-03-08 22:36:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000097600_49971200.pth... [2023-03-08 22:36:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000096896_49610752.pth [2023-03-08 22:36:21,335][569911] Updated weights for policy 0, policy_version 97680 (0.0005) [2023-03-08 22:36:22,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11810.1, 300 sec: 11927.0). Total num frames: 50024448. Throughput: 0: 11834.0. Samples: 50024460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:36:22,789][569624] Avg episode reward: [(0, '4345.462')] [2023-03-08 22:36:25,060][569911] Updated weights for policy 0, policy_version 97760 (0.0005) [2023-03-08 22:36:27,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11741.9, 300 sec: 11927.0). Total num frames: 50081792. Throughput: 0: 11813.9. Samples: 50058800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:36:27,789][569624] Avg episode reward: [(0, '4554.378')] [2023-03-08 22:36:28,639][569911] Updated weights for policy 0, policy_version 97840 (0.0006) [2023-03-08 22:36:32,115][569911] Updated weights for policy 0, policy_version 97920 (0.0005) [2023-03-08 22:36:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11741.9, 300 sec: 11913.1). Total num frames: 50139136. Throughput: 0: 11789.7. Samples: 50128732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:36:32,789][569624] Avg episode reward: [(0, '4647.662')] [2023-03-08 22:36:32,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000097936_50143232.pth... [2023-03-08 22:36:32,805][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000097248_49790976.pth [2023-03-08 22:36:35,609][569911] Updated weights for policy 0, policy_version 98000 (0.0005) [2023-03-08 22:36:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11913.1). Total num frames: 50200576. Throughput: 0: 11814.6. Samples: 50199716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:36:37,789][569624] Avg episode reward: [(0, '4405.634')] [2023-03-08 22:36:39,044][569911] Updated weights for policy 0, policy_version 98080 (0.0005) [2023-03-08 22:36:42,510][569911] Updated weights for policy 0, policy_version 98160 (0.0005) [2023-03-08 22:36:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11913.1). Total num frames: 50257920. Throughput: 0: 11780.8. Samples: 50233612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:36:42,789][569624] Avg episode reward: [(0, '4683.027')] [2023-03-08 22:36:46,167][569911] Updated weights for policy 0, policy_version 98240 (0.0005) [2023-03-08 22:36:47,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11899.2). Total num frames: 50315264. Throughput: 0: 11740.8. Samples: 50302992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:36:47,789][569624] Avg episode reward: [(0, '4701.027')] [2023-03-08 22:36:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000098272_50315264.pth... [2023-03-08 22:36:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000097600_49971200.pth [2023-03-08 22:36:49,733][569911] Updated weights for policy 0, policy_version 98320 (0.0005) [2023-03-08 22:36:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11913.1). Total num frames: 50376704. Throughput: 0: 11660.1. Samples: 50373032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:36:52,789][569624] Avg episode reward: [(0, '4632.619')] [2023-03-08 22:36:53,102][569911] Updated weights for policy 0, policy_version 98400 (0.0005) [2023-03-08 22:36:56,656][569911] Updated weights for policy 0, policy_version 98480 (0.0004) [2023-03-08 22:36:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11899.2). Total num frames: 50434048. Throughput: 0: 11637.0. Samples: 50409004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:36:57,789][569624] Avg episode reward: [(0, '4739.406')] [2023-03-08 22:37:00,122][569911] Updated weights for policy 0, policy_version 98560 (0.0005) [2023-03-08 22:37:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11899.2). Total num frames: 50491392. Throughput: 0: 11577.9. Samples: 50479116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:37:02,789][569624] Avg episode reward: [(0, '4677.871')] [2023-03-08 22:37:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000098616_50491392.pth... [2023-03-08 22:37:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000097936_50143232.pth [2023-03-08 22:37:03,598][569911] Updated weights for policy 0, policy_version 98640 (0.0004) [2023-03-08 22:37:07,029][569911] Updated weights for policy 0, policy_version 98720 (0.0004) [2023-03-08 22:37:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11913.1). Total num frames: 50552832. Throughput: 0: 11727.6. Samples: 50552200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:37:07,789][569624] Avg episode reward: [(0, '4517.150')] [2023-03-08 22:37:10,357][569911] Updated weights for policy 0, policy_version 98800 (0.0004) [2023-03-08 22:37:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11913.1). Total num frames: 50610176. Throughput: 0: 11763.9. Samples: 50588176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:37:12,789][569624] Avg episode reward: [(0, '4667.108')] [2023-03-08 22:37:14,101][569911] Updated weights for policy 0, policy_version 98880 (0.0005) [2023-03-08 22:37:17,716][569911] Updated weights for policy 0, policy_version 98960 (0.0005) [2023-03-08 22:37:17,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11899.2). Total num frames: 50667520. Throughput: 0: 11656.9. Samples: 50653292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:37:17,789][569624] Avg episode reward: [(0, '4739.909')] [2023-03-08 22:37:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000098960_50667520.pth... [2023-03-08 22:37:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000098272_50315264.pth [2023-03-08 22:37:21,218][569911] Updated weights for policy 0, policy_version 99040 (0.0004) [2023-03-08 22:37:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11899.2). Total num frames: 50724864. Throughput: 0: 11659.3. Samples: 50724384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:37:22,789][569624] Avg episode reward: [(0, '4638.420')] [2023-03-08 22:37:24,701][569911] Updated weights for policy 0, policy_version 99120 (0.0004) [2023-03-08 22:37:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11885.3). Total num frames: 50782208. Throughput: 0: 11717.9. Samples: 50760920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:37:27,790][569624] Avg episode reward: [(0, '4659.551')] [2023-03-08 22:37:28,185][569911] Updated weights for policy 0, policy_version 99200 (0.0005) [2023-03-08 22:37:31,706][569911] Updated weights for policy 0, policy_version 99280 (0.0005) [2023-03-08 22:37:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11885.3). Total num frames: 50839552. Throughput: 0: 11692.0. Samples: 50829132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:37:32,789][569624] Avg episode reward: [(0, '4667.149')] [2023-03-08 22:37:32,795][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000099304_50843648.pth... [2023-03-08 22:37:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000098616_50491392.pth [2023-03-08 22:37:35,314][569911] Updated weights for policy 0, policy_version 99360 (0.0006) [2023-03-08 22:37:37,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11605.3, 300 sec: 11885.3). Total num frames: 50896896. Throughput: 0: 11678.3. Samples: 50898556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:37:37,789][569624] Avg episode reward: [(0, '4728.466')] [2023-03-08 22:37:38,899][569911] Updated weights for policy 0, policy_version 99440 (0.0004) [2023-03-08 22:37:42,385][569911] Updated weights for policy 0, policy_version 99520 (0.0005) [2023-03-08 22:37:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11885.3). Total num frames: 50958336. Throughput: 0: 11650.0. Samples: 50933256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:37:42,789][569624] Avg episode reward: [(0, '4786.765')] [2023-03-08 22:37:45,895][569911] Updated weights for policy 0, policy_version 99600 (0.0005) [2023-03-08 22:37:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11885.3). Total num frames: 51015680. Throughput: 0: 11646.4. Samples: 51003204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:37:47,789][569624] Avg episode reward: [(0, '4636.097')] [2023-03-08 22:37:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000099640_51015680.pth... [2023-03-08 22:37:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000098960_50667520.pth [2023-03-08 22:37:49,489][569911] Updated weights for policy 0, policy_version 99680 (0.0005) [2023-03-08 22:37:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11871.5). Total num frames: 51073024. Throughput: 0: 11533.4. Samples: 51071204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:37:52,789][569624] Avg episode reward: [(0, '4551.557')] [2023-03-08 22:37:53,062][569911] Updated weights for policy 0, policy_version 99760 (0.0005) [2023-03-08 22:37:56,694][569911] Updated weights for policy 0, policy_version 99840 (0.0005) [2023-03-08 22:37:57,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11605.3, 300 sec: 11871.5). Total num frames: 51130368. Throughput: 0: 11489.3. Samples: 51105196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:37:57,789][569624] Avg episode reward: [(0, '4718.865')] [2023-03-08 22:38:00,141][569911] Updated weights for policy 0, policy_version 99920 (0.0005) [2023-03-08 22:38:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11871.5). Total num frames: 51187712. Throughput: 0: 11604.4. Samples: 51175488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:38:02,789][569624] Avg episode reward: [(0, '4734.765')] [2023-03-08 22:38:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000099976_51187712.pth... [2023-03-08 22:38:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000099304_50843648.pth [2023-03-08 22:38:03,574][569911] Updated weights for policy 0, policy_version 100000 (0.0005) [2023-03-08 22:38:07,112][569911] Updated weights for policy 0, policy_version 100080 (0.0005) [2023-03-08 22:38:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11857.6). Total num frames: 51245056. Throughput: 0: 11583.0. Samples: 51245620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:38:07,789][569624] Avg episode reward: [(0, '4597.213')] [2023-03-08 22:38:10,773][569911] Updated weights for policy 0, policy_version 100160 (0.0005) [2023-03-08 22:38:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11605.3, 300 sec: 11857.6). Total num frames: 51306496. Throughput: 0: 11511.3. Samples: 51278928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:38:12,789][569624] Avg episode reward: [(0, '4221.845')] [2023-03-08 22:38:14,036][569911] Updated weights for policy 0, policy_version 100240 (0.0005) [2023-03-08 22:38:17,401][569911] Updated weights for policy 0, policy_version 100320 (0.0005) [2023-03-08 22:38:17,789][569624] Fps is (10 sec: 12287.8, 60 sec: 11673.6, 300 sec: 11857.6). Total num frames: 51367936. Throughput: 0: 11652.8. Samples: 51353508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:38:17,789][569624] Avg episode reward: [(0, '4624.843')] [2023-03-08 22:38:17,794][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000100328_51367936.pth... [2023-03-08 22:38:17,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000099640_51015680.pth [2023-03-08 22:38:21,046][569911] Updated weights for policy 0, policy_version 100400 (0.0005) [2023-03-08 22:38:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11843.7). Total num frames: 51421184. Throughput: 0: 11615.4. Samples: 51421248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:38:22,789][569624] Avg episode reward: [(0, '4691.803')] [2023-03-08 22:38:24,585][569911] Updated weights for policy 0, policy_version 100480 (0.0005) [2023-03-08 22:38:27,789][569624] Fps is (10 sec: 11059.4, 60 sec: 11605.4, 300 sec: 11843.7). Total num frames: 51478528. Throughput: 0: 11588.5. Samples: 51454736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:38:27,789][569624] Avg episode reward: [(0, '4635.954')] [2023-03-08 22:38:28,160][569911] Updated weights for policy 0, policy_version 100560 (0.0004) [2023-03-08 22:38:31,707][569911] Updated weights for policy 0, policy_version 100640 (0.0005) [2023-03-08 22:38:32,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11843.7). Total num frames: 51539968. Throughput: 0: 11625.9. Samples: 51526372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:38:32,789][569624] Avg episode reward: [(0, '4522.057')] [2023-03-08 22:38:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000100664_51539968.pth... [2023-03-08 22:38:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000099976_51187712.pth [2023-03-08 22:38:35,059][569911] Updated weights for policy 0, policy_version 100720 (0.0005) [2023-03-08 22:38:37,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 51601408. Throughput: 0: 11692.9. Samples: 51597384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:38:37,789][569624] Avg episode reward: [(0, '4425.165')] [2023-03-08 22:38:38,532][569911] Updated weights for policy 0, policy_version 100800 (0.0005) [2023-03-08 22:38:42,104][569911] Updated weights for policy 0, policy_version 100880 (0.0005) [2023-03-08 22:38:42,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11605.3, 300 sec: 11815.9). Total num frames: 51654656. Throughput: 0: 11740.6. Samples: 51633524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:38:42,789][569624] Avg episode reward: [(0, '4486.205')] [2023-03-08 22:38:45,610][569911] Updated weights for policy 0, policy_version 100960 (0.0005) [2023-03-08 22:38:47,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11815.9). Total num frames: 51716096. Throughput: 0: 11735.3. Samples: 51703576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:38:47,789][569624] Avg episode reward: [(0, '4440.540')] [2023-03-08 22:38:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000101008_51716096.pth... [2023-03-08 22:38:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000100328_51367936.pth [2023-03-08 22:38:49,115][569911] Updated weights for policy 0, policy_version 101040 (0.0005) [2023-03-08 22:38:52,450][569911] Updated weights for policy 0, policy_version 101120 (0.0005) [2023-03-08 22:38:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11741.9, 300 sec: 11829.8). Total num frames: 51777536. Throughput: 0: 11742.0. Samples: 51774012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:38:52,789][569624] Avg episode reward: [(0, '4744.294')] [2023-03-08 22:38:55,743][569911] Updated weights for policy 0, policy_version 101200 (0.0005) [2023-03-08 22:38:57,789][569624] Fps is (10 sec: 11878.6, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 51834880. Throughput: 0: 11831.9. Samples: 51811364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:38:57,789][569624] Avg episode reward: [(0, '4557.458')] [2023-03-08 22:38:59,197][569911] Updated weights for policy 0, policy_version 101280 (0.0005) [2023-03-08 22:39:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11788.1). Total num frames: 51892224. Throughput: 0: 11762.6. Samples: 51882824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:39:02,789][569624] Avg episode reward: [(0, '4565.199')] [2023-03-08 22:39:02,790][569911] Updated weights for policy 0, policy_version 101360 (0.0005) [2023-03-08 22:39:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000101360_51896320.pth... [2023-03-08 22:39:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000100664_51539968.pth [2023-03-08 22:39:06,361][569911] Updated weights for policy 0, policy_version 101440 (0.0005) [2023-03-08 22:39:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 51949568. Throughput: 0: 11742.1. Samples: 51949640. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:39:07,789][569624] Avg episode reward: [(0, '4511.074')] [2023-03-08 22:39:09,945][569911] Updated weights for policy 0, policy_version 101520 (0.0004) [2023-03-08 22:39:12,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11774.3). Total num frames: 52006912. Throughput: 0: 11789.8. Samples: 51985276. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:39:12,789][569624] Avg episode reward: [(0, '4576.999')] [2023-03-08 22:39:13,484][569911] Updated weights for policy 0, policy_version 101600 (0.0005) [2023-03-08 22:39:17,007][569911] Updated weights for policy 0, policy_version 101680 (0.0005) [2023-03-08 22:39:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11774.3). Total num frames: 52068352. Throughput: 0: 11751.7. Samples: 52055196. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:39:17,789][569624] Avg episode reward: [(0, '4758.062')] [2023-03-08 22:39:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000101696_52068352.pth... [2023-03-08 22:39:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000101008_51716096.pth [2023-03-08 22:39:20,442][569911] Updated weights for policy 0, policy_version 101760 (0.0005) [2023-03-08 22:39:22,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11760.4). Total num frames: 52125696. Throughput: 0: 11740.3. Samples: 52125696. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:39:22,789][569624] Avg episode reward: [(0, '4815.015')] [2023-03-08 22:39:24,004][569911] Updated weights for policy 0, policy_version 101840 (0.0005) [2023-03-08 22:39:27,599][569911] Updated weights for policy 0, policy_version 101920 (0.0004) [2023-03-08 22:39:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11760.4). Total num frames: 52183040. Throughput: 0: 11712.5. Samples: 52160588. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:39:27,789][569624] Avg episode reward: [(0, '4721.138')] [2023-03-08 22:39:31,273][569911] Updated weights for policy 0, policy_version 102000 (0.0005) [2023-03-08 22:39:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11760.4). Total num frames: 52240384. Throughput: 0: 11645.3. Samples: 52227616. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:39:32,789][569624] Avg episode reward: [(0, '4714.665')] [2023-03-08 22:39:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000102032_52240384.pth... [2023-03-08 22:39:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000101360_51896320.pth [2023-03-08 22:39:34,739][569911] Updated weights for policy 0, policy_version 102080 (0.0005) [2023-03-08 22:39:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11746.5). Total num frames: 52297728. Throughput: 0: 11639.7. Samples: 52297800. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:39:37,789][569624] Avg episode reward: [(0, '4588.111')] [2023-03-08 22:39:38,182][569911] Updated weights for policy 0, policy_version 102160 (0.0005) [2023-03-08 22:39:41,615][569911] Updated weights for policy 0, policy_version 102240 (0.0005) [2023-03-08 22:39:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11746.5). Total num frames: 52359168. Throughput: 0: 11605.1. Samples: 52333596. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:39:42,789][569624] Avg episode reward: [(0, '4784.223')] [2023-03-08 22:39:45,194][569911] Updated weights for policy 0, policy_version 102320 (0.0005) [2023-03-08 22:39:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11718.7). Total num frames: 52416512. Throughput: 0: 11578.0. Samples: 52403836. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:39:47,789][569624] Avg episode reward: [(0, '4583.112')] [2023-03-08 22:39:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000102376_52416512.pth... [2023-03-08 22:39:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000101696_52068352.pth [2023-03-08 22:39:48,838][569911] Updated weights for policy 0, policy_version 102400 (0.0004) [2023-03-08 22:39:52,225][569911] Updated weights for policy 0, policy_version 102480 (0.0005) [2023-03-08 22:39:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11718.7). Total num frames: 52473856. Throughput: 0: 11651.0. Samples: 52473936. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:39:52,789][569624] Avg episode reward: [(0, '4654.722')] [2023-03-08 22:39:55,696][569911] Updated weights for policy 0, policy_version 102560 (0.0005) [2023-03-08 22:39:57,789][569624] Fps is (10 sec: 11878.6, 60 sec: 11673.6, 300 sec: 11718.7). Total num frames: 52535296. Throughput: 0: 11662.1. Samples: 52510068. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:39:57,789][569624] Avg episode reward: [(0, '4724.376')] [2023-03-08 22:39:58,990][569911] Updated weights for policy 0, policy_version 102640 (0.0005) [2023-03-08 22:40:02,581][569911] Updated weights for policy 0, policy_version 102720 (0.0004) [2023-03-08 22:40:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11718.7). Total num frames: 52592640. Throughput: 0: 11700.1. Samples: 52581700. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 22:40:02,789][569624] Avg episode reward: [(0, '4585.481')] [2023-03-08 22:40:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000102720_52592640.pth... [2023-03-08 22:40:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000102032_52240384.pth [2023-03-08 22:40:06,193][569911] Updated weights for policy 0, policy_version 102800 (0.0006) [2023-03-08 22:40:07,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11704.8). Total num frames: 52649984. Throughput: 0: 11647.3. Samples: 52649824. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:40:07,789][569624] Avg episode reward: [(0, '4640.776')] [2023-03-08 22:40:09,730][569911] Updated weights for policy 0, policy_version 102880 (0.0005) [2023-03-08 22:40:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11704.8). Total num frames: 52707328. Throughput: 0: 11605.1. Samples: 52682816. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:40:12,789][569624] Avg episode reward: [(0, '4800.453')] [2023-03-08 22:40:13,376][569911] Updated weights for policy 0, policy_version 102960 (0.0005) [2023-03-08 22:40:16,817][569911] Updated weights for policy 0, policy_version 103040 (0.0005) [2023-03-08 22:40:17,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11691.0). Total num frames: 52764672. Throughput: 0: 11666.9. Samples: 52752624. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:40:17,789][569624] Avg episode reward: [(0, '4531.140')] [2023-03-08 22:40:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000103064_52768768.pth... [2023-03-08 22:40:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000102376_52416512.pth [2023-03-08 22:40:20,167][569911] Updated weights for policy 0, policy_version 103120 (0.0005) [2023-03-08 22:40:22,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11691.0). Total num frames: 52826112. Throughput: 0: 11736.3. Samples: 52825932. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:40:22,789][569624] Avg episode reward: [(0, '4625.169')] [2023-03-08 22:40:23,695][569911] Updated weights for policy 0, policy_version 103200 (0.0004) [2023-03-08 22:40:27,182][569911] Updated weights for policy 0, policy_version 103280 (0.0005) [2023-03-08 22:40:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11691.0). Total num frames: 52883456. Throughput: 0: 11702.2. Samples: 52860196. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:40:27,789][569624] Avg episode reward: [(0, '4661.856')] [2023-03-08 22:40:30,648][569911] Updated weights for policy 0, policy_version 103360 (0.0005) [2023-03-08 22:40:32,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11704.8). Total num frames: 52944896. Throughput: 0: 11718.8. Samples: 52931180. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:40:32,789][569624] Avg episode reward: [(0, '4699.575')] [2023-03-08 22:40:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000103408_52944896.pth... [2023-03-08 22:40:32,797][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000102720_52592640.pth [2023-03-08 22:40:34,193][569911] Updated weights for policy 0, policy_version 103440 (0.0005) [2023-03-08 22:40:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11691.0). Total num frames: 52998144. Throughput: 0: 11678.3. Samples: 52999460. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:40:37,789][569624] Avg episode reward: [(0, '4630.205')] [2023-03-08 22:40:37,793][569911] Updated weights for policy 0, policy_version 103520 (0.0005) [2023-03-08 22:40:41,241][569911] Updated weights for policy 0, policy_version 103600 (0.0005) [2023-03-08 22:40:42,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11691.0). Total num frames: 53059584. Throughput: 0: 11666.7. Samples: 53035072. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:40:42,789][569624] Avg episode reward: [(0, '4769.831')] [2023-03-08 22:40:44,814][569911] Updated weights for policy 0, policy_version 103680 (0.0006) [2023-03-08 22:40:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11691.0). Total num frames: 53116928. Throughput: 0: 11616.3. Samples: 53104432. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:40:47,789][569624] Avg episode reward: [(0, '4695.747')] [2023-03-08 22:40:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000103744_53116928.pth... [2023-03-08 22:40:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000103064_52768768.pth [2023-03-08 22:40:48,488][569911] Updated weights for policy 0, policy_version 103760 (0.0005) [2023-03-08 22:40:52,306][569911] Updated weights for policy 0, policy_version 103840 (0.0004) [2023-03-08 22:40:52,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11605.3, 300 sec: 11663.2). Total num frames: 53170176. Throughput: 0: 11535.7. Samples: 53168932. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:40:52,789][569624] Avg episode reward: [(0, '4772.045')] [2023-03-08 22:40:55,743][569911] Updated weights for policy 0, policy_version 103920 (0.0005) [2023-03-08 22:40:57,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11537.1, 300 sec: 11649.3). Total num frames: 53227520. Throughput: 0: 11575.6. Samples: 53203720. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:40:57,789][569624] Avg episode reward: [(0, '4631.262')] [2023-03-08 22:40:59,150][569911] Updated weights for policy 0, policy_version 104000 (0.0005) [2023-03-08 22:41:02,361][569911] Updated weights for policy 0, policy_version 104080 (0.0005) [2023-03-08 22:41:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11673.6, 300 sec: 11677.1). Total num frames: 53293056. Throughput: 0: 11647.4. Samples: 53276756. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:41:02,789][569624] Avg episode reward: [(0, '4740.225')] [2023-03-08 22:41:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000104088_53293056.pth... [2023-03-08 22:41:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000103408_52944896.pth [2023-03-08 22:41:05,886][569911] Updated weights for policy 0, policy_version 104160 (0.0005) [2023-03-08 22:41:07,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11673.6, 300 sec: 11649.3). Total num frames: 53350400. Throughput: 0: 11648.7. Samples: 53350124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:41:07,789][569624] Avg episode reward: [(0, '4774.854')] [2023-03-08 22:41:09,127][569911] Updated weights for policy 0, policy_version 104240 (0.0005) [2023-03-08 22:41:12,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11649.3). Total num frames: 53407744. Throughput: 0: 11704.1. Samples: 53386880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:41:12,789][569624] Avg episode reward: [(0, '4568.134')] [2023-03-08 22:41:12,808][569911] Updated weights for policy 0, policy_version 104320 (0.0006) [2023-03-08 22:41:16,288][569911] Updated weights for policy 0, policy_version 104400 (0.0005) [2023-03-08 22:41:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11677.1). Total num frames: 53469184. Throughput: 0: 11672.4. Samples: 53456440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:41:17,789][569624] Avg episode reward: [(0, '4588.652')] [2023-03-08 22:41:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000104432_53469184.pth... [2023-03-08 22:41:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000103744_53116928.pth [2023-03-08 22:41:19,906][569911] Updated weights for policy 0, policy_version 104480 (0.0005) [2023-03-08 22:41:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11663.2). Total num frames: 53522432. Throughput: 0: 11620.5. Samples: 53522384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:41:22,789][569624] Avg episode reward: [(0, '4691.929')] [2023-03-08 22:41:23,587][569911] Updated weights for policy 0, policy_version 104560 (0.0005) [2023-03-08 22:41:27,237][569911] Updated weights for policy 0, policy_version 104640 (0.0005) [2023-03-08 22:41:27,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11605.3, 300 sec: 11663.2). Total num frames: 53579776. Throughput: 0: 11558.6. Samples: 53555208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:41:27,789][569624] Avg episode reward: [(0, '4685.203')] [2023-03-08 22:41:30,890][569911] Updated weights for policy 0, policy_version 104720 (0.0004) [2023-03-08 22:41:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11649.3). Total num frames: 53637120. Throughput: 0: 11564.7. Samples: 53624844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:41:32,789][569624] Avg episode reward: [(0, '4659.660')] [2023-03-08 22:41:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000104760_53637120.pth... [2023-03-08 22:41:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000104088_53293056.pth [2023-03-08 22:41:34,443][569911] Updated weights for policy 0, policy_version 104800 (0.0005) [2023-03-08 22:41:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11649.3). Total num frames: 53694464. Throughput: 0: 11678.5. Samples: 53694464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:41:37,789][569624] Avg episode reward: [(0, '4694.905')] [2023-03-08 22:41:37,905][569911] Updated weights for policy 0, policy_version 104880 (0.0005) [2023-03-08 22:41:41,330][569911] Updated weights for policy 0, policy_version 104960 (0.0005) [2023-03-08 22:41:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11663.2). Total num frames: 53755904. Throughput: 0: 11719.8. Samples: 53731112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:41:42,789][569624] Avg episode reward: [(0, '4323.180')] [2023-03-08 22:41:44,795][569911] Updated weights for policy 0, policy_version 105040 (0.0005) [2023-03-08 22:41:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11673.6, 300 sec: 11663.2). Total num frames: 53817344. Throughput: 0: 11666.2. Samples: 53801736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:41:47,789][569624] Avg episode reward: [(0, '4045.541')] [2023-03-08 22:41:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000105112_53817344.pth... [2023-03-08 22:41:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000104432_53469184.pth [2023-03-08 22:41:48,116][569911] Updated weights for policy 0, policy_version 105120 (0.0005) [2023-03-08 22:41:51,508][569911] Updated weights for policy 0, policy_version 105200 (0.0005) [2023-03-08 22:41:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11663.2). Total num frames: 53874688. Throughput: 0: 11666.6. Samples: 53875120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:41:52,789][569624] Avg episode reward: [(0, '4598.342')] [2023-03-08 22:41:54,977][569911] Updated weights for policy 0, policy_version 105280 (0.0005) [2023-03-08 22:41:57,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11677.1). Total num frames: 53936128. Throughput: 0: 11646.3. Samples: 53910964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:41:57,789][569624] Avg episode reward: [(0, '4708.278')] [2023-03-08 22:41:58,342][569911] Updated weights for policy 0, policy_version 105360 (0.0005) [2023-03-08 22:42:01,699][569911] Updated weights for policy 0, policy_version 105440 (0.0005) [2023-03-08 22:42:02,789][569624] Fps is (10 sec: 12288.2, 60 sec: 11741.9, 300 sec: 11677.1). Total num frames: 53997568. Throughput: 0: 11707.5. Samples: 53983276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:42:02,789][569624] Avg episode reward: [(0, '4595.104')] [2023-03-08 22:42:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000105464_53997568.pth... [2023-03-08 22:42:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000104760_53637120.pth [2023-03-08 22:42:05,192][569911] Updated weights for policy 0, policy_version 105520 (0.0005) [2023-03-08 22:42:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11677.1). Total num frames: 54054912. Throughput: 0: 11792.8. Samples: 54053060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:42:07,789][569624] Avg episode reward: [(0, '4533.499')] [2023-03-08 22:42:08,684][569911] Updated weights for policy 0, policy_version 105600 (0.0005) [2023-03-08 22:42:12,285][569911] Updated weights for policy 0, policy_version 105680 (0.0005) [2023-03-08 22:42:12,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11677.1). Total num frames: 54112256. Throughput: 0: 11832.7. Samples: 54087680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:42:12,789][569624] Avg episode reward: [(0, '4547.048')] [2023-03-08 22:42:15,642][569911] Updated weights for policy 0, policy_version 105760 (0.0005) [2023-03-08 22:42:17,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11691.0). Total num frames: 54173696. Throughput: 0: 11881.5. Samples: 54159512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:42:17,789][569624] Avg episode reward: [(0, '4413.321')] [2023-03-08 22:42:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000105808_54173696.pth... [2023-03-08 22:42:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000105112_53817344.pth [2023-03-08 22:42:19,035][569911] Updated weights for policy 0, policy_version 105840 (0.0005) [2023-03-08 22:42:22,682][569911] Updated weights for policy 0, policy_version 105920 (0.0005) [2023-03-08 22:42:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11691.0). Total num frames: 54231040. Throughput: 0: 11893.9. Samples: 54229688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:42:22,789][569624] Avg episode reward: [(0, '4571.359')] [2023-03-08 22:42:26,111][569911] Updated weights for policy 0, policy_version 106000 (0.0005) [2023-03-08 22:42:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11704.8). Total num frames: 54292480. Throughput: 0: 11873.7. Samples: 54265428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:42:27,789][569624] Avg episode reward: [(0, '4729.404')] [2023-03-08 22:42:29,433][569911] Updated weights for policy 0, policy_version 106080 (0.0005) [2023-03-08 22:42:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11704.8). Total num frames: 54349824. Throughput: 0: 11933.1. Samples: 54338728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:42:32,789][569624] Avg episode reward: [(0, '4682.215')] [2023-03-08 22:42:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000106152_54349824.pth... [2023-03-08 22:42:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000105464_53997568.pth [2023-03-08 22:42:32,891][569911] Updated weights for policy 0, policy_version 106160 (0.0005) [2023-03-08 22:42:36,296][569911] Updated weights for policy 0, policy_version 106240 (0.0005) [2023-03-08 22:42:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11704.8). Total num frames: 54411264. Throughput: 0: 11911.8. Samples: 54411148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:42:37,789][569624] Avg episode reward: [(0, '4276.814')] [2023-03-08 22:42:39,692][569911] Updated weights for policy 0, policy_version 106320 (0.0005) [2023-03-08 22:42:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11704.8). Total num frames: 54468608. Throughput: 0: 11847.4. Samples: 54444096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:42:42,789][569624] Avg episode reward: [(0, '4108.137')] [2023-03-08 22:42:43,270][569911] Updated weights for policy 0, policy_version 106400 (0.0004) [2023-03-08 22:42:46,794][569911] Updated weights for policy 0, policy_version 106480 (0.0004) [2023-03-08 22:42:47,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11704.8). Total num frames: 54525952. Throughput: 0: 11808.8. Samples: 54514672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:42:47,789][569624] Avg episode reward: [(0, '4223.264')] [2023-03-08 22:42:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000106496_54525952.pth... [2023-03-08 22:42:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000105808_54173696.pth [2023-03-08 22:42:50,255][569911] Updated weights for policy 0, policy_version 106560 (0.0005) [2023-03-08 22:42:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11718.7). Total num frames: 54587392. Throughput: 0: 11799.8. Samples: 54584052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:42:52,789][569624] Avg episode reward: [(0, '4455.298')] [2023-03-08 22:42:53,764][569911] Updated weights for policy 0, policy_version 106640 (0.0005) [2023-03-08 22:42:57,183][569911] Updated weights for policy 0, policy_version 106720 (0.0005) [2023-03-08 22:42:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11718.7). Total num frames: 54644736. Throughput: 0: 11855.2. Samples: 54621164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:42:57,789][569624] Avg episode reward: [(0, '4294.369')] [2023-03-08 22:43:00,505][569911] Updated weights for policy 0, policy_version 106800 (0.0004) [2023-03-08 22:43:02,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11746.5). Total num frames: 54710272. Throughput: 0: 11880.1. Samples: 54694116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:43:02,789][569624] Avg episode reward: [(0, '4453.004')] [2023-03-08 22:43:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000106856_54710272.pth... [2023-03-08 22:43:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000106152_54349824.pth [2023-03-08 22:43:03,853][569911] Updated weights for policy 0, policy_version 106880 (0.0005) [2023-03-08 22:43:07,341][569911] Updated weights for policy 0, policy_version 106960 (0.0005) [2023-03-08 22:43:07,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11732.6). Total num frames: 54767616. Throughput: 0: 11921.0. Samples: 54766132. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:43:07,789][569624] Avg episode reward: [(0, '4456.665')] [2023-03-08 22:43:10,725][569911] Updated weights for policy 0, policy_version 107040 (0.0005) [2023-03-08 22:43:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11718.7). Total num frames: 54824960. Throughput: 0: 11906.5. Samples: 54801220. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:43:12,789][569624] Avg episode reward: [(0, '4502.358')] [2023-03-08 22:43:14,381][569911] Updated weights for policy 0, policy_version 107120 (0.0005) [2023-03-08 22:43:17,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11732.6). Total num frames: 54882304. Throughput: 0: 11836.7. Samples: 54871380. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:43:17,789][569624] Avg episode reward: [(0, '4454.037')] [2023-03-08 22:43:17,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000107192_54882304.pth... [2023-03-08 22:43:17,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000106496_54525952.pth [2023-03-08 22:43:17,845][569911] Updated weights for policy 0, policy_version 107200 (0.0005) [2023-03-08 22:43:21,147][569911] Updated weights for policy 0, policy_version 107280 (0.0005) [2023-03-08 22:43:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11746.5). Total num frames: 54943744. Throughput: 0: 11837.1. Samples: 54943816. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:43:22,789][569624] Avg episode reward: [(0, '4497.675')] [2023-03-08 22:43:24,482][569911] Updated weights for policy 0, policy_version 107360 (0.0005) [2023-03-08 22:43:27,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11746.5). Total num frames: 55005184. Throughput: 0: 11923.9. Samples: 54980672. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:43:27,789][569624] Avg episode reward: [(0, '4481.779')] [2023-03-08 22:43:27,870][569911] Updated weights for policy 0, policy_version 107440 (0.0005) [2023-03-08 22:43:31,078][569911] Updated weights for policy 0, policy_version 107520 (0.0005) [2023-03-08 22:43:32,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12014.9, 300 sec: 11760.4). Total num frames: 55070720. Throughput: 0: 12032.9. Samples: 55056156. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:43:32,789][569624] Avg episode reward: [(0, '4331.122')] [2023-03-08 22:43:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000107560_55070720.pth... [2023-03-08 22:43:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000106856_54710272.pth [2023-03-08 22:43:34,455][569911] Updated weights for policy 0, policy_version 107600 (0.0004) [2023-03-08 22:43:37,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11774.3). Total num frames: 55128064. Throughput: 0: 12081.6. Samples: 55127724. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:43:37,789][569624] Avg episode reward: [(0, '4687.033')] [2023-03-08 22:43:37,985][569911] Updated weights for policy 0, policy_version 107680 (0.0005) [2023-03-08 22:43:41,451][569911] Updated weights for policy 0, policy_version 107760 (0.0005) [2023-03-08 22:43:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11774.3). Total num frames: 55189504. Throughput: 0: 12019.7. Samples: 55162052. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:43:42,789][569624] Avg episode reward: [(0, '4417.928')] [2023-03-08 22:43:44,898][569911] Updated weights for policy 0, policy_version 107840 (0.0005) [2023-03-08 22:43:47,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11774.3). Total num frames: 55250944. Throughput: 0: 12076.4. Samples: 55237556. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:43:47,789][569624] Avg episode reward: [(0, '4685.571')] [2023-03-08 22:43:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000107912_55250944.pth... [2023-03-08 22:43:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000107192_54882304.pth [2023-03-08 22:43:47,945][569911] Updated weights for policy 0, policy_version 107920 (0.0006) [2023-03-08 22:43:51,363][569911] Updated weights for policy 0, policy_version 108000 (0.0005) [2023-03-08 22:43:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11788.1). Total num frames: 55312384. Throughput: 0: 12104.4. Samples: 55310832. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:43:52,789][569624] Avg episode reward: [(0, '4631.041')] [2023-03-08 22:43:54,721][569911] Updated weights for policy 0, policy_version 108080 (0.0005) [2023-03-08 22:43:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 11788.1). Total num frames: 55369728. Throughput: 0: 12103.8. Samples: 55345892. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:43:57,789][569624] Avg episode reward: [(0, '4611.221')] [2023-03-08 22:43:58,149][569911] Updated weights for policy 0, policy_version 108160 (0.0005) [2023-03-08 22:44:01,463][569911] Updated weights for policy 0, policy_version 108240 (0.0005) [2023-03-08 22:44:02,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11802.0). Total num frames: 55431168. Throughput: 0: 12176.5. Samples: 55419324. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:44:02,789][569624] Avg episode reward: [(0, '4655.794')] [2023-03-08 22:44:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000108264_55431168.pth... [2023-03-08 22:44:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000107560_55070720.pth [2023-03-08 22:44:04,996][569911] Updated weights for policy 0, policy_version 108320 (0.0005) [2023-03-08 22:44:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11815.9). Total num frames: 55492608. Throughput: 0: 12194.9. Samples: 55492588. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:44:07,789][569624] Avg episode reward: [(0, '4384.540')] [2023-03-08 22:44:08,178][569911] Updated weights for policy 0, policy_version 108400 (0.0005) [2023-03-08 22:44:11,564][569911] Updated weights for policy 0, policy_version 108480 (0.0005) [2023-03-08 22:44:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11815.9). Total num frames: 55554048. Throughput: 0: 12204.4. Samples: 55529872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:44:12,789][569624] Avg episode reward: [(0, '4415.043')] [2023-03-08 22:44:15,053][569911] Updated weights for policy 0, policy_version 108560 (0.0005) [2023-03-08 22:44:17,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 11829.8). Total num frames: 55615488. Throughput: 0: 12124.4. Samples: 55601752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:44:17,789][569624] Avg episode reward: [(0, '4713.252')] [2023-03-08 22:44:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000108624_55615488.pth... [2023-03-08 22:44:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000107912_55250944.pth [2023-03-08 22:44:18,427][569911] Updated weights for policy 0, policy_version 108640 (0.0005) [2023-03-08 22:44:21,806][569911] Updated weights for policy 0, policy_version 108720 (0.0005) [2023-03-08 22:44:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 11829.8). Total num frames: 55672832. Throughput: 0: 12113.4. Samples: 55672828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:44:22,789][569624] Avg episode reward: [(0, '4520.540')] [2023-03-08 22:44:25,267][569911] Updated weights for policy 0, policy_version 108800 (0.0004) [2023-03-08 22:44:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 11843.7). Total num frames: 55734272. Throughput: 0: 12162.8. Samples: 55709376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:44:27,789][569624] Avg episode reward: [(0, '4459.088')] [2023-03-08 22:44:28,648][569911] Updated weights for policy 0, policy_version 108880 (0.0004) [2023-03-08 22:44:32,173][569911] Updated weights for policy 0, policy_version 108960 (0.0005) [2023-03-08 22:44:32,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 11843.7). Total num frames: 55791616. Throughput: 0: 12039.6. Samples: 55779336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:44:32,789][569624] Avg episode reward: [(0, '4356.701')] [2023-03-08 22:44:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000108968_55791616.pth... [2023-03-08 22:44:32,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000108264_55431168.pth [2023-03-08 22:44:35,551][569911] Updated weights for policy 0, policy_version 109040 (0.0004) [2023-03-08 22:44:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11843.7). Total num frames: 55853056. Throughput: 0: 12032.3. Samples: 55852284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:44:37,789][569624] Avg episode reward: [(0, '4385.621')] [2023-03-08 22:44:39,052][569911] Updated weights for policy 0, policy_version 109120 (0.0005) [2023-03-08 22:44:42,354][569911] Updated weights for policy 0, policy_version 109200 (0.0005) [2023-03-08 22:44:42,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11857.6). Total num frames: 55914496. Throughput: 0: 12043.9. Samples: 55887868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:44:42,789][569624] Avg episode reward: [(0, '4658.009')] [2023-03-08 22:44:45,476][569911] Updated weights for policy 0, policy_version 109280 (0.0004) [2023-03-08 22:44:47,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12151.5, 300 sec: 11885.3). Total num frames: 55980032. Throughput: 0: 12101.7. Samples: 55963900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:44:47,789][569624] Avg episode reward: [(0, '4785.684')] [2023-03-08 22:44:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000109336_55980032.pth... [2023-03-08 22:44:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000108624_55615488.pth [2023-03-08 22:44:48,863][569911] Updated weights for policy 0, policy_version 109360 (0.0005) [2023-03-08 22:44:52,238][569911] Updated weights for policy 0, policy_version 109440 (0.0005) [2023-03-08 22:44:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11871.5). Total num frames: 56037376. Throughput: 0: 12104.1. Samples: 56037272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:44:52,789][569624] Avg episode reward: [(0, '4648.874')] [2023-03-08 22:44:55,735][569911] Updated weights for policy 0, policy_version 109520 (0.0005) [2023-03-08 22:44:57,789][569624] Fps is (10 sec: 11878.6, 60 sec: 12151.5, 300 sec: 11885.3). Total num frames: 56098816. Throughput: 0: 12047.9. Samples: 56072028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:44:57,789][569624] Avg episode reward: [(0, '4642.190')] [2023-03-08 22:44:58,876][569911] Updated weights for policy 0, policy_version 109600 (0.0005) [2023-03-08 22:45:02,173][569911] Updated weights for policy 0, policy_version 109680 (0.0005) [2023-03-08 22:45:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11899.2). Total num frames: 56160256. Throughput: 0: 12139.6. Samples: 56148032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:45:02,789][569624] Avg episode reward: [(0, '4777.270')] [2023-03-08 22:45:02,830][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000109696_56164352.pth... [2023-03-08 22:45:02,832][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000108968_55791616.pth [2023-03-08 22:45:05,397][569911] Updated weights for policy 0, policy_version 109760 (0.0005) [2023-03-08 22:45:07,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 11913.1). Total num frames: 56221696. Throughput: 0: 12197.0. Samples: 56221692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:45:07,789][569624] Avg episode reward: [(0, '4705.046')] [2023-03-08 22:45:08,833][569911] Updated weights for policy 0, policy_version 109840 (0.0005) [2023-03-08 22:45:12,309][569911] Updated weights for policy 0, policy_version 109920 (0.0005) [2023-03-08 22:45:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11927.0). Total num frames: 56283136. Throughput: 0: 12203.8. Samples: 56258548. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 22:45:12,789][569624] Avg episode reward: [(0, '4734.942')] [2023-03-08 22:45:15,578][569911] Updated weights for policy 0, policy_version 110000 (0.0005) [2023-03-08 22:45:17,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 11927.0). Total num frames: 56344576. Throughput: 0: 12266.2. Samples: 56331316. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 22:45:17,789][569624] Avg episode reward: [(0, '4772.652')] [2023-03-08 22:45:17,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000110048_56344576.pth... [2023-03-08 22:45:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000109336_55980032.pth [2023-03-08 22:45:18,961][569911] Updated weights for policy 0, policy_version 110080 (0.0005) [2023-03-08 22:45:22,373][569911] Updated weights for policy 0, policy_version 110160 (0.0005) [2023-03-08 22:45:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 11927.0). Total num frames: 56401920. Throughput: 0: 12255.8. Samples: 56403796. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 22:45:22,789][569624] Avg episode reward: [(0, '4658.929')] [2023-03-08 22:45:25,697][569911] Updated weights for policy 0, policy_version 110240 (0.0004) [2023-03-08 22:45:27,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 11940.9). Total num frames: 56467456. Throughput: 0: 12296.1. Samples: 56441196. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 22:45:27,789][569624] Avg episode reward: [(0, '4795.927')] [2023-03-08 22:45:29,089][569911] Updated weights for policy 0, policy_version 110320 (0.0005) [2023-03-08 22:45:32,401][569911] Updated weights for policy 0, policy_version 110400 (0.0005) [2023-03-08 22:45:32,789][569624] Fps is (10 sec: 12697.7, 60 sec: 12288.0, 300 sec: 11968.7). Total num frames: 56528896. Throughput: 0: 12245.4. Samples: 56514940. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 22:45:32,789][569624] Avg episode reward: [(0, '4640.239')] [2023-03-08 22:45:32,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000110408_56528896.pth... [2023-03-08 22:45:32,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000109696_56164352.pth [2023-03-08 22:45:35,831][569911] Updated weights for policy 0, policy_version 110480 (0.0005) [2023-03-08 22:45:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 11954.8). Total num frames: 56586240. Throughput: 0: 12196.5. Samples: 56586116. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 22:45:37,789][569624] Avg episode reward: [(0, '4540.037')] [2023-03-08 22:45:39,269][569911] Updated weights for policy 0, policy_version 110560 (0.0005) [2023-03-08 22:45:42,722][569911] Updated weights for policy 0, policy_version 110640 (0.0005) [2023-03-08 22:45:42,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12219.7, 300 sec: 11968.7). Total num frames: 56647680. Throughput: 0: 12212.1. Samples: 56621572. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 22:45:42,789][569624] Avg episode reward: [(0, '4679.297')] [2023-03-08 22:45:46,431][569911] Updated weights for policy 0, policy_version 110720 (0.0005) [2023-03-08 22:45:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 56705024. Throughput: 0: 12036.9. Samples: 56689692. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 22:45:47,789][569624] Avg episode reward: [(0, '4703.628')] [2023-03-08 22:45:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000110752_56705024.pth... [2023-03-08 22:45:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000110048_56344576.pth [2023-03-08 22:45:49,830][569911] Updated weights for policy 0, policy_version 110800 (0.0005) [2023-03-08 22:45:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 56762368. Throughput: 0: 12030.3. Samples: 56763056. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 22:45:52,789][569624] Avg episode reward: [(0, '4740.373')] [2023-03-08 22:45:53,206][569911] Updated weights for policy 0, policy_version 110880 (0.0005) [2023-03-08 22:45:56,395][569911] Updated weights for policy 0, policy_version 110960 (0.0005) [2023-03-08 22:45:57,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12151.4, 300 sec: 11982.5). Total num frames: 56827904. Throughput: 0: 12042.9. Samples: 56800480. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 22:45:57,789][569624] Avg episode reward: [(0, '4602.346')] [2023-03-08 22:45:59,802][569911] Updated weights for policy 0, policy_version 111040 (0.0004) [2023-03-08 22:46:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 56885248. Throughput: 0: 12046.8. Samples: 56873424. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 22:46:02,789][569624] Avg episode reward: [(0, '4543.255')] [2023-03-08 22:46:02,817][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000111112_56889344.pth... [2023-03-08 22:46:02,818][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000110408_56528896.pth [2023-03-08 22:46:03,096][569911] Updated weights for policy 0, policy_version 111120 (0.0005) [2023-03-08 22:46:06,080][569911] Updated weights for policy 0, policy_version 111200 (0.0005) [2023-03-08 22:46:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 56950784. Throughput: 0: 12159.7. Samples: 56950984. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 22:46:07,789][569624] Avg episode reward: [(0, '4741.098')] [2023-03-08 22:46:09,687][569911] Updated weights for policy 0, policy_version 111280 (0.0005) [2023-03-08 22:46:12,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 57012224. Throughput: 0: 12058.8. Samples: 56983840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:46:12,789][569624] Avg episode reward: [(0, '4728.125')] [2023-03-08 22:46:12,981][569911] Updated weights for policy 0, policy_version 111360 (0.0005) [2023-03-08 22:46:16,306][569911] Updated weights for policy 0, policy_version 111440 (0.0005) [2023-03-08 22:46:17,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 57073664. Throughput: 0: 12122.6. Samples: 57060456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:46:17,789][569624] Avg episode reward: [(0, '4798.030')] [2023-03-08 22:46:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000111472_57073664.pth... [2023-03-08 22:46:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000110752_56705024.pth [2023-03-08 22:46:19,646][569911] Updated weights for policy 0, policy_version 111520 (0.0005) [2023-03-08 22:46:22,732][569911] Updated weights for policy 0, policy_version 111600 (0.0005) [2023-03-08 22:46:22,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 57139200. Throughput: 0: 12216.7. Samples: 57135868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:46:22,789][569624] Avg episode reward: [(0, '4450.151')] [2023-03-08 22:46:25,938][569911] Updated weights for policy 0, policy_version 111680 (0.0005) [2023-03-08 22:46:27,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 57200640. Throughput: 0: 12286.5. Samples: 57174464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:46:27,789][569624] Avg episode reward: [(0, '4729.995')] [2023-03-08 22:46:29,227][569911] Updated weights for policy 0, policy_version 111760 (0.0004) [2023-03-08 22:46:32,547][569911] Updated weights for policy 0, policy_version 111840 (0.0005) [2023-03-08 22:46:32,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 57262080. Throughput: 0: 12446.8. Samples: 57249800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:46:32,789][569624] Avg episode reward: [(0, '4739.306')] [2023-03-08 22:46:32,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000111840_57262080.pth... [2023-03-08 22:46:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000111112_56889344.pth [2023-03-08 22:46:36,047][569911] Updated weights for policy 0, policy_version 111920 (0.0005) [2023-03-08 22:46:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12219.8, 300 sec: 12079.7). Total num frames: 57319424. Throughput: 0: 12395.8. Samples: 57320868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:46:37,789][569624] Avg episode reward: [(0, '4573.899')] [2023-03-08 22:46:39,375][569911] Updated weights for policy 0, policy_version 112000 (0.0005) [2023-03-08 22:46:42,674][569911] Updated weights for policy 0, policy_version 112080 (0.0005) [2023-03-08 22:46:42,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12093.6). Total num frames: 57384960. Throughput: 0: 12392.8. Samples: 57358156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:46:42,789][569624] Avg episode reward: [(0, '4753.262')] [2023-03-08 22:46:46,195][569911] Updated weights for policy 0, policy_version 112160 (0.0004) [2023-03-08 22:46:47,789][569624] Fps is (10 sec: 12287.8, 60 sec: 12288.0, 300 sec: 12093.6). Total num frames: 57442304. Throughput: 0: 12368.1. Samples: 57429988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:46:47,789][569624] Avg episode reward: [(0, '4675.962')] [2023-03-08 22:46:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000112192_57442304.pth... [2023-03-08 22:46:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000111472_57073664.pth [2023-03-08 22:46:49,684][569911] Updated weights for policy 0, policy_version 112240 (0.0005) [2023-03-08 22:46:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12356.3, 300 sec: 12093.6). Total num frames: 57503744. Throughput: 0: 12245.3. Samples: 57502024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:46:52,789][569624] Avg episode reward: [(0, '4383.827')] [2023-03-08 22:46:53,080][569911] Updated weights for policy 0, policy_version 112320 (0.0005) [2023-03-08 22:46:56,406][569911] Updated weights for policy 0, policy_version 112400 (0.0005) [2023-03-08 22:46:57,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12288.0, 300 sec: 12093.6). Total num frames: 57565184. Throughput: 0: 12310.1. Samples: 57537796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:46:57,789][569624] Avg episode reward: [(0, '4395.068')] [2023-03-08 22:46:59,723][569911] Updated weights for policy 0, policy_version 112480 (0.0005) [2023-03-08 22:47:02,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12288.0, 300 sec: 12093.6). Total num frames: 57622528. Throughput: 0: 12226.2. Samples: 57610636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:47:02,789][569624] Avg episode reward: [(0, '4601.274')] [2023-03-08 22:47:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000112544_57622528.pth... [2023-03-08 22:47:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000111840_57262080.pth [2023-03-08 22:47:03,267][569911] Updated weights for policy 0, policy_version 112560 (0.0005) [2023-03-08 22:47:07,026][569911] Updated weights for policy 0, policy_version 112640 (0.0005) [2023-03-08 22:47:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 57679872. Throughput: 0: 12066.2. Samples: 57678848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:47:07,789][569624] Avg episode reward: [(0, '4360.508')] [2023-03-08 22:47:10,432][569911] Updated weights for policy 0, policy_version 112720 (0.0004) [2023-03-08 22:47:12,789][569624] Fps is (10 sec: 11468.8, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 57737216. Throughput: 0: 11984.7. Samples: 57713776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:47:12,789][569624] Avg episode reward: [(0, '4494.979')] [2023-03-08 22:47:13,875][569911] Updated weights for policy 0, policy_version 112800 (0.0005) [2023-03-08 22:47:17,287][569911] Updated weights for policy 0, policy_version 112880 (0.0005) [2023-03-08 22:47:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 57798656. Throughput: 0: 11917.2. Samples: 57786072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:47:17,789][569624] Avg episode reward: [(0, '4148.031')] [2023-03-08 22:47:17,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000112888_57798656.pth... [2023-03-08 22:47:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000112192_57442304.pth [2023-03-08 22:47:20,806][569911] Updated weights for policy 0, policy_version 112960 (0.0005) [2023-03-08 22:47:22,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 57860096. Throughput: 0: 11926.0. Samples: 57857540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:47:22,789][569624] Avg episode reward: [(0, '3919.625')] [2023-03-08 22:47:24,128][569911] Updated weights for policy 0, policy_version 113040 (0.0005) [2023-03-08 22:47:27,525][569911] Updated weights for policy 0, policy_version 113120 (0.0005) [2023-03-08 22:47:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12093.6). Total num frames: 57917440. Throughput: 0: 11883.8. Samples: 57892928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:47:27,789][569624] Avg episode reward: [(0, '4439.262')] [2023-03-08 22:47:31,082][569911] Updated weights for policy 0, policy_version 113200 (0.0004) [2023-03-08 22:47:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 12079.7). Total num frames: 57974784. Throughput: 0: 11846.0. Samples: 57963060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:47:32,789][569624] Avg episode reward: [(0, '4310.555')] [2023-03-08 22:47:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000113232_57974784.pth... [2023-03-08 22:47:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000112544_57622528.pth [2023-03-08 22:47:34,548][569911] Updated weights for policy 0, policy_version 113280 (0.0005) [2023-03-08 22:47:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 12079.7). Total num frames: 58032128. Throughput: 0: 11803.3. Samples: 58033172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:47:37,789][569624] Avg episode reward: [(0, '4213.172')] [2023-03-08 22:47:38,099][569911] Updated weights for policy 0, policy_version 113360 (0.0005) [2023-03-08 22:47:41,457][569911] Updated weights for policy 0, policy_version 113440 (0.0004) [2023-03-08 22:47:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 12093.6). Total num frames: 58093568. Throughput: 0: 11841.5. Samples: 58070664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:47:42,789][569624] Avg episode reward: [(0, '4359.473')] [2023-03-08 22:47:44,823][569911] Updated weights for policy 0, policy_version 113520 (0.0005) [2023-03-08 22:47:47,789][569624] Fps is (10 sec: 12697.6, 60 sec: 11946.7, 300 sec: 12107.5). Total num frames: 58159104. Throughput: 0: 11870.9. Samples: 58144828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:47:47,789][569624] Avg episode reward: [(0, '4322.996')] [2023-03-08 22:47:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000113592_58159104.pth... [2023-03-08 22:47:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000112888_57798656.pth [2023-03-08 22:47:48,094][569911] Updated weights for policy 0, policy_version 113600 (0.0005) [2023-03-08 22:47:51,685][569911] Updated weights for policy 0, policy_version 113680 (0.0005) [2023-03-08 22:47:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 12107.5). Total num frames: 58216448. Throughput: 0: 11911.0. Samples: 58214844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:47:52,789][569624] Avg episode reward: [(0, '4188.649')] [2023-03-08 22:47:55,180][569911] Updated weights for policy 0, policy_version 113760 (0.0005) [2023-03-08 22:47:57,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 12079.7). Total num frames: 58273792. Throughput: 0: 11898.4. Samples: 58249204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:47:57,789][569624] Avg episode reward: [(0, '4590.592')] [2023-03-08 22:47:58,468][569911] Updated weights for policy 0, policy_version 113840 (0.0005) [2023-03-08 22:48:01,751][569911] Updated weights for policy 0, policy_version 113920 (0.0005) [2023-03-08 22:48:02,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 12093.6). Total num frames: 58335232. Throughput: 0: 11948.2. Samples: 58323744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:48:02,789][569624] Avg episode reward: [(0, '4776.437')] [2023-03-08 22:48:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000113936_58335232.pth... [2023-03-08 22:48:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000113232_57974784.pth [2023-03-08 22:48:05,230][569911] Updated weights for policy 0, policy_version 114000 (0.0005) [2023-03-08 22:48:07,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 12107.5). Total num frames: 58396672. Throughput: 0: 11958.5. Samples: 58395672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:48:07,789][569624] Avg episode reward: [(0, '4767.655')] [2023-03-08 22:48:08,632][569911] Updated weights for policy 0, policy_version 114080 (0.0005) [2023-03-08 22:48:12,012][569911] Updated weights for policy 0, policy_version 114160 (0.0005) [2023-03-08 22:48:12,789][569624] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 12121.4). Total num frames: 58458112. Throughput: 0: 11982.6. Samples: 58432144. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:48:12,789][569624] Avg episode reward: [(0, '4615.429')] [2023-03-08 22:48:15,342][569911] Updated weights for policy 0, policy_version 114240 (0.0005) [2023-03-08 22:48:17,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12121.4). Total num frames: 58519552. Throughput: 0: 12067.6. Samples: 58506100. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:48:17,789][569624] Avg episode reward: [(0, '4610.755')] [2023-03-08 22:48:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000114296_58519552.pth... [2023-03-08 22:48:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000113592_58159104.pth [2023-03-08 22:48:18,748][569911] Updated weights for policy 0, policy_version 114320 (0.0005) [2023-03-08 22:48:22,181][569911] Updated weights for policy 0, policy_version 114400 (0.0005) [2023-03-08 22:48:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12107.5). Total num frames: 58576896. Throughput: 0: 12098.7. Samples: 58577616. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:48:22,789][569624] Avg episode reward: [(0, '4598.571')] [2023-03-08 22:48:25,607][569911] Updated weights for policy 0, policy_version 114480 (0.0005) [2023-03-08 22:48:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 58638336. Throughput: 0: 12050.1. Samples: 58612920. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:48:27,789][569624] Avg episode reward: [(0, '4704.741')] [2023-03-08 22:48:29,197][569911] Updated weights for policy 0, policy_version 114560 (0.0005) [2023-03-08 22:48:32,598][569911] Updated weights for policy 0, policy_version 114640 (0.0006) [2023-03-08 22:48:32,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 58695680. Throughput: 0: 11958.9. Samples: 58682980. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:48:32,789][569624] Avg episode reward: [(0, '4661.405')] [2023-03-08 22:48:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000114640_58695680.pth... [2023-03-08 22:48:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000113936_58335232.pth [2023-03-08 22:48:35,865][569911] Updated weights for policy 0, policy_version 114720 (0.0005) [2023-03-08 22:48:37,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 58761216. Throughput: 0: 12080.8. Samples: 58758480. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:48:37,789][569624] Avg episode reward: [(0, '4732.066')] [2023-03-08 22:48:39,021][569911] Updated weights for policy 0, policy_version 114800 (0.0005) [2023-03-08 22:48:42,463][569911] Updated weights for policy 0, policy_version 114880 (0.0005) [2023-03-08 22:48:42,789][569624] Fps is (10 sec: 12697.6, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 58822656. Throughput: 0: 12154.1. Samples: 58796140. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:48:42,789][569624] Avg episode reward: [(0, '4612.542')] [2023-03-08 22:48:45,685][569911] Updated weights for policy 0, policy_version 114960 (0.0005) [2023-03-08 22:48:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 58884096. Throughput: 0: 12141.0. Samples: 58870088. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:48:47,789][569624] Avg episode reward: [(0, '4716.877')] [2023-03-08 22:48:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000115008_58884096.pth... [2023-03-08 22:48:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000114296_58519552.pth [2023-03-08 22:48:49,018][569911] Updated weights for policy 0, policy_version 115040 (0.0004) [2023-03-08 22:48:52,487][569911] Updated weights for policy 0, policy_version 115120 (0.0004) [2023-03-08 22:48:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 58941440. Throughput: 0: 12130.3. Samples: 58941536. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:48:52,789][569624] Avg episode reward: [(0, '4714.143')] [2023-03-08 22:48:56,056][569911] Updated weights for policy 0, policy_version 115200 (0.0005) [2023-03-08 22:48:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 59002880. Throughput: 0: 12090.3. Samples: 58976208. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:48:57,789][569624] Avg episode reward: [(0, '4636.105')] [2023-03-08 22:48:59,515][569911] Updated weights for policy 0, policy_version 115280 (0.0005) [2023-03-08 22:49:02,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 59060224. Throughput: 0: 12041.0. Samples: 59047944. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:49:02,789][569624] Avg episode reward: [(0, '4616.634')] [2023-03-08 22:49:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000115352_59060224.pth... [2023-03-08 22:49:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000114640_58695680.pth [2023-03-08 22:49:02,915][569911] Updated weights for policy 0, policy_version 115360 (0.0004) [2023-03-08 22:49:06,420][569911] Updated weights for policy 0, policy_version 115440 (0.0005) [2023-03-08 22:49:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 59121664. Throughput: 0: 12021.1. Samples: 59118564. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:49:07,789][569624] Avg episode reward: [(0, '4621.769')] [2023-03-08 22:49:09,866][569911] Updated weights for policy 0, policy_version 115520 (0.0005) [2023-03-08 22:49:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 59179008. Throughput: 0: 12035.0. Samples: 59154496. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:49:12,789][569624] Avg episode reward: [(0, '4519.850')] [2023-03-08 22:49:13,299][569911] Updated weights for policy 0, policy_version 115600 (0.0005) [2023-03-08 22:49:17,099][569911] Updated weights for policy 0, policy_version 115680 (0.0005) [2023-03-08 22:49:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 59236352. Throughput: 0: 12011.5. Samples: 59223496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:49:17,789][569624] Avg episode reward: [(0, '4540.910')] [2023-03-08 22:49:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000115696_59236352.pth... [2023-03-08 22:49:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000115008_58884096.pth [2023-03-08 22:49:20,526][569911] Updated weights for policy 0, policy_version 115760 (0.0004) [2023-03-08 22:49:22,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11878.4, 300 sec: 12052.0). Total num frames: 59289600. Throughput: 0: 11835.6. Samples: 59291084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:49:22,789][569624] Avg episode reward: [(0, '4462.948')] [2023-03-08 22:49:24,255][569911] Updated weights for policy 0, policy_version 115840 (0.0005) [2023-03-08 22:49:27,720][569911] Updated weights for policy 0, policy_version 115920 (0.0005) [2023-03-08 22:49:27,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 59351040. Throughput: 0: 11788.5. Samples: 59326624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:49:27,789][569624] Avg episode reward: [(0, '4233.794')] [2023-03-08 22:49:31,152][569911] Updated weights for policy 0, policy_version 116000 (0.0006) [2023-03-08 22:49:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12052.0). Total num frames: 59408384. Throughput: 0: 11700.6. Samples: 59396616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:49:32,789][569624] Avg episode reward: [(0, '4371.689')] [2023-03-08 22:49:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000116032_59408384.pth... [2023-03-08 22:49:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000115352_59060224.pth [2023-03-08 22:49:34,672][569911] Updated weights for policy 0, policy_version 116080 (0.0005) [2023-03-08 22:49:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 12038.1). Total num frames: 59465728. Throughput: 0: 11650.1. Samples: 59465792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:49:37,789][569624] Avg episode reward: [(0, '4478.177')] [2023-03-08 22:49:38,258][569911] Updated weights for policy 0, policy_version 116160 (0.0004) [2023-03-08 22:49:41,580][569911] Updated weights for policy 0, policy_version 116240 (0.0004) [2023-03-08 22:49:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 12024.2). Total num frames: 59527168. Throughput: 0: 11696.5. Samples: 59502552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:49:42,789][569624] Avg episode reward: [(0, '4475.136')] [2023-03-08 22:49:44,848][569911] Updated weights for policy 0, policy_version 116320 (0.0005) [2023-03-08 22:49:47,789][569624] Fps is (10 sec: 12697.5, 60 sec: 11810.1, 300 sec: 12052.0). Total num frames: 59592704. Throughput: 0: 11775.4. Samples: 59577836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:49:47,789][569624] Avg episode reward: [(0, '4573.589')] [2023-03-08 22:49:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000116392_59592704.pth... [2023-03-08 22:49:47,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000115696_59236352.pth [2023-03-08 22:49:48,157][569911] Updated weights for policy 0, policy_version 116400 (0.0004) [2023-03-08 22:49:51,729][569911] Updated weights for policy 0, policy_version 116480 (0.0005) [2023-03-08 22:49:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 12024.2). Total num frames: 59645952. Throughput: 0: 11723.2. Samples: 59646108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:49:52,789][569624] Avg episode reward: [(0, '4413.853')] [2023-03-08 22:49:55,241][569911] Updated weights for policy 0, policy_version 116560 (0.0005) [2023-03-08 22:49:57,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11741.8, 300 sec: 12024.2). Total num frames: 59707392. Throughput: 0: 11740.2. Samples: 59682804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:49:57,789][569624] Avg episode reward: [(0, '4473.685')] [2023-03-08 22:49:58,568][569911] Updated weights for policy 0, policy_version 116640 (0.0005) [2023-03-08 22:50:02,134][569911] Updated weights for policy 0, policy_version 116720 (0.0005) [2023-03-08 22:50:02,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 12010.3). Total num frames: 59764736. Throughput: 0: 11771.3. Samples: 59753204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:50:02,789][569624] Avg episode reward: [(0, '4610.146')] [2023-03-08 22:50:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000116728_59764736.pth... [2023-03-08 22:50:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000116032_59408384.pth [2023-03-08 22:50:05,818][569911] Updated weights for policy 0, policy_version 116800 (0.0005) [2023-03-08 22:50:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11996.4). Total num frames: 59822080. Throughput: 0: 11799.3. Samples: 59822052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:50:07,789][569624] Avg episode reward: [(0, '4482.849')] [2023-03-08 22:50:09,566][569911] Updated weights for policy 0, policy_version 116880 (0.0005) [2023-03-08 22:50:12,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11605.3, 300 sec: 11968.7). Total num frames: 59875328. Throughput: 0: 11726.9. Samples: 59854336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:50:12,789][569624] Avg episode reward: [(0, '4488.571')] [2023-03-08 22:50:13,148][569911] Updated weights for policy 0, policy_version 116960 (0.0005) [2023-03-08 22:50:16,550][569911] Updated weights for policy 0, policy_version 117040 (0.0004) [2023-03-08 22:50:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11982.5). Total num frames: 59936768. Throughput: 0: 11730.5. Samples: 59924488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:50:17,789][569624] Avg episode reward: [(0, '4572.324')] [2023-03-08 22:50:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000117064_59936768.pth... [2023-03-08 22:50:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000116392_59592704.pth [2023-03-08 22:50:19,895][569911] Updated weights for policy 0, policy_version 117120 (0.0005) [2023-03-08 22:50:22,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11968.7). Total num frames: 59998208. Throughput: 0: 11847.7. Samples: 59998940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:50:22,789][569624] Avg episode reward: [(0, '4602.780')] [2023-03-08 22:50:23,123][569911] Updated weights for policy 0, policy_version 117200 (0.0004) [2023-03-08 22:50:26,538][569911] Updated weights for policy 0, policy_version 117280 (0.0004) [2023-03-08 22:50:27,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11810.1, 300 sec: 11968.6). Total num frames: 60059648. Throughput: 0: 11835.4. Samples: 60035144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:50:27,789][569624] Avg episode reward: [(0, '4571.864')] [2023-03-08 22:50:30,072][569911] Updated weights for policy 0, policy_version 117360 (0.0004) [2023-03-08 22:50:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11968.6). Total num frames: 60116992. Throughput: 0: 11708.4. Samples: 60104712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:50:32,789][569624] Avg episode reward: [(0, '4301.209')] [2023-03-08 22:50:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000117416_60116992.pth... [2023-03-08 22:50:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000116728_59764736.pth [2023-03-08 22:50:33,658][569911] Updated weights for policy 0, policy_version 117440 (0.0004) [2023-03-08 22:50:36,930][569911] Updated weights for policy 0, policy_version 117520 (0.0004) [2023-03-08 22:50:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 60178432. Throughput: 0: 11819.6. Samples: 60177988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:50:37,789][569624] Avg episode reward: [(0, '4554.756')] [2023-03-08 22:50:40,289][569911] Updated weights for policy 0, policy_version 117600 (0.0005) [2023-03-08 22:50:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11968.6). Total num frames: 60235776. Throughput: 0: 11823.6. Samples: 60214868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:50:42,789][569624] Avg episode reward: [(0, '4572.388')] [2023-03-08 22:50:44,097][569911] Updated weights for policy 0, policy_version 117680 (0.0005) [2023-03-08 22:50:47,698][569911] Updated weights for policy 0, policy_version 117760 (0.0005) [2023-03-08 22:50:47,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11968.7). Total num frames: 60293120. Throughput: 0: 11719.4. Samples: 60280576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:50:47,789][569624] Avg episode reward: [(0, '4390.389')] [2023-03-08 22:50:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000117760_60293120.pth... [2023-03-08 22:50:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000117064_59936768.pth [2023-03-08 22:50:51,101][569911] Updated weights for policy 0, policy_version 117840 (0.0005) [2023-03-08 22:50:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11954.8). Total num frames: 60354560. Throughput: 0: 11771.5. Samples: 60351772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:50:52,789][569624] Avg episode reward: [(0, '4587.480')] [2023-03-08 22:50:54,523][569911] Updated weights for policy 0, policy_version 117920 (0.0004) [2023-03-08 22:50:57,784][569911] Updated weights for policy 0, policy_version 118000 (0.0005) [2023-03-08 22:50:57,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11968.6). Total num frames: 60416000. Throughput: 0: 11844.4. Samples: 60387336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:50:57,789][569624] Avg episode reward: [(0, '4684.614')] [2023-03-08 22:51:01,403][569911] Updated weights for policy 0, policy_version 118080 (0.0005) [2023-03-08 22:51:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11940.9). Total num frames: 60473344. Throughput: 0: 11870.5. Samples: 60458660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:51:02,789][569624] Avg episode reward: [(0, '4502.929')] [2023-03-08 22:51:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000118112_60473344.pth... [2023-03-08 22:51:02,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000117416_60116992.pth [2023-03-08 22:51:04,756][569911] Updated weights for policy 0, policy_version 118160 (0.0005) [2023-03-08 22:51:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11927.0). Total num frames: 60530688. Throughput: 0: 11776.6. Samples: 60528888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:51:07,789][569624] Avg episode reward: [(0, '4561.804')] [2023-03-08 22:51:08,364][569911] Updated weights for policy 0, policy_version 118240 (0.0004) [2023-03-08 22:51:11,688][569911] Updated weights for policy 0, policy_version 118320 (0.0004) [2023-03-08 22:51:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.6, 300 sec: 11927.0). Total num frames: 60592128. Throughput: 0: 11828.9. Samples: 60567444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:51:12,789][569624] Avg episode reward: [(0, '4587.085')] [2023-03-08 22:51:15,081][569911] Updated weights for policy 0, policy_version 118400 (0.0005) [2023-03-08 22:51:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 60649472. Throughput: 0: 11867.0. Samples: 60638728. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:51:17,789][569624] Avg episode reward: [(0, '4483.228')] [2023-03-08 22:51:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000118456_60649472.pth... [2023-03-08 22:51:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000117760_60293120.pth [2023-03-08 22:51:18,567][569911] Updated weights for policy 0, policy_version 118480 (0.0005) [2023-03-08 22:51:22,060][569911] Updated weights for policy 0, policy_version 118560 (0.0005) [2023-03-08 22:51:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 60710912. Throughput: 0: 11772.7. Samples: 60707760. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:51:22,789][569624] Avg episode reward: [(0, '4494.607')] [2023-03-08 22:51:25,496][569911] Updated weights for policy 0, policy_version 118640 (0.0005) [2023-03-08 22:51:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11885.3). Total num frames: 60768256. Throughput: 0: 11753.0. Samples: 60743752. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:51:27,789][569624] Avg episode reward: [(0, '4606.418')] [2023-03-08 22:51:29,013][569911] Updated weights for policy 0, policy_version 118720 (0.0004) [2023-03-08 22:51:32,557][569911] Updated weights for policy 0, policy_version 118800 (0.0005) [2023-03-08 22:51:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11885.3). Total num frames: 60825600. Throughput: 0: 11850.8. Samples: 60813864. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:51:32,789][569624] Avg episode reward: [(0, '4580.912')] [2023-03-08 22:51:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000118800_60825600.pth... [2023-03-08 22:51:32,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000118112_60473344.pth [2023-03-08 22:51:36,068][569911] Updated weights for policy 0, policy_version 118880 (0.0005) [2023-03-08 22:51:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11871.5). Total num frames: 60887040. Throughput: 0: 11849.3. Samples: 60884988. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:51:37,789][569624] Avg episode reward: [(0, '4809.574')] [2023-03-08 22:51:39,309][569911] Updated weights for policy 0, policy_version 118960 (0.0005) [2023-03-08 22:51:42,314][569911] Updated weights for policy 0, policy_version 119040 (0.0005) [2023-03-08 22:51:42,789][569624] Fps is (10 sec: 12697.7, 60 sec: 11946.7, 300 sec: 11899.2). Total num frames: 60952576. Throughput: 0: 11923.9. Samples: 60923912. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:51:42,789][569624] Avg episode reward: [(0, '4750.639')] [2023-03-08 22:51:45,734][569911] Updated weights for policy 0, policy_version 119120 (0.0005) [2023-03-08 22:51:47,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11946.6, 300 sec: 11885.3). Total num frames: 61009920. Throughput: 0: 11981.4. Samples: 60997824. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:51:47,789][569624] Avg episode reward: [(0, '4679.600')] [2023-03-08 22:51:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000119160_61009920.pth... [2023-03-08 22:51:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000118456_60649472.pth [2023-03-08 22:51:49,200][569911] Updated weights for policy 0, policy_version 119200 (0.0005) [2023-03-08 22:51:52,683][569911] Updated weights for policy 0, policy_version 119280 (0.0005) [2023-03-08 22:51:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11885.3). Total num frames: 61071360. Throughput: 0: 12028.9. Samples: 61070188. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:51:52,789][569624] Avg episode reward: [(0, '4661.154')] [2023-03-08 22:51:56,196][569911] Updated weights for policy 0, policy_version 119360 (0.0005) [2023-03-08 22:51:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 61128704. Throughput: 0: 11929.4. Samples: 61104268. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:51:57,789][569624] Avg episode reward: [(0, '4668.327')] [2023-03-08 22:51:59,748][569911] Updated weights for policy 0, policy_version 119440 (0.0005) [2023-03-08 22:52:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 61186048. Throughput: 0: 11888.8. Samples: 61173724. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:52:02,789][569624] Avg episode reward: [(0, '4372.017')] [2023-03-08 22:52:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000119504_61186048.pth... [2023-03-08 22:52:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000118800_60825600.pth [2023-03-08 22:52:03,456][569911] Updated weights for policy 0, policy_version 119520 (0.0006) [2023-03-08 22:52:06,933][569911] Updated weights for policy 0, policy_version 119600 (0.0005) [2023-03-08 22:52:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 61243392. Throughput: 0: 11873.8. Samples: 61242080. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:52:07,789][569624] Avg episode reward: [(0, '4721.265')] [2023-03-08 22:52:10,148][569911] Updated weights for policy 0, policy_version 119680 (0.0005) [2023-03-08 22:52:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 61304832. Throughput: 0: 11922.6. Samples: 61280268. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:52:12,789][569624] Avg episode reward: [(0, '4295.677')] [2023-03-08 22:52:13,681][569911] Updated weights for policy 0, policy_version 119760 (0.0005) [2023-03-08 22:52:17,244][569911] Updated weights for policy 0, policy_version 119840 (0.0005) [2023-03-08 22:52:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 61362176. Throughput: 0: 11911.9. Samples: 61349900. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 22:52:17,789][569624] Avg episode reward: [(0, '4467.374')] [2023-03-08 22:52:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000119848_61362176.pth... [2023-03-08 22:52:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000119160_61009920.pth [2023-03-08 22:52:20,639][569911] Updated weights for policy 0, policy_version 119920 (0.0005) [2023-03-08 22:52:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 61423616. Throughput: 0: 11903.2. Samples: 61420632. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:52:22,789][569624] Avg episode reward: [(0, '4333.925')] [2023-03-08 22:52:24,148][569911] Updated weights for policy 0, policy_version 120000 (0.0005) [2023-03-08 22:52:27,785][569911] Updated weights for policy 0, policy_version 120080 (0.0005) [2023-03-08 22:52:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 61480960. Throughput: 0: 11832.7. Samples: 61456384. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:52:27,789][569624] Avg episode reward: [(0, '4518.771')] [2023-03-08 22:52:31,129][569911] Updated weights for policy 0, policy_version 120160 (0.0005) [2023-03-08 22:52:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 61538304. Throughput: 0: 11742.4. Samples: 61526232. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:52:32,789][569624] Avg episode reward: [(0, '4616.513')] [2023-03-08 22:52:32,813][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000120200_61542400.pth... [2023-03-08 22:52:32,815][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000119504_61186048.pth [2023-03-08 22:52:34,455][569911] Updated weights for policy 0, policy_version 120240 (0.0005) [2023-03-08 22:52:37,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 61599744. Throughput: 0: 11755.2. Samples: 61599172. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:52:37,789][569624] Avg episode reward: [(0, '4514.768')] [2023-03-08 22:52:37,975][569911] Updated weights for policy 0, policy_version 120320 (0.0004) [2023-03-08 22:52:41,517][569911] Updated weights for policy 0, policy_version 120400 (0.0005) [2023-03-08 22:52:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11857.6). Total num frames: 61657088. Throughput: 0: 11761.1. Samples: 61633516. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:52:42,789][569624] Avg episode reward: [(0, '4649.333')] [2023-03-08 22:52:44,998][569911] Updated weights for policy 0, policy_version 120480 (0.0005) [2023-03-08 22:52:47,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11857.6). Total num frames: 61714432. Throughput: 0: 11786.3. Samples: 61704108. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:52:47,789][569624] Avg episode reward: [(0, '4655.146')] [2023-03-08 22:52:47,824][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000120544_61718528.pth... [2023-03-08 22:52:47,825][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000119848_61362176.pth [2023-03-08 22:52:48,507][569911] Updated weights for policy 0, policy_version 120560 (0.0005) [2023-03-08 22:52:52,093][569911] Updated weights for policy 0, policy_version 120640 (0.0005) [2023-03-08 22:52:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11871.5). Total num frames: 61775872. Throughput: 0: 11793.5. Samples: 61772788. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:52:52,789][569624] Avg episode reward: [(0, '4687.295')] [2023-03-08 22:52:55,311][569911] Updated weights for policy 0, policy_version 120720 (0.0005) [2023-03-08 22:52:57,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11857.6). Total num frames: 61833216. Throughput: 0: 11815.1. Samples: 61811948. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:52:57,789][569624] Avg episode reward: [(0, '4403.319')] [2023-03-08 22:52:58,856][569911] Updated weights for policy 0, policy_version 120800 (0.0005) [2023-03-08 22:53:02,434][569911] Updated weights for policy 0, policy_version 120880 (0.0005) [2023-03-08 22:53:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 61890560. Throughput: 0: 11790.0. Samples: 61880448. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:53:02,789][569624] Avg episode reward: [(0, '4712.101')] [2023-03-08 22:53:02,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000120888_61894656.pth... [2023-03-08 22:53:02,805][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000120200_61542400.pth [2023-03-08 22:53:06,014][569911] Updated weights for policy 0, policy_version 120960 (0.0005) [2023-03-08 22:53:07,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 61952000. Throughput: 0: 11772.1. Samples: 61950376. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:53:07,789][569624] Avg episode reward: [(0, '4804.658')] [2023-03-08 22:53:09,390][569911] Updated weights for policy 0, policy_version 121040 (0.0005) [2023-03-08 22:53:12,780][569911] Updated weights for policy 0, policy_version 121120 (0.0005) [2023-03-08 22:53:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 62013440. Throughput: 0: 11757.1. Samples: 61985456. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:53:12,789][569624] Avg episode reward: [(0, '4774.685')] [2023-03-08 22:53:16,225][569911] Updated weights for policy 0, policy_version 121200 (0.0005) [2023-03-08 22:53:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 62070784. Throughput: 0: 11827.7. Samples: 62058480. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 22:53:17,789][569624] Avg episode reward: [(0, '4725.438')] [2023-03-08 22:53:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000121232_62070784.pth... [2023-03-08 22:53:17,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000120544_61718528.pth [2023-03-08 22:53:19,786][569911] Updated weights for policy 0, policy_version 121280 (0.0005) [2023-03-08 22:53:22,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11829.8). Total num frames: 62128128. Throughput: 0: 11753.6. Samples: 62128084. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:53:22,789][569624] Avg episode reward: [(0, '4821.340')] [2023-03-08 22:53:22,790][569867] Saving new best policy, reward=4821.340! [2023-03-08 22:53:23,317][569911] Updated weights for policy 0, policy_version 121360 (0.0005) [2023-03-08 22:53:27,069][569911] Updated weights for policy 0, policy_version 121440 (0.0005) [2023-03-08 22:53:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11829.8). Total num frames: 62185472. Throughput: 0: 11719.7. Samples: 62160904. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:53:27,789][569624] Avg episode reward: [(0, '4680.579')] [2023-03-08 22:53:30,350][569911] Updated weights for policy 0, policy_version 121520 (0.0005) [2023-03-08 22:53:32,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 62246912. Throughput: 0: 11737.0. Samples: 62232276. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:53:32,789][569624] Avg episode reward: [(0, '4689.161')] [2023-03-08 22:53:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000121576_62246912.pth... [2023-03-08 22:53:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000120888_61894656.pth [2023-03-08 22:53:33,754][569911] Updated weights for policy 0, policy_version 121600 (0.0005) [2023-03-08 22:53:37,302][569911] Updated weights for policy 0, policy_version 121680 (0.0005) [2023-03-08 22:53:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11802.0). Total num frames: 62304256. Throughput: 0: 11801.6. Samples: 62303860. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:53:37,789][569624] Avg episode reward: [(0, '4719.354')] [2023-03-08 22:53:40,666][569911] Updated weights for policy 0, policy_version 121760 (0.0004) [2023-03-08 22:53:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 62365696. Throughput: 0: 11728.8. Samples: 62339744. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:53:42,789][569624] Avg episode reward: [(0, '4634.885')] [2023-03-08 22:53:44,032][569911] Updated weights for policy 0, policy_version 121840 (0.0005) [2023-03-08 22:53:47,376][569911] Updated weights for policy 0, policy_version 121920 (0.0005) [2023-03-08 22:53:47,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11815.9). Total num frames: 62427136. Throughput: 0: 11784.7. Samples: 62410760. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:53:47,789][569624] Avg episode reward: [(0, '4738.006')] [2023-03-08 22:53:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000121928_62427136.pth... [2023-03-08 22:53:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000121232_62070784.pth [2023-03-08 22:53:50,902][569911] Updated weights for policy 0, policy_version 122000 (0.0004) [2023-03-08 22:53:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 62484480. Throughput: 0: 11855.7. Samples: 62483884. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:53:52,789][569624] Avg episode reward: [(0, '4737.144')] [2023-03-08 22:53:54,303][569911] Updated weights for policy 0, policy_version 122080 (0.0005) [2023-03-08 22:53:57,602][569911] Updated weights for policy 0, policy_version 122160 (0.0005) [2023-03-08 22:53:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11815.9). Total num frames: 62545920. Throughput: 0: 11856.0. Samples: 62518976. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:53:57,789][569624] Avg episode reward: [(0, '4774.368')] [2023-03-08 22:54:01,138][569911] Updated weights for policy 0, policy_version 122240 (0.0005) [2023-03-08 22:54:02,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11815.9). Total num frames: 62607360. Throughput: 0: 11856.4. Samples: 62592016. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:54:02,789][569624] Avg episode reward: [(0, '4779.706')] [2023-03-08 22:54:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000122280_62607360.pth... [2023-03-08 22:54:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000121576_62246912.pth [2023-03-08 22:54:04,519][569911] Updated weights for policy 0, policy_version 122320 (0.0005) [2023-03-08 22:54:07,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11815.9). Total num frames: 62664704. Throughput: 0: 11925.2. Samples: 62664716. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:54:07,789][569624] Avg episode reward: [(0, '4775.189')] [2023-03-08 22:54:07,883][569911] Updated weights for policy 0, policy_version 122400 (0.0005) [2023-03-08 22:54:11,259][569911] Updated weights for policy 0, policy_version 122480 (0.0005) [2023-03-08 22:54:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11829.8). Total num frames: 62726144. Throughput: 0: 12013.0. Samples: 62701488. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:54:12,789][569624] Avg episode reward: [(0, '4770.491')] [2023-03-08 22:54:14,599][569911] Updated weights for policy 0, policy_version 122560 (0.0004) [2023-03-08 22:54:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11843.7). Total num frames: 62783488. Throughput: 0: 12016.2. Samples: 62773004. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:54:17,789][569624] Avg episode reward: [(0, '4722.358')] [2023-03-08 22:54:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000122624_62783488.pth... [2023-03-08 22:54:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000121928_62427136.pth [2023-03-08 22:54:18,140][569911] Updated weights for policy 0, policy_version 122640 (0.0005) [2023-03-08 22:54:21,518][569911] Updated weights for policy 0, policy_version 122720 (0.0005) [2023-03-08 22:54:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11843.7). Total num frames: 62844928. Throughput: 0: 12023.7. Samples: 62844928. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:54:22,789][569624] Avg episode reward: [(0, '4780.374')] [2023-03-08 22:54:25,078][569911] Updated weights for policy 0, policy_version 122800 (0.0005) [2023-03-08 22:54:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11843.7). Total num frames: 62902272. Throughput: 0: 11966.6. Samples: 62878240. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:54:27,789][569624] Avg episode reward: [(0, '4837.950')] [2023-03-08 22:54:27,812][569867] Saving new best policy, reward=4837.950! [2023-03-08 22:54:28,583][569911] Updated weights for policy 0, policy_version 122880 (0.0005) [2023-03-08 22:54:31,835][569911] Updated weights for policy 0, policy_version 122960 (0.0005) [2023-03-08 22:54:32,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11857.6). Total num frames: 62963712. Throughput: 0: 12037.8. Samples: 62952460. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:54:32,789][569624] Avg episode reward: [(0, '4627.589')] [2023-03-08 22:54:32,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000122976_62963712.pth... [2023-03-08 22:54:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000122280_62607360.pth [2023-03-08 22:54:35,285][569911] Updated weights for policy 0, policy_version 123040 (0.0005) [2023-03-08 22:54:37,789][569624] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11857.6). Total num frames: 63025152. Throughput: 0: 11978.4. Samples: 63022912. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:54:37,789][569624] Avg episode reward: [(0, '4594.181')] [2023-03-08 22:54:38,734][569911] Updated weights for policy 0, policy_version 123120 (0.0005) [2023-03-08 22:54:42,273][569911] Updated weights for policy 0, policy_version 123200 (0.0005) [2023-03-08 22:54:42,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11829.8). Total num frames: 63082496. Throughput: 0: 11984.5. Samples: 63058280. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:54:42,789][569624] Avg episode reward: [(0, '4702.309')] [2023-03-08 22:54:45,478][569911] Updated weights for policy 0, policy_version 123280 (0.0004) [2023-03-08 22:54:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11857.6). Total num frames: 63143936. Throughput: 0: 12002.7. Samples: 63132136. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:54:47,789][569624] Avg episode reward: [(0, '4699.972')] [2023-03-08 22:54:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000123328_63143936.pth... [2023-03-08 22:54:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000122624_62783488.pth [2023-03-08 22:54:48,983][569911] Updated weights for policy 0, policy_version 123360 (0.0004) [2023-03-08 22:54:52,687][569911] Updated weights for policy 0, policy_version 123440 (0.0005) [2023-03-08 22:54:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11843.7). Total num frames: 63201280. Throughput: 0: 11904.1. Samples: 63200400. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:54:52,789][569624] Avg episode reward: [(0, '4570.319')] [2023-03-08 22:54:56,296][569911] Updated weights for policy 0, policy_version 123520 (0.0005) [2023-03-08 22:54:57,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11843.7). Total num frames: 63258624. Throughput: 0: 11834.9. Samples: 63234056. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:54:57,789][569624] Avg episode reward: [(0, '4626.956')] [2023-03-08 22:54:59,732][569911] Updated weights for policy 0, policy_version 123600 (0.0005) [2023-03-08 22:55:02,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11810.2, 300 sec: 11843.7). Total num frames: 63315968. Throughput: 0: 11830.1. Samples: 63305356. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:55:02,799][569624] Avg episode reward: [(0, '4600.811')] [2023-03-08 22:55:02,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000123664_63315968.pth... [2023-03-08 22:55:02,804][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000122976_62963712.pth [2023-03-08 22:55:03,213][569911] Updated weights for policy 0, policy_version 123680 (0.0005) [2023-03-08 22:55:06,711][569911] Updated weights for policy 0, policy_version 123760 (0.0005) [2023-03-08 22:55:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 63377408. Throughput: 0: 11784.8. Samples: 63375244. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:55:07,800][569624] Avg episode reward: [(0, '4578.319')] [2023-03-08 22:55:09,953][569911] Updated weights for policy 0, policy_version 123840 (0.0005) [2023-03-08 22:55:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 63438848. Throughput: 0: 11900.7. Samples: 63413772. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:55:12,799][569624] Avg episode reward: [(0, '4469.871')] [2023-03-08 22:55:13,122][569911] Updated weights for policy 0, policy_version 123920 (0.0005) [2023-03-08 22:55:16,374][569911] Updated weights for policy 0, policy_version 124000 (0.0005) [2023-03-08 22:55:17,789][569624] Fps is (10 sec: 12697.5, 60 sec: 12014.9, 300 sec: 11885.3). Total num frames: 63504384. Throughput: 0: 11949.7. Samples: 63490200. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:55:17,800][569624] Avg episode reward: [(0, '4676.618')] [2023-03-08 22:55:17,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000124032_63504384.pth... [2023-03-08 22:55:17,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000123328_63143936.pth [2023-03-08 22:55:19,809][569911] Updated weights for policy 0, policy_version 124080 (0.0005) [2023-03-08 22:55:22,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11871.5). Total num frames: 63561728. Throughput: 0: 11973.9. Samples: 63561736. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-08 22:55:22,800][569624] Avg episode reward: [(0, '4631.447')] [2023-03-08 22:55:23,207][569911] Updated weights for policy 0, policy_version 124160 (0.0005) [2023-03-08 22:55:26,982][569911] Updated weights for policy 0, policy_version 124240 (0.0005) [2023-03-08 22:55:27,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11871.5). Total num frames: 63619072. Throughput: 0: 11962.2. Samples: 63596580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:55:27,800][569624] Avg episode reward: [(0, '4444.490')] [2023-03-08 22:55:30,433][569911] Updated weights for policy 0, policy_version 124320 (0.0005) [2023-03-08 22:55:32,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11857.6). Total num frames: 63676416. Throughput: 0: 11859.8. Samples: 63665824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:55:32,799][569624] Avg episode reward: [(0, '4731.282')] [2023-03-08 22:55:32,805][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000124376_63680512.pth... [2023-03-08 22:55:32,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000123664_63315968.pth [2023-03-08 22:55:33,834][569911] Updated weights for policy 0, policy_version 124400 (0.0004) [2023-03-08 22:55:37,043][569911] Updated weights for policy 0, policy_version 124480 (0.0005) [2023-03-08 22:55:37,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11885.3). Total num frames: 63741952. Throughput: 0: 12008.3. Samples: 63740772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:55:37,800][569624] Avg episode reward: [(0, '4713.013')] [2023-03-08 22:55:40,540][569911] Updated weights for policy 0, policy_version 124560 (0.0006) [2023-03-08 22:55:42,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11885.3). Total num frames: 63799296. Throughput: 0: 12016.1. Samples: 63774784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:55:42,789][569624] Avg episode reward: [(0, '4607.268')] [2023-03-08 22:55:44,078][569911] Updated weights for policy 0, policy_version 124640 (0.0006) [2023-03-08 22:55:47,623][569911] Updated weights for policy 0, policy_version 124720 (0.0005) [2023-03-08 22:55:47,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 63856640. Throughput: 0: 11971.1. Samples: 63844056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:55:47,789][569624] Avg episode reward: [(0, '4549.758')] [2023-03-08 22:55:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000124720_63856640.pth... [2023-03-08 22:55:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000124032_63504384.pth [2023-03-08 22:55:50,991][569911] Updated weights for policy 0, policy_version 124800 (0.0005) [2023-03-08 22:55:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11871.5). Total num frames: 63918080. Throughput: 0: 12014.1. Samples: 63915880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:55:52,790][569624] Avg episode reward: [(0, '4398.770')] [2023-03-08 22:55:54,387][569911] Updated weights for policy 0, policy_version 124880 (0.0005) [2023-03-08 22:55:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.6, 300 sec: 11871.5). Total num frames: 63975424. Throughput: 0: 11956.2. Samples: 63951804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:55:57,800][569624] Avg episode reward: [(0, '4544.606')] [2023-03-08 22:55:58,002][569911] Updated weights for policy 0, policy_version 124960 (0.0004) [2023-03-08 22:56:01,455][569911] Updated weights for policy 0, policy_version 125040 (0.0005) [2023-03-08 22:56:02,789][569624] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11885.3). Total num frames: 64036864. Throughput: 0: 11798.0. Samples: 64021108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:56:02,789][569624] Avg episode reward: [(0, '4632.609')] [2023-03-08 22:56:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000125072_64036864.pth... [2023-03-08 22:56:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000124376_63680512.pth [2023-03-08 22:56:04,833][569911] Updated weights for policy 0, policy_version 125120 (0.0005) [2023-03-08 22:56:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11871.5). Total num frames: 64094208. Throughput: 0: 11782.4. Samples: 64091944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:56:07,800][569624] Avg episode reward: [(0, '4299.518')] [2023-03-08 22:56:08,435][569911] Updated weights for policy 0, policy_version 125200 (0.0004) [2023-03-08 22:56:12,139][569911] Updated weights for policy 0, policy_version 125280 (0.0005) [2023-03-08 22:56:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 64151552. Throughput: 0: 11718.3. Samples: 64123904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:56:12,800][569624] Avg episode reward: [(0, '4542.115')] [2023-03-08 22:56:15,691][569911] Updated weights for policy 0, policy_version 125360 (0.0005) [2023-03-08 22:56:17,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11857.6). Total num frames: 64208896. Throughput: 0: 11749.0. Samples: 64194528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:56:17,800][569624] Avg episode reward: [(0, '4543.074')] [2023-03-08 22:56:17,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000125408_64208896.pth... [2023-03-08 22:56:17,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000124720_63856640.pth [2023-03-08 22:56:19,088][569911] Updated weights for policy 0, policy_version 125440 (0.0005) [2023-03-08 22:56:22,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11673.6, 300 sec: 11843.7). Total num frames: 64262144. Throughput: 0: 11587.5. Samples: 64262208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:56:22,800][569624] Avg episode reward: [(0, '4497.723')] [2023-03-08 22:56:22,913][569911] Updated weights for policy 0, policy_version 125520 (0.0005) [2023-03-08 22:56:23,958][569867] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000007 [2023-03-08 22:56:26,351][569911] Updated weights for policy 0, policy_version 125600 (0.0004) [2023-03-08 22:56:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11857.6). Total num frames: 64323584. Throughput: 0: 11590.2. Samples: 64296344. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:56:27,800][569624] Avg episode reward: [(0, '4547.774')] [2023-03-08 22:56:29,788][569911] Updated weights for policy 0, policy_version 125680 (0.0005) [2023-03-08 22:56:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 64380928. Throughput: 0: 11659.0. Samples: 64368712. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:56:32,800][569624] Avg episode reward: [(0, '4568.975')] [2023-03-08 22:56:32,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000125744_64380928.pth... [2023-03-08 22:56:32,805][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000125072_64036864.pth [2023-03-08 22:56:33,285][569911] Updated weights for policy 0, policy_version 125760 (0.0005) [2023-03-08 22:56:36,775][569911] Updated weights for policy 0, policy_version 125840 (0.0005) [2023-03-08 22:56:37,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11605.4, 300 sec: 11815.9). Total num frames: 64438272. Throughput: 0: 11618.0. Samples: 64438688. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:56:37,799][569624] Avg episode reward: [(0, '4414.209')] [2023-03-08 22:56:40,338][569911] Updated weights for policy 0, policy_version 125920 (0.0005) [2023-03-08 22:56:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11815.9). Total num frames: 64495616. Throughput: 0: 11586.5. Samples: 64473196. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:56:42,799][569624] Avg episode reward: [(0, '4375.359')] [2023-03-08 22:56:44,001][569911] Updated weights for policy 0, policy_version 126000 (0.0005) [2023-03-08 22:56:47,655][569911] Updated weights for policy 0, policy_version 126080 (0.0006) [2023-03-08 22:56:47,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11802.0). Total num frames: 64552960. Throughput: 0: 11546.1. Samples: 64540684. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:56:47,800][569624] Avg episode reward: [(0, '4592.547')] [2023-03-08 22:56:47,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000126080_64552960.pth... [2023-03-08 22:56:47,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000125408_64208896.pth [2023-03-08 22:56:51,163][569911] Updated weights for policy 0, policy_version 126160 (0.0005) [2023-03-08 22:56:52,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11537.1, 300 sec: 11802.0). Total num frames: 64610304. Throughput: 0: 11514.6. Samples: 64610100. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:56:52,800][569624] Avg episode reward: [(0, '4403.318')] [2023-03-08 22:56:54,910][569911] Updated weights for policy 0, policy_version 126240 (0.0006) [2023-03-08 22:56:57,789][569624] Fps is (10 sec: 11469.0, 60 sec: 11537.1, 300 sec: 11802.0). Total num frames: 64667648. Throughput: 0: 11536.3. Samples: 64643036. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:56:57,789][569624] Avg episode reward: [(0, '4357.639')] [2023-03-08 22:56:58,466][569911] Updated weights for policy 0, policy_version 126320 (0.0005) [2023-03-08 22:57:02,072][569911] Updated weights for policy 0, policy_version 126400 (0.0005) [2023-03-08 22:57:02,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11400.6, 300 sec: 11788.2). Total num frames: 64720896. Throughput: 0: 11485.8. Samples: 64711388. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:57:02,789][569624] Avg episode reward: [(0, '4195.971')] [2023-03-08 22:57:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000126416_64724992.pth... [2023-03-08 22:57:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000125744_64380928.pth [2023-03-08 22:57:05,702][569911] Updated weights for policy 0, policy_version 126480 (0.0005) [2023-03-08 22:57:07,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 11788.2). Total num frames: 64782336. Throughput: 0: 11489.5. Samples: 64779236. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:57:07,789][569624] Avg episode reward: [(0, '4088.313')] [2023-03-08 22:57:09,220][569911] Updated weights for policy 0, policy_version 126560 (0.0006) [2023-03-08 22:57:12,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11774.3). Total num frames: 64835584. Throughput: 0: 11526.5. Samples: 64815036. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:57:12,789][569624] Avg episode reward: [(0, '4108.649')] [2023-03-08 22:57:12,792][569911] Updated weights for policy 0, policy_version 126640 (0.0005) [2023-03-08 22:57:16,273][569911] Updated weights for policy 0, policy_version 126720 (0.0005) [2023-03-08 22:57:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 11774.3). Total num frames: 64897024. Throughput: 0: 11459.7. Samples: 64884400. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:57:17,789][569624] Avg episode reward: [(0, '4020.450')] [2023-03-08 22:57:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000126752_64897024.pth... [2023-03-08 22:57:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000126080_64552960.pth [2023-03-08 22:57:19,753][569911] Updated weights for policy 0, policy_version 126800 (0.0005) [2023-03-08 22:57:22,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11537.1, 300 sec: 11774.3). Total num frames: 64954368. Throughput: 0: 11459.4. Samples: 64954360. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 22:57:22,789][569624] Avg episode reward: [(0, '4101.733')] [2023-03-08 22:57:23,290][569911] Updated weights for policy 0, policy_version 126880 (0.0005) [2023-03-08 22:57:26,772][569911] Updated weights for policy 0, policy_version 126960 (0.0004) [2023-03-08 22:57:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11537.1, 300 sec: 11788.2). Total num frames: 65015808. Throughput: 0: 11438.3. Samples: 64987920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:57:27,789][569624] Avg episode reward: [(0, '4114.324')] [2023-03-08 22:57:30,264][569911] Updated weights for policy 0, policy_version 127040 (0.0005) [2023-03-08 22:57:32,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11537.1, 300 sec: 11774.3). Total num frames: 65073152. Throughput: 0: 11543.6. Samples: 65060144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:57:32,789][569624] Avg episode reward: [(0, '4405.986')] [2023-03-08 22:57:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000127096_65073152.pth... [2023-03-08 22:57:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000126416_64724992.pth [2023-03-08 22:57:33,694][569911] Updated weights for policy 0, policy_version 127120 (0.0005) [2023-03-08 22:57:37,421][569911] Updated weights for policy 0, policy_version 127200 (0.0005) [2023-03-08 22:57:37,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11537.0, 300 sec: 11774.3). Total num frames: 65130496. Throughput: 0: 11497.1. Samples: 65127468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:57:37,789][569624] Avg episode reward: [(0, '4213.118')] [2023-03-08 22:57:41,033][569911] Updated weights for policy 0, policy_version 127280 (0.0004) [2023-03-08 22:57:42,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11468.8, 300 sec: 11760.4). Total num frames: 65183744. Throughput: 0: 11518.8. Samples: 65161380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:57:42,789][569624] Avg episode reward: [(0, '3992.462')] [2023-03-08 22:57:44,625][569911] Updated weights for policy 0, policy_version 127360 (0.0005) [2023-03-08 22:57:47,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11760.4). Total num frames: 65245184. Throughput: 0: 11539.1. Samples: 65230648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:57:47,789][569624] Avg episode reward: [(0, '4086.774')] [2023-03-08 22:57:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000127432_65245184.pth... [2023-03-08 22:57:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000126752_64897024.pth [2023-03-08 22:57:48,157][569911] Updated weights for policy 0, policy_version 127440 (0.0005) [2023-03-08 22:57:51,722][569911] Updated weights for policy 0, policy_version 127520 (0.0005) [2023-03-08 22:57:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11746.5). Total num frames: 65298432. Throughput: 0: 11565.9. Samples: 65299700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:57:52,789][569624] Avg episode reward: [(0, '3813.109')] [2023-03-08 22:57:55,301][569911] Updated weights for policy 0, policy_version 127600 (0.0005) [2023-03-08 22:57:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.0, 300 sec: 11760.4). Total num frames: 65359872. Throughput: 0: 11543.6. Samples: 65334500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:57:57,789][569624] Avg episode reward: [(0, '4336.364')] [2023-03-08 22:57:58,838][569911] Updated weights for policy 0, policy_version 127680 (0.0005) [2023-03-08 22:58:02,340][569911] Updated weights for policy 0, policy_version 127760 (0.0005) [2023-03-08 22:58:02,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 11746.5). Total num frames: 65417216. Throughput: 0: 11526.9. Samples: 65403112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:58:02,789][569624] Avg episode reward: [(0, '4093.862')] [2023-03-08 22:58:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000127768_65417216.pth... [2023-03-08 22:58:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000127096_65073152.pth [2023-03-08 22:58:06,037][569911] Updated weights for policy 0, policy_version 127840 (0.0004) [2023-03-08 22:58:07,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11468.8, 300 sec: 11718.7). Total num frames: 65470464. Throughput: 0: 11469.2. Samples: 65470472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:58:07,789][569624] Avg episode reward: [(0, '4446.951')] [2023-03-08 22:58:09,800][569911] Updated weights for policy 0, policy_version 127920 (0.0004) [2023-03-08 22:58:12,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11537.1, 300 sec: 11718.7). Total num frames: 65527808. Throughput: 0: 11451.5. Samples: 65503240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:58:12,789][569624] Avg episode reward: [(0, '4272.685')] [2023-03-08 22:58:13,470][569911] Updated weights for policy 0, policy_version 128000 (0.0004) [2023-03-08 22:58:17,139][569911] Updated weights for policy 0, policy_version 128080 (0.0005) [2023-03-08 22:58:17,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11400.5, 300 sec: 11704.8). Total num frames: 65581056. Throughput: 0: 11323.5. Samples: 65569700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:58:17,789][569624] Avg episode reward: [(0, '4285.324')] [2023-03-08 22:58:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000128088_65581056.pth... [2023-03-08 22:58:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000127432_65245184.pth [2023-03-08 22:58:20,729][569911] Updated weights for policy 0, policy_version 128160 (0.0004) [2023-03-08 22:58:22,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11400.5, 300 sec: 11704.8). Total num frames: 65638400. Throughput: 0: 11375.5. Samples: 65639364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:58:22,789][569624] Avg episode reward: [(0, '4495.612')] [2023-03-08 22:58:24,151][569911] Updated weights for policy 0, policy_version 128240 (0.0005) [2023-03-08 22:58:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11691.0). Total num frames: 65695744. Throughput: 0: 11418.3. Samples: 65675204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:58:27,789][569624] Avg episode reward: [(0, '4516.598')] [2023-03-08 22:58:27,817][569911] Updated weights for policy 0, policy_version 128320 (0.0005) [2023-03-08 22:58:31,252][569911] Updated weights for policy 0, policy_version 128400 (0.0005) [2023-03-08 22:58:32,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11400.5, 300 sec: 11704.8). Total num frames: 65757184. Throughput: 0: 11414.0. Samples: 65744280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:58:32,789][569624] Avg episode reward: [(0, '4652.715')] [2023-03-08 22:58:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000128432_65757184.pth... [2023-03-08 22:58:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000127768_65417216.pth [2023-03-08 22:58:34,955][569911] Updated weights for policy 0, policy_version 128480 (0.0004) [2023-03-08 22:58:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11400.5, 300 sec: 11691.0). Total num frames: 65814528. Throughput: 0: 11370.2. Samples: 65811360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:58:37,789][569624] Avg episode reward: [(0, '4529.112')] [2023-03-08 22:58:38,451][569911] Updated weights for policy 0, policy_version 128560 (0.0005) [2023-03-08 22:58:42,053][569911] Updated weights for policy 0, policy_version 128640 (0.0005) [2023-03-08 22:58:42,789][569624] Fps is (10 sec: 11469.0, 60 sec: 11468.8, 300 sec: 11677.1). Total num frames: 65871872. Throughput: 0: 11370.1. Samples: 65846152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:58:42,789][569624] Avg episode reward: [(0, '4521.303')] [2023-03-08 22:58:45,355][569911] Updated weights for policy 0, policy_version 128720 (0.0005) [2023-03-08 22:58:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 11691.0). Total num frames: 65933312. Throughput: 0: 11483.2. Samples: 65919856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:58:47,789][569624] Avg episode reward: [(0, '4404.815')] [2023-03-08 22:58:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000128776_65933312.pth... [2023-03-08 22:58:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000128088_65581056.pth [2023-03-08 22:58:48,748][569911] Updated weights for policy 0, policy_version 128800 (0.0005) [2023-03-08 22:58:52,386][569911] Updated weights for policy 0, policy_version 128880 (0.0004) [2023-03-08 22:58:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11663.2). Total num frames: 65986560. Throughput: 0: 11503.0. Samples: 65988108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:58:52,789][569624] Avg episode reward: [(0, '4469.835')] [2023-03-08 22:58:56,197][569911] Updated weights for policy 0, policy_version 128960 (0.0005) [2023-03-08 22:58:57,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11649.3). Total num frames: 66043904. Throughput: 0: 11471.6. Samples: 66019460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:58:57,789][569624] Avg episode reward: [(0, '4411.153')] [2023-03-08 22:58:59,813][569911] Updated weights for policy 0, policy_version 129040 (0.0004) [2023-03-08 22:59:02,789][569624] Fps is (10 sec: 11878.2, 60 sec: 11468.8, 300 sec: 11663.2). Total num frames: 66105344. Throughput: 0: 11539.1. Samples: 66088960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:59:02,789][569624] Avg episode reward: [(0, '4746.948')] [2023-03-08 22:59:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000129112_66105344.pth... [2023-03-08 22:59:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000128432_65757184.pth [2023-03-08 22:59:03,165][569911] Updated weights for policy 0, policy_version 129120 (0.0004) [2023-03-08 22:59:06,765][569911] Updated weights for policy 0, policy_version 129200 (0.0004) [2023-03-08 22:59:07,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11468.8, 300 sec: 11635.4). Total num frames: 66158592. Throughput: 0: 11558.7. Samples: 66159504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:59:07,789][569624] Avg episode reward: [(0, '4552.824')] [2023-03-08 22:59:10,528][569911] Updated weights for policy 0, policy_version 129280 (0.0005) [2023-03-08 22:59:12,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11468.8, 300 sec: 11635.4). Total num frames: 66215936. Throughput: 0: 11471.6. Samples: 66191424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:59:12,789][569624] Avg episode reward: [(0, '4275.675')] [2023-03-08 22:59:14,183][569911] Updated weights for policy 0, policy_version 129360 (0.0004) [2023-03-08 22:59:17,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11468.8, 300 sec: 11607.6). Total num frames: 66269184. Throughput: 0: 11391.7. Samples: 66256908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:59:17,789][569624] Avg episode reward: [(0, '4082.322')] [2023-03-08 22:59:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000129432_66269184.pth... [2023-03-08 22:59:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000128776_65933312.pth [2023-03-08 22:59:17,940][569911] Updated weights for policy 0, policy_version 129440 (0.0004) [2023-03-08 22:59:21,537][569911] Updated weights for policy 0, policy_version 129520 (0.0005) [2023-03-08 22:59:22,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11468.8, 300 sec: 11607.6). Total num frames: 66326528. Throughput: 0: 11426.1. Samples: 66325536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:59:22,789][569624] Avg episode reward: [(0, '4587.497')] [2023-03-08 22:59:25,202][569911] Updated weights for policy 0, policy_version 129600 (0.0005) [2023-03-08 22:59:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11593.8). Total num frames: 66383872. Throughput: 0: 11404.8. Samples: 66359368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:59:27,789][569624] Avg episode reward: [(0, '4663.837')] [2023-03-08 22:59:28,724][569911] Updated weights for policy 0, policy_version 129680 (0.0005) [2023-03-08 22:59:32,431][569911] Updated weights for policy 0, policy_version 129760 (0.0004) [2023-03-08 22:59:32,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11332.3, 300 sec: 11566.0). Total num frames: 66437120. Throughput: 0: 11240.2. Samples: 66425664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:59:32,789][569624] Avg episode reward: [(0, '4227.883')] [2023-03-08 22:59:32,797][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000129768_66441216.pth... [2023-03-08 22:59:32,798][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000129112_66105344.pth [2023-03-08 22:59:35,930][569911] Updated weights for policy 0, policy_version 129840 (0.0004) [2023-03-08 22:59:37,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11332.3, 300 sec: 11566.0). Total num frames: 66494464. Throughput: 0: 11281.2. Samples: 66495764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:59:37,789][569624] Avg episode reward: [(0, '4303.244')] [2023-03-08 22:59:39,657][569911] Updated weights for policy 0, policy_version 129920 (0.0004) [2023-03-08 22:59:42,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11332.3, 300 sec: 11552.1). Total num frames: 66551808. Throughput: 0: 11313.8. Samples: 66528580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:59:42,789][569624] Avg episode reward: [(0, '4334.945')] [2023-03-08 22:59:43,159][569911] Updated weights for policy 0, policy_version 130000 (0.0005) [2023-03-08 22:59:46,503][569911] Updated weights for policy 0, policy_version 130080 (0.0004) [2023-03-08 22:59:47,789][569624] Fps is (10 sec: 11878.2, 60 sec: 11332.2, 300 sec: 11566.0). Total num frames: 66613248. Throughput: 0: 11379.4. Samples: 66601032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:59:47,800][569624] Avg episode reward: [(0, '4525.646')] [2023-03-08 22:59:47,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000130104_66613248.pth... [2023-03-08 22:59:47,805][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000129432_66269184.pth [2023-03-08 22:59:49,991][569911] Updated weights for policy 0, policy_version 130160 (0.0005) [2023-03-08 22:59:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11468.8, 300 sec: 11579.9). Total num frames: 66674688. Throughput: 0: 11409.0. Samples: 66672908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:59:52,800][569624] Avg episode reward: [(0, '4406.037')] [2023-03-08 22:59:53,476][569911] Updated weights for policy 0, policy_version 130240 (0.0005) [2023-03-08 22:59:56,931][569911] Updated weights for policy 0, policy_version 130320 (0.0005) [2023-03-08 22:59:57,789][569624] Fps is (10 sec: 11878.6, 60 sec: 11468.8, 300 sec: 11579.9). Total num frames: 66732032. Throughput: 0: 11469.0. Samples: 66707528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 22:59:57,800][569624] Avg episode reward: [(0, '4412.429')] [2023-03-08 23:00:00,482][569911] Updated weights for policy 0, policy_version 130400 (0.0005) [2023-03-08 23:00:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11566.0). Total num frames: 66789376. Throughput: 0: 11559.8. Samples: 66777100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:00:02,800][569624] Avg episode reward: [(0, '4256.464')] [2023-03-08 23:00:02,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000130448_66789376.pth... [2023-03-08 23:00:02,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000129768_66441216.pth [2023-03-08 23:00:04,071][569911] Updated weights for policy 0, policy_version 130480 (0.0005) [2023-03-08 23:00:07,661][569911] Updated weights for policy 0, policy_version 130560 (0.0004) [2023-03-08 23:00:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11552.1). Total num frames: 66846720. Throughput: 0: 11568.1. Samples: 66846100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:00:07,800][569624] Avg episode reward: [(0, '4400.316')] [2023-03-08 23:00:11,123][569911] Updated weights for policy 0, policy_version 130640 (0.0005) [2023-03-08 23:00:12,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11524.3). Total num frames: 66904064. Throughput: 0: 11581.4. Samples: 66880532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:00:12,800][569624] Avg episode reward: [(0, '4244.040')] [2023-03-08 23:00:14,755][569911] Updated weights for policy 0, policy_version 130720 (0.0004) [2023-03-08 23:00:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11537.1, 300 sec: 11524.3). Total num frames: 66961408. Throughput: 0: 11635.2. Samples: 66949248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:00:17,789][569624] Avg episode reward: [(0, '4386.999')] [2023-03-08 23:00:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000130784_66961408.pth... [2023-03-08 23:00:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000130104_66613248.pth [2023-03-08 23:00:18,392][569911] Updated weights for policy 0, policy_version 130800 (0.0005) [2023-03-08 23:00:21,995][569911] Updated weights for policy 0, policy_version 130880 (0.0005) [2023-03-08 23:00:22,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11468.8, 300 sec: 11510.5). Total num frames: 67014656. Throughput: 0: 11559.9. Samples: 67015960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:00:22,789][569624] Avg episode reward: [(0, '4176.749')] [2023-03-08 23:00:25,643][569911] Updated weights for policy 0, policy_version 130960 (0.0004) [2023-03-08 23:00:27,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11468.8, 300 sec: 11510.5). Total num frames: 67072000. Throughput: 0: 11596.6. Samples: 67050428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:00:27,789][569624] Avg episode reward: [(0, '4231.496')] [2023-03-08 23:00:29,224][569911] Updated weights for policy 0, policy_version 131040 (0.0005) [2023-03-08 23:00:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11482.7). Total num frames: 67129344. Throughput: 0: 11523.4. Samples: 67119584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:00:32,789][569624] Avg episode reward: [(0, '4266.674')] [2023-03-08 23:00:32,824][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000131120_67133440.pth... [2023-03-08 23:00:32,825][569911] Updated weights for policy 0, policy_version 131120 (0.0004) [2023-03-08 23:00:32,826][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000130448_66789376.pth [2023-03-08 23:00:36,466][569911] Updated weights for policy 0, policy_version 131200 (0.0005) [2023-03-08 23:00:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11482.7). Total num frames: 67186688. Throughput: 0: 11410.0. Samples: 67186356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:00:37,789][569624] Avg episode reward: [(0, '4128.297')] [2023-03-08 23:00:40,354][569911] Updated weights for policy 0, policy_version 131280 (0.0005) [2023-03-08 23:00:42,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11468.8, 300 sec: 11468.8). Total num frames: 67239936. Throughput: 0: 11330.6. Samples: 67217404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:00:42,789][569624] Avg episode reward: [(0, '3944.232')] [2023-03-08 23:00:44,120][569911] Updated weights for policy 0, policy_version 131360 (0.0004) [2023-03-08 23:00:47,550][569911] Updated weights for policy 0, policy_version 131440 (0.0004) [2023-03-08 23:00:47,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11400.5, 300 sec: 11454.9). Total num frames: 67297280. Throughput: 0: 11312.9. Samples: 67286180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:00:47,789][569624] Avg episode reward: [(0, '4389.665')] [2023-03-08 23:00:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000131440_67297280.pth... [2023-03-08 23:00:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000130784_66961408.pth [2023-03-08 23:00:51,062][569911] Updated weights for policy 0, policy_version 131520 (0.0005) [2023-03-08 23:00:52,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11332.3, 300 sec: 11454.9). Total num frames: 67354624. Throughput: 0: 11300.7. Samples: 67354632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:00:52,789][569624] Avg episode reward: [(0, '4491.176')] [2023-03-08 23:00:54,638][569911] Updated weights for policy 0, policy_version 131600 (0.0006) [2023-03-08 23:00:57,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11332.3, 300 sec: 11441.0). Total num frames: 67411968. Throughput: 0: 11293.2. Samples: 67388728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:00:57,800][569624] Avg episode reward: [(0, '4381.918')] [2023-03-08 23:00:58,449][569911] Updated weights for policy 0, policy_version 131680 (0.0005) [2023-03-08 23:01:01,842][569911] Updated weights for policy 0, policy_version 131760 (0.0005) [2023-03-08 23:01:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11441.0). Total num frames: 67469312. Throughput: 0: 11284.1. Samples: 67457032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:01:02,800][569624] Avg episode reward: [(0, '4452.137')] [2023-03-08 23:01:02,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000131776_67469312.pth... [2023-03-08 23:01:02,804][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000131120_67133440.pth [2023-03-08 23:01:05,332][569911] Updated weights for policy 0, policy_version 131840 (0.0005) [2023-03-08 23:01:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11441.0). Total num frames: 67526656. Throughput: 0: 11350.2. Samples: 67526720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:01:07,789][569624] Avg episode reward: [(0, '4113.177')] [2023-03-08 23:01:08,941][569911] Updated weights for policy 0, policy_version 131920 (0.0005) [2023-03-08 23:01:12,514][569911] Updated weights for policy 0, policy_version 132000 (0.0005) [2023-03-08 23:01:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11332.3, 300 sec: 11441.0). Total num frames: 67584000. Throughput: 0: 11348.8. Samples: 67561124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:01:12,789][569624] Avg episode reward: [(0, '4110.397')] [2023-03-08 23:01:15,957][569911] Updated weights for policy 0, policy_version 132080 (0.0004) [2023-03-08 23:01:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11400.5, 300 sec: 11468.8). Total num frames: 67645440. Throughput: 0: 11387.3. Samples: 67632016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:01:17,789][569624] Avg episode reward: [(0, '4134.492')] [2023-03-08 23:01:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000132120_67645440.pth... [2023-03-08 23:01:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000131440_67297280.pth [2023-03-08 23:01:19,447][569911] Updated weights for policy 0, policy_version 132160 (0.0005) [2023-03-08 23:01:22,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11468.8, 300 sec: 11454.9). Total num frames: 67702784. Throughput: 0: 11423.0. Samples: 67700392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:01:22,789][569624] Avg episode reward: [(0, '4141.179')] [2023-03-08 23:01:23,103][569911] Updated weights for policy 0, policy_version 132240 (0.0005) [2023-03-08 23:01:26,594][569911] Updated weights for policy 0, policy_version 132320 (0.0005) [2023-03-08 23:01:27,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11468.8, 300 sec: 11454.9). Total num frames: 67760128. Throughput: 0: 11519.4. Samples: 67735776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:01:27,789][569624] Avg episode reward: [(0, '4154.318')] [2023-03-08 23:01:30,164][569911] Updated weights for policy 0, policy_version 132400 (0.0005) [2023-03-08 23:01:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11454.9). Total num frames: 67817472. Throughput: 0: 11509.3. Samples: 67804096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:01:32,789][569624] Avg episode reward: [(0, '3929.333')] [2023-03-08 23:01:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000132456_67817472.pth... [2023-03-08 23:01:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000131776_67469312.pth [2023-03-08 23:01:33,797][569911] Updated weights for policy 0, policy_version 132480 (0.0005) [2023-03-08 23:01:37,223][569911] Updated weights for policy 0, policy_version 132560 (0.0005) [2023-03-08 23:01:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11454.9). Total num frames: 67874816. Throughput: 0: 11557.8. Samples: 67874732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:01:37,789][569624] Avg episode reward: [(0, '4317.075')] [2023-03-08 23:01:40,525][569911] Updated weights for policy 0, policy_version 132640 (0.0005) [2023-03-08 23:01:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11468.8). Total num frames: 67936256. Throughput: 0: 11622.6. Samples: 67911744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:01:42,789][569624] Avg episode reward: [(0, '3893.323')] [2023-03-08 23:01:44,141][569911] Updated weights for policy 0, policy_version 132720 (0.0004) [2023-03-08 23:01:47,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11537.1, 300 sec: 11454.9). Total num frames: 67989504. Throughput: 0: 11593.8. Samples: 67978752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:01:47,789][569624] Avg episode reward: [(0, '3770.990')] [2023-03-08 23:01:47,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000132800_67993600.pth... [2023-03-08 23:01:47,803][569911] Updated weights for policy 0, policy_version 132800 (0.0005) [2023-03-08 23:01:47,804][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000132120_67645440.pth [2023-03-08 23:01:51,428][569911] Updated weights for policy 0, policy_version 132880 (0.0006) [2023-03-08 23:01:52,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11537.1, 300 sec: 11454.9). Total num frames: 68046848. Throughput: 0: 11558.6. Samples: 68046856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:01:52,789][569624] Avg episode reward: [(0, '4005.056')] [2023-03-08 23:01:55,056][569911] Updated weights for policy 0, policy_version 132960 (0.0005) [2023-03-08 23:01:57,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11537.1, 300 sec: 11468.8). Total num frames: 68104192. Throughput: 0: 11554.1. Samples: 68081060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:01:57,789][569624] Avg episode reward: [(0, '4068.355')] [2023-03-08 23:01:58,462][569911] Updated weights for policy 0, policy_version 133040 (0.0005) [2023-03-08 23:02:02,071][569911] Updated weights for policy 0, policy_version 133120 (0.0005) [2023-03-08 23:02:02,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 11468.8). Total num frames: 68165632. Throughput: 0: 11549.7. Samples: 68151752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:02:02,789][569624] Avg episode reward: [(0, '4075.886')] [2023-03-08 23:02:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000133136_68165632.pth... [2023-03-08 23:02:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000132456_67817472.pth [2023-03-08 23:02:05,526][569911] Updated weights for policy 0, policy_version 133200 (0.0005) [2023-03-08 23:02:07,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11537.1, 300 sec: 11468.8). Total num frames: 68218880. Throughput: 0: 11529.0. Samples: 68219196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:02:07,789][569624] Avg episode reward: [(0, '3589.206')] [2023-03-08 23:02:09,522][569911] Updated weights for policy 0, policy_version 133280 (0.0005) [2023-03-08 23:02:12,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11537.1, 300 sec: 11454.9). Total num frames: 68276224. Throughput: 0: 11465.3. Samples: 68251712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:02:12,789][569624] Avg episode reward: [(0, '3854.920')] [2023-03-08 23:02:12,986][569911] Updated weights for policy 0, policy_version 133360 (0.0005) [2023-03-08 23:02:16,661][569911] Updated weights for policy 0, policy_version 133440 (0.0004) [2023-03-08 23:02:17,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11441.0). Total num frames: 68329472. Throughput: 0: 11473.2. Samples: 68320388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:02:17,789][569624] Avg episode reward: [(0, '3775.917')] [2023-03-08 23:02:17,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000133464_68333568.pth... [2023-03-08 23:02:17,805][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000132800_67993600.pth [2023-03-08 23:02:20,311][569911] Updated weights for policy 0, policy_version 133520 (0.0005) [2023-03-08 23:02:22,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.6, 300 sec: 11427.1). Total num frames: 68386816. Throughput: 0: 11367.5. Samples: 68386268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:02:22,789][569624] Avg episode reward: [(0, '3651.045')] [2023-03-08 23:02:24,163][569911] Updated weights for policy 0, policy_version 133600 (0.0006) [2023-03-08 23:02:27,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11413.3). Total num frames: 68440064. Throughput: 0: 11260.6. Samples: 68418468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:02:27,789][569624] Avg episode reward: [(0, '3784.164')] [2023-03-08 23:02:27,943][569911] Updated weights for policy 0, policy_version 133680 (0.0005) [2023-03-08 23:02:31,503][569911] Updated weights for policy 0, policy_version 133760 (0.0005) [2023-03-08 23:02:32,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11332.3, 300 sec: 11413.3). Total num frames: 68497408. Throughput: 0: 11254.2. Samples: 68485192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:02:32,789][569624] Avg episode reward: [(0, '3885.016')] [2023-03-08 23:02:32,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000133784_68497408.pth... [2023-03-08 23:02:32,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000133136_68165632.pth [2023-03-08 23:02:35,066][569911] Updated weights for policy 0, policy_version 133840 (0.0005) [2023-03-08 23:02:37,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11332.3, 300 sec: 11427.1). Total num frames: 68554752. Throughput: 0: 11235.1. Samples: 68552436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:02:37,789][569624] Avg episode reward: [(0, '4032.914')] [2023-03-08 23:02:38,785][569911] Updated weights for policy 0, policy_version 133920 (0.0004) [2023-03-08 23:02:42,260][569911] Updated weights for policy 0, policy_version 134000 (0.0005) [2023-03-08 23:02:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11413.3). Total num frames: 68612096. Throughput: 0: 11252.8. Samples: 68587436. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:02:42,789][569624] Avg episode reward: [(0, '4049.511')] [2023-03-08 23:02:45,813][569911] Updated weights for policy 0, policy_version 134080 (0.0005) [2023-03-08 23:02:47,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11427.1). Total num frames: 68669440. Throughput: 0: 11248.1. Samples: 68657916. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:02:47,789][569624] Avg episode reward: [(0, '3888.854')] [2023-03-08 23:02:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000134120_68669440.pth... [2023-03-08 23:02:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000133464_68333568.pth [2023-03-08 23:02:49,335][569911] Updated weights for policy 0, policy_version 134160 (0.0005) [2023-03-08 23:02:52,768][569911] Updated weights for policy 0, policy_version 134240 (0.0005) [2023-03-08 23:02:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11400.5, 300 sec: 11427.1). Total num frames: 68730880. Throughput: 0: 11296.8. Samples: 68727552. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:02:52,789][569624] Avg episode reward: [(0, '4137.041')] [2023-03-08 23:02:56,412][569911] Updated weights for policy 0, policy_version 134320 (0.0005) [2023-03-08 23:02:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11413.3). Total num frames: 68784128. Throughput: 0: 11365.2. Samples: 68763148. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:02:57,789][569624] Avg episode reward: [(0, '4515.745')] [2023-03-08 23:03:00,049][569911] Updated weights for policy 0, policy_version 134400 (0.0005) [2023-03-08 23:03:02,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11264.0, 300 sec: 11427.1). Total num frames: 68841472. Throughput: 0: 11306.6. Samples: 68829184. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:03:02,789][569624] Avg episode reward: [(0, '4051.556')] [2023-03-08 23:03:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000134456_68841472.pth... [2023-03-08 23:03:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000133784_68497408.pth [2023-03-08 23:03:03,786][569911] Updated weights for policy 0, policy_version 134480 (0.0005) [2023-03-08 23:03:07,405][569911] Updated weights for policy 0, policy_version 134560 (0.0005) [2023-03-08 23:03:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11427.1). Total num frames: 68898816. Throughput: 0: 11330.3. Samples: 68896132. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:03:07,789][569624] Avg episode reward: [(0, '4480.360')] [2023-03-08 23:03:11,020][569911] Updated weights for policy 0, policy_version 134640 (0.0005) [2023-03-08 23:03:12,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11441.0). Total num frames: 68956160. Throughput: 0: 11389.4. Samples: 68930992. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:03:12,789][569624] Avg episode reward: [(0, '4502.633')] [2023-03-08 23:03:14,366][569911] Updated weights for policy 0, policy_version 134720 (0.0005) [2023-03-08 23:03:17,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11441.0). Total num frames: 69013504. Throughput: 0: 11478.0. Samples: 69001704. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:03:17,789][569624] Avg episode reward: [(0, '4553.275')] [2023-03-08 23:03:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000134792_69013504.pth... [2023-03-08 23:03:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000134120_68669440.pth [2023-03-08 23:03:17,880][569911] Updated weights for policy 0, policy_version 134800 (0.0005) [2023-03-08 23:03:21,612][569911] Updated weights for policy 0, policy_version 134880 (0.0005) [2023-03-08 23:03:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11441.0). Total num frames: 69070848. Throughput: 0: 11449.6. Samples: 69067668. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:03:22,789][569624] Avg episode reward: [(0, '4445.951')] [2023-03-08 23:03:25,463][569911] Updated weights for policy 0, policy_version 134960 (0.0005) [2023-03-08 23:03:27,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11413.3). Total num frames: 69124096. Throughput: 0: 11389.4. Samples: 69099960. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:03:27,789][569624] Avg episode reward: [(0, '4088.946')] [2023-03-08 23:03:29,105][569911] Updated weights for policy 0, policy_version 135040 (0.0004) [2023-03-08 23:03:32,753][569911] Updated weights for policy 0, policy_version 135120 (0.0005) [2023-03-08 23:03:32,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11413.3). Total num frames: 69181440. Throughput: 0: 11343.4. Samples: 69168372. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:03:32,789][569624] Avg episode reward: [(0, '4366.382')] [2023-03-08 23:03:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000135120_69181440.pth... [2023-03-08 23:03:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000134456_68841472.pth [2023-03-08 23:03:36,397][569911] Updated weights for policy 0, policy_version 135200 (0.0004) [2023-03-08 23:03:37,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11332.2, 300 sec: 11399.4). Total num frames: 69234688. Throughput: 0: 11286.3. Samples: 69235436. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:03:37,789][569624] Avg episode reward: [(0, '4283.796')] [2023-03-08 23:03:40,020][569911] Updated weights for policy 0, policy_version 135280 (0.0005) [2023-03-08 23:03:42,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11385.5). Total num frames: 69292032. Throughput: 0: 11252.0. Samples: 69269488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:03:42,789][569624] Avg episode reward: [(0, '4302.320')] [2023-03-08 23:03:43,558][569911] Updated weights for policy 0, policy_version 135360 (0.0005) [2023-03-08 23:03:47,189][569911] Updated weights for policy 0, policy_version 135440 (0.0005) [2023-03-08 23:03:47,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11332.3, 300 sec: 11399.4). Total num frames: 69349376. Throughput: 0: 11332.1. Samples: 69339128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:03:47,789][569624] Avg episode reward: [(0, '4446.676')] [2023-03-08 23:03:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000135448_69349376.pth... [2023-03-08 23:03:47,792][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000134792_69013504.pth [2023-03-08 23:03:50,704][569911] Updated weights for policy 0, policy_version 135520 (0.0004) [2023-03-08 23:03:52,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11264.0, 300 sec: 11399.4). Total num frames: 69406720. Throughput: 0: 11359.5. Samples: 69407312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:03:52,789][569624] Avg episode reward: [(0, '4635.864')] [2023-03-08 23:03:54,158][569911] Updated weights for policy 0, policy_version 135600 (0.0005) [2023-03-08 23:03:57,520][569911] Updated weights for policy 0, policy_version 135680 (0.0005) [2023-03-08 23:03:57,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11400.5, 300 sec: 11399.4). Total num frames: 69468160. Throughput: 0: 11388.3. Samples: 69443464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:03:57,789][569624] Avg episode reward: [(0, '4288.656')] [2023-03-08 23:04:01,221][569911] Updated weights for policy 0, policy_version 135760 (0.0005) [2023-03-08 23:04:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11400.5, 300 sec: 11413.3). Total num frames: 69525504. Throughput: 0: 11358.8. Samples: 69512848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:04:02,789][569624] Avg episode reward: [(0, '4427.189')] [2023-03-08 23:04:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000135792_69525504.pth... [2023-03-08 23:04:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000135120_69181440.pth [2023-03-08 23:04:04,871][569911] Updated weights for policy 0, policy_version 135840 (0.0005) [2023-03-08 23:04:07,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11400.5, 300 sec: 11413.3). Total num frames: 69582848. Throughput: 0: 11433.0. Samples: 69582152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:04:07,789][569624] Avg episode reward: [(0, '4471.830')] [2023-03-08 23:04:08,411][569911] Updated weights for policy 0, policy_version 135920 (0.0005) [2023-03-08 23:04:11,810][569911] Updated weights for policy 0, policy_version 136000 (0.0005) [2023-03-08 23:04:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11400.5, 300 sec: 11427.1). Total num frames: 69640192. Throughput: 0: 11476.2. Samples: 69616388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:04:12,789][569624] Avg episode reward: [(0, '4524.828')] [2023-03-08 23:04:15,469][569911] Updated weights for policy 0, policy_version 136080 (0.0005) [2023-03-08 23:04:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11427.1). Total num frames: 69697536. Throughput: 0: 11483.6. Samples: 69685132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:04:17,789][569624] Avg episode reward: [(0, '4494.280')] [2023-03-08 23:04:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000136128_69697536.pth... [2023-03-08 23:04:17,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000135448_69349376.pth [2023-03-08 23:04:19,177][569911] Updated weights for policy 0, policy_version 136160 (0.0005) [2023-03-08 23:04:22,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11413.3). Total num frames: 69750784. Throughput: 0: 11487.4. Samples: 69752368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:04:22,789][569624] Avg episode reward: [(0, '4384.738')] [2023-03-08 23:04:22,810][569911] Updated weights for policy 0, policy_version 136240 (0.0005) [2023-03-08 23:04:26,616][569911] Updated weights for policy 0, policy_version 136320 (0.0005) [2023-03-08 23:04:27,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11400.5, 300 sec: 11427.1). Total num frames: 69808128. Throughput: 0: 11453.2. Samples: 69784880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:04:27,789][569624] Avg episode reward: [(0, '4382.622')] [2023-03-08 23:04:30,208][569911] Updated weights for policy 0, policy_version 136400 (0.0005) [2023-03-08 23:04:32,789][569624] Fps is (10 sec: 11468.6, 60 sec: 11400.5, 300 sec: 11427.1). Total num frames: 69865472. Throughput: 0: 11397.6. Samples: 69852020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:04:32,789][569624] Avg episode reward: [(0, '4299.582')] [2023-03-08 23:04:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000136456_69865472.pth... [2023-03-08 23:04:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000135792_69525504.pth [2023-03-08 23:04:33,894][569911] Updated weights for policy 0, policy_version 136480 (0.0005) [2023-03-08 23:04:37,673][569911] Updated weights for policy 0, policy_version 136560 (0.0005) [2023-03-08 23:04:37,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11400.5, 300 sec: 11413.3). Total num frames: 69918720. Throughput: 0: 11348.6. Samples: 69918000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:04:37,789][569624] Avg episode reward: [(0, '4303.980')] [2023-03-08 23:04:41,241][569911] Updated weights for policy 0, policy_version 136640 (0.0006) [2023-03-08 23:04:42,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11399.4). Total num frames: 69976064. Throughput: 0: 11298.0. Samples: 69951872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:04:42,789][569624] Avg episode reward: [(0, '4571.045')] [2023-03-08 23:04:44,928][569911] Updated weights for policy 0, policy_version 136720 (0.0005) [2023-03-08 23:04:47,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11371.6). Total num frames: 70029312. Throughput: 0: 11232.7. Samples: 70018320. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:04:47,789][569624] Avg episode reward: [(0, '4322.376')] [2023-03-08 23:04:47,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000136784_70033408.pth... [2023-03-08 23:04:47,804][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000136128_69697536.pth [2023-03-08 23:04:48,456][569911] Updated weights for policy 0, policy_version 136800 (0.0005) [2023-03-08 23:04:52,091][569911] Updated weights for policy 0, policy_version 136880 (0.0005) [2023-03-08 23:04:52,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11332.3, 300 sec: 11371.6). Total num frames: 70086656. Throughput: 0: 11228.9. Samples: 70087452. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:04:52,789][569624] Avg episode reward: [(0, '4511.177')] [2023-03-08 23:04:55,786][569911] Updated weights for policy 0, policy_version 136960 (0.0005) [2023-03-08 23:04:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11371.6). Total num frames: 70144000. Throughput: 0: 11204.3. Samples: 70120580. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:04:57,789][569624] Avg episode reward: [(0, '4061.593')] [2023-03-08 23:04:59,198][569911] Updated weights for policy 0, policy_version 137040 (0.0005) [2023-03-08 23:05:02,667][569911] Updated weights for policy 0, policy_version 137120 (0.0005) [2023-03-08 23:05:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11332.3, 300 sec: 11385.5). Total num frames: 70205440. Throughput: 0: 11281.5. Samples: 70192800. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:05:02,789][569624] Avg episode reward: [(0, '4568.504')] [2023-03-08 23:05:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000137120_70205440.pth... [2023-03-08 23:05:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000136456_69865472.pth [2023-03-08 23:05:06,467][569911] Updated weights for policy 0, policy_version 137200 (0.0005) [2023-03-08 23:05:07,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11264.0, 300 sec: 11371.6). Total num frames: 70258688. Throughput: 0: 11251.3. Samples: 70258676. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:05:07,789][569624] Avg episode reward: [(0, '4383.849')] [2023-03-08 23:05:10,237][569911] Updated weights for policy 0, policy_version 137280 (0.0005) [2023-03-08 23:05:12,789][569624] Fps is (10 sec: 10649.6, 60 sec: 11195.7, 300 sec: 11357.7). Total num frames: 70311936. Throughput: 0: 11247.3. Samples: 70291008. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:05:12,789][569624] Avg episode reward: [(0, '4234.774')] [2023-03-08 23:05:14,006][569911] Updated weights for policy 0, policy_version 137360 (0.0006) [2023-03-08 23:05:17,719][569911] Updated weights for policy 0, policy_version 137440 (0.0005) [2023-03-08 23:05:17,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11195.7, 300 sec: 11371.6). Total num frames: 70369280. Throughput: 0: 11217.1. Samples: 70356788. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:05:17,789][569624] Avg episode reward: [(0, '4150.759')] [2023-03-08 23:05:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000137440_70369280.pth... [2023-03-08 23:05:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000136784_70033408.pth [2023-03-08 23:05:21,305][569911] Updated weights for policy 0, policy_version 137520 (0.0005) [2023-03-08 23:05:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11371.6). Total num frames: 70426624. Throughput: 0: 11265.9. Samples: 70424964. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:05:22,789][569624] Avg episode reward: [(0, '4418.650')] [2023-03-08 23:05:24,684][569911] Updated weights for policy 0, policy_version 137600 (0.0005) [2023-03-08 23:05:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11371.6). Total num frames: 70483968. Throughput: 0: 11291.4. Samples: 70459984. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:05:27,789][569624] Avg episode reward: [(0, '4424.952')] [2023-03-08 23:05:28,327][569911] Updated weights for policy 0, policy_version 137680 (0.0005) [2023-03-08 23:05:31,983][569911] Updated weights for policy 0, policy_version 137760 (0.0005) [2023-03-08 23:05:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11371.6). Total num frames: 70541312. Throughput: 0: 11323.9. Samples: 70527896. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:05:32,789][569624] Avg episode reward: [(0, '4558.111')] [2023-03-08 23:05:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000137776_70541312.pth... [2023-03-08 23:05:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000137120_70205440.pth [2023-03-08 23:05:35,626][569911] Updated weights for policy 0, policy_version 137840 (0.0005) [2023-03-08 23:05:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11385.5). Total num frames: 70598656. Throughput: 0: 11329.9. Samples: 70597296. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:05:37,789][569624] Avg episode reward: [(0, '4476.951')] [2023-03-08 23:05:39,110][569911] Updated weights for policy 0, policy_version 137920 (0.0005) [2023-03-08 23:05:42,500][569911] Updated weights for policy 0, policy_version 138000 (0.0005) [2023-03-08 23:05:42,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11332.3, 300 sec: 11385.5). Total num frames: 70656000. Throughput: 0: 11352.2. Samples: 70631432. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:05:42,789][569624] Avg episode reward: [(0, '4386.301')] [2023-03-08 23:05:46,335][569911] Updated weights for policy 0, policy_version 138080 (0.0006) [2023-03-08 23:05:47,789][569624] Fps is (10 sec: 11468.6, 60 sec: 11400.5, 300 sec: 11385.5). Total num frames: 70713344. Throughput: 0: 11260.0. Samples: 70699500. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:05:47,789][569624] Avg episode reward: [(0, '4645.709')] [2023-03-08 23:05:47,794][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000138112_70713344.pth... [2023-03-08 23:05:47,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000137440_70369280.pth [2023-03-08 23:05:49,972][569911] Updated weights for policy 0, policy_version 138160 (0.0006) [2023-03-08 23:05:52,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11332.3, 300 sec: 11371.6). Total num frames: 70766592. Throughput: 0: 11287.0. Samples: 70766592. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:05:52,789][569624] Avg episode reward: [(0, '4643.440')] [2023-03-08 23:05:53,666][569911] Updated weights for policy 0, policy_version 138240 (0.0005) [2023-03-08 23:05:57,268][569911] Updated weights for policy 0, policy_version 138320 (0.0005) [2023-03-08 23:05:57,789][569624] Fps is (10 sec: 11059.4, 60 sec: 11332.2, 300 sec: 11371.6). Total num frames: 70823936. Throughput: 0: 11312.1. Samples: 70800052. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:05:57,789][569624] Avg episode reward: [(0, '4478.735')] [2023-03-08 23:06:00,829][569911] Updated weights for policy 0, policy_version 138400 (0.0005) [2023-03-08 23:06:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11371.6). Total num frames: 70881280. Throughput: 0: 11383.7. Samples: 70869056. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:06:02,789][569624] Avg episode reward: [(0, '4305.532')] [2023-03-08 23:06:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000138440_70881280.pth... [2023-03-08 23:06:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000137776_70541312.pth [2023-03-08 23:06:04,250][569911] Updated weights for policy 0, policy_version 138480 (0.0005) [2023-03-08 23:06:07,735][569911] Updated weights for policy 0, policy_version 138560 (0.0005) [2023-03-08 23:06:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11400.5, 300 sec: 11385.5). Total num frames: 70942720. Throughput: 0: 11466.0. Samples: 70940932. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:06:07,789][569624] Avg episode reward: [(0, '4446.321')] [2023-03-08 23:06:11,310][569911] Updated weights for policy 0, policy_version 138640 (0.0005) [2023-03-08 23:06:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 11371.6). Total num frames: 71000064. Throughput: 0: 11451.2. Samples: 70975288. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:06:12,789][569624] Avg episode reward: [(0, '4407.050')] [2023-03-08 23:06:14,899][569911] Updated weights for policy 0, policy_version 138720 (0.0005) [2023-03-08 23:06:17,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11400.6, 300 sec: 11357.7). Total num frames: 71053312. Throughput: 0: 11443.0. Samples: 71042832. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:06:17,799][569624] Avg episode reward: [(0, '4416.765')] [2023-03-08 23:06:17,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000138776_71053312.pth... [2023-03-08 23:06:17,804][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000138112_70713344.pth [2023-03-08 23:06:18,682][569911] Updated weights for policy 0, policy_version 138800 (0.0005) [2023-03-08 23:06:22,341][569911] Updated weights for policy 0, policy_version 138880 (0.0005) [2023-03-08 23:06:22,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11400.6, 300 sec: 11357.7). Total num frames: 71110656. Throughput: 0: 11373.2. Samples: 71109088. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:06:22,799][569624] Avg episode reward: [(0, '4398.856')] [2023-03-08 23:06:25,685][569911] Updated weights for policy 0, policy_version 138960 (0.0005) [2023-03-08 23:06:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 11371.6). Total num frames: 71172096. Throughput: 0: 11441.7. Samples: 71146308. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:06:27,789][569624] Avg episode reward: [(0, '4345.556')] [2023-03-08 23:06:29,118][569911] Updated weights for policy 0, policy_version 139040 (0.0005) [2023-03-08 23:06:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11357.7). Total num frames: 71225344. Throughput: 0: 11486.6. Samples: 71216396. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:06:32,797][569911] Updated weights for policy 0, policy_version 139120 (0.0005) [2023-03-08 23:06:32,800][569624] Avg episode reward: [(0, '4528.342')] [2023-03-08 23:06:32,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000139120_71229440.pth... [2023-03-08 23:06:32,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000138440_70881280.pth [2023-03-08 23:06:36,518][569911] Updated weights for policy 0, policy_version 139200 (0.0005) [2023-03-08 23:06:37,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11400.5, 300 sec: 11343.8). Total num frames: 71282688. Throughput: 0: 11465.9. Samples: 71282556. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:06:37,799][569624] Avg episode reward: [(0, '4362.034')] [2023-03-08 23:06:40,265][569911] Updated weights for policy 0, policy_version 139280 (0.0006) [2023-03-08 23:06:42,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11343.8). Total num frames: 71335936. Throughput: 0: 11445.3. Samples: 71315088. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:06:42,789][569624] Avg episode reward: [(0, '4527.496')] [2023-03-08 23:06:43,928][569911] Updated weights for policy 0, policy_version 139360 (0.0005) [2023-03-08 23:06:47,430][569911] Updated weights for policy 0, policy_version 139440 (0.0005) [2023-03-08 23:06:47,789][569624] Fps is (10 sec: 11468.6, 60 sec: 11400.5, 300 sec: 11357.7). Total num frames: 71397376. Throughput: 0: 11415.2. Samples: 71382744. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:06:47,790][569624] Avg episode reward: [(0, '4214.715')] [2023-03-08 23:06:47,795][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000139448_71397376.pth... [2023-03-08 23:06:47,797][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000138776_71053312.pth [2023-03-08 23:06:50,953][569911] Updated weights for policy 0, policy_version 139520 (0.0005) [2023-03-08 23:06:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11468.8, 300 sec: 11357.7). Total num frames: 71454720. Throughput: 0: 11381.5. Samples: 71453100. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:06:52,789][569624] Avg episode reward: [(0, '4356.279')] [2023-03-08 23:06:54,429][569911] Updated weights for policy 0, policy_version 139600 (0.0005) [2023-03-08 23:06:57,789][569624] Fps is (10 sec: 11469.1, 60 sec: 11468.8, 300 sec: 11343.8). Total num frames: 71512064. Throughput: 0: 11383.7. Samples: 71487552. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:06:57,789][569624] Avg episode reward: [(0, '4274.854')] [2023-03-08 23:06:58,073][569911] Updated weights for policy 0, policy_version 139680 (0.0005) [2023-03-08 23:07:01,637][569911] Updated weights for policy 0, policy_version 139760 (0.0005) [2023-03-08 23:07:02,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11343.8). Total num frames: 71565312. Throughput: 0: 11394.9. Samples: 71555604. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:07:02,789][569624] Avg episode reward: [(0, '4416.788')] [2023-03-08 23:07:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000139776_71565312.pth... [2023-03-08 23:07:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000139120_71229440.pth [2023-03-08 23:07:05,374][569911] Updated weights for policy 0, policy_version 139840 (0.0005) [2023-03-08 23:07:07,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11357.7). Total num frames: 71626752. Throughput: 0: 11430.0. Samples: 71623440. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:07:07,789][569624] Avg episode reward: [(0, '4368.224')] [2023-03-08 23:07:08,771][569911] Updated weights for policy 0, policy_version 139920 (0.0005) [2023-03-08 23:07:12,415][569911] Updated weights for policy 0, policy_version 140000 (0.0005) [2023-03-08 23:07:12,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11357.7). Total num frames: 71680000. Throughput: 0: 11404.9. Samples: 71659528. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:07:12,789][569624] Avg episode reward: [(0, '4198.414')] [2023-03-08 23:07:16,117][569911] Updated weights for policy 0, policy_version 140080 (0.0005) [2023-03-08 23:07:17,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11357.7). Total num frames: 71737344. Throughput: 0: 11305.2. Samples: 71725128. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:07:17,789][569624] Avg episode reward: [(0, '4167.634')] [2023-03-08 23:07:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000140112_71737344.pth... [2023-03-08 23:07:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000139448_71397376.pth [2023-03-08 23:07:19,703][569911] Updated weights for policy 0, policy_version 140160 (0.0005) [2023-03-08 23:07:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11371.6). Total num frames: 71794688. Throughput: 0: 11379.4. Samples: 71794628. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:07:22,789][569624] Avg episode reward: [(0, '4369.227')] [2023-03-08 23:07:23,252][569911] Updated weights for policy 0, policy_version 140240 (0.0005) [2023-03-08 23:07:26,763][569911] Updated weights for policy 0, policy_version 140320 (0.0005) [2023-03-08 23:07:27,789][569624] Fps is (10 sec: 11469.0, 60 sec: 11332.3, 300 sec: 11371.6). Total num frames: 71852032. Throughput: 0: 11435.9. Samples: 71829704. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:07:27,789][569624] Avg episode reward: [(0, '4302.641')] [2023-03-08 23:07:30,534][569911] Updated weights for policy 0, policy_version 140400 (0.0004) [2023-03-08 23:07:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11371.6). Total num frames: 71909376. Throughput: 0: 11410.4. Samples: 71896208. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:07:32,789][569624] Avg episode reward: [(0, '4211.601')] [2023-03-08 23:07:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000140448_71909376.pth... [2023-03-08 23:07:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000139776_71565312.pth [2023-03-08 23:07:34,038][569911] Updated weights for policy 0, policy_version 140480 (0.0005) [2023-03-08 23:07:37,694][569911] Updated weights for policy 0, policy_version 140560 (0.0005) [2023-03-08 23:07:37,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11371.6). Total num frames: 71966720. Throughput: 0: 11362.8. Samples: 71964424. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:07:37,789][569624] Avg episode reward: [(0, '4341.591')] [2023-03-08 23:07:41,511][569911] Updated weights for policy 0, policy_version 140640 (0.0004) [2023-03-08 23:07:42,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11357.7). Total num frames: 72019968. Throughput: 0: 11342.0. Samples: 71997944. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:07:42,789][569624] Avg episode reward: [(0, '3954.022')] [2023-03-08 23:07:45,224][569911] Updated weights for policy 0, policy_version 140720 (0.0004) [2023-03-08 23:07:47,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11343.8). Total num frames: 72077312. Throughput: 0: 11280.0. Samples: 72063204. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:07:47,789][569624] Avg episode reward: [(0, '3909.537')] [2023-03-08 23:07:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000140776_72077312.pth... [2023-03-08 23:07:47,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000140112_71737344.pth [2023-03-08 23:07:48,915][569911] Updated weights for policy 0, policy_version 140800 (0.0005) [2023-03-08 23:07:52,598][569911] Updated weights for policy 0, policy_version 140880 (0.0005) [2023-03-08 23:07:52,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11264.0, 300 sec: 11343.8). Total num frames: 72130560. Throughput: 0: 11268.5. Samples: 72130524. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:07:52,789][569624] Avg episode reward: [(0, '4119.966')] [2023-03-08 23:07:56,346][569911] Updated weights for policy 0, policy_version 140960 (0.0005) [2023-03-08 23:07:57,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11264.0, 300 sec: 11343.8). Total num frames: 72187904. Throughput: 0: 11199.6. Samples: 72163512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:07:57,789][569624] Avg episode reward: [(0, '4194.564')] [2023-03-08 23:08:00,025][569911] Updated weights for policy 0, policy_version 141040 (0.0005) [2023-03-08 23:08:02,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11264.0, 300 sec: 11330.0). Total num frames: 72241152. Throughput: 0: 11193.6. Samples: 72228840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:08:02,789][569624] Avg episode reward: [(0, '4196.870')] [2023-03-08 23:08:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000141096_72241152.pth... [2023-03-08 23:08:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000140448_71909376.pth [2023-03-08 23:08:03,766][569911] Updated weights for policy 0, policy_version 141120 (0.0005) [2023-03-08 23:08:07,131][569911] Updated weights for policy 0, policy_version 141200 (0.0005) [2023-03-08 23:08:07,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11195.7, 300 sec: 11330.0). Total num frames: 72298496. Throughput: 0: 11198.7. Samples: 72298568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:08:07,789][569624] Avg episode reward: [(0, '4069.933')] [2023-03-08 23:08:10,873][569911] Updated weights for policy 0, policy_version 141280 (0.0005) [2023-03-08 23:08:12,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11195.7, 300 sec: 11316.1). Total num frames: 72351744. Throughput: 0: 11151.4. Samples: 72331516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:08:12,789][569624] Avg episode reward: [(0, '3839.149')] [2023-03-08 23:08:14,461][569911] Updated weights for policy 0, policy_version 141360 (0.0005) [2023-03-08 23:08:17,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11195.7, 300 sec: 11316.1). Total num frames: 72409088. Throughput: 0: 11185.4. Samples: 72399552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:08:17,789][569624] Avg episode reward: [(0, '4308.502')] [2023-03-08 23:08:17,814][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000141432_72413184.pth... [2023-03-08 23:08:17,815][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000140776_72077312.pth [2023-03-08 23:08:18,227][569911] Updated weights for policy 0, policy_version 141440 (0.0005) [2023-03-08 23:08:21,611][569911] Updated weights for policy 0, policy_version 141520 (0.0004) [2023-03-08 23:08:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11195.8, 300 sec: 11330.0). Total num frames: 72466432. Throughput: 0: 11178.0. Samples: 72467432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:08:22,789][569624] Avg episode reward: [(0, '4232.444')] [2023-03-08 23:08:25,460][569911] Updated weights for policy 0, policy_version 141600 (0.0004) [2023-03-08 23:08:27,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11127.5, 300 sec: 11316.1). Total num frames: 72519680. Throughput: 0: 11149.5. Samples: 72499672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:08:27,789][569624] Avg episode reward: [(0, '4466.309')] [2023-03-08 23:08:29,317][569911] Updated weights for policy 0, policy_version 141680 (0.0005) [2023-03-08 23:08:32,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11127.5, 300 sec: 11330.0). Total num frames: 72577024. Throughput: 0: 11146.7. Samples: 72564808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:08:32,789][569624] Avg episode reward: [(0, '4425.768')] [2023-03-08 23:08:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000141752_72577024.pth... [2023-03-08 23:08:32,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000141096_72241152.pth [2023-03-08 23:08:32,915][569911] Updated weights for policy 0, policy_version 141760 (0.0004) [2023-03-08 23:08:36,397][569911] Updated weights for policy 0, policy_version 141840 (0.0005) [2023-03-08 23:08:37,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11127.5, 300 sec: 11330.0). Total num frames: 72634368. Throughput: 0: 11198.1. Samples: 72634440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:08:37,789][569624] Avg episode reward: [(0, '4480.963')] [2023-03-08 23:08:39,991][569911] Updated weights for policy 0, policy_version 141920 (0.0005) [2023-03-08 23:08:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11264.0, 300 sec: 11343.8). Total num frames: 72695808. Throughput: 0: 11222.7. Samples: 72668536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:08:42,789][569624] Avg episode reward: [(0, '4384.951')] [2023-03-08 23:08:43,463][569911] Updated weights for policy 0, policy_version 142000 (0.0005) [2023-03-08 23:08:46,837][569911] Updated weights for policy 0, policy_version 142080 (0.0005) [2023-03-08 23:08:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11264.0, 300 sec: 11343.8). Total num frames: 72753152. Throughput: 0: 11391.8. Samples: 72741472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:08:47,789][569624] Avg episode reward: [(0, '4524.260')] [2023-03-08 23:08:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000142096_72753152.pth... [2023-03-08 23:08:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000141432_72413184.pth [2023-03-08 23:08:50,475][569911] Updated weights for policy 0, policy_version 142160 (0.0005) [2023-03-08 23:08:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11330.0). Total num frames: 72810496. Throughput: 0: 11353.5. Samples: 72809476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:08:52,789][569624] Avg episode reward: [(0, '4484.838')] [2023-03-08 23:08:54,118][569911] Updated weights for policy 0, policy_version 142240 (0.0005) [2023-03-08 23:08:57,744][569911] Updated weights for policy 0, policy_version 142320 (0.0005) [2023-03-08 23:08:57,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11332.3, 300 sec: 11330.0). Total num frames: 72867840. Throughput: 0: 11371.3. Samples: 72843224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:08:57,789][569624] Avg episode reward: [(0, '4484.084')] [2023-03-08 23:09:01,319][569911] Updated weights for policy 0, policy_version 142400 (0.0005) [2023-03-08 23:09:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11329.9). Total num frames: 72925184. Throughput: 0: 11382.7. Samples: 72911772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:09:02,789][569624] Avg episode reward: [(0, '4511.748')] [2023-03-08 23:09:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000142432_72925184.pth... [2023-03-08 23:09:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000141752_72577024.pth [2023-03-08 23:09:04,989][569911] Updated weights for policy 0, policy_version 142480 (0.0005) [2023-03-08 23:09:07,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11316.1). Total num frames: 72978432. Throughput: 0: 11355.0. Samples: 72978408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:09:07,789][569624] Avg episode reward: [(0, '4533.384')] [2023-03-08 23:09:08,556][569911] Updated weights for policy 0, policy_version 142560 (0.0005) [2023-03-08 23:09:12,106][569911] Updated weights for policy 0, policy_version 142640 (0.0004) [2023-03-08 23:09:12,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11400.5, 300 sec: 11316.1). Total num frames: 73035776. Throughput: 0: 11402.6. Samples: 73012788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:09:12,789][569624] Avg episode reward: [(0, '4661.975')] [2023-03-08 23:09:15,596][569911] Updated weights for policy 0, policy_version 142720 (0.0004) [2023-03-08 23:09:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11468.8, 300 sec: 11343.8). Total num frames: 73097216. Throughput: 0: 11544.9. Samples: 73084328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:09:17,789][569624] Avg episode reward: [(0, '4535.473')] [2023-03-08 23:09:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000142768_73097216.pth... [2023-03-08 23:09:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000142096_72753152.pth [2023-03-08 23:09:19,170][569911] Updated weights for policy 0, policy_version 142800 (0.0004) [2023-03-08 23:09:22,682][569911] Updated weights for policy 0, policy_version 142880 (0.0005) [2023-03-08 23:09:22,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11468.8, 300 sec: 11343.8). Total num frames: 73154560. Throughput: 0: 11534.6. Samples: 73153496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:09:22,789][569624] Avg episode reward: [(0, '4547.725')] [2023-03-08 23:09:26,162][569911] Updated weights for policy 0, policy_version 142960 (0.0005) [2023-03-08 23:09:27,789][569624] Fps is (10 sec: 11469.0, 60 sec: 11537.1, 300 sec: 11343.8). Total num frames: 73211904. Throughput: 0: 11528.7. Samples: 73187328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:09:27,789][569624] Avg episode reward: [(0, '4346.637')] [2023-03-08 23:09:29,540][569911] Updated weights for policy 0, policy_version 143040 (0.0005) [2023-03-08 23:09:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11357.7). Total num frames: 73269248. Throughput: 0: 11489.4. Samples: 73258496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:09:32,789][569624] Avg episode reward: [(0, '4575.343')] [2023-03-08 23:09:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000143104_73269248.pth... [2023-03-08 23:09:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000142432_72925184.pth [2023-03-08 23:09:33,297][569911] Updated weights for policy 0, policy_version 143120 (0.0005) [2023-03-08 23:09:36,900][569911] Updated weights for policy 0, policy_version 143200 (0.0005) [2023-03-08 23:09:37,789][569624] Fps is (10 sec: 11468.6, 60 sec: 11537.1, 300 sec: 11357.7). Total num frames: 73326592. Throughput: 0: 11476.9. Samples: 73325936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:09:37,789][569624] Avg episode reward: [(0, '4361.299')] [2023-03-08 23:09:40,485][569911] Updated weights for policy 0, policy_version 143280 (0.0005) [2023-03-08 23:09:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11371.6). Total num frames: 73383936. Throughput: 0: 11473.2. Samples: 73359516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:09:42,789][569624] Avg episode reward: [(0, '4490.479')] [2023-03-08 23:09:44,116][569911] Updated weights for policy 0, policy_version 143360 (0.0005) [2023-03-08 23:09:47,714][569911] Updated weights for policy 0, policy_version 143440 (0.0005) [2023-03-08 23:09:47,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11371.6). Total num frames: 73441280. Throughput: 0: 11491.4. Samples: 73428884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:09:47,789][569624] Avg episode reward: [(0, '4256.573')] [2023-03-08 23:09:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000143440_73441280.pth... [2023-03-08 23:09:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000142768_73097216.pth [2023-03-08 23:09:51,380][569911] Updated weights for policy 0, policy_version 143520 (0.0005) [2023-03-08 23:09:52,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11357.7). Total num frames: 73494528. Throughput: 0: 11470.9. Samples: 73494600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:09:52,789][569624] Avg episode reward: [(0, '4442.701')] [2023-03-08 23:09:54,896][569911] Updated weights for policy 0, policy_version 143600 (0.0005) [2023-03-08 23:09:57,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11400.5, 300 sec: 11343.8). Total num frames: 73551872. Throughput: 0: 11516.0. Samples: 73531008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:09:57,789][569624] Avg episode reward: [(0, '4533.594')] [2023-03-08 23:09:58,553][569911] Updated weights for policy 0, policy_version 143680 (0.0005) [2023-03-08 23:10:02,067][569911] Updated weights for policy 0, policy_version 143760 (0.0005) [2023-03-08 23:10:02,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11400.5, 300 sec: 11357.7). Total num frames: 73609216. Throughput: 0: 11447.1. Samples: 73599444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:10:02,789][569624] Avg episode reward: [(0, '4591.152')] [2023-03-08 23:10:02,804][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000143776_73613312.pth... [2023-03-08 23:10:02,805][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000143104_73269248.pth [2023-03-08 23:10:05,733][569911] Updated weights for policy 0, policy_version 143840 (0.0005) [2023-03-08 23:10:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11371.6). Total num frames: 73666560. Throughput: 0: 11401.6. Samples: 73666568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:10:07,789][569624] Avg episode reward: [(0, '4678.452')] [2023-03-08 23:10:09,463][569911] Updated weights for policy 0, policy_version 143920 (0.0005) [2023-03-08 23:10:12,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 11371.6). Total num frames: 73723904. Throughput: 0: 11371.1. Samples: 73699028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:10:12,789][569624] Avg episode reward: [(0, '4598.193')] [2023-03-08 23:10:13,119][569911] Updated weights for policy 0, policy_version 144000 (0.0005) [2023-03-08 23:10:16,707][569911] Updated weights for policy 0, policy_version 144080 (0.0005) [2023-03-08 23:10:17,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11332.3, 300 sec: 11357.7). Total num frames: 73777152. Throughput: 0: 11317.7. Samples: 73767792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:10:17,789][569624] Avg episode reward: [(0, '4770.099')] [2023-03-08 23:10:17,823][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000144104_73781248.pth... [2023-03-08 23:10:17,825][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000143440_73441280.pth [2023-03-08 23:10:20,359][569911] Updated weights for policy 0, policy_version 144160 (0.0004) [2023-03-08 23:10:22,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11332.3, 300 sec: 11357.7). Total num frames: 73834496. Throughput: 0: 11299.6. Samples: 73834416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:10:22,789][569624] Avg episode reward: [(0, '4665.119')] [2023-03-08 23:10:23,967][569911] Updated weights for policy 0, policy_version 144240 (0.0005) [2023-03-08 23:10:27,363][569911] Updated weights for policy 0, policy_version 144320 (0.0005) [2023-03-08 23:10:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11400.5, 300 sec: 11371.6). Total num frames: 73895936. Throughput: 0: 11290.2. Samples: 73867572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:10:27,789][569624] Avg episode reward: [(0, '4662.498')] [2023-03-08 23:10:30,729][569911] Updated weights for policy 0, policy_version 144400 (0.0004) [2023-03-08 23:10:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11400.5, 300 sec: 11371.6). Total num frames: 73953280. Throughput: 0: 11388.5. Samples: 73941368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:10:32,789][569624] Avg episode reward: [(0, '4625.738')] [2023-03-08 23:10:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000144440_73953280.pth... [2023-03-08 23:10:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000143776_73613312.pth [2023-03-08 23:10:34,466][569911] Updated weights for policy 0, policy_version 144480 (0.0004) [2023-03-08 23:10:37,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11332.3, 300 sec: 11357.7). Total num frames: 74006528. Throughput: 0: 11376.2. Samples: 74006528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:10:37,789][569624] Avg episode reward: [(0, '4493.553')] [2023-03-08 23:10:38,248][569911] Updated weights for policy 0, policy_version 144560 (0.0004) [2023-03-08 23:10:41,882][569911] Updated weights for policy 0, policy_version 144640 (0.0004) [2023-03-08 23:10:42,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11332.3, 300 sec: 11357.7). Total num frames: 74063872. Throughput: 0: 11345.0. Samples: 74041532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:10:42,789][569624] Avg episode reward: [(0, '4532.658')] [2023-03-08 23:10:45,278][569911] Updated weights for policy 0, policy_version 144720 (0.0005) [2023-03-08 23:10:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11400.5, 300 sec: 11385.5). Total num frames: 74125312. Throughput: 0: 11399.7. Samples: 74112432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:10:47,789][569624] Avg episode reward: [(0, '4667.062')] [2023-03-08 23:10:47,794][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000144776_74125312.pth... [2023-03-08 23:10:47,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000144104_73781248.pth [2023-03-08 23:10:48,691][569911] Updated weights for policy 0, policy_version 144800 (0.0005) [2023-03-08 23:10:52,380][569911] Updated weights for policy 0, policy_version 144880 (0.0004) [2023-03-08 23:10:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11468.8, 300 sec: 11385.5). Total num frames: 74182656. Throughput: 0: 11413.1. Samples: 74180160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:10:52,789][569624] Avg episode reward: [(0, '4291.270')] [2023-03-08 23:10:56,104][569911] Updated weights for policy 0, policy_version 144960 (0.0005) [2023-03-08 23:10:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11385.5). Total num frames: 74240000. Throughput: 0: 11439.4. Samples: 74213800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:10:57,789][569624] Avg episode reward: [(0, '4413.342')] [2023-03-08 23:10:59,491][569911] Updated weights for policy 0, policy_version 145040 (0.0005) [2023-03-08 23:11:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11371.6). Total num frames: 74297344. Throughput: 0: 11485.1. Samples: 74284620. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:11:02,789][569624] Avg episode reward: [(0, '4210.277')] [2023-03-08 23:11:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000145112_74297344.pth... [2023-03-08 23:11:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000144440_73953280.pth [2023-03-08 23:11:03,090][569911] Updated weights for policy 0, policy_version 145120 (0.0005) [2023-03-08 23:11:06,670][569911] Updated weights for policy 0, policy_version 145200 (0.0004) [2023-03-08 23:11:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11371.6). Total num frames: 74354688. Throughput: 0: 11514.7. Samples: 74352576. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:11:07,789][569624] Avg episode reward: [(0, '4105.451')] [2023-03-08 23:11:10,471][569911] Updated weights for policy 0, policy_version 145280 (0.0005) [2023-03-08 23:11:12,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11400.6, 300 sec: 11371.6). Total num frames: 74407936. Throughput: 0: 11468.6. Samples: 74383660. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:11:12,789][569624] Avg episode reward: [(0, '4327.272')] [2023-03-08 23:11:14,080][569911] Updated weights for policy 0, policy_version 145360 (0.0005) [2023-03-08 23:11:17,768][569911] Updated weights for policy 0, policy_version 145440 (0.0005) [2023-03-08 23:11:17,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11468.8, 300 sec: 11371.6). Total num frames: 74465280. Throughput: 0: 11358.1. Samples: 74452484. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:11:17,790][569624] Avg episode reward: [(0, '4188.528')] [2023-03-08 23:11:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000145440_74465280.pth... [2023-03-08 23:11:17,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000144776_74125312.pth [2023-03-08 23:11:21,455][569911] Updated weights for policy 0, policy_version 145520 (0.0005) [2023-03-08 23:11:22,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 11357.7). Total num frames: 74522624. Throughput: 0: 11379.4. Samples: 74518600. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:11:22,789][569624] Avg episode reward: [(0, '4244.026')] [2023-03-08 23:11:24,913][569911] Updated weights for policy 0, policy_version 145600 (0.0005) [2023-03-08 23:11:27,789][569624] Fps is (10 sec: 11469.0, 60 sec: 11400.5, 300 sec: 11371.6). Total num frames: 74579968. Throughput: 0: 11418.0. Samples: 74555340. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:11:27,789][569624] Avg episode reward: [(0, '4533.138')] [2023-03-08 23:11:28,403][569911] Updated weights for policy 0, policy_version 145680 (0.0005) [2023-03-08 23:11:31,960][569911] Updated weights for policy 0, policy_version 145760 (0.0005) [2023-03-08 23:11:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11371.6). Total num frames: 74637312. Throughput: 0: 11390.9. Samples: 74625024. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:11:32,789][569624] Avg episode reward: [(0, '4573.459')] [2023-03-08 23:11:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000145776_74637312.pth... [2023-03-08 23:11:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000145112_74297344.pth [2023-03-08 23:11:35,592][569911] Updated weights for policy 0, policy_version 145840 (0.0004) [2023-03-08 23:11:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11385.5). Total num frames: 74694656. Throughput: 0: 11348.7. Samples: 74690852. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:11:37,789][569624] Avg episode reward: [(0, '4476.602')] [2023-03-08 23:11:39,271][569911] Updated weights for policy 0, policy_version 145920 (0.0005) [2023-03-08 23:11:42,789][569624] Fps is (10 sec: 11059.4, 60 sec: 11400.5, 300 sec: 11357.7). Total num frames: 74747904. Throughput: 0: 11396.0. Samples: 74726620. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:11:42,789][569624] Avg episode reward: [(0, '4591.630')] [2023-03-08 23:11:42,807][569911] Updated weights for policy 0, policy_version 146000 (0.0005) [2023-03-08 23:11:46,489][569911] Updated weights for policy 0, policy_version 146080 (0.0005) [2023-03-08 23:11:47,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11332.3, 300 sec: 11357.7). Total num frames: 74805248. Throughput: 0: 11298.5. Samples: 74793052. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:11:47,789][569624] Avg episode reward: [(0, '4517.387')] [2023-03-08 23:11:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000146104_74805248.pth... [2023-03-08 23:11:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000145440_74465280.pth [2023-03-08 23:11:50,069][569911] Updated weights for policy 0, policy_version 146160 (0.0005) [2023-03-08 23:11:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11357.7). Total num frames: 74862592. Throughput: 0: 11325.4. Samples: 74862220. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:11:52,789][569624] Avg episode reward: [(0, '4539.459')] [2023-03-08 23:11:53,671][569911] Updated weights for policy 0, policy_version 146240 (0.0004) [2023-03-08 23:11:57,366][569911] Updated weights for policy 0, policy_version 146320 (0.0005) [2023-03-08 23:11:57,789][569624] Fps is (10 sec: 11059.4, 60 sec: 11264.0, 300 sec: 11357.7). Total num frames: 74915840. Throughput: 0: 11373.2. Samples: 74895456. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:11:57,789][569624] Avg episode reward: [(0, '4667.112')] [2023-03-08 23:12:00,871][569911] Updated weights for policy 0, policy_version 146400 (0.0005) [2023-03-08 23:12:02,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11332.3, 300 sec: 11357.7). Total num frames: 74977280. Throughput: 0: 11386.2. Samples: 74964864. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:12:02,789][569624] Avg episode reward: [(0, '4519.606')] [2023-03-08 23:12:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000146440_74977280.pth... [2023-03-08 23:12:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000145776_74637312.pth [2023-03-08 23:12:04,413][569911] Updated weights for policy 0, policy_version 146480 (0.0004) [2023-03-08 23:12:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11332.3, 300 sec: 11371.6). Total num frames: 75034624. Throughput: 0: 11443.5. Samples: 75033560. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:12:07,789][569624] Avg episode reward: [(0, '4543.343')] [2023-03-08 23:12:08,044][569911] Updated weights for policy 0, policy_version 146560 (0.0005) [2023-03-08 23:12:11,757][569911] Updated weights for policy 0, policy_version 146640 (0.0005) [2023-03-08 23:12:12,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11357.7). Total num frames: 75087872. Throughput: 0: 11371.5. Samples: 75067056. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:12:12,789][569624] Avg episode reward: [(0, '4655.347')] [2023-03-08 23:12:15,630][569911] Updated weights for policy 0, policy_version 146720 (0.0005) [2023-03-08 23:12:17,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11332.3, 300 sec: 11357.7). Total num frames: 75145216. Throughput: 0: 11239.8. Samples: 75130816. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:12:17,789][569624] Avg episode reward: [(0, '4546.700')] [2023-03-08 23:12:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000146768_75145216.pth... [2023-03-08 23:12:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000146104_74805248.pth [2023-03-08 23:12:19,368][569911] Updated weights for policy 0, policy_version 146800 (0.0005) [2023-03-08 23:12:22,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11264.0, 300 sec: 11343.8). Total num frames: 75198464. Throughput: 0: 11255.3. Samples: 75197340. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:12:22,789][569624] Avg episode reward: [(0, '4534.587')] [2023-03-08 23:12:23,055][569911] Updated weights for policy 0, policy_version 146880 (0.0005) [2023-03-08 23:12:26,777][569911] Updated weights for policy 0, policy_version 146960 (0.0005) [2023-03-08 23:12:27,789][569624] Fps is (10 sec: 10649.7, 60 sec: 11195.7, 300 sec: 11330.0). Total num frames: 75251712. Throughput: 0: 11184.6. Samples: 75229928. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:12:27,789][569624] Avg episode reward: [(0, '4690.211')] [2023-03-08 23:12:30,326][569911] Updated weights for policy 0, policy_version 147040 (0.0005) [2023-03-08 23:12:32,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11195.8, 300 sec: 11330.0). Total num frames: 75309056. Throughput: 0: 11231.1. Samples: 75298448. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:12:32,789][569624] Avg episode reward: [(0, '4710.495')] [2023-03-08 23:12:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000147088_75309056.pth... [2023-03-08 23:12:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000146440_74977280.pth [2023-03-08 23:12:33,986][569911] Updated weights for policy 0, policy_version 147120 (0.0005) [2023-03-08 23:12:37,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11127.5, 300 sec: 11330.0). Total num frames: 75362304. Throughput: 0: 11124.1. Samples: 75362804. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:12:37,789][569624] Avg episode reward: [(0, '4433.271')] [2023-03-08 23:12:37,821][569911] Updated weights for policy 0, policy_version 147200 (0.0004) [2023-03-08 23:12:41,174][569911] Updated weights for policy 0, policy_version 147280 (0.0005) [2023-03-08 23:12:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11343.8). Total num frames: 75423744. Throughput: 0: 11146.6. Samples: 75397052. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:12:42,789][569624] Avg episode reward: [(0, '4463.887')] [2023-03-08 23:12:44,777][569911] Updated weights for policy 0, policy_version 147360 (0.0005) [2023-03-08 23:12:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11264.0, 300 sec: 11357.7). Total num frames: 75481088. Throughput: 0: 11190.0. Samples: 75468412. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:12:47,789][569624] Avg episode reward: [(0, '4327.073')] [2023-03-08 23:12:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000147424_75481088.pth... [2023-03-08 23:12:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000146768_75145216.pth [2023-03-08 23:12:48,412][569911] Updated weights for policy 0, policy_version 147440 (0.0005) [2023-03-08 23:12:51,997][569911] Updated weights for policy 0, policy_version 147520 (0.0005) [2023-03-08 23:12:52,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11264.0, 300 sec: 11357.7). Total num frames: 75538432. Throughput: 0: 11167.8. Samples: 75536112. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:12:52,789][569624] Avg episode reward: [(0, '4601.232')] [2023-03-08 23:12:55,693][569911] Updated weights for policy 0, policy_version 147600 (0.0004) [2023-03-08 23:12:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11371.6). Total num frames: 75595776. Throughput: 0: 11166.8. Samples: 75569564. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:12:57,789][569624] Avg episode reward: [(0, '4632.037')] [2023-03-08 23:12:59,137][569911] Updated weights for policy 0, policy_version 147680 (0.0005) [2023-03-08 23:13:02,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11195.8, 300 sec: 11357.7). Total num frames: 75649024. Throughput: 0: 11312.9. Samples: 75639896. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:13:02,789][569624] Avg episode reward: [(0, '4693.176')] [2023-03-08 23:13:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000147752_75649024.pth... [2023-03-08 23:13:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000147088_75309056.pth [2023-03-08 23:13:02,899][569911] Updated weights for policy 0, policy_version 147760 (0.0004) [2023-03-08 23:13:06,598][569911] Updated weights for policy 0, policy_version 147840 (0.0004) [2023-03-08 23:13:07,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11195.7, 300 sec: 11371.6). Total num frames: 75706368. Throughput: 0: 11275.5. Samples: 75704736. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:13:07,789][569624] Avg episode reward: [(0, '4563.762')] [2023-03-08 23:13:10,405][569911] Updated weights for policy 0, policy_version 147920 (0.0005) [2023-03-08 23:13:12,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11195.7, 300 sec: 11357.7). Total num frames: 75759616. Throughput: 0: 11251.4. Samples: 75736240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:13:12,789][569624] Avg episode reward: [(0, '4638.785')] [2023-03-08 23:13:14,141][569911] Updated weights for policy 0, policy_version 148000 (0.0004) [2023-03-08 23:13:17,789][569624] Fps is (10 sec: 10649.7, 60 sec: 11127.5, 300 sec: 11343.8). Total num frames: 75812864. Throughput: 0: 11165.5. Samples: 75800896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:13:17,789][569624] Avg episode reward: [(0, '4779.860')] [2023-03-08 23:13:17,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000148072_75812864.pth... [2023-03-08 23:13:17,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000147424_75481088.pth [2023-03-08 23:13:17,946][569911] Updated weights for policy 0, policy_version 148080 (0.0005) [2023-03-08 23:13:21,643][569911] Updated weights for policy 0, policy_version 148160 (0.0005) [2023-03-08 23:13:22,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11195.7, 300 sec: 11357.7). Total num frames: 75870208. Throughput: 0: 11237.4. Samples: 75868488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:13:22,789][569624] Avg episode reward: [(0, '4651.180')] [2023-03-08 23:13:25,239][569911] Updated weights for policy 0, policy_version 148240 (0.0004) [2023-03-08 23:13:27,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11195.7, 300 sec: 11343.8). Total num frames: 75923456. Throughput: 0: 11232.7. Samples: 75902524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:13:27,789][569624] Avg episode reward: [(0, '4704.024')] [2023-03-08 23:13:28,992][569911] Updated weights for policy 0, policy_version 148320 (0.0005) [2023-03-08 23:13:32,676][569911] Updated weights for policy 0, policy_version 148400 (0.0004) [2023-03-08 23:13:32,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11195.7, 300 sec: 11343.8). Total num frames: 75980800. Throughput: 0: 11112.3. Samples: 75968468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:13:32,789][569624] Avg episode reward: [(0, '4745.962')] [2023-03-08 23:13:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000148400_75980800.pth... [2023-03-08 23:13:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000147752_75649024.pth [2023-03-08 23:13:36,267][569911] Updated weights for policy 0, policy_version 148480 (0.0005) [2023-03-08 23:13:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11330.0). Total num frames: 76038144. Throughput: 0: 11119.0. Samples: 76036464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:13:37,789][569624] Avg episode reward: [(0, '4643.877')] [2023-03-08 23:13:39,915][569911] Updated weights for policy 0, policy_version 148560 (0.0004) [2023-03-08 23:13:42,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11195.7, 300 sec: 11330.0). Total num frames: 76095488. Throughput: 0: 11141.1. Samples: 76070912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:13:42,789][569624] Avg episode reward: [(0, '4635.028')] [2023-03-08 23:13:43,597][569911] Updated weights for policy 0, policy_version 148640 (0.0004) [2023-03-08 23:13:47,097][569911] Updated weights for policy 0, policy_version 148720 (0.0005) [2023-03-08 23:13:47,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 11316.1). Total num frames: 76148736. Throughput: 0: 11081.7. Samples: 76138572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:13:47,789][569624] Avg episode reward: [(0, '4709.593')] [2023-03-08 23:13:47,856][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000148736_76152832.pth... [2023-03-08 23:13:47,858][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000148072_75812864.pth [2023-03-08 23:13:50,983][569911] Updated weights for policy 0, policy_version 148800 (0.0005) [2023-03-08 23:13:52,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11127.5, 300 sec: 11316.1). Total num frames: 76206080. Throughput: 0: 11070.5. Samples: 76202908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:13:52,789][569624] Avg episode reward: [(0, '4521.498')] [2023-03-08 23:13:54,807][569911] Updated weights for policy 0, policy_version 148880 (0.0004) [2023-03-08 23:13:57,789][569624] Fps is (10 sec: 10649.7, 60 sec: 10990.9, 300 sec: 11288.3). Total num frames: 76255232. Throughput: 0: 11076.1. Samples: 76234664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:13:57,789][569624] Avg episode reward: [(0, '4589.054')] [2023-03-08 23:13:58,544][569911] Updated weights for policy 0, policy_version 148960 (0.0004) [2023-03-08 23:14:02,428][569911] Updated weights for policy 0, policy_version 149040 (0.0004) [2023-03-08 23:14:02,789][569624] Fps is (10 sec: 10649.6, 60 sec: 11059.2, 300 sec: 11302.2). Total num frames: 76312576. Throughput: 0: 11091.7. Samples: 76300024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:14:02,789][569624] Avg episode reward: [(0, '4573.368')] [2023-03-08 23:14:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000149048_76312576.pth... [2023-03-08 23:14:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000148400_75980800.pth [2023-03-08 23:14:05,859][569911] Updated weights for policy 0, policy_version 149120 (0.0005) [2023-03-08 23:14:07,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11059.2, 300 sec: 11302.2). Total num frames: 76369920. Throughput: 0: 11110.1. Samples: 76368444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:14:07,789][569624] Avg episode reward: [(0, '4644.775')] [2023-03-08 23:14:09,452][569911] Updated weights for policy 0, policy_version 149200 (0.0005) [2023-03-08 23:14:12,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11059.2, 300 sec: 11274.4). Total num frames: 76423168. Throughput: 0: 11115.0. Samples: 76402700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:14:12,789][569624] Avg episode reward: [(0, '4661.187')] [2023-03-08 23:14:13,229][569911] Updated weights for policy 0, policy_version 149280 (0.0006) [2023-03-08 23:14:16,880][569911] Updated weights for policy 0, policy_version 149360 (0.0005) [2023-03-08 23:14:17,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 11274.4). Total num frames: 76480512. Throughput: 0: 11108.9. Samples: 76468368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:14:17,789][569624] Avg episode reward: [(0, '4774.625')] [2023-03-08 23:14:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000149376_76480512.pth... [2023-03-08 23:14:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000148736_76152832.pth [2023-03-08 23:14:20,601][569911] Updated weights for policy 0, policy_version 149440 (0.0005) [2023-03-08 23:14:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11127.5, 300 sec: 11274.4). Total num frames: 76537856. Throughput: 0: 11080.3. Samples: 76535076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:14:22,789][569624] Avg episode reward: [(0, '4728.277')] [2023-03-08 23:14:24,209][569911] Updated weights for policy 0, policy_version 149520 (0.0005) [2023-03-08 23:14:27,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 11260.5). Total num frames: 76591104. Throughput: 0: 11070.0. Samples: 76569064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:14:27,789][569624] Avg episode reward: [(0, '4454.534')] [2023-03-08 23:14:27,911][569911] Updated weights for policy 0, policy_version 149600 (0.0005) [2023-03-08 23:14:31,525][569911] Updated weights for policy 0, policy_version 149680 (0.0005) [2023-03-08 23:14:32,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11127.5, 300 sec: 11260.5). Total num frames: 76648448. Throughput: 0: 11057.8. Samples: 76636172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:14:32,789][569624] Avg episode reward: [(0, '4528.283')] [2023-03-08 23:14:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000149704_76648448.pth... [2023-03-08 23:14:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000149048_76312576.pth [2023-03-08 23:14:34,880][569911] Updated weights for policy 0, policy_version 149760 (0.0004) [2023-03-08 23:14:37,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11127.5, 300 sec: 11260.5). Total num frames: 76705792. Throughput: 0: 11175.2. Samples: 76705792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:14:37,789][569624] Avg episode reward: [(0, '4585.707')] [2023-03-08 23:14:38,649][569911] Updated weights for policy 0, policy_version 149840 (0.0005) [2023-03-08 23:14:42,319][569911] Updated weights for policy 0, policy_version 149920 (0.0004) [2023-03-08 23:14:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11127.4, 300 sec: 11260.5). Total num frames: 76763136. Throughput: 0: 11234.4. Samples: 76740212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:14:42,789][569624] Avg episode reward: [(0, '4358.729')] [2023-03-08 23:14:45,833][569911] Updated weights for policy 0, policy_version 150000 (0.0005) [2023-03-08 23:14:47,789][569624] Fps is (10 sec: 11468.6, 60 sec: 11195.7, 300 sec: 11274.4). Total num frames: 76820480. Throughput: 0: 11289.1. Samples: 76808032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:14:47,789][569624] Avg episode reward: [(0, '4441.552')] [2023-03-08 23:14:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000150040_76820480.pth... [2023-03-08 23:14:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000149376_76480512.pth [2023-03-08 23:14:49,377][569911] Updated weights for policy 0, policy_version 150080 (0.0004) [2023-03-08 23:14:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11195.7, 300 sec: 11274.4). Total num frames: 76877824. Throughput: 0: 11318.3. Samples: 76877768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:14:52,789][569624] Avg episode reward: [(0, '4665.114')] [2023-03-08 23:14:52,943][569911] Updated weights for policy 0, policy_version 150160 (0.0005) [2023-03-08 23:14:56,580][569911] Updated weights for policy 0, policy_version 150240 (0.0005) [2023-03-08 23:14:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.2, 300 sec: 11274.4). Total num frames: 76935168. Throughput: 0: 11288.6. Samples: 76910688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:14:57,789][569624] Avg episode reward: [(0, '4670.442')] [2023-03-08 23:15:00,198][569911] Updated weights for policy 0, policy_version 150320 (0.0005) [2023-03-08 23:15:02,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11332.3, 300 sec: 11274.4). Total num frames: 76992512. Throughput: 0: 11383.1. Samples: 76980608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:15:02,789][569624] Avg episode reward: [(0, '4663.021')] [2023-03-08 23:15:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000150376_76992512.pth... [2023-03-08 23:15:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000149704_76648448.pth [2023-03-08 23:15:03,587][569911] Updated weights for policy 0, policy_version 150400 (0.0005) [2023-03-08 23:15:07,230][569911] Updated weights for policy 0, policy_version 150480 (0.0004) [2023-03-08 23:15:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11274.4). Total num frames: 77049856. Throughput: 0: 11434.4. Samples: 77049624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:15:07,789][569624] Avg episode reward: [(0, '4710.597')] [2023-03-08 23:15:10,736][569911] Updated weights for policy 0, policy_version 150560 (0.0004) [2023-03-08 23:15:12,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11288.3). Total num frames: 77107200. Throughput: 0: 11456.5. Samples: 77084608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:15:12,789][569624] Avg episode reward: [(0, '4698.325')] [2023-03-08 23:15:14,227][569911] Updated weights for policy 0, policy_version 150640 (0.0005) [2023-03-08 23:15:17,717][569911] Updated weights for policy 0, policy_version 150720 (0.0005) [2023-03-08 23:15:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 11302.2). Total num frames: 77168640. Throughput: 0: 11537.7. Samples: 77155368. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:15:17,789][569624] Avg episode reward: [(0, '4576.687')] [2023-03-08 23:15:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000150720_77168640.pth... [2023-03-08 23:15:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000150040_76820480.pth [2023-03-08 23:15:21,217][569911] Updated weights for policy 0, policy_version 150800 (0.0004) [2023-03-08 23:15:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 11288.3). Total num frames: 77225984. Throughput: 0: 11552.2. Samples: 77225640. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:15:22,789][569624] Avg episode reward: [(0, '4559.196')] [2023-03-08 23:15:24,893][569911] Updated weights for policy 0, policy_version 150880 (0.0005) [2023-03-08 23:15:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11288.3). Total num frames: 77283328. Throughput: 0: 11516.1. Samples: 77258436. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:15:27,789][569624] Avg episode reward: [(0, '4711.411')] [2023-03-08 23:15:28,405][569911] Updated weights for policy 0, policy_version 150960 (0.0005) [2023-03-08 23:15:31,600][569911] Updated weights for policy 0, policy_version 151040 (0.0005) [2023-03-08 23:15:32,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11605.3, 300 sec: 11316.1). Total num frames: 77344768. Throughput: 0: 11650.6. Samples: 77332308. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:15:32,789][569624] Avg episode reward: [(0, '4455.162')] [2023-03-08 23:15:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000151064_77344768.pth... [2023-03-08 23:15:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000150376_76992512.pth [2023-03-08 23:15:35,216][569911] Updated weights for policy 0, policy_version 151120 (0.0005) [2023-03-08 23:15:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11316.1). Total num frames: 77402112. Throughput: 0: 11623.8. Samples: 77400840. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:15:37,789][569624] Avg episode reward: [(0, '4392.540')] [2023-03-08 23:15:38,802][569911] Updated weights for policy 0, policy_version 151200 (0.0005) [2023-03-08 23:15:42,378][569911] Updated weights for policy 0, policy_version 151280 (0.0005) [2023-03-08 23:15:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11302.2). Total num frames: 77459456. Throughput: 0: 11650.1. Samples: 77434944. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:15:42,789][569624] Avg episode reward: [(0, '4636.513')] [2023-03-08 23:15:46,171][569911] Updated weights for policy 0, policy_version 151360 (0.0004) [2023-03-08 23:15:47,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11537.1, 300 sec: 11288.3). Total num frames: 77512704. Throughput: 0: 11551.0. Samples: 77500404. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:15:47,789][569624] Avg episode reward: [(0, '4592.559')] [2023-03-08 23:15:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000151392_77512704.pth... [2023-03-08 23:15:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000150720_77168640.pth [2023-03-08 23:15:49,882][569911] Updated weights for policy 0, policy_version 151440 (0.0005) [2023-03-08 23:15:52,789][569624] Fps is (10 sec: 10649.6, 60 sec: 11468.8, 300 sec: 11274.4). Total num frames: 77565952. Throughput: 0: 11509.2. Samples: 77567536. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:15:52,789][569624] Avg episode reward: [(0, '4611.884')] [2023-03-08 23:15:53,610][569911] Updated weights for policy 0, policy_version 151520 (0.0005) [2023-03-08 23:15:57,285][569911] Updated weights for policy 0, policy_version 151600 (0.0005) [2023-03-08 23:15:57,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11468.8, 300 sec: 11274.4). Total num frames: 77623296. Throughput: 0: 11445.9. Samples: 77599672. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:15:57,789][569624] Avg episode reward: [(0, '4740.255')] [2023-03-08 23:16:01,219][569911] Updated weights for policy 0, policy_version 151680 (0.0004) [2023-03-08 23:16:02,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11400.5, 300 sec: 11260.5). Total num frames: 77676544. Throughput: 0: 11308.4. Samples: 77664248. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:16:02,789][569624] Avg episode reward: [(0, '4610.612')] [2023-03-08 23:16:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000151712_77676544.pth... [2023-03-08 23:16:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000151064_77344768.pth [2023-03-08 23:16:04,816][569911] Updated weights for policy 0, policy_version 151760 (0.0005) [2023-03-08 23:16:07,789][569624] Fps is (10 sec: 10649.6, 60 sec: 11332.3, 300 sec: 11260.5). Total num frames: 77729792. Throughput: 0: 11216.6. Samples: 77730388. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:16:07,789][569624] Avg episode reward: [(0, '4721.653')] [2023-03-08 23:16:08,505][569911] Updated weights for policy 0, policy_version 151840 (0.0004) [2023-03-08 23:16:12,234][569911] Updated weights for policy 0, policy_version 151920 (0.0004) [2023-03-08 23:16:12,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11260.5). Total num frames: 77787136. Throughput: 0: 11246.3. Samples: 77764520. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:16:12,789][569624] Avg episode reward: [(0, '4708.773')] [2023-03-08 23:16:15,890][569911] Updated weights for policy 0, policy_version 152000 (0.0005) [2023-03-08 23:16:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11264.0, 300 sec: 11260.5). Total num frames: 77844480. Throughput: 0: 11102.9. Samples: 77831940. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:16:17,789][569624] Avg episode reward: [(0, '4661.969')] [2023-03-08 23:16:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000152040_77844480.pth... [2023-03-08 23:16:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000151392_77512704.pth [2023-03-08 23:16:19,474][569911] Updated weights for policy 0, policy_version 152080 (0.0005) [2023-03-08 23:16:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11260.5). Total num frames: 77901824. Throughput: 0: 11156.7. Samples: 77902892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:16:22,789][569624] Avg episode reward: [(0, '4558.115')] [2023-03-08 23:16:22,800][569911] Updated weights for policy 0, policy_version 152160 (0.0005) [2023-03-08 23:16:26,206][569911] Updated weights for policy 0, policy_version 152240 (0.0004) [2023-03-08 23:16:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11332.3, 300 sec: 11274.4). Total num frames: 77963264. Throughput: 0: 11196.8. Samples: 77938800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:16:27,789][569624] Avg episode reward: [(0, '4662.871')] [2023-03-08 23:16:29,610][569911] Updated weights for policy 0, policy_version 152320 (0.0004) [2023-03-08 23:16:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11264.0, 300 sec: 11274.4). Total num frames: 78020608. Throughput: 0: 11348.6. Samples: 78011088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:16:32,789][569624] Avg episode reward: [(0, '4534.901')] [2023-03-08 23:16:32,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000152384_78020608.pth... [2023-03-08 23:16:32,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000151712_77676544.pth [2023-03-08 23:16:33,238][569911] Updated weights for policy 0, policy_version 152400 (0.0005) [2023-03-08 23:16:36,827][569911] Updated weights for policy 0, policy_version 152480 (0.0005) [2023-03-08 23:16:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11288.3). Total num frames: 78077952. Throughput: 0: 11344.0. Samples: 78078016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:16:37,789][569624] Avg episode reward: [(0, '4595.523')] [2023-03-08 23:16:40,301][569911] Updated weights for policy 0, policy_version 152560 (0.0005) [2023-03-08 23:16:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11288.3). Total num frames: 78135296. Throughput: 0: 11424.0. Samples: 78113752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:16:42,789][569624] Avg episode reward: [(0, '4669.801')] [2023-03-08 23:16:44,025][569911] Updated weights for policy 0, policy_version 152640 (0.0006) [2023-03-08 23:16:47,473][569911] Updated weights for policy 0, policy_version 152720 (0.0005) [2023-03-08 23:16:47,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11288.3). Total num frames: 78192640. Throughput: 0: 11492.0. Samples: 78181388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:16:47,789][569624] Avg episode reward: [(0, '4397.194')] [2023-03-08 23:16:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000152720_78192640.pth... [2023-03-08 23:16:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000152040_77844480.pth [2023-03-08 23:16:51,144][569911] Updated weights for policy 0, policy_version 152800 (0.0005) [2023-03-08 23:16:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11302.2). Total num frames: 78249984. Throughput: 0: 11544.0. Samples: 78249868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:16:52,789][569624] Avg episode reward: [(0, '4603.961')] [2023-03-08 23:16:54,775][569911] Updated weights for policy 0, policy_version 152880 (0.0005) [2023-03-08 23:16:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11288.3). Total num frames: 78307328. Throughput: 0: 11525.1. Samples: 78283152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:16:57,789][569624] Avg episode reward: [(0, '4460.336')] [2023-03-08 23:16:58,284][569911] Updated weights for policy 0, policy_version 152960 (0.0005) [2023-03-08 23:17:01,960][569911] Updated weights for policy 0, policy_version 153040 (0.0005) [2023-03-08 23:17:02,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 11288.3). Total num frames: 78364672. Throughput: 0: 11560.6. Samples: 78352168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:17:02,789][569624] Avg episode reward: [(0, '4696.215')] [2023-03-08 23:17:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000153056_78364672.pth... [2023-03-08 23:17:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000152384_78020608.pth [2023-03-08 23:17:05,678][569911] Updated weights for policy 0, policy_version 153120 (0.0005) [2023-03-08 23:17:07,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11468.8, 300 sec: 11288.3). Total num frames: 78417920. Throughput: 0: 11442.4. Samples: 78417800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:17:07,789][569624] Avg episode reward: [(0, '4585.239')] [2023-03-08 23:17:09,473][569911] Updated weights for policy 0, policy_version 153200 (0.0004) [2023-03-08 23:17:12,789][569624] Fps is (10 sec: 10649.6, 60 sec: 11400.5, 300 sec: 11274.4). Total num frames: 78471168. Throughput: 0: 11352.7. Samples: 78449672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:17:12,789][569624] Avg episode reward: [(0, '4766.648')] [2023-03-08 23:17:13,292][569911] Updated weights for policy 0, policy_version 153280 (0.0005) [2023-03-08 23:17:17,202][569911] Updated weights for policy 0, policy_version 153360 (0.0005) [2023-03-08 23:17:17,789][569624] Fps is (10 sec: 10649.6, 60 sec: 11332.3, 300 sec: 11274.4). Total num frames: 78524416. Throughput: 0: 11151.2. Samples: 78512892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:17:17,789][569624] Avg episode reward: [(0, '4601.245')] [2023-03-08 23:17:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000153368_78524416.pth... [2023-03-08 23:17:17,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000152720_78192640.pth [2023-03-08 23:17:20,980][569911] Updated weights for policy 0, policy_version 153440 (0.0004) [2023-03-08 23:17:22,789][569624] Fps is (10 sec: 10649.7, 60 sec: 11264.0, 300 sec: 11274.4). Total num frames: 78577664. Throughput: 0: 11118.9. Samples: 78578364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:17:22,789][569624] Avg episode reward: [(0, '4676.827')] [2023-03-08 23:17:24,581][569911] Updated weights for policy 0, policy_version 153520 (0.0005) [2023-03-08 23:17:27,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11195.7, 300 sec: 11274.4). Total num frames: 78635008. Throughput: 0: 11095.3. Samples: 78613040. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:17:27,789][569624] Avg episode reward: [(0, '4700.202')] [2023-03-08 23:17:28,231][569911] Updated weights for policy 0, policy_version 153600 (0.0005) [2023-03-08 23:17:31,824][569911] Updated weights for policy 0, policy_version 153680 (0.0005) [2023-03-08 23:17:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11195.7, 300 sec: 11288.3). Total num frames: 78692352. Throughput: 0: 11103.9. Samples: 78681064. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:17:32,789][569624] Avg episode reward: [(0, '4778.844')] [2023-03-08 23:17:32,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000153696_78692352.pth... [2023-03-08 23:17:32,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000153056_78364672.pth [2023-03-08 23:17:35,210][569911] Updated weights for policy 0, policy_version 153760 (0.0005) [2023-03-08 23:17:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11195.7, 300 sec: 11274.4). Total num frames: 78749696. Throughput: 0: 11108.7. Samples: 78749760. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:17:37,789][569624] Avg episode reward: [(0, '4753.517')] [2023-03-08 23:17:38,844][569911] Updated weights for policy 0, policy_version 153840 (0.0005) [2023-03-08 23:17:42,413][569911] Updated weights for policy 0, policy_version 153920 (0.0005) [2023-03-08 23:17:42,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11195.7, 300 sec: 11274.4). Total num frames: 78807040. Throughput: 0: 11162.7. Samples: 78785472. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:17:42,789][569624] Avg episode reward: [(0, '4675.878')] [2023-03-08 23:17:45,910][569911] Updated weights for policy 0, policy_version 154000 (0.0005) [2023-03-08 23:17:47,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11264.0, 300 sec: 11288.3). Total num frames: 78868480. Throughput: 0: 11202.0. Samples: 78856256. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:17:47,789][569624] Avg episode reward: [(0, '4471.781')] [2023-03-08 23:17:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000154040_78868480.pth... [2023-03-08 23:17:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000153368_78524416.pth [2023-03-08 23:17:49,377][569911] Updated weights for policy 0, policy_version 154080 (0.0005) [2023-03-08 23:17:52,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11264.0, 300 sec: 11288.3). Total num frames: 78925824. Throughput: 0: 11289.6. Samples: 78925832. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:17:52,789][569624] Avg episode reward: [(0, '4510.336')] [2023-03-08 23:17:52,904][569911] Updated weights for policy 0, policy_version 154160 (0.0004) [2023-03-08 23:17:56,591][569911] Updated weights for policy 0, policy_version 154240 (0.0005) [2023-03-08 23:17:57,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11264.0, 300 sec: 11302.2). Total num frames: 78983168. Throughput: 0: 11314.0. Samples: 78958800. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:17:57,789][569624] Avg episode reward: [(0, '4472.653')] [2023-03-08 23:18:00,283][569911] Updated weights for policy 0, policy_version 154320 (0.0004) [2023-03-08 23:18:02,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11195.8, 300 sec: 11288.3). Total num frames: 79036416. Throughput: 0: 11383.6. Samples: 79025152. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:18:02,789][569624] Avg episode reward: [(0, '4678.584')] [2023-03-08 23:18:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000154368_79036416.pth... [2023-03-08 23:18:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000153696_78692352.pth [2023-03-08 23:18:04,079][569911] Updated weights for policy 0, policy_version 154400 (0.0004) [2023-03-08 23:18:07,634][569911] Updated weights for policy 0, policy_version 154480 (0.0005) [2023-03-08 23:18:07,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11264.0, 300 sec: 11302.2). Total num frames: 79093760. Throughput: 0: 11451.0. Samples: 79093660. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:18:07,789][569624] Avg episode reward: [(0, '4761.429')] [2023-03-08 23:18:11,320][569911] Updated weights for policy 0, policy_version 154560 (0.0005) [2023-03-08 23:18:12,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11264.0, 300 sec: 11302.2). Total num frames: 79147008. Throughput: 0: 11410.6. Samples: 79126516. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:18:12,789][569624] Avg episode reward: [(0, '4764.419')] [2023-03-08 23:18:14,987][569911] Updated weights for policy 0, policy_version 154640 (0.0005) [2023-03-08 23:18:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11316.1). Total num frames: 79208448. Throughput: 0: 11378.9. Samples: 79193116. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:18:17,789][569624] Avg episode reward: [(0, '4799.905')] [2023-03-08 23:18:17,794][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000154704_79208448.pth... [2023-03-08 23:18:17,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000154040_78868480.pth [2023-03-08 23:18:18,468][569911] Updated weights for policy 0, policy_version 154720 (0.0005) [2023-03-08 23:18:22,056][569911] Updated weights for policy 0, policy_version 154800 (0.0004) [2023-03-08 23:18:22,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11468.8, 300 sec: 11329.9). Total num frames: 79265792. Throughput: 0: 11431.1. Samples: 79264160. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:18:22,789][569624] Avg episode reward: [(0, '4684.381')] [2023-03-08 23:18:25,490][569911] Updated weights for policy 0, policy_version 154880 (0.0004) [2023-03-08 23:18:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11537.1, 300 sec: 11343.8). Total num frames: 79327232. Throughput: 0: 11403.6. Samples: 79298632. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:18:27,789][569624] Avg episode reward: [(0, '4692.679')] [2023-03-08 23:18:28,886][569911] Updated weights for policy 0, policy_version 154960 (0.0005) [2023-03-08 23:18:32,648][569911] Updated weights for policy 0, policy_version 155040 (0.0004) [2023-03-08 23:18:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 11329.9). Total num frames: 79380480. Throughput: 0: 11380.5. Samples: 79368380. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:18:32,789][569624] Avg episode reward: [(0, '4617.051')] [2023-03-08 23:18:32,794][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000155040_79380480.pth... [2023-03-08 23:18:32,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000154368_79036416.pth [2023-03-08 23:18:36,454][569911] Updated weights for policy 0, policy_version 155120 (0.0004) [2023-03-08 23:18:37,789][569624] Fps is (10 sec: 10649.6, 60 sec: 11400.5, 300 sec: 11316.1). Total num frames: 79433728. Throughput: 0: 11288.2. Samples: 79433800. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:18:37,789][569624] Avg episode reward: [(0, '4693.387')] [2023-03-08 23:18:40,039][569911] Updated weights for policy 0, policy_version 155200 (0.0004) [2023-03-08 23:18:42,789][569624] Fps is (10 sec: 11059.4, 60 sec: 11400.5, 300 sec: 11330.0). Total num frames: 79491072. Throughput: 0: 11312.2. Samples: 79467848. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:18:42,789][569624] Avg episode reward: [(0, '4744.754')] [2023-03-08 23:18:43,529][569911] Updated weights for policy 0, policy_version 155280 (0.0004) [2023-03-08 23:18:46,846][569911] Updated weights for policy 0, policy_version 155360 (0.0005) [2023-03-08 23:18:47,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11400.5, 300 sec: 11343.8). Total num frames: 79552512. Throughput: 0: 11447.5. Samples: 79540288. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:18:47,789][569624] Avg episode reward: [(0, '4693.645')] [2023-03-08 23:18:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000155376_79552512.pth... [2023-03-08 23:18:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000154704_79208448.pth [2023-03-08 23:18:50,457][569911] Updated weights for policy 0, policy_version 155440 (0.0005) [2023-03-08 23:18:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11400.5, 300 sec: 11371.6). Total num frames: 79609856. Throughput: 0: 11470.3. Samples: 79609824. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:18:52,789][569624] Avg episode reward: [(0, '4617.814')] [2023-03-08 23:18:53,847][569911] Updated weights for policy 0, policy_version 155520 (0.0004) [2023-03-08 23:18:57,389][569911] Updated weights for policy 0, policy_version 155600 (0.0004) [2023-03-08 23:18:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11371.6). Total num frames: 79667200. Throughput: 0: 11560.5. Samples: 79646736. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:18:57,789][569624] Avg episode reward: [(0, '4339.467')] [2023-03-08 23:19:01,057][569911] Updated weights for policy 0, policy_version 155680 (0.0004) [2023-03-08 23:19:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11371.6). Total num frames: 79724544. Throughput: 0: 11546.6. Samples: 79712712. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:19:02,789][569624] Avg episode reward: [(0, '4567.385')] [2023-03-08 23:19:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000155712_79724544.pth... [2023-03-08 23:19:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000155040_79380480.pth [2023-03-08 23:19:04,618][569911] Updated weights for policy 0, policy_version 155760 (0.0005) [2023-03-08 23:19:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11537.1, 300 sec: 11399.4). Total num frames: 79785984. Throughput: 0: 11547.3. Samples: 79783788. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:19:07,789][569624] Avg episode reward: [(0, '4553.988')] [2023-03-08 23:19:08,117][569911] Updated weights for policy 0, policy_version 155840 (0.0005) [2023-03-08 23:19:11,573][569911] Updated weights for policy 0, policy_version 155920 (0.0004) [2023-03-08 23:19:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11399.4). Total num frames: 79843328. Throughput: 0: 11560.4. Samples: 79818852. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:19:12,789][569624] Avg episode reward: [(0, '4743.604')] [2023-03-08 23:19:15,078][569911] Updated weights for policy 0, policy_version 156000 (0.0005) [2023-03-08 23:19:17,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11399.4). Total num frames: 79900672. Throughput: 0: 11557.3. Samples: 79888456. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:19:17,789][569624] Avg episode reward: [(0, '4663.072')] [2023-03-08 23:19:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000156056_79900672.pth... [2023-03-08 23:19:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000155376_79552512.pth [2023-03-08 23:19:18,651][569911] Updated weights for policy 0, policy_version 156080 (0.0005) [2023-03-08 23:19:22,090][569911] Updated weights for policy 0, policy_version 156160 (0.0005) [2023-03-08 23:19:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11413.3). Total num frames: 79958016. Throughput: 0: 11657.1. Samples: 79958368. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:19:22,789][569624] Avg episode reward: [(0, '4571.813')] [2023-03-08 23:19:25,606][569911] Updated weights for policy 0, policy_version 156240 (0.0005) [2023-03-08 23:19:27,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11468.8, 300 sec: 11413.3). Total num frames: 80015360. Throughput: 0: 11705.4. Samples: 79994592. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:19:27,789][569624] Avg episode reward: [(0, '4542.801')] [2023-03-08 23:19:29,251][569911] Updated weights for policy 0, policy_version 156320 (0.0004) [2023-03-08 23:19:32,459][569911] Updated weights for policy 0, policy_version 156400 (0.0005) [2023-03-08 23:19:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11605.4, 300 sec: 11427.1). Total num frames: 80076800. Throughput: 0: 11662.4. Samples: 80065096. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 23:19:32,789][569624] Avg episode reward: [(0, '4626.769')] [2023-03-08 23:19:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000156400_80076800.pth... [2023-03-08 23:19:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000155712_79724544.pth [2023-03-08 23:19:35,897][569911] Updated weights for policy 0, policy_version 156480 (0.0005) [2023-03-08 23:19:37,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11741.9, 300 sec: 11441.0). Total num frames: 80138240. Throughput: 0: 11674.3. Samples: 80135168. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 23:19:37,789][569624] Avg episode reward: [(0, '4549.156')] [2023-03-08 23:19:39,730][569911] Updated weights for policy 0, policy_version 156560 (0.0005) [2023-03-08 23:19:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11427.1). Total num frames: 80191488. Throughput: 0: 11592.2. Samples: 80168388. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 23:19:42,789][569624] Avg episode reward: [(0, '4762.609')] [2023-03-08 23:19:43,173][569911] Updated weights for policy 0, policy_version 156640 (0.0005) [2023-03-08 23:19:46,713][569911] Updated weights for policy 0, policy_version 156720 (0.0005) [2023-03-08 23:19:47,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11441.0). Total num frames: 80252928. Throughput: 0: 11706.9. Samples: 80239520. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 23:19:47,789][569624] Avg episode reward: [(0, '4806.237')] [2023-03-08 23:19:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000156744_80252928.pth... [2023-03-08 23:19:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000156056_79900672.pth [2023-03-08 23:19:50,139][569911] Updated weights for policy 0, policy_version 156800 (0.0005) [2023-03-08 23:19:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11441.0). Total num frames: 80310272. Throughput: 0: 11695.7. Samples: 80310096. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 23:19:52,789][569624] Avg episode reward: [(0, '4530.863')] [2023-03-08 23:19:53,646][569911] Updated weights for policy 0, policy_version 156880 (0.0004) [2023-03-08 23:19:57,191][569911] Updated weights for policy 0, policy_version 156960 (0.0005) [2023-03-08 23:19:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11441.0). Total num frames: 80367616. Throughput: 0: 11707.9. Samples: 80345708. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 23:19:57,789][569624] Avg episode reward: [(0, '4769.928')] [2023-03-08 23:20:00,678][569911] Updated weights for policy 0, policy_version 157040 (0.0004) [2023-03-08 23:20:02,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11454.9). Total num frames: 80429056. Throughput: 0: 11689.2. Samples: 80414468. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 23:20:02,789][569624] Avg episode reward: [(0, '4751.819')] [2023-03-08 23:20:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000157088_80429056.pth... [2023-03-08 23:20:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000156400_80076800.pth [2023-03-08 23:20:04,375][569911] Updated weights for policy 0, policy_version 157120 (0.0004) [2023-03-08 23:20:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.4, 300 sec: 11441.0). Total num frames: 80482304. Throughput: 0: 11643.2. Samples: 80482312. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 23:20:07,799][569624] Avg episode reward: [(0, '4720.062')] [2023-03-08 23:20:07,974][569911] Updated weights for policy 0, policy_version 157200 (0.0004) [2023-03-08 23:20:11,676][569911] Updated weights for policy 0, policy_version 157280 (0.0005) [2023-03-08 23:20:12,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11605.3, 300 sec: 11427.1). Total num frames: 80539648. Throughput: 0: 11566.4. Samples: 80515080. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 23:20:12,800][569624] Avg episode reward: [(0, '4684.936')] [2023-03-08 23:20:15,475][569911] Updated weights for policy 0, policy_version 157360 (0.0005) [2023-03-08 23:20:17,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11537.1, 300 sec: 11413.3). Total num frames: 80592896. Throughput: 0: 11457.2. Samples: 80580672. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 23:20:17,789][569624] Avg episode reward: [(0, '4665.657')] [2023-03-08 23:20:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000157408_80592896.pth... [2023-03-08 23:20:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000156744_80252928.pth [2023-03-08 23:20:19,188][569911] Updated weights for policy 0, policy_version 157440 (0.0005) [2023-03-08 23:20:22,789][569624] Fps is (10 sec: 10649.9, 60 sec: 11468.8, 300 sec: 11399.4). Total num frames: 80646144. Throughput: 0: 11360.4. Samples: 80646384. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 23:20:22,799][569624] Avg episode reward: [(0, '4674.676')] [2023-03-08 23:20:22,896][569911] Updated weights for policy 0, policy_version 157520 (0.0005) [2023-03-08 23:20:26,563][569911] Updated weights for policy 0, policy_version 157600 (0.0005) [2023-03-08 23:20:27,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11468.8, 300 sec: 11385.5). Total num frames: 80703488. Throughput: 0: 11381.8. Samples: 80680568. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 23:20:27,800][569624] Avg episode reward: [(0, '4750.924')] [2023-03-08 23:20:30,199][569911] Updated weights for policy 0, policy_version 157680 (0.0005) [2023-03-08 23:20:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11385.5). Total num frames: 80760832. Throughput: 0: 11287.3. Samples: 80747448. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-08 23:20:32,789][569624] Avg episode reward: [(0, '4556.499')] [2023-03-08 23:20:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000157736_80760832.pth... [2023-03-08 23:20:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000157088_80429056.pth [2023-03-08 23:20:33,807][569911] Updated weights for policy 0, policy_version 157760 (0.0005) [2023-03-08 23:20:37,502][569911] Updated weights for policy 0, policy_version 157840 (0.0005) [2023-03-08 23:20:37,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11264.0, 300 sec: 11371.6). Total num frames: 80814080. Throughput: 0: 11201.2. Samples: 80814152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:20:37,789][569624] Avg episode reward: [(0, '4704.869')] [2023-03-08 23:20:41,177][569911] Updated weights for policy 0, policy_version 157920 (0.0004) [2023-03-08 23:20:42,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11385.5). Total num frames: 80871424. Throughput: 0: 11160.1. Samples: 80847912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:20:42,789][569624] Avg episode reward: [(0, '4632.094')] [2023-03-08 23:20:44,603][569911] Updated weights for policy 0, policy_version 158000 (0.0005) [2023-03-08 23:20:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11332.2, 300 sec: 11413.3). Total num frames: 80932864. Throughput: 0: 11186.1. Samples: 80917844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:20:47,789][569624] Avg episode reward: [(0, '4631.303')] [2023-03-08 23:20:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000158072_80932864.pth... [2023-03-08 23:20:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000157408_80592896.pth [2023-03-08 23:20:48,126][569911] Updated weights for policy 0, policy_version 158080 (0.0004) [2023-03-08 23:20:51,803][569911] Updated weights for policy 0, policy_version 158160 (0.0005) [2023-03-08 23:20:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11399.4). Total num frames: 80986112. Throughput: 0: 11195.5. Samples: 80986112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:20:52,789][569624] Avg episode reward: [(0, '4633.901')] [2023-03-08 23:20:55,438][569911] Updated weights for policy 0, policy_version 158240 (0.0005) [2023-03-08 23:20:57,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11264.0, 300 sec: 11413.3). Total num frames: 81043456. Throughput: 0: 11213.9. Samples: 81019704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:20:57,789][569624] Avg episode reward: [(0, '4629.485')] [2023-03-08 23:20:59,042][569911] Updated weights for policy 0, policy_version 158320 (0.0004) [2023-03-08 23:21:02,624][569911] Updated weights for policy 0, policy_version 158400 (0.0005) [2023-03-08 23:21:02,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11195.8, 300 sec: 11427.1). Total num frames: 81100800. Throughput: 0: 11286.8. Samples: 81088576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:21:02,789][569624] Avg episode reward: [(0, '4626.310')] [2023-03-08 23:21:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000158400_81100800.pth... [2023-03-08 23:21:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000157736_80760832.pth [2023-03-08 23:21:06,295][569911] Updated weights for policy 0, policy_version 158480 (0.0005) [2023-03-08 23:21:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11427.1). Total num frames: 81158144. Throughput: 0: 11343.0. Samples: 81156820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:21:07,789][569624] Avg episode reward: [(0, '4591.304')] [2023-03-08 23:21:09,818][569911] Updated weights for policy 0, policy_version 158560 (0.0005) [2023-03-08 23:21:12,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11264.0, 300 sec: 11427.1). Total num frames: 81215488. Throughput: 0: 11342.4. Samples: 81190976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:21:12,789][569624] Avg episode reward: [(0, '4612.123')] [2023-03-08 23:21:13,214][569911] Updated weights for policy 0, policy_version 158640 (0.0004) [2023-03-08 23:21:16,558][569911] Updated weights for policy 0, policy_version 158720 (0.0005) [2023-03-08 23:21:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11400.5, 300 sec: 11441.0). Total num frames: 81276928. Throughput: 0: 11488.5. Samples: 81264428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:21:17,789][569624] Avg episode reward: [(0, '4573.450')] [2023-03-08 23:21:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000158744_81276928.pth... [2023-03-08 23:21:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000158072_80932864.pth [2023-03-08 23:21:20,111][569911] Updated weights for policy 0, policy_version 158800 (0.0005) [2023-03-08 23:21:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 11427.1). Total num frames: 81334272. Throughput: 0: 11541.4. Samples: 81333516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:21:22,789][569624] Avg episode reward: [(0, '4602.845')] [2023-03-08 23:21:23,793][569911] Updated weights for policy 0, policy_version 158880 (0.0005) [2023-03-08 23:21:27,553][569911] Updated weights for policy 0, policy_version 158960 (0.0004) [2023-03-08 23:21:27,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11413.3). Total num frames: 81387520. Throughput: 0: 11471.7. Samples: 81364136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:21:27,789][569624] Avg episode reward: [(0, '4774.650')] [2023-03-08 23:21:31,017][569911] Updated weights for policy 0, policy_version 159040 (0.0005) [2023-03-08 23:21:32,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11413.3). Total num frames: 81444864. Throughput: 0: 11493.0. Samples: 81435028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:21:32,800][569624] Avg episode reward: [(0, '4670.400')] [2023-03-08 23:21:32,817][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000159080_81448960.pth... [2023-03-08 23:21:32,819][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000158400_81100800.pth [2023-03-08 23:21:34,645][569911] Updated weights for policy 0, policy_version 159120 (0.0005) [2023-03-08 23:21:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11413.3). Total num frames: 81502208. Throughput: 0: 11465.4. Samples: 81502056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:21:37,800][569624] Avg episode reward: [(0, '4677.499')] [2023-03-08 23:21:38,345][569911] Updated weights for policy 0, policy_version 159200 (0.0005) [2023-03-08 23:21:41,844][569911] Updated weights for policy 0, policy_version 159280 (0.0005) [2023-03-08 23:21:42,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11468.8, 300 sec: 11413.3). Total num frames: 81559552. Throughput: 0: 11470.9. Samples: 81535892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:21:42,800][569624] Avg episode reward: [(0, '4732.358')] [2023-03-08 23:21:45,549][569911] Updated weights for policy 0, policy_version 159360 (0.0004) [2023-03-08 23:21:47,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11413.3). Total num frames: 81616896. Throughput: 0: 11467.3. Samples: 81604604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:21:47,800][569624] Avg episode reward: [(0, '4707.292')] [2023-03-08 23:21:47,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000159408_81616896.pth... [2023-03-08 23:21:47,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000158744_81276928.pth [2023-03-08 23:21:49,129][569911] Updated weights for policy 0, policy_version 159440 (0.0005) [2023-03-08 23:21:52,708][569911] Updated weights for policy 0, policy_version 159520 (0.0005) [2023-03-08 23:21:52,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 11413.3). Total num frames: 81674240. Throughput: 0: 11449.6. Samples: 81672052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:21:52,800][569624] Avg episode reward: [(0, '4624.330')] [2023-03-08 23:21:56,154][569911] Updated weights for policy 0, policy_version 159600 (0.0005) [2023-03-08 23:21:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11413.3). Total num frames: 81731584. Throughput: 0: 11469.0. Samples: 81707080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:21:57,800][569624] Avg episode reward: [(0, '4665.578')] [2023-03-08 23:21:59,880][569911] Updated weights for policy 0, policy_version 159680 (0.0004) [2023-03-08 23:22:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11427.1). Total num frames: 81788928. Throughput: 0: 11366.8. Samples: 81775936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:22:02,789][569624] Avg episode reward: [(0, '4710.748')] [2023-03-08 23:22:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000159744_81788928.pth... [2023-03-08 23:22:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000159080_81448960.pth [2023-03-08 23:22:03,472][569911] Updated weights for policy 0, policy_version 159760 (0.0005) [2023-03-08 23:22:06,743][569911] Updated weights for policy 0, policy_version 159840 (0.0005) [2023-03-08 23:22:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11441.0). Total num frames: 81846272. Throughput: 0: 11396.0. Samples: 81846336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:22:07,789][569624] Avg episode reward: [(0, '4521.726')] [2023-03-08 23:22:10,257][569911] Updated weights for policy 0, policy_version 159920 (0.0004) [2023-03-08 23:22:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11537.1, 300 sec: 11468.8). Total num frames: 81907712. Throughput: 0: 11530.1. Samples: 81882992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:22:12,789][569624] Avg episode reward: [(0, '4731.963')] [2023-03-08 23:22:13,703][569911] Updated weights for policy 0, policy_version 160000 (0.0005) [2023-03-08 23:22:17,223][569911] Updated weights for policy 0, policy_version 160080 (0.0005) [2023-03-08 23:22:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 11482.7). Total num frames: 81965056. Throughput: 0: 11504.4. Samples: 81952728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:22:17,789][569624] Avg episode reward: [(0, '4783.563')] [2023-03-08 23:22:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000160088_81965056.pth... [2023-03-08 23:22:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000159408_81616896.pth [2023-03-08 23:22:20,426][569911] Updated weights for policy 0, policy_version 160160 (0.0005) [2023-03-08 23:22:22,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11537.1, 300 sec: 11496.6). Total num frames: 82026496. Throughput: 0: 11680.3. Samples: 82027668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:22:22,789][569624] Avg episode reward: [(0, '4738.812')] [2023-03-08 23:22:23,962][569911] Updated weights for policy 0, policy_version 160240 (0.0005) [2023-03-08 23:22:27,600][569911] Updated weights for policy 0, policy_version 160320 (0.0005) [2023-03-08 23:22:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11496.6). Total num frames: 82083840. Throughput: 0: 11687.6. Samples: 82061836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:22:27,789][569624] Avg episode reward: [(0, '4648.155')] [2023-03-08 23:22:31,094][569911] Updated weights for policy 0, policy_version 160400 (0.0005) [2023-03-08 23:22:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11496.6). Total num frames: 82141184. Throughput: 0: 11652.7. Samples: 82128976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:22:32,789][569624] Avg episode reward: [(0, '4601.907')] [2023-03-08 23:22:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000160432_82141184.pth... [2023-03-08 23:22:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000159744_81788928.pth [2023-03-08 23:22:34,912][569911] Updated weights for policy 0, policy_version 160480 (0.0005) [2023-03-08 23:22:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11496.6). Total num frames: 82198528. Throughput: 0: 11694.0. Samples: 82198284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:22:37,789][569624] Avg episode reward: [(0, '4697.434')] [2023-03-08 23:22:38,384][569911] Updated weights for policy 0, policy_version 160560 (0.0005) [2023-03-08 23:22:41,892][569911] Updated weights for policy 0, policy_version 160640 (0.0004) [2023-03-08 23:22:42,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11605.3, 300 sec: 11482.7). Total num frames: 82255872. Throughput: 0: 11664.8. Samples: 82231992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:22:42,789][569624] Avg episode reward: [(0, '4733.559')] [2023-03-08 23:22:45,489][569911] Updated weights for policy 0, policy_version 160720 (0.0005) [2023-03-08 23:22:47,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11482.7). Total num frames: 82313216. Throughput: 0: 11661.5. Samples: 82300704. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:22:47,789][569624] Avg episode reward: [(0, '4394.471')] [2023-03-08 23:22:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000160768_82313216.pth... [2023-03-08 23:22:47,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000160088_81965056.pth [2023-03-08 23:22:49,088][569911] Updated weights for policy 0, policy_version 160800 (0.0005) [2023-03-08 23:22:52,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11537.1, 300 sec: 11468.8). Total num frames: 82366464. Throughput: 0: 11586.0. Samples: 82367704. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:22:52,789][569624] Avg episode reward: [(0, '4525.487')] [2023-03-08 23:22:52,793][569911] Updated weights for policy 0, policy_version 160880 (0.0005) [2023-03-08 23:22:56,212][569911] Updated weights for policy 0, policy_version 160960 (0.0005) [2023-03-08 23:22:57,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11605.4, 300 sec: 11496.6). Total num frames: 82427904. Throughput: 0: 11562.9. Samples: 82403320. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:22:57,789][569624] Avg episode reward: [(0, '4398.670')] [2023-03-08 23:22:59,916][569911] Updated weights for policy 0, policy_version 161040 (0.0004) [2023-03-08 23:23:02,789][569624] Fps is (10 sec: 11878.2, 60 sec: 11605.3, 300 sec: 11496.6). Total num frames: 82485248. Throughput: 0: 11549.0. Samples: 82472432. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:23:02,789][569624] Avg episode reward: [(0, '4110.924')] [2023-03-08 23:23:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000161104_82485248.pth... [2023-03-08 23:23:02,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000160432_82141184.pth [2023-03-08 23:23:03,350][569911] Updated weights for policy 0, policy_version 161120 (0.0005) [2023-03-08 23:23:07,057][569911] Updated weights for policy 0, policy_version 161200 (0.0005) [2023-03-08 23:23:07,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11510.5). Total num frames: 82542592. Throughput: 0: 11375.4. Samples: 82539560. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:23:07,789][569624] Avg episode reward: [(0, '4440.308')] [2023-03-08 23:23:10,583][569911] Updated weights for policy 0, policy_version 161280 (0.0005) [2023-03-08 23:23:12,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11496.6). Total num frames: 82599936. Throughput: 0: 11409.5. Samples: 82575264. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:23:12,789][569624] Avg episode reward: [(0, '4470.545')] [2023-03-08 23:23:14,094][569911] Updated weights for policy 0, policy_version 161360 (0.0005) [2023-03-08 23:23:17,628][569911] Updated weights for policy 0, policy_version 161440 (0.0004) [2023-03-08 23:23:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11537.1, 300 sec: 11496.6). Total num frames: 82657280. Throughput: 0: 11464.3. Samples: 82644868. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:23:17,789][569624] Avg episode reward: [(0, '4530.511')] [2023-03-08 23:23:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000161440_82657280.pth... [2023-03-08 23:23:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000160768_82313216.pth [2023-03-08 23:23:21,156][569911] Updated weights for policy 0, policy_version 161520 (0.0004) [2023-03-08 23:23:22,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11468.8, 300 sec: 11482.7). Total num frames: 82714624. Throughput: 0: 11475.7. Samples: 82714688. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:23:22,789][569624] Avg episode reward: [(0, '4733.039')] [2023-03-08 23:23:24,603][569911] Updated weights for policy 0, policy_version 161600 (0.0005) [2023-03-08 23:23:27,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11468.8, 300 sec: 11496.6). Total num frames: 82771968. Throughput: 0: 11537.2. Samples: 82751168. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:23:27,789][569624] Avg episode reward: [(0, '4697.823')] [2023-03-08 23:23:28,325][569911] Updated weights for policy 0, policy_version 161680 (0.0005) [2023-03-08 23:23:32,032][569911] Updated weights for policy 0, policy_version 161760 (0.0005) [2023-03-08 23:23:32,789][569624] Fps is (10 sec: 11468.6, 60 sec: 11468.8, 300 sec: 11510.5). Total num frames: 82829312. Throughput: 0: 11467.4. Samples: 82816736. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:23:32,789][569624] Avg episode reward: [(0, '4640.011')] [2023-03-08 23:23:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000161776_82829312.pth... [2023-03-08 23:23:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000161104_82485248.pth [2023-03-08 23:23:35,806][569911] Updated weights for policy 0, policy_version 161840 (0.0005) [2023-03-08 23:23:37,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11400.5, 300 sec: 11496.6). Total num frames: 82882560. Throughput: 0: 11365.9. Samples: 82879172. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:23:37,789][569624] Avg episode reward: [(0, '4469.435')] [2023-03-08 23:23:39,529][569911] Updated weights for policy 0, policy_version 161920 (0.0005) [2023-03-08 23:23:42,789][569624] Fps is (10 sec: 10649.7, 60 sec: 11332.3, 300 sec: 11468.8). Total num frames: 82935808. Throughput: 0: 11372.3. Samples: 82915076. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:23:42,789][569624] Avg episode reward: [(0, '4571.439')] [2023-03-08 23:23:43,148][569911] Updated weights for policy 0, policy_version 162000 (0.0005) [2023-03-08 23:23:46,711][569911] Updated weights for policy 0, policy_version 162080 (0.0004) [2023-03-08 23:23:47,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11400.6, 300 sec: 11482.7). Total num frames: 82997248. Throughput: 0: 11363.6. Samples: 82983792. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:23:47,789][569624] Avg episode reward: [(0, '4797.781')] [2023-03-08 23:23:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000162104_82997248.pth... [2023-03-08 23:23:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000161440_82657280.pth [2023-03-08 23:23:50,360][569911] Updated weights for policy 0, policy_version 162160 (0.0004) [2023-03-08 23:23:52,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11468.8). Total num frames: 83050496. Throughput: 0: 11355.5. Samples: 83050560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:23:52,789][569624] Avg episode reward: [(0, '4721.012')] [2023-03-08 23:23:53,969][569911] Updated weights for policy 0, policy_version 162240 (0.0005) [2023-03-08 23:23:57,655][569911] Updated weights for policy 0, policy_version 162320 (0.0005) [2023-03-08 23:23:57,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11332.3, 300 sec: 11468.8). Total num frames: 83107840. Throughput: 0: 11291.7. Samples: 83083388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:23:57,789][569624] Avg episode reward: [(0, '4611.219')] [2023-03-08 23:24:00,834][569911] Updated weights for policy 0, policy_version 162400 (0.0005) [2023-03-08 23:24:02,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11400.5, 300 sec: 11468.8). Total num frames: 83169280. Throughput: 0: 11379.8. Samples: 83156960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:24:02,789][569624] Avg episode reward: [(0, '4706.326')] [2023-03-08 23:24:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000162440_83169280.pth... [2023-03-08 23:24:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000161776_82829312.pth [2023-03-08 23:24:04,228][569911] Updated weights for policy 0, policy_version 162480 (0.0005) [2023-03-08 23:24:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11400.5, 300 sec: 11468.8). Total num frames: 83226624. Throughput: 0: 11371.5. Samples: 83226408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:24:07,789][569624] Avg episode reward: [(0, '4789.196')] [2023-03-08 23:24:07,964][569911] Updated weights for policy 0, policy_version 162560 (0.0005) [2023-03-08 23:24:11,523][569911] Updated weights for policy 0, policy_version 162640 (0.0005) [2023-03-08 23:24:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11400.5, 300 sec: 11468.8). Total num frames: 83283968. Throughput: 0: 11318.6. Samples: 83260504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:24:12,789][569624] Avg episode reward: [(0, '4732.126')] [2023-03-08 23:24:15,277][569911] Updated weights for policy 0, policy_version 162720 (0.0005) [2023-03-08 23:24:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11468.8). Total num frames: 83341312. Throughput: 0: 11349.2. Samples: 83327448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:24:17,789][569624] Avg episode reward: [(0, '4790.594')] [2023-03-08 23:24:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000162776_83341312.pth... [2023-03-08 23:24:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000162104_82997248.pth [2023-03-08 23:24:18,611][569911] Updated weights for policy 0, policy_version 162800 (0.0004) [2023-03-08 23:24:22,213][569911] Updated weights for policy 0, policy_version 162880 (0.0005) [2023-03-08 23:24:22,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11400.5, 300 sec: 11468.8). Total num frames: 83398656. Throughput: 0: 11545.7. Samples: 83398728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:24:22,789][569624] Avg episode reward: [(0, '4697.006')] [2023-03-08 23:24:25,718][569911] Updated weights for policy 0, policy_version 162960 (0.0004) [2023-03-08 23:24:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11468.8, 300 sec: 11468.8). Total num frames: 83460096. Throughput: 0: 11511.4. Samples: 83433088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:24:27,789][569624] Avg episode reward: [(0, '4730.702')] [2023-03-08 23:24:29,032][569911] Updated weights for policy 0, policy_version 163040 (0.0005) [2023-03-08 23:24:32,635][569911] Updated weights for policy 0, policy_version 163120 (0.0004) [2023-03-08 23:24:32,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11468.8, 300 sec: 11454.9). Total num frames: 83517440. Throughput: 0: 11585.8. Samples: 83505152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:24:32,789][569624] Avg episode reward: [(0, '4776.898')] [2023-03-08 23:24:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000163120_83517440.pth... [2023-03-08 23:24:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000162440_83169280.pth [2023-03-08 23:24:36,150][569911] Updated weights for policy 0, policy_version 163200 (0.0004) [2023-03-08 23:24:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11468.8). Total num frames: 83574784. Throughput: 0: 11649.1. Samples: 83574768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:24:37,789][569624] Avg episode reward: [(0, '4737.946')] [2023-03-08 23:24:39,876][569911] Updated weights for policy 0, policy_version 163280 (0.0004) [2023-03-08 23:24:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11454.9). Total num frames: 83632128. Throughput: 0: 11648.1. Samples: 83607552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:24:42,789][569624] Avg episode reward: [(0, '4702.843')] [2023-03-08 23:24:43,521][569911] Updated weights for policy 0, policy_version 163360 (0.0005) [2023-03-08 23:24:47,174][569911] Updated weights for policy 0, policy_version 163440 (0.0005) [2023-03-08 23:24:47,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11468.8, 300 sec: 11441.0). Total num frames: 83685376. Throughput: 0: 11494.3. Samples: 83674204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:24:47,791][569624] Avg episode reward: [(0, '4793.037')] [2023-03-08 23:24:47,794][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000163448_83685376.pth... [2023-03-08 23:24:47,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000162776_83341312.pth [2023-03-08 23:24:50,894][569911] Updated weights for policy 0, policy_version 163520 (0.0005) [2023-03-08 23:24:52,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11537.1, 300 sec: 11441.0). Total num frames: 83742720. Throughput: 0: 11419.8. Samples: 83740300. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:24:52,789][569624] Avg episode reward: [(0, '4794.418')] [2023-03-08 23:24:54,605][569911] Updated weights for policy 0, policy_version 163600 (0.0004) [2023-03-08 23:24:57,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11468.8, 300 sec: 11413.3). Total num frames: 83795968. Throughput: 0: 11432.5. Samples: 83774968. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:24:57,789][569624] Avg episode reward: [(0, '4754.445')] [2023-03-08 23:24:58,191][569911] Updated weights for policy 0, policy_version 163680 (0.0005) [2023-03-08 23:25:01,671][569911] Updated weights for policy 0, policy_version 163760 (0.0005) [2023-03-08 23:25:02,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 11441.0). Total num frames: 83857408. Throughput: 0: 11460.4. Samples: 83843164. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:25:02,789][569624] Avg episode reward: [(0, '4723.677')] [2023-03-08 23:25:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000163784_83857408.pth... [2023-03-08 23:25:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000163120_83517440.pth [2023-03-08 23:25:05,245][569911] Updated weights for policy 0, policy_version 163840 (0.0005) [2023-03-08 23:25:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11468.8, 300 sec: 11441.0). Total num frames: 83914752. Throughput: 0: 11407.9. Samples: 83912084. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:25:07,789][569624] Avg episode reward: [(0, '4731.216')] [2023-03-08 23:25:08,898][569911] Updated weights for policy 0, policy_version 163920 (0.0004) [2023-03-08 23:25:12,689][569911] Updated weights for policy 0, policy_version 164000 (0.0005) [2023-03-08 23:25:12,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11400.5, 300 sec: 11441.0). Total num frames: 83968000. Throughput: 0: 11355.0. Samples: 83944064. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:25:12,789][569624] Avg episode reward: [(0, '4734.610')] [2023-03-08 23:25:16,439][569911] Updated weights for policy 0, policy_version 164080 (0.0005) [2023-03-08 23:25:17,789][569624] Fps is (10 sec: 10649.7, 60 sec: 11332.3, 300 sec: 11441.0). Total num frames: 84021248. Throughput: 0: 11214.2. Samples: 84009788. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:25:17,789][569624] Avg episode reward: [(0, '4700.876')] [2023-03-08 23:25:17,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000164104_84021248.pth... [2023-03-08 23:25:17,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000163448_83685376.pth [2023-03-08 23:25:20,014][569911] Updated weights for policy 0, policy_version 164160 (0.0005) [2023-03-08 23:25:22,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11332.2, 300 sec: 11441.0). Total num frames: 84078592. Throughput: 0: 11191.1. Samples: 84078368. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:25:22,789][569624] Avg episode reward: [(0, '4356.037')] [2023-03-08 23:25:23,744][569911] Updated weights for policy 0, policy_version 164240 (0.0004) [2023-03-08 23:25:27,207][569911] Updated weights for policy 0, policy_version 164320 (0.0004) [2023-03-08 23:25:27,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11264.0, 300 sec: 11441.0). Total num frames: 84135936. Throughput: 0: 11205.9. Samples: 84111816. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:25:27,799][569624] Avg episode reward: [(0, '4222.756')] [2023-03-08 23:25:30,748][569911] Updated weights for policy 0, policy_version 164400 (0.0005) [2023-03-08 23:25:32,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11264.0, 300 sec: 11454.9). Total num frames: 84193280. Throughput: 0: 11269.4. Samples: 84181324. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:25:32,789][569624] Avg episode reward: [(0, '4467.419')] [2023-03-08 23:25:32,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000164440_84193280.pth... [2023-03-08 23:25:32,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000163784_83857408.pth [2023-03-08 23:25:34,364][569911] Updated weights for policy 0, policy_version 164480 (0.0005) [2023-03-08 23:25:37,744][569911] Updated weights for policy 0, policy_version 164560 (0.0005) [2023-03-08 23:25:37,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11332.3, 300 sec: 11468.8). Total num frames: 84254720. Throughput: 0: 11379.6. Samples: 84252380. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:25:37,789][569624] Avg episode reward: [(0, '4522.807')] [2023-03-08 23:25:41,568][569911] Updated weights for policy 0, policy_version 164640 (0.0004) [2023-03-08 23:25:42,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11264.0, 300 sec: 11441.0). Total num frames: 84307968. Throughput: 0: 11385.5. Samples: 84287316. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:25:42,789][569624] Avg episode reward: [(0, '4622.252')] [2023-03-08 23:25:45,052][569911] Updated weights for policy 0, policy_version 164720 (0.0005) [2023-03-08 23:25:47,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11332.3, 300 sec: 11454.9). Total num frames: 84365312. Throughput: 0: 11353.6. Samples: 84354076. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:25:47,789][569624] Avg episode reward: [(0, '4705.270')] [2023-03-08 23:25:47,818][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000164784_84369408.pth... [2023-03-08 23:25:47,820][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000164104_84021248.pth [2023-03-08 23:25:48,505][569911] Updated weights for policy 0, policy_version 164800 (0.0005) [2023-03-08 23:25:52,031][569911] Updated weights for policy 0, policy_version 164880 (0.0004) [2023-03-08 23:25:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11400.5, 300 sec: 11468.8). Total num frames: 84426752. Throughput: 0: 11420.9. Samples: 84426024. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:25:52,789][569624] Avg episode reward: [(0, '4564.809')] [2023-03-08 23:25:55,629][569911] Updated weights for policy 0, policy_version 164960 (0.0005) [2023-03-08 23:25:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 11468.8). Total num frames: 84484096. Throughput: 0: 11451.4. Samples: 84459376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:25:57,789][569624] Avg episode reward: [(0, '4563.165')] [2023-03-08 23:25:59,172][569911] Updated weights for policy 0, policy_version 165040 (0.0005) [2023-03-08 23:26:02,661][569911] Updated weights for policy 0, policy_version 165120 (0.0005) [2023-03-08 23:26:02,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11468.8). Total num frames: 84541440. Throughput: 0: 11541.7. Samples: 84529164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:26:02,789][569624] Avg episode reward: [(0, '4776.426')] [2023-03-08 23:26:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000165120_84541440.pth... [2023-03-08 23:26:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000164440_84193280.pth [2023-03-08 23:26:06,478][569911] Updated weights for policy 0, policy_version 165200 (0.0004) [2023-03-08 23:26:07,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11332.3, 300 sec: 11454.9). Total num frames: 84594688. Throughput: 0: 11473.6. Samples: 84594680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:26:07,789][569624] Avg episode reward: [(0, '4787.375')] [2023-03-08 23:26:10,137][569911] Updated weights for policy 0, policy_version 165280 (0.0005) [2023-03-08 23:26:12,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11400.5, 300 sec: 11441.0). Total num frames: 84652032. Throughput: 0: 11468.6. Samples: 84627904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:26:12,789][569624] Avg episode reward: [(0, '4727.296')] [2023-03-08 23:26:13,738][569911] Updated weights for policy 0, policy_version 165360 (0.0005) [2023-03-08 23:26:17,404][569911] Updated weights for policy 0, policy_version 165440 (0.0004) [2023-03-08 23:26:17,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11441.0). Total num frames: 84709376. Throughput: 0: 11452.4. Samples: 84696684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:26:17,800][569624] Avg episode reward: [(0, '4678.368')] [2023-03-08 23:26:17,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000165448_84709376.pth... [2023-03-08 23:26:17,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000164784_84369408.pth [2023-03-08 23:26:20,952][569911] Updated weights for policy 0, policy_version 165520 (0.0005) [2023-03-08 23:26:22,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11468.8, 300 sec: 11454.9). Total num frames: 84766720. Throughput: 0: 11431.2. Samples: 84766784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:26:22,799][569624] Avg episode reward: [(0, '4586.757')] [2023-03-08 23:26:24,242][569911] Updated weights for policy 0, policy_version 165600 (0.0005) [2023-03-08 23:26:27,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11468.8, 300 sec: 11454.9). Total num frames: 84824064. Throughput: 0: 11442.6. Samples: 84802232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:26:27,800][569624] Avg episode reward: [(0, '4695.492')] [2023-03-08 23:26:27,904][569911] Updated weights for policy 0, policy_version 165680 (0.0005) [2023-03-08 23:26:31,466][569911] Updated weights for policy 0, policy_version 165760 (0.0005) [2023-03-08 23:26:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11454.9). Total num frames: 84881408. Throughput: 0: 11453.9. Samples: 84869500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:26:32,800][569624] Avg episode reward: [(0, '4772.899')] [2023-03-08 23:26:32,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000165784_84881408.pth... [2023-03-08 23:26:32,803][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000165120_84541440.pth [2023-03-08 23:26:35,118][569911] Updated weights for policy 0, policy_version 165840 (0.0005) [2023-03-08 23:26:37,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11468.8, 300 sec: 11468.8). Total num frames: 84942848. Throughput: 0: 11412.5. Samples: 84939588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:26:37,800][569624] Avg episode reward: [(0, '4805.036')] [2023-03-08 23:26:38,457][569911] Updated weights for policy 0, policy_version 165920 (0.0005) [2023-03-08 23:26:41,899][569911] Updated weights for policy 0, policy_version 166000 (0.0005) [2023-03-08 23:26:42,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11537.1, 300 sec: 11468.8). Total num frames: 85000192. Throughput: 0: 11462.7. Samples: 84975196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:26:42,800][569624] Avg episode reward: [(0, '4622.117')] [2023-03-08 23:26:45,472][569911] Updated weights for policy 0, policy_version 166080 (0.0005) [2023-03-08 23:26:47,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11468.8). Total num frames: 85057536. Throughput: 0: 11476.5. Samples: 85045604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:26:47,800][569624] Avg episode reward: [(0, '4676.847')] [2023-03-08 23:26:47,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000166128_85057536.pth... [2023-03-08 23:26:47,805][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000165448_84709376.pth [2023-03-08 23:26:48,954][569911] Updated weights for policy 0, policy_version 166160 (0.0005) [2023-03-08 23:26:52,568][569911] Updated weights for policy 0, policy_version 166240 (0.0005) [2023-03-08 23:26:52,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11468.8, 300 sec: 11468.8). Total num frames: 85114880. Throughput: 0: 11556.4. Samples: 85114716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:26:52,800][569624] Avg episode reward: [(0, '4682.367')] [2023-03-08 23:26:55,920][569911] Updated weights for policy 0, policy_version 166320 (0.0005) [2023-03-08 23:26:57,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11537.1, 300 sec: 11482.7). Total num frames: 85176320. Throughput: 0: 11613.1. Samples: 85150492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:26:57,800][569624] Avg episode reward: [(0, '4591.777')] [2023-03-08 23:26:59,479][569911] Updated weights for policy 0, policy_version 166400 (0.0005) [2023-03-08 23:27:02,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11537.1, 300 sec: 11482.7). Total num frames: 85233664. Throughput: 0: 11616.2. Samples: 85219412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:27:02,789][569624] Avg episode reward: [(0, '4678.762')] [2023-03-08 23:27:02,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000166472_85233664.pth... [2023-03-08 23:27:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000165784_84881408.pth [2023-03-08 23:27:03,132][569911] Updated weights for policy 0, policy_version 166480 (0.0005) [2023-03-08 23:27:06,629][569911] Updated weights for policy 0, policy_version 166560 (0.0006) [2023-03-08 23:27:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11468.8). Total num frames: 85291008. Throughput: 0: 11609.0. Samples: 85289188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:27:07,789][569624] Avg episode reward: [(0, '4782.196')] [2023-03-08 23:27:10,302][569911] Updated weights for policy 0, policy_version 166640 (0.0005) [2023-03-08 23:27:12,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11468.8). Total num frames: 85348352. Throughput: 0: 11578.7. Samples: 85323272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:27:12,789][569624] Avg episode reward: [(0, '4715.559')] [2023-03-08 23:27:13,913][569911] Updated weights for policy 0, policy_version 166720 (0.0005) [2023-03-08 23:27:17,652][569911] Updated weights for policy 0, policy_version 166800 (0.0004) [2023-03-08 23:27:17,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11537.1, 300 sec: 11441.0). Total num frames: 85401600. Throughput: 0: 11552.8. Samples: 85389376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:27:17,789][569624] Avg episode reward: [(0, '4817.757')] [2023-03-08 23:27:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000166800_85401600.pth... [2023-03-08 23:27:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000166128_85057536.pth [2023-03-08 23:27:21,160][569911] Updated weights for policy 0, policy_version 166880 (0.0004) [2023-03-08 23:27:22,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11537.1, 300 sec: 11441.0). Total num frames: 85458944. Throughput: 0: 11532.3. Samples: 85458540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:27:22,789][569624] Avg episode reward: [(0, '4531.479')] [2023-03-08 23:27:24,769][569911] Updated weights for policy 0, policy_version 166960 (0.0005) [2023-03-08 23:27:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11605.3, 300 sec: 11454.9). Total num frames: 85520384. Throughput: 0: 11546.1. Samples: 85494768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:27:27,789][569624] Avg episode reward: [(0, '4753.756')] [2023-03-08 23:27:27,955][569911] Updated weights for policy 0, policy_version 167040 (0.0005) [2023-03-08 23:27:31,700][569911] Updated weights for policy 0, policy_version 167120 (0.0004) [2023-03-08 23:27:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11537.1, 300 sec: 11441.0). Total num frames: 85573632. Throughput: 0: 11517.1. Samples: 85563876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:27:32,789][569624] Avg episode reward: [(0, '4696.872')] [2023-03-08 23:27:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000167136_85573632.pth... [2023-03-08 23:27:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000166472_85233664.pth [2023-03-08 23:27:35,295][569911] Updated weights for policy 0, policy_version 167200 (0.0004) [2023-03-08 23:27:37,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11537.1, 300 sec: 11454.9). Total num frames: 85635072. Throughput: 0: 11493.9. Samples: 85631940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:27:37,789][569624] Avg episode reward: [(0, '4709.710')] [2023-03-08 23:27:38,840][569911] Updated weights for policy 0, policy_version 167280 (0.0005) [2023-03-08 23:27:42,253][569911] Updated weights for policy 0, policy_version 167360 (0.0004) [2023-03-08 23:27:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11537.1, 300 sec: 11454.9). Total num frames: 85692416. Throughput: 0: 11498.0. Samples: 85667904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:27:42,789][569624] Avg episode reward: [(0, '4706.369')] [2023-03-08 23:27:45,839][569911] Updated weights for policy 0, policy_version 167440 (0.0005) [2023-03-08 23:27:47,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11605.3, 300 sec: 11482.7). Total num frames: 85753856. Throughput: 0: 11528.3. Samples: 85738188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:27:47,789][569624] Avg episode reward: [(0, '4603.964')] [2023-03-08 23:27:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000167488_85753856.pth... [2023-03-08 23:27:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000166800_85401600.pth [2023-03-08 23:27:49,198][569911] Updated weights for policy 0, policy_version 167520 (0.0004) [2023-03-08 23:27:52,753][569911] Updated weights for policy 0, policy_version 167600 (0.0004) [2023-03-08 23:27:52,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11605.3, 300 sec: 11468.8). Total num frames: 85811200. Throughput: 0: 11548.1. Samples: 85808852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:27:52,789][569624] Avg episode reward: [(0, '4643.602')] [2023-03-08 23:27:56,399][569911] Updated weights for policy 0, policy_version 167680 (0.0005) [2023-03-08 23:27:57,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11468.8, 300 sec: 11454.9). Total num frames: 85864448. Throughput: 0: 11559.4. Samples: 85843444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:27:57,789][569624] Avg episode reward: [(0, '4597.246')] [2023-03-08 23:27:59,881][569911] Updated weights for policy 0, policy_version 167760 (0.0005) [2023-03-08 23:28:02,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11537.0, 300 sec: 11468.8). Total num frames: 85925888. Throughput: 0: 11656.0. Samples: 85913896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:28:02,789][569624] Avg episode reward: [(0, '4738.907')] [2023-03-08 23:28:02,832][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000167832_85929984.pth... [2023-03-08 23:28:02,833][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000167136_85573632.pth [2023-03-08 23:28:03,174][569911] Updated weights for policy 0, policy_version 167840 (0.0005) [2023-03-08 23:28:06,620][569911] Updated weights for policy 0, policy_version 167920 (0.0005) [2023-03-08 23:28:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11605.3, 300 sec: 11482.7). Total num frames: 85987328. Throughput: 0: 11708.3. Samples: 85985412. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:28:07,789][569624] Avg episode reward: [(0, '4581.508')] [2023-03-08 23:28:10,234][569911] Updated weights for policy 0, policy_version 168000 (0.0005) [2023-03-08 23:28:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11605.3, 300 sec: 11482.7). Total num frames: 86044672. Throughput: 0: 11670.6. Samples: 86019944. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:28:12,789][569624] Avg episode reward: [(0, '4696.270')] [2023-03-08 23:28:13,673][569911] Updated weights for policy 0, policy_version 168080 (0.0005) [2023-03-08 23:28:17,212][569911] Updated weights for policy 0, policy_version 168160 (0.0005) [2023-03-08 23:28:17,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11482.7). Total num frames: 86102016. Throughput: 0: 11697.0. Samples: 86090240. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:28:17,789][569624] Avg episode reward: [(0, '4765.080')] [2023-03-08 23:28:17,837][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000168176_86106112.pth... [2023-03-08 23:28:17,839][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000167488_85753856.pth [2023-03-08 23:28:20,633][569911] Updated weights for policy 0, policy_version 168240 (0.0005) [2023-03-08 23:28:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11482.7). Total num frames: 86159360. Throughput: 0: 11727.3. Samples: 86159668. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:28:22,789][569624] Avg episode reward: [(0, '4671.972')] [2023-03-08 23:28:24,235][569911] Updated weights for policy 0, policy_version 168320 (0.0005) [2023-03-08 23:28:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11482.7). Total num frames: 86216704. Throughput: 0: 11710.8. Samples: 86194888. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:28:27,789][569624] Avg episode reward: [(0, '4723.981')] [2023-03-08 23:28:27,933][569911] Updated weights for policy 0, policy_version 168400 (0.0005) [2023-03-08 23:28:31,618][569911] Updated weights for policy 0, policy_version 168480 (0.0005) [2023-03-08 23:28:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11496.6). Total num frames: 86274048. Throughput: 0: 11628.7. Samples: 86261480. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:28:32,789][569624] Avg episode reward: [(0, '4475.916')] [2023-03-08 23:28:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000168504_86274048.pth... [2023-03-08 23:28:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000167832_85929984.pth [2023-03-08 23:28:35,085][569911] Updated weights for policy 0, policy_version 168560 (0.0005) [2023-03-08 23:28:37,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11510.5). Total num frames: 86331392. Throughput: 0: 11600.7. Samples: 86330884. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:28:37,789][569624] Avg episode reward: [(0, '4472.210')] [2023-03-08 23:28:38,836][569911] Updated weights for policy 0, policy_version 168640 (0.0004) [2023-03-08 23:28:42,492][569911] Updated weights for policy 0, policy_version 168720 (0.0005) [2023-03-08 23:28:42,789][569624] Fps is (10 sec: 11059.4, 60 sec: 11537.1, 300 sec: 11482.7). Total num frames: 86384640. Throughput: 0: 11518.6. Samples: 86361780. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:28:42,789][569624] Avg episode reward: [(0, '4666.789')] [2023-03-08 23:28:45,955][569911] Updated weights for policy 0, policy_version 168800 (0.0005) [2023-03-08 23:28:47,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11468.8, 300 sec: 11496.6). Total num frames: 86441984. Throughput: 0: 11535.2. Samples: 86432980. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:28:47,789][569624] Avg episode reward: [(0, '4568.505')] [2023-03-08 23:28:47,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000168840_86446080.pth... [2023-03-08 23:28:47,804][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000168176_86106112.pth [2023-03-08 23:28:49,646][569911] Updated weights for policy 0, policy_version 168880 (0.0005) [2023-03-08 23:28:52,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 11496.6). Total num frames: 86499328. Throughput: 0: 11419.5. Samples: 86499292. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:28:52,789][569624] Avg episode reward: [(0, '4623.989')] [2023-03-08 23:28:53,284][569911] Updated weights for policy 0, policy_version 168960 (0.0005) [2023-03-08 23:28:56,987][569911] Updated weights for policy 0, policy_version 169040 (0.0004) [2023-03-08 23:28:57,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11537.1, 300 sec: 11482.7). Total num frames: 86556672. Throughput: 0: 11385.7. Samples: 86532300. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:28:57,789][569624] Avg episode reward: [(0, '4593.224')] [2023-03-08 23:29:00,488][569911] Updated weights for policy 0, policy_version 169120 (0.0004) [2023-03-08 23:29:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11482.7). Total num frames: 86614016. Throughput: 0: 11363.5. Samples: 86601600. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:29:02,789][569624] Avg episode reward: [(0, '4720.518')] [2023-03-08 23:29:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000169168_86614016.pth... [2023-03-08 23:29:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000168504_86274048.pth [2023-03-08 23:29:04,045][569911] Updated weights for policy 0, policy_version 169200 (0.0004) [2023-03-08 23:29:07,695][569911] Updated weights for policy 0, policy_version 169280 (0.0004) [2023-03-08 23:29:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11482.7). Total num frames: 86671360. Throughput: 0: 11367.6. Samples: 86671212. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:29:07,789][569624] Avg episode reward: [(0, '4651.946')] [2023-03-08 23:29:11,181][569911] Updated weights for policy 0, policy_version 169360 (0.0005) [2023-03-08 23:29:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11400.5, 300 sec: 11482.7). Total num frames: 86728704. Throughput: 0: 11327.5. Samples: 86704624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:29:12,789][569624] Avg episode reward: [(0, '4688.545')] [2023-03-08 23:29:14,683][569911] Updated weights for policy 0, policy_version 169440 (0.0004) [2023-03-08 23:29:17,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11482.7). Total num frames: 86786048. Throughput: 0: 11387.2. Samples: 86773904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:29:17,789][569624] Avg episode reward: [(0, '4676.490')] [2023-03-08 23:29:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000169504_86786048.pth... [2023-03-08 23:29:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000168840_86446080.pth [2023-03-08 23:29:18,351][569911] Updated weights for policy 0, policy_version 169520 (0.0005) [2023-03-08 23:29:22,136][569911] Updated weights for policy 0, policy_version 169600 (0.0005) [2023-03-08 23:29:22,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11454.9). Total num frames: 86839296. Throughput: 0: 11299.7. Samples: 86839368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:29:22,789][569624] Avg episode reward: [(0, '4566.467')] [2023-03-08 23:29:25,637][569911] Updated weights for policy 0, policy_version 169680 (0.0004) [2023-03-08 23:29:27,789][569624] Fps is (10 sec: 11059.4, 60 sec: 11332.3, 300 sec: 11454.9). Total num frames: 86896640. Throughput: 0: 11425.1. Samples: 86875908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:29:27,789][569624] Avg episode reward: [(0, '4672.146')] [2023-03-08 23:29:29,362][569911] Updated weights for policy 0, policy_version 169760 (0.0004) [2023-03-08 23:29:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11332.3, 300 sec: 11454.9). Total num frames: 86953984. Throughput: 0: 11305.0. Samples: 86941704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:29:32,789][569624] Avg episode reward: [(0, '4754.116')] [2023-03-08 23:29:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000169832_86953984.pth... [2023-03-08 23:29:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000169168_86614016.pth [2023-03-08 23:29:32,977][569911] Updated weights for policy 0, policy_version 169840 (0.0004) [2023-03-08 23:29:36,655][569911] Updated weights for policy 0, policy_version 169920 (0.0004) [2023-03-08 23:29:37,789][569624] Fps is (10 sec: 11468.6, 60 sec: 11332.3, 300 sec: 11454.9). Total num frames: 87011328. Throughput: 0: 11349.7. Samples: 87010028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:29:37,789][569624] Avg episode reward: [(0, '4785.352')] [2023-03-08 23:29:40,071][569911] Updated weights for policy 0, policy_version 170000 (0.0005) [2023-03-08 23:29:42,789][569624] Fps is (10 sec: 11469.0, 60 sec: 11400.5, 300 sec: 11468.8). Total num frames: 87068672. Throughput: 0: 11420.8. Samples: 87046236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:29:42,789][569624] Avg episode reward: [(0, '4716.176')] [2023-03-08 23:29:43,754][569911] Updated weights for policy 0, policy_version 170080 (0.0004) [2023-03-08 23:29:47,243][569911] Updated weights for policy 0, policy_version 170160 (0.0005) [2023-03-08 23:29:47,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11468.8). Total num frames: 87126016. Throughput: 0: 11379.4. Samples: 87113672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:29:47,789][569624] Avg episode reward: [(0, '4730.694')] [2023-03-08 23:29:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000170168_87126016.pth... [2023-03-08 23:29:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000169504_86786048.pth [2023-03-08 23:29:50,792][569911] Updated weights for policy 0, policy_version 170240 (0.0005) [2023-03-08 23:29:52,789][569624] Fps is (10 sec: 11468.6, 60 sec: 11400.5, 300 sec: 11482.7). Total num frames: 87183360. Throughput: 0: 11363.9. Samples: 87182588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:29:52,789][569624] Avg episode reward: [(0, '4764.179')] [2023-03-08 23:29:54,434][569911] Updated weights for policy 0, policy_version 170320 (0.0005) [2023-03-08 23:29:57,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11400.6, 300 sec: 11468.8). Total num frames: 87240704. Throughput: 0: 11366.9. Samples: 87216136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:29:57,789][569624] Avg episode reward: [(0, '4773.099')] [2023-03-08 23:29:57,839][569911] Updated weights for policy 0, policy_version 170400 (0.0005) [2023-03-08 23:30:01,394][569911] Updated weights for policy 0, policy_version 170480 (0.0005) [2023-03-08 23:30:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 11482.7). Total num frames: 87302144. Throughput: 0: 11413.1. Samples: 87287492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:30:02,789][569624] Avg episode reward: [(0, '4602.767')] [2023-03-08 23:30:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000170512_87302144.pth... [2023-03-08 23:30:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000169832_86953984.pth [2023-03-08 23:30:04,886][569911] Updated weights for policy 0, policy_version 170560 (0.0005) [2023-03-08 23:30:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11468.8, 300 sec: 11496.6). Total num frames: 87359488. Throughput: 0: 11543.5. Samples: 87358828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:30:07,789][569624] Avg episode reward: [(0, '4803.423')] [2023-03-08 23:30:08,297][569911] Updated weights for policy 0, policy_version 170640 (0.0005) [2023-03-08 23:30:11,804][569911] Updated weights for policy 0, policy_version 170720 (0.0005) [2023-03-08 23:30:12,789][569624] Fps is (10 sec: 11469.0, 60 sec: 11468.8, 300 sec: 11510.5). Total num frames: 87416832. Throughput: 0: 11474.7. Samples: 87392268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:30:12,789][569624] Avg episode reward: [(0, '4724.244')] [2023-03-08 23:30:15,266][569911] Updated weights for policy 0, policy_version 170800 (0.0005) [2023-03-08 23:30:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11537.1, 300 sec: 11524.3). Total num frames: 87478272. Throughput: 0: 11647.6. Samples: 87465848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:30:17,789][569624] Avg episode reward: [(0, '4686.804')] [2023-03-08 23:30:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000170856_87478272.pth... [2023-03-08 23:30:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000170168_87126016.pth [2023-03-08 23:30:18,742][569911] Updated weights for policy 0, policy_version 170880 (0.0005) [2023-03-08 23:30:22,240][569911] Updated weights for policy 0, policy_version 170960 (0.0005) [2023-03-08 23:30:22,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 11524.3). Total num frames: 87535616. Throughput: 0: 11677.7. Samples: 87535524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:30:22,789][569624] Avg episode reward: [(0, '4787.537')] [2023-03-08 23:30:25,738][569911] Updated weights for policy 0, policy_version 171040 (0.0005) [2023-03-08 23:30:27,789][569624] Fps is (10 sec: 11469.0, 60 sec: 11605.3, 300 sec: 11524.3). Total num frames: 87592960. Throughput: 0: 11646.2. Samples: 87570316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:30:27,789][569624] Avg episode reward: [(0, '4713.826')] [2023-03-08 23:30:29,357][569911] Updated weights for policy 0, policy_version 171120 (0.0005) [2023-03-08 23:30:32,786][569911] Updated weights for policy 0, policy_version 171200 (0.0005) [2023-03-08 23:30:32,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11524.3). Total num frames: 87654400. Throughput: 0: 11678.3. Samples: 87639196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:30:32,789][569624] Avg episode reward: [(0, '4801.904')] [2023-03-08 23:30:32,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000171200_87654400.pth... [2023-03-08 23:30:32,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000170512_87302144.pth [2023-03-08 23:30:36,226][569911] Updated weights for policy 0, policy_version 171280 (0.0004) [2023-03-08 23:30:37,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11538.2). Total num frames: 87711744. Throughput: 0: 11727.2. Samples: 87710312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:30:37,789][569624] Avg episode reward: [(0, '4803.424')] [2023-03-08 23:30:39,820][569911] Updated weights for policy 0, policy_version 171360 (0.0004) [2023-03-08 23:30:42,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11538.2). Total num frames: 87769088. Throughput: 0: 11748.5. Samples: 87744820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:30:42,789][569624] Avg episode reward: [(0, '4776.533')] [2023-03-08 23:30:43,274][569911] Updated weights for policy 0, policy_version 171440 (0.0005) [2023-03-08 23:30:46,734][569911] Updated weights for policy 0, policy_version 171520 (0.0005) [2023-03-08 23:30:47,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11538.2). Total num frames: 87830528. Throughput: 0: 11742.4. Samples: 87815900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:30:47,789][569624] Avg episode reward: [(0, '4722.108')] [2023-03-08 23:30:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000171544_87830528.pth... [2023-03-08 23:30:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000170856_87478272.pth [2023-03-08 23:30:50,133][569911] Updated weights for policy 0, policy_version 171600 (0.0005) [2023-03-08 23:30:52,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11538.2). Total num frames: 87887872. Throughput: 0: 11756.7. Samples: 87887880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:30:52,789][569624] Avg episode reward: [(0, '4786.693')] [2023-03-08 23:30:53,568][569911] Updated weights for policy 0, policy_version 171680 (0.0005) [2023-03-08 23:30:57,237][569911] Updated weights for policy 0, policy_version 171760 (0.0006) [2023-03-08 23:30:57,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11538.2). Total num frames: 87945216. Throughput: 0: 11776.3. Samples: 87922200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:30:57,789][569624] Avg episode reward: [(0, '4667.811')] [2023-03-08 23:31:00,679][569911] Updated weights for policy 0, policy_version 171840 (0.0005) [2023-03-08 23:31:02,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11552.1). Total num frames: 88002560. Throughput: 0: 11665.4. Samples: 87990792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:31:02,789][569624] Avg episode reward: [(0, '4766.571')] [2023-03-08 23:31:02,800][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000171888_88006656.pth... [2023-03-08 23:31:02,802][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000171200_87654400.pth [2023-03-08 23:31:04,200][569911] Updated weights for policy 0, policy_version 171920 (0.0005) [2023-03-08 23:31:07,676][569911] Updated weights for policy 0, policy_version 172000 (0.0005) [2023-03-08 23:31:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11566.0). Total num frames: 88064000. Throughput: 0: 11723.7. Samples: 88063092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:31:07,789][569624] Avg episode reward: [(0, '4781.522')] [2023-03-08 23:31:11,245][569911] Updated weights for policy 0, policy_version 172080 (0.0005) [2023-03-08 23:31:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11566.0). Total num frames: 88121344. Throughput: 0: 11701.1. Samples: 88096868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:31:12,789][569624] Avg episode reward: [(0, '4816.789')] [2023-03-08 23:31:14,573][569911] Updated weights for policy 0, policy_version 172160 (0.0005) [2023-03-08 23:31:17,740][569911] Updated weights for policy 0, policy_version 172240 (0.0005) [2023-03-08 23:31:17,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11593.8). Total num frames: 88186880. Throughput: 0: 11876.0. Samples: 88173616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:31:17,789][569624] Avg episode reward: [(0, '4769.323')] [2023-03-08 23:31:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000172240_88186880.pth... [2023-03-08 23:31:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000171544_87830528.pth [2023-03-08 23:31:21,335][569911] Updated weights for policy 0, policy_version 172320 (0.0006) [2023-03-08 23:31:22,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11593.8). Total num frames: 88244224. Throughput: 0: 11801.3. Samples: 88241372. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:31:22,789][569624] Avg episode reward: [(0, '4584.788')] [2023-03-08 23:31:24,855][569911] Updated weights for policy 0, policy_version 172400 (0.0005) [2023-03-08 23:31:27,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11741.9, 300 sec: 11579.9). Total num frames: 88297472. Throughput: 0: 11824.8. Samples: 88276936. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:31:27,789][569624] Avg episode reward: [(0, '4619.621')] [2023-03-08 23:31:28,711][569911] Updated weights for policy 0, policy_version 172480 (0.0004) [2023-03-08 23:31:32,189][569911] Updated weights for policy 0, policy_version 172560 (0.0005) [2023-03-08 23:31:32,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11673.6, 300 sec: 11566.0). Total num frames: 88354816. Throughput: 0: 11704.2. Samples: 88342592. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:31:32,789][569624] Avg episode reward: [(0, '4326.445')] [2023-03-08 23:31:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000172568_88354816.pth... [2023-03-08 23:31:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000171888_88006656.pth [2023-03-08 23:31:35,870][569911] Updated weights for policy 0, policy_version 172640 (0.0005) [2023-03-08 23:31:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11566.0). Total num frames: 88412160. Throughput: 0: 11628.3. Samples: 88411152. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:31:37,789][569624] Avg episode reward: [(0, '4678.524')] [2023-03-08 23:31:39,127][569911] Updated weights for policy 0, policy_version 172720 (0.0004) [2023-03-08 23:31:42,789][569624] Fps is (10 sec: 11469.0, 60 sec: 11673.6, 300 sec: 11566.0). Total num frames: 88469504. Throughput: 0: 11706.6. Samples: 88448996. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:31:42,789][569624] Avg episode reward: [(0, '4660.816')] [2023-03-08 23:31:42,846][569911] Updated weights for policy 0, policy_version 172800 (0.0004) [2023-03-08 23:31:46,397][569911] Updated weights for policy 0, policy_version 172880 (0.0005) [2023-03-08 23:31:47,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11566.0). Total num frames: 88526848. Throughput: 0: 11672.1. Samples: 88516036. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:31:47,789][569624] Avg episode reward: [(0, '4572.159')] [2023-03-08 23:31:47,807][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000172912_88530944.pth... [2023-03-08 23:31:47,809][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000172240_88186880.pth [2023-03-08 23:31:49,813][569911] Updated weights for policy 0, policy_version 172960 (0.0005) [2023-03-08 23:31:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11566.0). Total num frames: 88588288. Throughput: 0: 11624.4. Samples: 88586192. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:31:52,789][569624] Avg episode reward: [(0, '4535.862')] [2023-03-08 23:31:53,403][569911] Updated weights for policy 0, policy_version 173040 (0.0005) [2023-03-08 23:31:56,886][569911] Updated weights for policy 0, policy_version 173120 (0.0005) [2023-03-08 23:31:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11566.0). Total num frames: 88645632. Throughput: 0: 11648.8. Samples: 88621064. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:31:57,789][569624] Avg episode reward: [(0, '4705.277')] [2023-03-08 23:32:00,499][569911] Updated weights for policy 0, policy_version 173200 (0.0005) [2023-03-08 23:32:02,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11566.0). Total num frames: 88702976. Throughput: 0: 11490.8. Samples: 88690700. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:32:02,789][569624] Avg episode reward: [(0, '4794.832')] [2023-03-08 23:32:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000173248_88702976.pth... [2023-03-08 23:32:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000172568_88354816.pth [2023-03-08 23:32:03,886][569911] Updated weights for policy 0, policy_version 173280 (0.0005) [2023-03-08 23:32:07,656][569911] Updated weights for policy 0, policy_version 173360 (0.0005) [2023-03-08 23:32:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11566.0). Total num frames: 88760320. Throughput: 0: 11519.7. Samples: 88759756. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:32:07,789][569624] Avg episode reward: [(0, '4797.852')] [2023-03-08 23:32:11,186][569911] Updated weights for policy 0, policy_version 173440 (0.0005) [2023-03-08 23:32:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11605.3, 300 sec: 11579.9). Total num frames: 88817664. Throughput: 0: 11471.5. Samples: 88793152. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:32:12,789][569624] Avg episode reward: [(0, '4768.155')] [2023-03-08 23:32:14,906][569911] Updated weights for policy 0, policy_version 173520 (0.0005) [2023-03-08 23:32:17,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11400.6, 300 sec: 11566.0). Total num frames: 88870912. Throughput: 0: 11498.3. Samples: 88860016. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:32:17,789][569624] Avg episode reward: [(0, '4761.282')] [2023-03-08 23:32:17,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000173576_88870912.pth... [2023-03-08 23:32:17,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000172912_88530944.pth [2023-03-08 23:32:18,661][569911] Updated weights for policy 0, policy_version 173600 (0.0006) [2023-03-08 23:32:22,161][569911] Updated weights for policy 0, policy_version 173680 (0.0005) [2023-03-08 23:32:22,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11552.1). Total num frames: 88928256. Throughput: 0: 11491.6. Samples: 88928272. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:32:22,789][569624] Avg episode reward: [(0, '4823.422')] [2023-03-08 23:32:25,922][569911] Updated weights for policy 0, policy_version 173760 (0.0005) [2023-03-08 23:32:27,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 11566.0). Total num frames: 88985600. Throughput: 0: 11380.0. Samples: 88961096. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:32:27,789][569624] Avg episode reward: [(0, '4801.241')] [2023-03-08 23:32:29,343][569911] Updated weights for policy 0, policy_version 173840 (0.0005) [2023-03-08 23:32:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 11552.1). Total num frames: 89042944. Throughput: 0: 11434.5. Samples: 89030588. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:32:32,789][569624] Avg episode reward: [(0, '4691.221')] [2023-03-08 23:32:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000173912_89042944.pth... [2023-03-08 23:32:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000173248_88702976.pth [2023-03-08 23:32:33,071][569911] Updated weights for policy 0, policy_version 173920 (0.0005) [2023-03-08 23:32:36,586][569911] Updated weights for policy 0, policy_version 174000 (0.0006) [2023-03-08 23:32:37,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11468.8, 300 sec: 11552.1). Total num frames: 89100288. Throughput: 0: 11416.7. Samples: 89099944. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:32:37,789][569624] Avg episode reward: [(0, '4719.627')] [2023-03-08 23:32:40,111][569911] Updated weights for policy 0, policy_version 174080 (0.0005) [2023-03-08 23:32:42,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11468.8, 300 sec: 11538.2). Total num frames: 89157632. Throughput: 0: 11426.7. Samples: 89135264. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:32:42,789][569624] Avg episode reward: [(0, '4705.288')] [2023-03-08 23:32:43,618][569911] Updated weights for policy 0, policy_version 174160 (0.0005) [2023-03-08 23:32:47,288][569911] Updated weights for policy 0, policy_version 174240 (0.0005) [2023-03-08 23:32:47,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 11538.2). Total num frames: 89214976. Throughput: 0: 11375.8. Samples: 89202612. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:32:47,789][569624] Avg episode reward: [(0, '4661.992')] [2023-03-08 23:32:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000174248_89214976.pth... [2023-03-08 23:32:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000173576_88870912.pth [2023-03-08 23:32:50,717][569911] Updated weights for policy 0, policy_version 174320 (0.0005) [2023-03-08 23:32:52,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11552.1). Total num frames: 89272320. Throughput: 0: 11389.3. Samples: 89272276. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:32:52,789][569624] Avg episode reward: [(0, '4742.850')] [2023-03-08 23:32:54,363][569911] Updated weights for policy 0, policy_version 174400 (0.0005) [2023-03-08 23:32:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11538.2). Total num frames: 89329664. Throughput: 0: 11406.0. Samples: 89306424. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:32:57,789][569624] Avg episode reward: [(0, '4748.711')] [2023-03-08 23:32:57,883][569911] Updated weights for policy 0, policy_version 174480 (0.0004) [2023-03-08 23:33:01,582][569911] Updated weights for policy 0, policy_version 174560 (0.0005) [2023-03-08 23:33:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11524.3). Total num frames: 89387008. Throughput: 0: 11436.1. Samples: 89374644. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:33:02,789][569624] Avg episode reward: [(0, '4530.667')] [2023-03-08 23:33:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000174584_89387008.pth... [2023-03-08 23:33:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000173912_89042944.pth [2023-03-08 23:33:05,198][569911] Updated weights for policy 0, policy_version 174640 (0.0004) [2023-03-08 23:33:07,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11510.5). Total num frames: 89440256. Throughput: 0: 11381.9. Samples: 89440456. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:33:07,789][569624] Avg episode reward: [(0, '4514.873')] [2023-03-08 23:33:08,919][569911] Updated weights for policy 0, policy_version 174720 (0.0006) [2023-03-08 23:33:12,520][569911] Updated weights for policy 0, policy_version 174800 (0.0005) [2023-03-08 23:33:12,789][569624] Fps is (10 sec: 11059.4, 60 sec: 11332.3, 300 sec: 11510.5). Total num frames: 89497600. Throughput: 0: 11455.3. Samples: 89476584. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:33:12,789][569624] Avg episode reward: [(0, '4483.165')] [2023-03-08 23:33:16,239][569911] Updated weights for policy 0, policy_version 174880 (0.0005) [2023-03-08 23:33:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11510.4). Total num frames: 89554944. Throughput: 0: 11380.7. Samples: 89542720. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:33:17,789][569624] Avg episode reward: [(0, '4396.834')] [2023-03-08 23:33:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000174912_89554944.pth... [2023-03-08 23:33:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000174248_89214976.pth [2023-03-08 23:33:19,754][569911] Updated weights for policy 0, policy_version 174960 (0.0005) [2023-03-08 23:33:22,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11510.5). Total num frames: 89612288. Throughput: 0: 11384.7. Samples: 89612256. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:33:22,789][569624] Avg episode reward: [(0, '4612.657')] [2023-03-08 23:33:23,308][569911] Updated weights for policy 0, policy_version 175040 (0.0005) [2023-03-08 23:33:26,846][569911] Updated weights for policy 0, policy_version 175120 (0.0005) [2023-03-08 23:33:27,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11400.6, 300 sec: 11510.5). Total num frames: 89669632. Throughput: 0: 11359.6. Samples: 89646444. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-08 23:33:27,789][569624] Avg episode reward: [(0, '4528.489')] [2023-03-08 23:33:30,444][569911] Updated weights for policy 0, policy_version 175200 (0.0005) [2023-03-08 23:33:32,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11400.6, 300 sec: 11510.5). Total num frames: 89726976. Throughput: 0: 11380.9. Samples: 89714752. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:33:32,789][569624] Avg episode reward: [(0, '4419.147')] [2023-03-08 23:33:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000175248_89726976.pth... [2023-03-08 23:33:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000174584_89387008.pth [2023-03-08 23:33:33,970][569911] Updated weights for policy 0, policy_version 175280 (0.0005) [2023-03-08 23:33:37,524][569911] Updated weights for policy 0, policy_version 175360 (0.0005) [2023-03-08 23:33:37,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11524.3). Total num frames: 89784320. Throughput: 0: 11380.2. Samples: 89784384. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:33:37,789][569624] Avg episode reward: [(0, '4673.054')] [2023-03-08 23:33:41,118][569911] Updated weights for policy 0, policy_version 175440 (0.0005) [2023-03-08 23:33:42,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11524.3). Total num frames: 89841664. Throughput: 0: 11382.6. Samples: 89818640. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:33:42,789][569624] Avg episode reward: [(0, '4821.509')] [2023-03-08 23:33:44,806][569911] Updated weights for policy 0, policy_version 175520 (0.0005) [2023-03-08 23:33:47,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11524.3). Total num frames: 89899008. Throughput: 0: 11377.3. Samples: 89886624. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:33:47,789][569624] Avg episode reward: [(0, '4668.617')] [2023-03-08 23:33:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000175584_89899008.pth... [2023-03-08 23:33:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000174912_89554944.pth [2023-03-08 23:33:48,494][569911] Updated weights for policy 0, policy_version 175600 (0.0005) [2023-03-08 23:33:52,274][569911] Updated weights for policy 0, policy_version 175680 (0.0004) [2023-03-08 23:33:52,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11510.5). Total num frames: 89952256. Throughput: 0: 11363.6. Samples: 89951816. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:33:52,789][569624] Avg episode reward: [(0, '4616.430')] [2023-03-08 23:33:55,997][569911] Updated weights for policy 0, policy_version 175760 (0.0005) [2023-03-08 23:33:57,789][569624] Fps is (10 sec: 10649.7, 60 sec: 11264.0, 300 sec: 11496.6). Total num frames: 90005504. Throughput: 0: 11294.7. Samples: 89984844. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:33:57,789][569624] Avg episode reward: [(0, '4678.995')] [2023-03-08 23:33:59,743][569911] Updated weights for policy 0, policy_version 175840 (0.0005) [2023-03-08 23:34:02,789][569624] Fps is (10 sec: 10649.5, 60 sec: 11195.7, 300 sec: 11482.7). Total num frames: 90058752. Throughput: 0: 11265.4. Samples: 90049664. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:34:02,789][569624] Avg episode reward: [(0, '4732.184')] [2023-03-08 23:34:02,836][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000175904_90062848.pth... [2023-03-08 23:34:02,838][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000175248_89726976.pth [2023-03-08 23:34:03,498][569911] Updated weights for policy 0, policy_version 175920 (0.0005) [2023-03-08 23:34:07,141][569911] Updated weights for policy 0, policy_version 176000 (0.0005) [2023-03-08 23:34:07,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11264.0, 300 sec: 11482.7). Total num frames: 90116096. Throughput: 0: 11196.6. Samples: 90116104. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:34:07,789][569624] Avg episode reward: [(0, '4716.107')] [2023-03-08 23:34:10,875][569911] Updated weights for policy 0, policy_version 176080 (0.0005) [2023-03-08 23:34:12,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11482.7). Total num frames: 90173440. Throughput: 0: 11164.8. Samples: 90148860. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:34:12,789][569624] Avg episode reward: [(0, '4840.873')] [2023-03-08 23:34:12,790][569867] Saving new best policy, reward=4840.873! [2023-03-08 23:34:14,433][569911] Updated weights for policy 0, policy_version 176160 (0.0005) [2023-03-08 23:34:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11264.0, 300 sec: 11496.6). Total num frames: 90230784. Throughput: 0: 11189.8. Samples: 90218296. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:34:17,789][569624] Avg episode reward: [(0, '4830.693')] [2023-03-08 23:34:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000176232_90230784.pth... [2023-03-08 23:34:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000175584_89899008.pth [2023-03-08 23:34:17,925][569911] Updated weights for policy 0, policy_version 176240 (0.0005) [2023-03-08 23:34:21,633][569911] Updated weights for policy 0, policy_version 176320 (0.0005) [2023-03-08 23:34:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11264.0, 300 sec: 11496.6). Total num frames: 90288128. Throughput: 0: 11152.7. Samples: 90286256. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:34:22,789][569624] Avg episode reward: [(0, '4817.435')] [2023-03-08 23:34:25,402][569911] Updated weights for policy 0, policy_version 176400 (0.0005) [2023-03-08 23:34:27,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11195.7, 300 sec: 11482.7). Total num frames: 90341376. Throughput: 0: 11108.4. Samples: 90318520. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:34:27,789][569624] Avg episode reward: [(0, '4822.232')] [2023-03-08 23:34:28,975][569911] Updated weights for policy 0, policy_version 176480 (0.0005) [2023-03-08 23:34:32,758][569911] Updated weights for policy 0, policy_version 176560 (0.0004) [2023-03-08 23:34:32,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11195.7, 300 sec: 11482.7). Total num frames: 90398720. Throughput: 0: 11099.1. Samples: 90386084. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:34:32,789][569624] Avg episode reward: [(0, '4825.003')] [2023-03-08 23:34:32,794][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000176560_90398720.pth... [2023-03-08 23:34:32,797][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000175904_90062848.pth [2023-03-08 23:34:36,262][569911] Updated weights for policy 0, policy_version 176640 (0.0005) [2023-03-08 23:34:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11195.7, 300 sec: 11482.7). Total num frames: 90456064. Throughput: 0: 11177.3. Samples: 90454796. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:34:37,789][569624] Avg episode reward: [(0, '4738.770')] [2023-03-08 23:34:39,809][569911] Updated weights for policy 0, policy_version 176720 (0.0005) [2023-03-08 23:34:42,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11195.7, 300 sec: 11482.7). Total num frames: 90513408. Throughput: 0: 11199.4. Samples: 90488816. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:34:42,789][569624] Avg episode reward: [(0, '4828.786')] [2023-03-08 23:34:43,455][569911] Updated weights for policy 0, policy_version 176800 (0.0005) [2023-03-08 23:34:47,242][569911] Updated weights for policy 0, policy_version 176880 (0.0005) [2023-03-08 23:34:47,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11127.5, 300 sec: 11468.8). Total num frames: 90566656. Throughput: 0: 11237.8. Samples: 90555364. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:34:47,789][569624] Avg episode reward: [(0, '4687.340')] [2023-03-08 23:34:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000176888_90566656.pth... [2023-03-08 23:34:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000176232_90230784.pth [2023-03-08 23:34:50,914][569911] Updated weights for policy 0, policy_version 176960 (0.0005) [2023-03-08 23:34:52,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11195.7, 300 sec: 11468.8). Total num frames: 90624000. Throughput: 0: 11245.6. Samples: 90622156. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:34:52,789][569624] Avg episode reward: [(0, '4777.564')] [2023-03-08 23:34:54,544][569911] Updated weights for policy 0, policy_version 177040 (0.0004) [2023-03-08 23:34:57,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11264.0, 300 sec: 11454.9). Total num frames: 90681344. Throughput: 0: 11293.0. Samples: 90657044. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:34:57,789][569624] Avg episode reward: [(0, '4780.124')] [2023-03-08 23:34:57,973][569911] Updated weights for policy 0, policy_version 177120 (0.0005) [2023-03-08 23:35:01,566][569911] Updated weights for policy 0, policy_version 177200 (0.0005) [2023-03-08 23:35:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11454.9). Total num frames: 90738688. Throughput: 0: 11290.8. Samples: 90726384. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:35:02,789][569624] Avg episode reward: [(0, '4717.014')] [2023-03-08 23:35:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000177224_90738688.pth... [2023-03-08 23:35:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000176560_90398720.pth [2023-03-08 23:35:05,080][569911] Updated weights for policy 0, policy_version 177280 (0.0005) [2023-03-08 23:35:07,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11332.3, 300 sec: 11454.9). Total num frames: 90796032. Throughput: 0: 11327.5. Samples: 90795992. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:35:07,789][569624] Avg episode reward: [(0, '4633.510')] [2023-03-08 23:35:08,706][569911] Updated weights for policy 0, policy_version 177360 (0.0005) [2023-03-08 23:35:12,372][569911] Updated weights for policy 0, policy_version 177440 (0.0004) [2023-03-08 23:35:12,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11441.0). Total num frames: 90853376. Throughput: 0: 11336.5. Samples: 90828664. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:35:12,789][569624] Avg episode reward: [(0, '4768.596')] [2023-03-08 23:35:15,870][569911] Updated weights for policy 0, policy_version 177520 (0.0005) [2023-03-08 23:35:17,789][569624] Fps is (10 sec: 11468.6, 60 sec: 11332.3, 300 sec: 11441.0). Total num frames: 90910720. Throughput: 0: 11376.0. Samples: 90898004. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:35:17,789][569624] Avg episode reward: [(0, '4733.999')] [2023-03-08 23:35:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000177560_90910720.pth... [2023-03-08 23:35:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000176888_90566656.pth [2023-03-08 23:35:19,574][569911] Updated weights for policy 0, policy_version 177600 (0.0004) [2023-03-08 23:35:22,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11264.0, 300 sec: 11427.1). Total num frames: 90963968. Throughput: 0: 11332.7. Samples: 90964768. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:35:22,789][569624] Avg episode reward: [(0, '4701.680')] [2023-03-08 23:35:23,179][569911] Updated weights for policy 0, policy_version 177680 (0.0005) [2023-03-08 23:35:26,869][569911] Updated weights for policy 0, policy_version 177760 (0.0005) [2023-03-08 23:35:27,789][569624] Fps is (10 sec: 11059.4, 60 sec: 11332.3, 300 sec: 11413.3). Total num frames: 91021312. Throughput: 0: 11289.4. Samples: 90996840. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:35:27,789][569624] Avg episode reward: [(0, '4727.343')] [2023-03-08 23:35:30,410][569911] Updated weights for policy 0, policy_version 177840 (0.0005) [2023-03-08 23:35:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11413.3). Total num frames: 91078656. Throughput: 0: 11391.4. Samples: 91067976. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:35:32,789][569624] Avg episode reward: [(0, '4754.986')] [2023-03-08 23:35:32,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000177888_91078656.pth... [2023-03-08 23:35:32,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000177224_90738688.pth [2023-03-08 23:35:33,909][569911] Updated weights for policy 0, policy_version 177920 (0.0005) [2023-03-08 23:35:37,036][569911] Updated weights for policy 0, policy_version 178000 (0.0005) [2023-03-08 23:35:37,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11468.8, 300 sec: 11441.0). Total num frames: 91144192. Throughput: 0: 11572.7. Samples: 91142928. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:35:37,789][569624] Avg episode reward: [(0, '4617.608')] [2023-03-08 23:35:40,538][569911] Updated weights for policy 0, policy_version 178080 (0.0004) [2023-03-08 23:35:42,789][569624] Fps is (10 sec: 12288.1, 60 sec: 11468.8, 300 sec: 11427.1). Total num frames: 91201536. Throughput: 0: 11553.9. Samples: 91176968. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-08 23:35:42,799][569624] Avg episode reward: [(0, '4660.401')] [2023-03-08 23:35:43,785][569911] Updated weights for policy 0, policy_version 178160 (0.0005) [2023-03-08 23:35:47,357][569911] Updated weights for policy 0, policy_version 178240 (0.0004) [2023-03-08 23:35:47,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11441.0). Total num frames: 91262976. Throughput: 0: 11645.9. Samples: 91250448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:35:47,789][569624] Avg episode reward: [(0, '4769.730')] [2023-03-08 23:35:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000178248_91262976.pth... [2023-03-08 23:35:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000177560_90910720.pth [2023-03-08 23:35:50,901][569911] Updated weights for policy 0, policy_version 178320 (0.0005) [2023-03-08 23:35:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 11441.0). Total num frames: 91320320. Throughput: 0: 11601.7. Samples: 91318068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:35:52,789][569624] Avg episode reward: [(0, '4777.808')] [2023-03-08 23:35:54,521][569911] Updated weights for policy 0, policy_version 178400 (0.0005) [2023-03-08 23:35:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11441.0). Total num frames: 91377664. Throughput: 0: 11652.5. Samples: 91353024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:35:57,789][569624] Avg episode reward: [(0, '4715.324')] [2023-03-08 23:35:57,966][569911] Updated weights for policy 0, policy_version 178480 (0.0005) [2023-03-08 23:36:01,476][569911] Updated weights for policy 0, policy_version 178560 (0.0004) [2023-03-08 23:36:02,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11605.4, 300 sec: 11427.1). Total num frames: 91435008. Throughput: 0: 11666.3. Samples: 91422984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:36:02,799][569624] Avg episode reward: [(0, '4488.744')] [2023-03-08 23:36:02,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000178584_91435008.pth... [2023-03-08 23:36:02,805][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000177888_91078656.pth [2023-03-08 23:36:05,193][569911] Updated weights for policy 0, policy_version 178640 (0.0004) [2023-03-08 23:36:07,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11537.1, 300 sec: 11413.3). Total num frames: 91488256. Throughput: 0: 11640.4. Samples: 91488584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:36:07,800][569624] Avg episode reward: [(0, '4736.197')] [2023-03-08 23:36:09,096][569911] Updated weights for policy 0, policy_version 178720 (0.0005) [2023-03-08 23:36:12,558][569911] Updated weights for policy 0, policy_version 178800 (0.0005) [2023-03-08 23:36:12,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11537.1, 300 sec: 11385.5). Total num frames: 91545600. Throughput: 0: 11650.0. Samples: 91521088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:36:12,799][569624] Avg episode reward: [(0, '4713.411')] [2023-03-08 23:36:16,036][569911] Updated weights for policy 0, policy_version 178880 (0.0004) [2023-03-08 23:36:17,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 11399.4). Total num frames: 91607040. Throughput: 0: 11640.6. Samples: 91591804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:36:17,800][569624] Avg episode reward: [(0, '4804.331')] [2023-03-08 23:36:17,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000178920_91607040.pth... [2023-03-08 23:36:17,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000178248_91262976.pth [2023-03-08 23:36:19,348][569911] Updated weights for policy 0, policy_version 178960 (0.0004) [2023-03-08 23:36:22,741][569911] Updated weights for policy 0, policy_version 179040 (0.0005) [2023-03-08 23:36:22,789][569624] Fps is (10 sec: 12287.8, 60 sec: 11741.9, 300 sec: 11427.1). Total num frames: 91668480. Throughput: 0: 11629.5. Samples: 91666256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:36:22,789][569624] Avg episode reward: [(0, '4783.366')] [2023-03-08 23:36:26,428][569911] Updated weights for policy 0, policy_version 179120 (0.0004) [2023-03-08 23:36:27,789][569624] Fps is (10 sec: 11469.0, 60 sec: 11673.6, 300 sec: 11413.3). Total num frames: 91721728. Throughput: 0: 11645.6. Samples: 91701020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:36:27,799][569624] Avg episode reward: [(0, '4826.813')] [2023-03-08 23:36:30,100][569911] Updated weights for policy 0, policy_version 179200 (0.0005) [2023-03-08 23:36:32,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11673.6, 300 sec: 11413.3). Total num frames: 91779072. Throughput: 0: 11491.3. Samples: 91767556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:36:32,800][569624] Avg episode reward: [(0, '4794.074')] [2023-03-08 23:36:32,803][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000179256_91779072.pth... [2023-03-08 23:36:32,805][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000178584_91435008.pth [2023-03-08 23:36:33,504][569911] Updated weights for policy 0, policy_version 179280 (0.0005) [2023-03-08 23:36:37,093][569911] Updated weights for policy 0, policy_version 179360 (0.0005) [2023-03-08 23:36:37,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 11427.1). Total num frames: 91840512. Throughput: 0: 11534.2. Samples: 91837108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:36:37,800][569624] Avg episode reward: [(0, '4725.710')] [2023-03-08 23:36:40,720][569911] Updated weights for policy 0, policy_version 179440 (0.0004) [2023-03-08 23:36:42,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11537.1, 300 sec: 11413.3). Total num frames: 91893760. Throughput: 0: 11525.2. Samples: 91871656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:36:42,800][569624] Avg episode reward: [(0, '4804.232')] [2023-03-08 23:36:44,256][569911] Updated weights for policy 0, policy_version 179520 (0.0005) [2023-03-08 23:36:47,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11468.8, 300 sec: 11399.4). Total num frames: 91951104. Throughput: 0: 11514.2. Samples: 91941124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:36:47,797][569911] Updated weights for policy 0, policy_version 179600 (0.0005) [2023-03-08 23:36:47,800][569624] Avg episode reward: [(0, '4824.339')] [2023-03-08 23:36:47,804][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000179600_91955200.pth... [2023-03-08 23:36:47,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000178920_91607040.pth [2023-03-08 23:36:51,464][569911] Updated weights for policy 0, policy_version 179680 (0.0005) [2023-03-08 23:36:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11399.4). Total num frames: 92008448. Throughput: 0: 11552.8. Samples: 92008460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:36:52,799][569624] Avg episode reward: [(0, '4823.744')] [2023-03-08 23:36:54,992][569911] Updated weights for policy 0, policy_version 179760 (0.0005) [2023-03-08 23:36:57,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11399.4). Total num frames: 92065792. Throughput: 0: 11605.3. Samples: 92043328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:36:57,789][569624] Avg episode reward: [(0, '4744.649')] [2023-03-08 23:36:58,720][569911] Updated weights for policy 0, policy_version 179840 (0.0005) [2023-03-08 23:37:02,333][569911] Updated weights for policy 0, policy_version 179920 (0.0005) [2023-03-08 23:37:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11399.4). Total num frames: 92123136. Throughput: 0: 11530.2. Samples: 92110660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:37:02,789][569624] Avg episode reward: [(0, '4825.177')] [2023-03-08 23:37:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000179928_92123136.pth... [2023-03-08 23:37:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000179256_91779072.pth [2023-03-08 23:37:05,970][569911] Updated weights for policy 0, policy_version 180000 (0.0005) [2023-03-08 23:37:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11399.4). Total num frames: 92180480. Throughput: 0: 11340.9. Samples: 92176596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:37:07,789][569624] Avg episode reward: [(0, '4774.509')] [2023-03-08 23:37:09,579][569911] Updated weights for policy 0, policy_version 180080 (0.0005) [2023-03-08 23:37:12,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11468.8, 300 sec: 11399.4). Total num frames: 92233728. Throughput: 0: 11350.8. Samples: 92211808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:37:12,789][569624] Avg episode reward: [(0, '4682.472')] [2023-03-08 23:37:13,225][569911] Updated weights for policy 0, policy_version 180160 (0.0005) [2023-03-08 23:37:16,948][569911] Updated weights for policy 0, policy_version 180240 (0.0004) [2023-03-08 23:37:17,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11400.6, 300 sec: 11399.4). Total num frames: 92291072. Throughput: 0: 11338.8. Samples: 92277800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:37:17,789][569624] Avg episode reward: [(0, '4679.433')] [2023-03-08 23:37:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000180256_92291072.pth... [2023-03-08 23:37:17,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000179600_91955200.pth [2023-03-08 23:37:20,474][569911] Updated weights for policy 0, policy_version 180320 (0.0005) [2023-03-08 23:37:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11399.4). Total num frames: 92348416. Throughput: 0: 11354.6. Samples: 92348064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:37:22,789][569624] Avg episode reward: [(0, '4685.480')] [2023-03-08 23:37:23,926][569911] Updated weights for policy 0, policy_version 180400 (0.0005) [2023-03-08 23:37:27,363][569911] Updated weights for policy 0, policy_version 180480 (0.0005) [2023-03-08 23:37:27,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11468.8, 300 sec: 11413.3). Total num frames: 92409856. Throughput: 0: 11411.9. Samples: 92385192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:37:27,789][569624] Avg episode reward: [(0, '4818.184')] [2023-03-08 23:37:30,943][569911] Updated weights for policy 0, policy_version 180560 (0.0005) [2023-03-08 23:37:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 11413.3). Total num frames: 92467200. Throughput: 0: 11415.2. Samples: 92454808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:37:32,789][569624] Avg episode reward: [(0, '4676.484')] [2023-03-08 23:37:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000180600_92467200.pth... [2023-03-08 23:37:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000179928_92123136.pth [2023-03-08 23:37:34,174][569911] Updated weights for policy 0, policy_version 180640 (0.0005) [2023-03-08 23:37:37,623][569911] Updated weights for policy 0, policy_version 180720 (0.0005) [2023-03-08 23:37:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 11427.1). Total num frames: 92528640. Throughput: 0: 11556.7. Samples: 92528512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:37:37,789][569624] Avg episode reward: [(0, '4732.766')] [2023-03-08 23:37:41,184][569911] Updated weights for policy 0, policy_version 180800 (0.0005) [2023-03-08 23:37:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11537.1, 300 sec: 11427.1). Total num frames: 92585984. Throughput: 0: 11522.9. Samples: 92561860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:37:42,789][569624] Avg episode reward: [(0, '4698.722')] [2023-03-08 23:37:44,942][569911] Updated weights for policy 0, policy_version 180880 (0.0004) [2023-03-08 23:37:47,789][569624] Fps is (10 sec: 11059.1, 60 sec: 11468.8, 300 sec: 11413.3). Total num frames: 92639232. Throughput: 0: 11527.0. Samples: 92629376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:37:47,789][569624] Avg episode reward: [(0, '4842.688')] [2023-03-08 23:37:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000180944_92643328.pth... [2023-03-08 23:37:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000180256_92291072.pth [2023-03-08 23:37:47,795][569867] Saving new best policy, reward=4842.688! [2023-03-08 23:37:48,493][569911] Updated weights for policy 0, policy_version 180960 (0.0005) [2023-03-08 23:37:51,893][569911] Updated weights for policy 0, policy_version 181040 (0.0005) [2023-03-08 23:37:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11427.1). Total num frames: 92700672. Throughput: 0: 11645.9. Samples: 92700664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:37:52,789][569624] Avg episode reward: [(0, '4847.195')] [2023-03-08 23:37:52,803][569867] Saving new best policy, reward=4847.195! [2023-03-08 23:37:55,035][569911] Updated weights for policy 0, policy_version 181120 (0.0005) [2023-03-08 23:37:57,789][569624] Fps is (10 sec: 12288.1, 60 sec: 11605.3, 300 sec: 11441.0). Total num frames: 92762112. Throughput: 0: 11706.2. Samples: 92738588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:37:57,789][569624] Avg episode reward: [(0, '4680.766')] [2023-03-08 23:37:58,602][569911] Updated weights for policy 0, policy_version 181200 (0.0005) [2023-03-08 23:38:01,680][569911] Updated weights for policy 0, policy_version 181280 (0.0005) [2023-03-08 23:38:02,789][569624] Fps is (10 sec: 12697.5, 60 sec: 11741.8, 300 sec: 11482.7). Total num frames: 92827648. Throughput: 0: 11889.5. Samples: 92812828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:38:02,789][569624] Avg episode reward: [(0, '4755.578')] [2023-03-08 23:38:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000181304_92827648.pth... [2023-03-08 23:38:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000180600_92467200.pth [2023-03-08 23:38:04,862][569911] Updated weights for policy 0, policy_version 181360 (0.0005) [2023-03-08 23:38:07,789][569624] Fps is (10 sec: 12697.5, 60 sec: 11810.1, 300 sec: 11496.6). Total num frames: 92889088. Throughput: 0: 11956.2. Samples: 92886092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:38:07,789][569624] Avg episode reward: [(0, '4829.383')] [2023-03-08 23:38:08,312][569911] Updated weights for policy 0, policy_version 181440 (0.0005) [2023-03-08 23:38:11,736][569911] Updated weights for policy 0, policy_version 181520 (0.0005) [2023-03-08 23:38:12,789][569624] Fps is (10 sec: 12288.2, 60 sec: 11946.7, 300 sec: 11510.5). Total num frames: 92950528. Throughput: 0: 12010.9. Samples: 92925680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:38:12,789][569624] Avg episode reward: [(0, '4742.544')] [2023-03-08 23:38:15,068][569911] Updated weights for policy 0, policy_version 181600 (0.0005) [2023-03-08 23:38:17,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.6, 300 sec: 11510.5). Total num frames: 93007872. Throughput: 0: 12013.8. Samples: 92995428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:38:17,800][569624] Avg episode reward: [(0, '4830.860')] [2023-03-08 23:38:17,802][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000181656_93007872.pth... [2023-03-08 23:38:17,805][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000180944_92643328.pth [2023-03-08 23:38:18,843][569911] Updated weights for policy 0, policy_version 181680 (0.0005) [2023-03-08 23:38:22,488][569911] Updated weights for policy 0, policy_version 181760 (0.0005) [2023-03-08 23:38:22,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11878.4, 300 sec: 11496.6). Total num frames: 93061120. Throughput: 0: 11838.0. Samples: 93061220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:38:22,800][569624] Avg episode reward: [(0, '4847.582')] [2023-03-08 23:38:22,823][569867] Saving new best policy, reward=4847.582! [2023-03-08 23:38:25,900][569911] Updated weights for policy 0, policy_version 181840 (0.0005) [2023-03-08 23:38:27,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11510.5). Total num frames: 93122560. Throughput: 0: 11913.0. Samples: 93097944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:38:27,789][569624] Avg episode reward: [(0, '4828.631')] [2023-03-08 23:38:29,342][569911] Updated weights for policy 0, policy_version 181920 (0.0004) [2023-03-08 23:38:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11510.5). Total num frames: 93179904. Throughput: 0: 11985.7. Samples: 93168732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:38:32,790][569624] Avg episode reward: [(0, '4783.697')] [2023-03-08 23:38:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000181992_93179904.pth... [2023-03-08 23:38:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000181304_92827648.pth [2023-03-08 23:38:32,887][569911] Updated weights for policy 0, policy_version 182000 (0.0006) [2023-03-08 23:38:36,443][569911] Updated weights for policy 0, policy_version 182080 (0.0005) [2023-03-08 23:38:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11510.5). Total num frames: 93237248. Throughput: 0: 11950.8. Samples: 93238448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:38:37,800][569624] Avg episode reward: [(0, '4837.058')] [2023-03-08 23:38:39,884][569911] Updated weights for policy 0, policy_version 182160 (0.0005) [2023-03-08 23:38:42,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11524.3). Total num frames: 93298688. Throughput: 0: 11898.2. Samples: 93274008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:38:42,799][569624] Avg episode reward: [(0, '4803.395')] [2023-03-08 23:38:43,413][569911] Updated weights for policy 0, policy_version 182240 (0.0005) [2023-03-08 23:38:46,895][569911] Updated weights for policy 0, policy_version 182320 (0.0005) [2023-03-08 23:38:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11538.2). Total num frames: 93356032. Throughput: 0: 11798.3. Samples: 93343752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:38:47,789][569624] Avg episode reward: [(0, '4776.983')] [2023-03-08 23:38:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000182336_93356032.pth... [2023-03-08 23:38:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000181656_93007872.pth [2023-03-08 23:38:50,474][569911] Updated weights for policy 0, policy_version 182400 (0.0005) [2023-03-08 23:38:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11552.1). Total num frames: 93413376. Throughput: 0: 11702.6. Samples: 93412708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:38:52,789][569624] Avg episode reward: [(0, '4788.252')] [2023-03-08 23:38:54,030][569911] Updated weights for policy 0, policy_version 182480 (0.0004) [2023-03-08 23:38:57,573][569911] Updated weights for policy 0, policy_version 182560 (0.0005) [2023-03-08 23:38:57,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11566.0). Total num frames: 93470720. Throughput: 0: 11579.9. Samples: 93446776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:38:57,789][569624] Avg episode reward: [(0, '4818.476')] [2023-03-08 23:39:00,976][569911] Updated weights for policy 0, policy_version 182640 (0.0004) [2023-03-08 23:39:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11579.9). Total num frames: 93532160. Throughput: 0: 11642.0. Samples: 93519320. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:39:02,789][569624] Avg episode reward: [(0, '4683.707')] [2023-03-08 23:39:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000182680_93532160.pth... [2023-03-08 23:39:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000181992_93179904.pth [2023-03-08 23:39:04,531][569911] Updated weights for policy 0, policy_version 182720 (0.0005) [2023-03-08 23:39:07,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11579.9). Total num frames: 93589504. Throughput: 0: 11698.8. Samples: 93587668. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:39:07,789][569624] Avg episode reward: [(0, '4831.173')] [2023-03-08 23:39:08,073][569911] Updated weights for policy 0, policy_version 182800 (0.0005) [2023-03-08 23:39:11,650][569911] Updated weights for policy 0, policy_version 182880 (0.0005) [2023-03-08 23:39:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11605.3, 300 sec: 11579.9). Total num frames: 93646848. Throughput: 0: 11645.9. Samples: 93622008. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:39:12,789][569624] Avg episode reward: [(0, '4841.107')] [2023-03-08 23:39:15,140][569911] Updated weights for policy 0, policy_version 182960 (0.0005) [2023-03-08 23:39:17,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11579.9). Total num frames: 93704192. Throughput: 0: 11633.8. Samples: 93692252. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:39:17,789][569624] Avg episode reward: [(0, '4787.665')] [2023-03-08 23:39:17,795][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000183024_93708288.pth... [2023-03-08 23:39:17,797][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000182336_93356032.pth [2023-03-08 23:39:18,503][569911] Updated weights for policy 0, policy_version 183040 (0.0005) [2023-03-08 23:39:21,978][569911] Updated weights for policy 0, policy_version 183120 (0.0005) [2023-03-08 23:39:22,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11607.6). Total num frames: 93765632. Throughput: 0: 11707.4. Samples: 93765280. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:39:22,789][569624] Avg episode reward: [(0, '4827.854')] [2023-03-08 23:39:25,369][569911] Updated weights for policy 0, policy_version 183200 (0.0005) [2023-03-08 23:39:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11607.7). Total num frames: 93822976. Throughput: 0: 11698.7. Samples: 93800448. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:39:27,789][569624] Avg episode reward: [(0, '4816.082')] [2023-03-08 23:39:28,915][569911] Updated weights for policy 0, policy_version 183280 (0.0005) [2023-03-08 23:39:32,473][569911] Updated weights for policy 0, policy_version 183360 (0.0004) [2023-03-08 23:39:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11607.6). Total num frames: 93880320. Throughput: 0: 11695.0. Samples: 93870024. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:39:32,789][569624] Avg episode reward: [(0, '4816.423')] [2023-03-08 23:39:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000183360_93880320.pth... [2023-03-08 23:39:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000182680_93532160.pth [2023-03-08 23:39:36,015][569911] Updated weights for policy 0, policy_version 183440 (0.0005) [2023-03-08 23:39:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11621.5). Total num frames: 93941760. Throughput: 0: 11741.0. Samples: 93941052. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:39:37,789][569624] Avg episode reward: [(0, '4761.282')] [2023-03-08 23:39:39,392][569911] Updated weights for policy 0, policy_version 183520 (0.0005) [2023-03-08 23:39:42,789][569624] Fps is (10 sec: 11878.2, 60 sec: 11673.6, 300 sec: 11635.4). Total num frames: 93999104. Throughput: 0: 11760.9. Samples: 93976016. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:39:42,790][569624] Avg episode reward: [(0, '4639.748')] [2023-03-08 23:39:42,948][569911] Updated weights for policy 0, policy_version 183600 (0.0005) [2023-03-08 23:39:46,372][569911] Updated weights for policy 0, policy_version 183680 (0.0005) [2023-03-08 23:39:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11649.3). Total num frames: 94060544. Throughput: 0: 11710.9. Samples: 94046312. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:39:47,789][569624] Avg episode reward: [(0, '4671.836')] [2023-03-08 23:39:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000183712_94060544.pth... [2023-03-08 23:39:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000183024_93708288.pth [2023-03-08 23:39:49,746][569911] Updated weights for policy 0, policy_version 183760 (0.0005) [2023-03-08 23:39:52,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11649.3). Total num frames: 94117888. Throughput: 0: 11780.7. Samples: 94117800. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:39:52,789][569624] Avg episode reward: [(0, '4830.785')] [2023-03-08 23:39:53,297][569911] Updated weights for policy 0, policy_version 183840 (0.0006) [2023-03-08 23:39:56,844][569911] Updated weights for policy 0, policy_version 183920 (0.0006) [2023-03-08 23:39:57,789][569624] Fps is (10 sec: 11469.0, 60 sec: 11741.9, 300 sec: 11649.3). Total num frames: 94175232. Throughput: 0: 11777.0. Samples: 94151972. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:39:57,789][569624] Avg episode reward: [(0, '4835.477')] [2023-03-08 23:40:00,424][569911] Updated weights for policy 0, policy_version 184000 (0.0005) [2023-03-08 23:40:02,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11649.3). Total num frames: 94232576. Throughput: 0: 11791.1. Samples: 94222852. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-08 23:40:02,789][569624] Avg episode reward: [(0, '4742.777')] [2023-03-08 23:40:02,827][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000184056_94236672.pth... [2023-03-08 23:40:02,828][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000183360_93880320.pth [2023-03-08 23:40:03,920][569911] Updated weights for policy 0, policy_version 184080 (0.0005) [2023-03-08 23:40:07,257][569911] Updated weights for policy 0, policy_version 184160 (0.0005) [2023-03-08 23:40:07,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11663.2). Total num frames: 94294016. Throughput: 0: 11742.4. Samples: 94293688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:40:07,789][569624] Avg episode reward: [(0, '4841.704')] [2023-03-08 23:40:10,857][569911] Updated weights for policy 0, policy_version 184240 (0.0005) [2023-03-08 23:40:12,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11663.2). Total num frames: 94351360. Throughput: 0: 11697.9. Samples: 94326856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:40:12,789][569624] Avg episode reward: [(0, '4834.526')] [2023-03-08 23:40:14,442][569911] Updated weights for policy 0, policy_version 184320 (0.0005) [2023-03-08 23:40:17,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11677.1). Total num frames: 94408704. Throughput: 0: 11695.5. Samples: 94396324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:40:17,789][569624] Avg episode reward: [(0, '4744.614')] [2023-03-08 23:40:17,806][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000184400_94412800.pth... [2023-03-08 23:40:17,806][569911] Updated weights for policy 0, policy_version 184400 (0.0005) [2023-03-08 23:40:17,807][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000183712_94060544.pth [2023-03-08 23:40:21,381][569911] Updated weights for policy 0, policy_version 184480 (0.0005) [2023-03-08 23:40:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.8, 300 sec: 11691.0). Total num frames: 94470144. Throughput: 0: 11718.0. Samples: 94468360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:40:22,789][569624] Avg episode reward: [(0, '4826.720')] [2023-03-08 23:40:24,806][569911] Updated weights for policy 0, policy_version 184560 (0.0005) [2023-03-08 23:40:27,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11691.0). Total num frames: 94527488. Throughput: 0: 11708.8. Samples: 94502912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:40:27,789][569624] Avg episode reward: [(0, '4784.057')] [2023-03-08 23:40:28,408][569911] Updated weights for policy 0, policy_version 184640 (0.0004) [2023-03-08 23:40:31,951][569911] Updated weights for policy 0, policy_version 184720 (0.0005) [2023-03-08 23:40:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11663.2). Total num frames: 94584832. Throughput: 0: 11693.3. Samples: 94572508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:40:32,789][569624] Avg episode reward: [(0, '4745.206')] [2023-03-08 23:40:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000184736_94584832.pth... [2023-03-08 23:40:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000184056_94236672.pth [2023-03-08 23:40:35,558][569911] Updated weights for policy 0, policy_version 184800 (0.0005) [2023-03-08 23:40:37,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11663.2). Total num frames: 94642176. Throughput: 0: 11593.2. Samples: 94639492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:40:37,789][569624] Avg episode reward: [(0, '4824.726')] [2023-03-08 23:40:39,233][569911] Updated weights for policy 0, policy_version 184880 (0.0005) [2023-03-08 23:40:42,737][569911] Updated weights for policy 0, policy_version 184960 (0.0005) [2023-03-08 23:40:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11649.3). Total num frames: 94699520. Throughput: 0: 11555.8. Samples: 94671984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:40:42,789][569624] Avg episode reward: [(0, '4794.717')] [2023-03-08 23:40:46,577][569911] Updated weights for policy 0, policy_version 185040 (0.0005) [2023-03-08 23:40:47,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11537.1, 300 sec: 11635.4). Total num frames: 94752768. Throughput: 0: 11502.9. Samples: 94740480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:40:47,789][569624] Avg episode reward: [(0, '4852.841')] [2023-03-08 23:40:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000185064_94752768.pth... [2023-03-08 23:40:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000184400_94412800.pth [2023-03-08 23:40:47,794][569867] Saving new best policy, reward=4852.841! [2023-03-08 23:40:50,035][569911] Updated weights for policy 0, policy_version 185120 (0.0005) [2023-03-08 23:40:52,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11605.4, 300 sec: 11649.3). Total num frames: 94814208. Throughput: 0: 11511.9. Samples: 94811724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:40:52,800][569624] Avg episode reward: [(0, '4761.935')] [2023-03-08 23:40:53,467][569911] Updated weights for policy 0, policy_version 185200 (0.0005) [2023-03-08 23:40:57,067][569911] Updated weights for policy 0, policy_version 185280 (0.0005) [2023-03-08 23:40:57,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 11649.3). Total num frames: 94871552. Throughput: 0: 11555.7. Samples: 94846864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:40:57,800][569624] Avg episode reward: [(0, '4853.091')] [2023-03-08 23:40:57,801][569867] Saving new best policy, reward=4853.091! [2023-03-08 23:41:00,453][569911] Updated weights for policy 0, policy_version 185360 (0.0005) [2023-03-08 23:41:02,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11663.2). Total num frames: 94928896. Throughput: 0: 11562.1. Samples: 94916616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:41:02,789][569624] Avg episode reward: [(0, '4840.181')] [2023-03-08 23:41:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000185408_94928896.pth... [2023-03-08 23:41:02,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000184736_94584832.pth [2023-03-08 23:41:03,962][569911] Updated weights for policy 0, policy_version 185440 (0.0005) [2023-03-08 23:41:07,542][569911] Updated weights for policy 0, policy_version 185520 (0.0005) [2023-03-08 23:41:07,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11537.1, 300 sec: 11663.2). Total num frames: 94986240. Throughput: 0: 11508.6. Samples: 94986248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:41:07,789][569624] Avg episode reward: [(0, '4700.332')] [2023-03-08 23:41:11,065][569911] Updated weights for policy 0, policy_version 185600 (0.0004) [2023-03-08 23:41:12,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11649.3). Total num frames: 95043584. Throughput: 0: 11534.1. Samples: 95021944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:41:12,789][569624] Avg episode reward: [(0, '4820.035')] [2023-03-08 23:41:14,703][569911] Updated weights for policy 0, policy_version 185680 (0.0005) [2023-03-08 23:41:17,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11635.4). Total num frames: 95100928. Throughput: 0: 11465.5. Samples: 95088456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:41:17,789][569624] Avg episode reward: [(0, '4692.524')] [2023-03-08 23:41:17,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000185744_95100928.pth... [2023-03-08 23:41:17,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000185064_94752768.pth [2023-03-08 23:41:18,271][569911] Updated weights for policy 0, policy_version 185760 (0.0004) [2023-03-08 23:41:21,751][569911] Updated weights for policy 0, policy_version 185840 (0.0005) [2023-03-08 23:41:22,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11537.1, 300 sec: 11663.2). Total num frames: 95162368. Throughput: 0: 11557.8. Samples: 95159596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:41:22,789][569624] Avg episode reward: [(0, '4723.219')] [2023-03-08 23:41:24,864][569911] Updated weights for policy 0, policy_version 185920 (0.0005) [2023-03-08 23:41:27,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11605.3, 300 sec: 11677.1). Total num frames: 95223808. Throughput: 0: 11716.6. Samples: 95199232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:41:27,789][569624] Avg episode reward: [(0, '4789.088')] [2023-03-08 23:41:28,354][569911] Updated weights for policy 0, policy_version 186000 (0.0005) [2023-03-08 23:41:31,775][569911] Updated weights for policy 0, policy_version 186080 (0.0005) [2023-03-08 23:41:32,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11663.2). Total num frames: 95281152. Throughput: 0: 11769.9. Samples: 95270128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:41:32,789][569624] Avg episode reward: [(0, '4707.861')] [2023-03-08 23:41:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000186096_95281152.pth... [2023-03-08 23:41:32,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000185408_94928896.pth [2023-03-08 23:41:35,271][569911] Updated weights for policy 0, policy_version 186160 (0.0004) [2023-03-08 23:41:37,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11691.0). Total num frames: 95342592. Throughput: 0: 11756.7. Samples: 95340776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:41:37,789][569624] Avg episode reward: [(0, '4797.481')] [2023-03-08 23:41:38,804][569911] Updated weights for policy 0, policy_version 186240 (0.0004) [2023-03-08 23:41:42,189][569911] Updated weights for policy 0, policy_version 186320 (0.0005) [2023-03-08 23:41:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11691.0). Total num frames: 95399936. Throughput: 0: 11745.8. Samples: 95375424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:41:42,789][569624] Avg episode reward: [(0, '4753.343')] [2023-03-08 23:41:45,466][569911] Updated weights for policy 0, policy_version 186400 (0.0005) [2023-03-08 23:41:47,789][569624] Fps is (10 sec: 11878.2, 60 sec: 11810.1, 300 sec: 11704.8). Total num frames: 95461376. Throughput: 0: 11834.1. Samples: 95449152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:41:47,789][569624] Avg episode reward: [(0, '4847.012')] [2023-03-08 23:41:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000186448_95461376.pth... [2023-03-08 23:41:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000185744_95100928.pth [2023-03-08 23:41:49,015][569911] Updated weights for policy 0, policy_version 186480 (0.0005) [2023-03-08 23:41:52,448][569911] Updated weights for policy 0, policy_version 186560 (0.0005) [2023-03-08 23:41:52,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11718.7). Total num frames: 95522816. Throughput: 0: 11850.5. Samples: 95519524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:41:52,789][569624] Avg episode reward: [(0, '4845.593')] [2023-03-08 23:41:56,023][569911] Updated weights for policy 0, policy_version 186640 (0.0005) [2023-03-08 23:41:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11718.7). Total num frames: 95580160. Throughput: 0: 11853.6. Samples: 95555356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:41:57,789][569624] Avg episode reward: [(0, '4591.621')] [2023-03-08 23:41:59,415][569911] Updated weights for policy 0, policy_version 186720 (0.0005) [2023-03-08 23:42:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11718.7). Total num frames: 95637504. Throughput: 0: 11928.0. Samples: 95625216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:42:02,789][569624] Avg episode reward: [(0, '4834.229')] [2023-03-08 23:42:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000186792_95637504.pth... [2023-03-08 23:42:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000186096_95281152.pth [2023-03-08 23:42:03,056][569911] Updated weights for policy 0, policy_version 186800 (0.0005) [2023-03-08 23:42:06,487][569911] Updated weights for policy 0, policy_version 186880 (0.0005) [2023-03-08 23:42:07,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11732.6). Total num frames: 95694848. Throughput: 0: 11893.8. Samples: 95694816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:42:07,789][569624] Avg episode reward: [(0, '4783.782')] [2023-03-08 23:42:10,151][569911] Updated weights for policy 0, policy_version 186960 (0.0005) [2023-03-08 23:42:12,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11732.6). Total num frames: 95752192. Throughput: 0: 11757.7. Samples: 95728328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:42:12,789][569624] Avg episode reward: [(0, '4745.401')] [2023-03-08 23:42:13,711][569911] Updated weights for policy 0, policy_version 187040 (0.0005) [2023-03-08 23:42:17,328][569911] Updated weights for policy 0, policy_version 187120 (0.0004) [2023-03-08 23:42:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11810.1, 300 sec: 11732.6). Total num frames: 95809536. Throughput: 0: 11713.9. Samples: 95797256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:42:17,789][569624] Avg episode reward: [(0, '4848.553')] [2023-03-08 23:42:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000187128_95809536.pth... [2023-03-08 23:42:17,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000186448_95461376.pth [2023-03-08 23:42:20,632][569911] Updated weights for policy 0, policy_version 187200 (0.0005) [2023-03-08 23:42:22,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11810.2, 300 sec: 11732.6). Total num frames: 95870976. Throughput: 0: 11697.1. Samples: 95867144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:42:22,789][569624] Avg episode reward: [(0, '4780.581')] [2023-03-08 23:42:24,234][569911] Updated weights for policy 0, policy_version 187280 (0.0005) [2023-03-08 23:42:27,532][569911] Updated weights for policy 0, policy_version 187360 (0.0005) [2023-03-08 23:42:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11732.6). Total num frames: 95928320. Throughput: 0: 11746.1. Samples: 95904000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:42:27,789][569624] Avg episode reward: [(0, '4641.082')] [2023-03-08 23:42:31,199][569911] Updated weights for policy 0, policy_version 187440 (0.0005) [2023-03-08 23:42:32,789][569624] Fps is (10 sec: 11468.6, 60 sec: 11741.9, 300 sec: 11718.7). Total num frames: 95985664. Throughput: 0: 11648.4. Samples: 95973328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:42:32,789][569624] Avg episode reward: [(0, '4789.958')] [2023-03-08 23:42:32,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000187472_95985664.pth... [2023-03-08 23:42:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000186792_95637504.pth [2023-03-08 23:42:34,779][569911] Updated weights for policy 0, policy_version 187520 (0.0005) [2023-03-08 23:42:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11718.7). Total num frames: 96043008. Throughput: 0: 11608.3. Samples: 96041896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:42:37,789][569624] Avg episode reward: [(0, '4778.624')] [2023-03-08 23:42:38,481][569911] Updated weights for policy 0, policy_version 187600 (0.0004) [2023-03-08 23:42:41,995][569911] Updated weights for policy 0, policy_version 187680 (0.0005) [2023-03-08 23:42:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11732.6). Total num frames: 96100352. Throughput: 0: 11564.7. Samples: 96075768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:42:42,789][569624] Avg episode reward: [(0, '4653.087')] [2023-03-08 23:42:45,577][569911] Updated weights for policy 0, policy_version 187760 (0.0005) [2023-03-08 23:42:47,789][569624] Fps is (10 sec: 11059.3, 60 sec: 11537.1, 300 sec: 11704.8). Total num frames: 96153600. Throughput: 0: 11512.9. Samples: 96143296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:42:47,789][569624] Avg episode reward: [(0, '4829.135')] [2023-03-08 23:42:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000187800_96153600.pth... [2023-03-08 23:42:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000187128_95809536.pth [2023-03-08 23:42:49,302][569911] Updated weights for policy 0, policy_version 187840 (0.0005) [2023-03-08 23:42:52,635][569911] Updated weights for policy 0, policy_version 187920 (0.0005) [2023-03-08 23:42:52,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11704.8). Total num frames: 96215040. Throughput: 0: 11546.8. Samples: 96214424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:42:52,789][569624] Avg episode reward: [(0, '4771.949')] [2023-03-08 23:42:55,915][569911] Updated weights for policy 0, policy_version 188000 (0.0005) [2023-03-08 23:42:57,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11605.3, 300 sec: 11691.0). Total num frames: 96276480. Throughput: 0: 11635.3. Samples: 96251916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:42:57,789][569624] Avg episode reward: [(0, '4754.033')] [2023-03-08 23:42:59,421][569911] Updated weights for policy 0, policy_version 188080 (0.0005) [2023-03-08 23:43:02,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11677.1). Total num frames: 96333824. Throughput: 0: 11690.9. Samples: 96323348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:43:02,789][569624] Avg episode reward: [(0, '4736.680')] [2023-03-08 23:43:02,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000188152_96333824.pth... [2023-03-08 23:43:02,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000187472_95985664.pth [2023-03-08 23:43:02,862][569911] Updated weights for policy 0, policy_version 188160 (0.0005) [2023-03-08 23:43:06,400][569911] Updated weights for policy 0, policy_version 188240 (0.0005) [2023-03-08 23:43:07,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11663.2). Total num frames: 96391168. Throughput: 0: 11645.2. Samples: 96391180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:43:07,789][569624] Avg episode reward: [(0, '4649.945')] [2023-03-08 23:43:09,931][569911] Updated weights for policy 0, policy_version 188320 (0.0005) [2023-03-08 23:43:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11677.1). Total num frames: 96452608. Throughput: 0: 11629.4. Samples: 96427324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:43:12,789][569624] Avg episode reward: [(0, '4787.998')] [2023-03-08 23:43:13,491][569911] Updated weights for policy 0, policy_version 188400 (0.0005) [2023-03-08 23:43:16,737][569911] Updated weights for policy 0, policy_version 188480 (0.0005) [2023-03-08 23:43:17,789][569624] Fps is (10 sec: 12287.8, 60 sec: 11741.8, 300 sec: 11704.8). Total num frames: 96514048. Throughput: 0: 11659.3. Samples: 96497996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:43:17,789][569624] Avg episode reward: [(0, '4811.355')] [2023-03-08 23:43:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000188504_96514048.pth... [2023-03-08 23:43:17,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000187800_96153600.pth [2023-03-08 23:43:20,097][569911] Updated weights for policy 0, policy_version 188560 (0.0005) [2023-03-08 23:43:22,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11691.0). Total num frames: 96571392. Throughput: 0: 11766.8. Samples: 96571400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:43:22,789][569624] Avg episode reward: [(0, '4825.728')] [2023-03-08 23:43:23,651][569911] Updated weights for policy 0, policy_version 188640 (0.0005) [2023-03-08 23:43:26,926][569911] Updated weights for policy 0, policy_version 188720 (0.0005) [2023-03-08 23:43:27,789][569624] Fps is (10 sec: 11878.7, 60 sec: 11741.9, 300 sec: 11704.8). Total num frames: 96632832. Throughput: 0: 11834.5. Samples: 96608320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:43:27,789][569624] Avg episode reward: [(0, '4700.800')] [2023-03-08 23:43:30,170][569911] Updated weights for policy 0, policy_version 188800 (0.0005) [2023-03-08 23:43:32,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11718.7). Total num frames: 96694272. Throughput: 0: 11963.0. Samples: 96681632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:43:32,789][569624] Avg episode reward: [(0, '4817.459')] [2023-03-08 23:43:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000188856_96694272.pth... [2023-03-08 23:43:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000188152_96333824.pth [2023-03-08 23:43:33,747][569911] Updated weights for policy 0, policy_version 188880 (0.0005) [2023-03-08 23:43:37,146][569911] Updated weights for policy 0, policy_version 188960 (0.0005) [2023-03-08 23:43:37,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11704.8). Total num frames: 96751616. Throughput: 0: 11937.8. Samples: 96751624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:43:37,789][569624] Avg episode reward: [(0, '4754.186')] [2023-03-08 23:43:40,547][569911] Updated weights for policy 0, policy_version 189040 (0.0005) [2023-03-08 23:43:42,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11718.7). Total num frames: 96813056. Throughput: 0: 11923.8. Samples: 96788488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:43:42,789][569624] Avg episode reward: [(0, '4689.343')] [2023-03-08 23:43:44,045][569911] Updated weights for policy 0, policy_version 189120 (0.0005) [2023-03-08 23:43:47,375][569911] Updated weights for policy 0, policy_version 189200 (0.0005) [2023-03-08 23:43:47,789][569624] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11732.6). Total num frames: 96874496. Throughput: 0: 11885.1. Samples: 96858176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:43:47,789][569624] Avg episode reward: [(0, '4768.873')] [2023-03-08 23:43:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000189208_96874496.pth... [2023-03-08 23:43:47,794][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000188504_96514048.pth [2023-03-08 23:43:50,814][569911] Updated weights for policy 0, policy_version 189280 (0.0005) [2023-03-08 23:43:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11732.6). Total num frames: 96931840. Throughput: 0: 11962.0. Samples: 96929472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:43:52,789][569624] Avg episode reward: [(0, '4586.830')] [2023-03-08 23:43:54,538][569911] Updated weights for policy 0, policy_version 189360 (0.0005) [2023-03-08 23:43:57,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11718.7). Total num frames: 96989184. Throughput: 0: 11938.8. Samples: 96964568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:43:57,789][569624] Avg episode reward: [(0, '4673.585')] [2023-03-08 23:43:58,066][569911] Updated weights for policy 0, policy_version 189440 (0.0005) [2023-03-08 23:44:01,732][569911] Updated weights for policy 0, policy_version 189520 (0.0005) [2023-03-08 23:44:02,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11718.7). Total num frames: 97046528. Throughput: 0: 11875.8. Samples: 97032404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:44:02,789][569624] Avg episode reward: [(0, '4643.052')] [2023-03-08 23:44:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000189544_97046528.pth... [2023-03-08 23:44:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000188856_96694272.pth [2023-03-08 23:44:05,233][569911] Updated weights for policy 0, policy_version 189600 (0.0005) [2023-03-08 23:44:07,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11718.7). Total num frames: 97103872. Throughput: 0: 11809.7. Samples: 97102836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:44:07,789][569624] Avg episode reward: [(0, '4743.520')] [2023-03-08 23:44:08,748][569911] Updated weights for policy 0, policy_version 189680 (0.0005) [2023-03-08 23:44:11,991][569911] Updated weights for policy 0, policy_version 189760 (0.0005) [2023-03-08 23:44:12,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11732.6). Total num frames: 97165312. Throughput: 0: 11802.7. Samples: 97139444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:44:12,789][569624] Avg episode reward: [(0, '4788.575')] [2023-03-08 23:44:15,465][569911] Updated weights for policy 0, policy_version 189840 (0.0005) [2023-03-08 23:44:17,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11732.6). Total num frames: 97226752. Throughput: 0: 11775.5. Samples: 97211528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:44:17,789][569624] Avg episode reward: [(0, '4680.212')] [2023-03-08 23:44:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000189896_97226752.pth... [2023-03-08 23:44:17,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000189208_96874496.pth [2023-03-08 23:44:18,835][569911] Updated weights for policy 0, policy_version 189920 (0.0005) [2023-03-08 23:44:22,238][569911] Updated weights for policy 0, policy_version 190000 (0.0005) [2023-03-08 23:44:22,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11732.6). Total num frames: 97284096. Throughput: 0: 11832.1. Samples: 97284068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:44:22,789][569624] Avg episode reward: [(0, '4665.550')] [2023-03-08 23:44:25,893][569911] Updated weights for policy 0, policy_version 190080 (0.0005) [2023-03-08 23:44:27,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11732.6). Total num frames: 97341440. Throughput: 0: 11741.7. Samples: 97316864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:44:27,789][569624] Avg episode reward: [(0, '4836.030')] [2023-03-08 23:44:29,666][569911] Updated weights for policy 0, policy_version 190160 (0.0005) [2023-03-08 23:44:32,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11673.6, 300 sec: 11704.8). Total num frames: 97394688. Throughput: 0: 11678.7. Samples: 97383716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:44:32,789][569624] Avg episode reward: [(0, '4653.459')] [2023-03-08 23:44:32,826][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000190232_97398784.pth... [2023-03-08 23:44:32,828][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000189544_97046528.pth [2023-03-08 23:44:33,160][569911] Updated weights for policy 0, policy_version 190240 (0.0005) [2023-03-08 23:44:36,654][569911] Updated weights for policy 0, policy_version 190320 (0.0004) [2023-03-08 23:44:37,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11718.7). Total num frames: 97456128. Throughput: 0: 11658.0. Samples: 97454080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:44:37,789][569624] Avg episode reward: [(0, '4738.244')] [2023-03-08 23:44:40,263][569911] Updated weights for policy 0, policy_version 190400 (0.0004) [2023-03-08 23:44:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11704.8). Total num frames: 97513472. Throughput: 0: 11638.7. Samples: 97488308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:44:42,789][569624] Avg episode reward: [(0, '4752.003')] [2023-03-08 23:44:43,558][569911] Updated weights for policy 0, policy_version 190480 (0.0004) [2023-03-08 23:44:47,152][569911] Updated weights for policy 0, policy_version 190560 (0.0005) [2023-03-08 23:44:47,789][569624] Fps is (10 sec: 11468.5, 60 sec: 11605.3, 300 sec: 11704.8). Total num frames: 97570816. Throughput: 0: 11726.8. Samples: 97560112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:44:47,789][569624] Avg episode reward: [(0, '4599.634')] [2023-03-08 23:44:47,814][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000190576_97574912.pth... [2023-03-08 23:44:47,816][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000189896_97226752.pth [2023-03-08 23:44:50,664][569911] Updated weights for policy 0, policy_version 190640 (0.0005) [2023-03-08 23:44:52,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11605.4, 300 sec: 11704.8). Total num frames: 97628160. Throughput: 0: 11695.5. Samples: 97629132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:44:52,789][569624] Avg episode reward: [(0, '4578.512')] [2023-03-08 23:44:54,255][569911] Updated weights for policy 0, policy_version 190720 (0.0004) [2023-03-08 23:44:57,788][569911] Updated weights for policy 0, policy_version 190800 (0.0005) [2023-03-08 23:44:57,789][569624] Fps is (10 sec: 11878.6, 60 sec: 11673.6, 300 sec: 11718.7). Total num frames: 97689600. Throughput: 0: 11669.9. Samples: 97664588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:44:57,789][569624] Avg episode reward: [(0, '4700.187')] [2023-03-08 23:45:01,471][569911] Updated weights for policy 0, policy_version 190880 (0.0005) [2023-03-08 23:45:02,789][569624] Fps is (10 sec: 11468.6, 60 sec: 11605.3, 300 sec: 11691.0). Total num frames: 97742848. Throughput: 0: 11540.9. Samples: 97730868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:45:02,789][569624] Avg episode reward: [(0, '4655.909')] [2023-03-08 23:45:02,815][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000190912_97746944.pth... [2023-03-08 23:45:02,816][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000190232_97398784.pth [2023-03-08 23:45:04,869][569911] Updated weights for policy 0, policy_version 190960 (0.0005) [2023-03-08 23:45:07,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11704.8). Total num frames: 97804288. Throughput: 0: 11550.0. Samples: 97803820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:45:07,789][569624] Avg episode reward: [(0, '4750.388')] [2023-03-08 23:45:08,331][569911] Updated weights for policy 0, policy_version 191040 (0.0005) [2023-03-08 23:45:11,903][569911] Updated weights for policy 0, policy_version 191120 (0.0005) [2023-03-08 23:45:12,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11605.4, 300 sec: 11704.8). Total num frames: 97861632. Throughput: 0: 11560.0. Samples: 97837064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:45:12,789][569624] Avg episode reward: [(0, '4710.834')] [2023-03-08 23:45:15,569][569911] Updated weights for policy 0, policy_version 191200 (0.0006) [2023-03-08 23:45:17,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11537.1, 300 sec: 11691.0). Total num frames: 97918976. Throughput: 0: 11621.1. Samples: 97906664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:45:17,789][569624] Avg episode reward: [(0, '4668.733')] [2023-03-08 23:45:17,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000191248_97918976.pth... [2023-03-08 23:45:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000190576_97574912.pth [2023-03-08 23:45:19,030][569911] Updated weights for policy 0, policy_version 191280 (0.0005) [2023-03-08 23:45:22,740][569911] Updated weights for policy 0, policy_version 191360 (0.0005) [2023-03-08 23:45:22,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11537.1, 300 sec: 11691.0). Total num frames: 97976320. Throughput: 0: 11560.3. Samples: 97974296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:45:22,789][569624] Avg episode reward: [(0, '4682.583')] [2023-03-08 23:45:26,113][569911] Updated weights for policy 0, policy_version 191440 (0.0006) [2023-03-08 23:45:27,789][569624] Fps is (10 sec: 11878.6, 60 sec: 11605.4, 300 sec: 11704.8). Total num frames: 98037760. Throughput: 0: 11630.1. Samples: 98011660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:45:27,789][569624] Avg episode reward: [(0, '4587.290')] [2023-03-08 23:45:29,392][569911] Updated weights for policy 0, policy_version 191520 (0.0005) [2023-03-08 23:45:32,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11704.8). Total num frames: 98095104. Throughput: 0: 11598.0. Samples: 98082020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:45:32,789][569624] Avg episode reward: [(0, '4658.380')] [2023-03-08 23:45:32,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000191592_98095104.pth... [2023-03-08 23:45:32,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000190912_97746944.pth [2023-03-08 23:45:33,054][569911] Updated weights for policy 0, policy_version 191600 (0.0004) [2023-03-08 23:45:36,592][569911] Updated weights for policy 0, policy_version 191680 (0.0005) [2023-03-08 23:45:37,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11704.8). Total num frames: 98152448. Throughput: 0: 11599.6. Samples: 98151116. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:45:37,789][569624] Avg episode reward: [(0, '4773.551')] [2023-03-08 23:45:40,254][569911] Updated weights for policy 0, policy_version 191760 (0.0005) [2023-03-08 23:45:42,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11718.7). Total num frames: 98209792. Throughput: 0: 11568.6. Samples: 98185176. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:45:42,789][569624] Avg episode reward: [(0, '4768.983')] [2023-03-08 23:45:43,527][569911] Updated weights for policy 0, policy_version 191840 (0.0004) [2023-03-08 23:45:46,905][569911] Updated weights for policy 0, policy_version 191920 (0.0005) [2023-03-08 23:45:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11718.7). Total num frames: 98271232. Throughput: 0: 11736.4. Samples: 98259008. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:45:47,789][569624] Avg episode reward: [(0, '4421.623')] [2023-03-08 23:45:47,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000191936_98271232.pth... [2023-03-08 23:45:47,796][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000191248_97918976.pth [2023-03-08 23:45:50,439][569911] Updated weights for policy 0, policy_version 192000 (0.0005) [2023-03-08 23:45:52,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11741.8, 300 sec: 11732.6). Total num frames: 98332672. Throughput: 0: 11670.9. Samples: 98329012. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:45:52,789][569624] Avg episode reward: [(0, '4573.265')] [2023-03-08 23:45:53,886][569911] Updated weights for policy 0, policy_version 192080 (0.0005) [2023-03-08 23:45:57,420][569911] Updated weights for policy 0, policy_version 192160 (0.0005) [2023-03-08 23:45:57,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11732.6). Total num frames: 98390016. Throughput: 0: 11724.2. Samples: 98364652. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:45:57,800][569624] Avg episode reward: [(0, '4742.479')] [2023-03-08 23:46:00,722][569911] Updated weights for policy 0, policy_version 192240 (0.0005) [2023-03-08 23:46:02,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11732.6). Total num frames: 98447360. Throughput: 0: 11768.1. Samples: 98436228. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:46:02,789][569624] Avg episode reward: [(0, '4641.810')] [2023-03-08 23:46:02,832][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000192288_98451456.pth... [2023-03-08 23:46:02,833][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000191592_98095104.pth [2023-03-08 23:46:04,299][569911] Updated weights for policy 0, policy_version 192320 (0.0006) [2023-03-08 23:46:07,722][569911] Updated weights for policy 0, policy_version 192400 (0.0005) [2023-03-08 23:46:07,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11746.5). Total num frames: 98508800. Throughput: 0: 11838.4. Samples: 98507024. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:46:07,789][569624] Avg episode reward: [(0, '4642.208')] [2023-03-08 23:46:11,028][569911] Updated weights for policy 0, policy_version 192480 (0.0005) [2023-03-08 23:46:12,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11760.4). Total num frames: 98570240. Throughput: 0: 11803.4. Samples: 98542816. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:46:12,800][569624] Avg episode reward: [(0, '4731.095')] [2023-03-08 23:46:14,410][569911] Updated weights for policy 0, policy_version 192560 (0.0005) [2023-03-08 23:46:17,678][569911] Updated weights for policy 0, policy_version 192640 (0.0005) [2023-03-08 23:46:17,789][569624] Fps is (10 sec: 12287.8, 60 sec: 11878.4, 300 sec: 11760.4). Total num frames: 98631680. Throughput: 0: 11910.0. Samples: 98617968. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:46:17,800][569624] Avg episode reward: [(0, '4578.340')] [2023-03-08 23:46:17,804][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000192640_98631680.pth... [2023-03-08 23:46:17,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000191936_98271232.pth [2023-03-08 23:46:20,889][569911] Updated weights for policy 0, policy_version 192720 (0.0005) [2023-03-08 23:46:22,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11760.4). Total num frames: 98693120. Throughput: 0: 12012.1. Samples: 98691660. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:46:22,800][569624] Avg episode reward: [(0, '4838.893')] [2023-03-08 23:46:24,412][569911] Updated weights for policy 0, policy_version 192800 (0.0005) [2023-03-08 23:46:27,789][569624] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11760.4). Total num frames: 98750464. Throughput: 0: 12034.7. Samples: 98726736. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:46:27,800][569624] Avg episode reward: [(0, '4788.564')] [2023-03-08 23:46:27,914][569911] Updated weights for policy 0, policy_version 192880 (0.0005) [2023-03-08 23:46:31,565][569911] Updated weights for policy 0, policy_version 192960 (0.0005) [2023-03-08 23:46:32,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11746.5). Total num frames: 98807808. Throughput: 0: 11920.8. Samples: 98795444. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:46:32,800][569624] Avg episode reward: [(0, '4450.481')] [2023-03-08 23:46:32,804][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000192984_98807808.pth... [2023-03-08 23:46:32,806][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000192288_98451456.pth [2023-03-08 23:46:35,050][569911] Updated weights for policy 0, policy_version 193040 (0.0005) [2023-03-08 23:46:37,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11760.4). Total num frames: 98869248. Throughput: 0: 11941.5. Samples: 98866380. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-08 23:46:37,800][569624] Avg episode reward: [(0, '4462.105')] [2023-03-08 23:46:38,432][569911] Updated weights for policy 0, policy_version 193120 (0.0005) [2023-03-08 23:46:41,993][569911] Updated weights for policy 0, policy_version 193200 (0.0005) [2023-03-08 23:46:42,789][569624] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 11746.5). Total num frames: 98926592. Throughput: 0: 11941.7. Samples: 98902028. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:46:42,800][569624] Avg episode reward: [(0, '4630.465')] [2023-03-08 23:46:45,704][569911] Updated weights for policy 0, policy_version 193280 (0.0005) [2023-03-08 23:46:47,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11732.6). Total num frames: 98983936. Throughput: 0: 11864.9. Samples: 98970148. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:46:47,789][569624] Avg episode reward: [(0, '4719.090')] [2023-03-08 23:46:47,792][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000193328_98983936.pth... [2023-03-08 23:46:47,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000192640_98631680.pth [2023-03-08 23:46:49,166][569911] Updated weights for policy 0, policy_version 193360 (0.0005) [2023-03-08 23:46:52,789][569624] Fps is (10 sec: 11059.2, 60 sec: 11741.9, 300 sec: 11718.7). Total num frames: 99037184. Throughput: 0: 11807.6. Samples: 99038364. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:46:52,789][569624] Avg episode reward: [(0, '4787.710')] [2023-03-08 23:46:52,815][569911] Updated weights for policy 0, policy_version 193440 (0.0005) [2023-03-08 23:46:56,294][569911] Updated weights for policy 0, policy_version 193520 (0.0005) [2023-03-08 23:46:57,789][569624] Fps is (10 sec: 11469.0, 60 sec: 11810.1, 300 sec: 11732.6). Total num frames: 99098624. Throughput: 0: 11805.1. Samples: 99074044. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:46:57,789][569624] Avg episode reward: [(0, '4681.150')] [2023-03-08 23:46:59,793][569911] Updated weights for policy 0, policy_version 193600 (0.0005) [2023-03-08 23:47:02,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11732.6). Total num frames: 99155968. Throughput: 0: 11678.2. Samples: 99143488. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:47:02,789][569624] Avg episode reward: [(0, '4753.509')] [2023-03-08 23:47:02,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000193664_99155968.pth... [2023-03-08 23:47:02,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000192984_98807808.pth [2023-03-08 23:47:03,446][569911] Updated weights for policy 0, policy_version 193680 (0.0005) [2023-03-08 23:47:07,180][569911] Updated weights for policy 0, policy_version 193760 (0.0004) [2023-03-08 23:47:07,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11741.8, 300 sec: 11732.6). Total num frames: 99213312. Throughput: 0: 11502.8. Samples: 99209288. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:47:07,789][569624] Avg episode reward: [(0, '4561.831')] [2023-03-08 23:47:10,516][569911] Updated weights for policy 0, policy_version 193840 (0.0005) [2023-03-08 23:47:12,789][569624] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11732.6). Total num frames: 99270656. Throughput: 0: 11542.4. Samples: 99246144. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:47:12,789][569624] Avg episode reward: [(0, '4690.958')] [2023-03-08 23:47:13,957][569911] Updated weights for policy 0, policy_version 193920 (0.0005) [2023-03-08 23:47:17,657][569911] Updated weights for policy 0, policy_version 194000 (0.0005) [2023-03-08 23:47:17,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11718.7). Total num frames: 99328000. Throughput: 0: 11550.9. Samples: 99315236. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:47:17,789][569624] Avg episode reward: [(0, '4777.091')] [2023-03-08 23:47:17,793][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000194000_99328000.pth... [2023-03-08 23:47:17,795][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000193328_98983936.pth [2023-03-08 23:47:21,134][569911] Updated weights for policy 0, policy_version 194080 (0.0005) [2023-03-08 23:47:22,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11718.7). Total num frames: 99385344. Throughput: 0: 11531.1. Samples: 99385280. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:47:22,789][569624] Avg episode reward: [(0, '4696.898')] [2023-03-08 23:47:24,740][569911] Updated weights for policy 0, policy_version 194160 (0.0005) [2023-03-08 23:47:27,789][569624] Fps is (10 sec: 11468.9, 60 sec: 11537.1, 300 sec: 11718.7). Total num frames: 99442688. Throughput: 0: 11484.2. Samples: 99418816. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:47:27,789][569624] Avg episode reward: [(0, '4682.406')] [2023-03-08 23:47:28,197][569911] Updated weights for policy 0, policy_version 194240 (0.0005) [2023-03-08 23:47:31,768][569911] Updated weights for policy 0, policy_version 194320 (0.0005) [2023-03-08 23:47:32,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11718.7). Total num frames: 99500032. Throughput: 0: 11525.9. Samples: 99488812. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:47:32,789][569624] Avg episode reward: [(0, '4651.173')] [2023-03-08 23:47:32,809][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000194344_99504128.pth... [2023-03-08 23:47:32,810][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000193664_99155968.pth [2023-03-08 23:47:35,266][569911] Updated weights for policy 0, policy_version 194400 (0.0005) [2023-03-08 23:47:37,789][569624] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11718.7). Total num frames: 99557376. Throughput: 0: 11542.5. Samples: 99557776. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:47:37,789][569624] Avg episode reward: [(0, '4733.321')] [2023-03-08 23:47:38,854][569911] Updated weights for policy 0, policy_version 194480 (0.0005) [2023-03-08 23:47:42,383][569911] Updated weights for policy 0, policy_version 194560 (0.0005) [2023-03-08 23:47:42,789][569624] Fps is (10 sec: 11878.4, 60 sec: 11537.1, 300 sec: 11746.5). Total num frames: 99618816. Throughput: 0: 11559.0. Samples: 99594200. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:47:42,789][569624] Avg episode reward: [(0, '4671.065')] [2023-03-08 23:47:45,822][569911] Updated weights for policy 0, policy_version 194640 (0.0005) [2023-03-08 23:47:47,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11537.1, 300 sec: 11732.6). Total num frames: 99676160. Throughput: 0: 11565.7. Samples: 99663944. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-08 23:47:47,789][569624] Avg episode reward: [(0, '4610.936')] [2023-03-08 23:47:47,791][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000194680_99676160.pth... [2023-03-08 23:47:47,793][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000194000_99328000.pth [2023-03-08 23:47:49,097][569911] Updated weights for policy 0, policy_version 194720 (0.0005) [2023-03-08 23:47:52,608][569911] Updated weights for policy 0, policy_version 194800 (0.0005) [2023-03-08 23:47:52,789][569624] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11732.6). Total num frames: 99737600. Throughput: 0: 11738.6. Samples: 99737524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:47:52,789][569624] Avg episode reward: [(0, '4755.841')] [2023-03-08 23:47:55,924][569911] Updated weights for policy 0, policy_version 194880 (0.0004) [2023-03-08 23:47:57,789][569624] Fps is (10 sec: 12287.9, 60 sec: 11673.6, 300 sec: 11746.5). Total num frames: 99799040. Throughput: 0: 11740.6. Samples: 99774472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:47:57,789][569624] Avg episode reward: [(0, '4608.591')] [2023-03-08 23:47:59,433][569911] Updated weights for policy 0, policy_version 194960 (0.0005) [2023-03-08 23:48:02,789][569624] Fps is (10 sec: 11878.6, 60 sec: 11673.6, 300 sec: 11746.5). Total num frames: 99856384. Throughput: 0: 11804.5. Samples: 99846436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:48:02,789][569624] Avg episode reward: [(0, '4735.188')] [2023-03-08 23:48:02,826][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000195040_99860480.pth... [2023-03-08 23:48:02,827][569911] Updated weights for policy 0, policy_version 195040 (0.0005) [2023-03-08 23:48:02,828][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000194344_99504128.pth [2023-03-08 23:48:06,048][569911] Updated weights for policy 0, policy_version 195120 (0.0004) [2023-03-08 23:48:07,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11810.1, 300 sec: 11760.4). Total num frames: 99921920. Throughput: 0: 11840.1. Samples: 99918084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:48:07,789][569624] Avg episode reward: [(0, '4693.304')] [2023-03-08 23:48:09,519][569911] Updated weights for policy 0, policy_version 195200 (0.0005) [2023-03-08 23:48:12,789][569624] Fps is (10 sec: 12288.0, 60 sec: 11810.2, 300 sec: 11746.5). Total num frames: 99979264. Throughput: 0: 11906.0. Samples: 99954584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-08 23:48:12,789][569624] Avg episode reward: [(0, '4593.858')] [2023-03-08 23:48:12,878][569911] Updated weights for policy 0, policy_version 195280 (0.0005) [2023-03-08 23:48:14,619][569867] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 [2023-03-08 23:48:14,928][569867] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 [2023-03-08 23:48:14,929][569949] Stopping RolloutWorker_w5... [2023-03-08 23:48:14,929][570012] Stopping RolloutWorker_w7... [2023-03-08 23:48:14,929][570012] Loop rollout_proc7_evt_loop terminating... [2023-03-08 23:48:14,929][569949] Loop rollout_proc5_evt_loop terminating... [2023-03-08 23:48:14,929][569912] Stopping RolloutWorker_w1... [2023-03-08 23:48:14,929][569928] Stopping RolloutWorker_w6... [2023-03-08 23:48:14,929][569914] Stopping RolloutWorker_w2... [2023-03-08 23:48:14,929][569915] Stopping RolloutWorker_w3... [2023-03-08 23:48:14,929][569912] Loop rollout_proc1_evt_loop terminating... [2023-03-08 23:48:14,929][569928] Loop rollout_proc6_evt_loop terminating... [2023-03-08 23:48:14,929][569914] Loop rollout_proc2_evt_loop terminating... [2023-03-08 23:48:14,929][569916] Stopping RolloutWorker_w4... [2023-03-08 23:48:14,929][569867] Stopping Batcher_0... [2023-03-08 23:48:14,929][569624] Component RolloutWorker_w5 stopped! [2023-03-08 23:48:14,929][569915] Loop rollout_proc3_evt_loop terminating... [2023-03-08 23:48:14,929][569913] Stopping RolloutWorker_w0... [2023-03-08 23:48:14,930][569916] Loop rollout_proc4_evt_loop terminating... [2023-03-08 23:48:14,930][569913] Loop rollout_proc0_evt_loop terminating... [2023-03-08 23:48:14,930][569867] Loop batcher_evt_loop terminating... [2023-03-08 23:48:14,930][569624] Component RolloutWorker_w7 stopped! [2023-03-08 23:48:14,930][569624] Component Batcher_0 stopped! [2023-03-08 23:48:14,930][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000195328_100007936.pth... [2023-03-08 23:48:14,930][569624] Component RolloutWorker_w1 stopped! [2023-03-08 23:48:14,930][569624] Component RolloutWorker_w6 stopped! [2023-03-08 23:48:14,931][569624] Component RolloutWorker_w3 stopped! [2023-03-08 23:48:14,931][569624] Component RolloutWorker_w2 stopped! [2023-03-08 23:48:14,931][569624] Component RolloutWorker_w4 stopped! [2023-03-08 23:48:14,931][569624] Component RolloutWorker_w0 stopped! [2023-03-08 23:48:14,933][569867] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000194680_99676160.pth [2023-03-08 23:48:14,933][569867] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/handle-press-v2/checkpoint_p0/checkpoint_000195328_100007936.pth... [2023-03-08 23:48:14,936][569867] Stopping LearnerWorker_p0... [2023-03-08 23:48:14,936][569867] Loop learner_proc0_evt_loop terminating... [2023-03-08 23:48:14,936][569624] Component LearnerWorker_p0 stopped! [2023-03-08 23:48:14,956][569911] Weights refcount: 2 0 [2023-03-08 23:48:14,957][569911] Stopping InferenceWorker_p0-w0... [2023-03-08 23:48:14,957][569911] Loop inference_proc0-0_evt_loop terminating... [2023-03-08 23:48:14,957][569624] Component InferenceWorker_p0-w0 stopped! [2023-03-08 23:48:14,958][569624] Waiting for process learner_proc0 to stop... [2023-03-08 23:48:15,476][569624] Waiting for process inference_proc0-0 to join... [2023-03-08 23:48:15,476][569624] Waiting for process rollout_proc0 to join... [2023-03-08 23:48:15,476][569624] Waiting for process rollout_proc1 to join... [2023-03-08 23:48:15,477][569624] Waiting for process rollout_proc2 to join... [2023-03-08 23:48:15,477][569624] Waiting for process rollout_proc3 to join... [2023-03-08 23:48:15,477][569624] Waiting for process rollout_proc4 to join... [2023-03-08 23:48:15,477][569624] Waiting for process rollout_proc5 to join... [2023-03-08 23:48:15,478][569624] Waiting for process rollout_proc6 to join... [2023-03-08 23:48:15,478][569624] Waiting for process rollout_proc7 to join... [2023-03-08 23:48:15,478][569624] Batcher 0 profile tree view: batching: 17.3589, releasing_batches: 14.5749 [2023-03-08 23:48:15,479][569624] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0000 wait_policy_total: 2976.6702 update_model: 94.5618 weight_update: 0.0005 one_step: 0.0009 handle_policy_step: 4838.8487 deserialize: 207.2251, stack: 49.9797, obs_to_device_normalize: 834.7076, forward: 2394.6606, send_messages: 408.9424 prepare_outputs: 530.0341 to_cpu: 79.6535 [2023-03-08 23:48:15,479][569624] Learner 0 profile tree view: misc: 0.0953, prepare_batch: 78.1829 train: 1003.6210 epoch_init: 0.3569, minibatch_init: 10.6191, losses_postprocess: 12.0979, kl_divergence: 3.8207, after_optimizer: 5.4355 calculate_losses: 408.2741 losses_init: 0.2824, forward_head: 196.4878, bptt_initial: 1.1164, bptt: 1.1674, tail: 100.6706, advantages_returns: 8.2606, losses: 87.9205 update: 548.8813 clip: 49.7449 [2023-03-08 23:48:15,479][569624] RolloutWorker_w0 profile tree view: wait_for_trajectories: 4.3642, enqueue_policy_requests: 155.0565, env_step: 4421.3383, overhead: 339.6932, complete_rollouts: 3.6444 save_policy_outputs: 383.5156 split_output_tensors: 186.5653 [2023-03-08 23:48:15,479][569624] RolloutWorker_w7 profile tree view: wait_for_trajectories: 4.1220, enqueue_policy_requests: 155.2521, env_step: 4363.4798, overhead: 332.2715, complete_rollouts: 3.7285 save_policy_outputs: 373.4461 split_output_tensors: 182.2224 [2023-03-08 23:48:15,480][569624] Loop Runner_EvtLoop terminating... [2023-03-08 23:48:15,480][569624] Runner profile tree view: main_loop: 8485.3381 [2023-03-08 23:48:15,480][569624] Collected {0: 100007936}, FPS: 11786.0