[2023-03-10 19:58:02,014][1096160] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/config.json... [2023-03-10 19:58:02,030][1096160] Rollout worker 0 uses device cpu [2023-03-10 19:58:02,030][1096160] Rollout worker 1 uses device cpu [2023-03-10 19:58:02,031][1096160] Rollout worker 2 uses device cpu [2023-03-10 19:58:02,031][1096160] Rollout worker 3 uses device cpu [2023-03-10 19:58:02,031][1096160] Rollout worker 4 uses device cpu [2023-03-10 19:58:02,031][1096160] Rollout worker 5 uses device cpu [2023-03-10 19:58:02,031][1096160] Rollout worker 6 uses device cpu [2023-03-10 19:58:02,031][1096160] Rollout worker 7 uses device cpu [2023-03-10 19:58:02,031][1096160] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 [2023-03-10 19:58:02,044][1096160] InferenceWorker_p0-w0: min num requests: 2 [2023-03-10 19:58:02,062][1096160] Starting all processes... [2023-03-10 19:58:02,063][1096160] Starting process learner_proc0 [2023-03-10 19:58:02,113][1096160] Starting all processes... [2023-03-10 19:58:02,132][1096160] Starting process inference_proc0-0 [2023-03-10 19:58:02,142][1096160] Starting process rollout_proc0 [2023-03-10 19:58:02,144][1096160] Starting process rollout_proc2 [2023-03-10 19:58:02,143][1096160] Starting process rollout_proc1 [2023-03-10 19:58:02,144][1096160] Starting process rollout_proc3 [2023-03-10 19:58:02,144][1096160] Starting process rollout_proc4 [2023-03-10 19:58:02,144][1096160] Starting process rollout_proc5 [2023-03-10 19:58:02,144][1096160] Starting process rollout_proc6 [2023-03-10 19:58:02,144][1096160] Starting process rollout_proc7 [2023-03-10 19:58:03,693][1096399] Starting seed is not provided [2023-03-10 19:58:03,693][1096399] Initializing actor-critic model on device cpu [2023-03-10 19:58:03,693][1096399] RunningMeanStd input shape: (39,) [2023-03-10 19:58:03,694][1096399] RunningMeanStd input shape: (1,) [2023-03-10 19:58:03,699][1096446] Worker 5 uses CPU cores [20, 21, 22, 23] [2023-03-10 19:58:03,761][1096399] Created Actor Critic model with architecture: [2023-03-10 19:58:03,762][1096399] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): MlpEncoder( (mlp_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=Tanh) (2): RecursiveScriptModule(original_name=Linear) (3): RecursiveScriptModule(original_name=Tanh) ) ) ) ) (core): ModelCoreIdentity() (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=64, out_features=1, bias=True) (action_parameterization): ActionParameterizationContinuousNonAdaptiveStddev( (distribution_linear): Linear(in_features=64, out_features=4, bias=True) ) ) [2023-03-10 19:58:03,935][1096447] Worker 2 uses CPU cores [8, 9, 10, 11] [2023-03-10 19:58:04,075][1096399] Using optimizer [2023-03-10 19:58:04,076][1096399] No checkpoints found [2023-03-10 19:58:04,076][1096399] Did not load from checkpoint, starting from scratch! [2023-03-10 19:58:04,076][1096399] Initialized policy 0 weights for model version 0 [2023-03-10 19:58:04,077][1096399] LearnerWorker_p0 finished initialization! [2023-03-10 19:58:04,079][1096443] RunningMeanStd input shape: (39,) [2023-03-10 19:58:04,079][1096443] RunningMeanStd input shape: (1,) [2023-03-10 19:58:04,088][1096449] Worker 4 uses CPU cores [16, 17, 18, 19] [2023-03-10 19:58:04,132][1096444] Worker 1 uses CPU cores [4, 5, 6, 7] [2023-03-10 19:58:04,177][1096160] Inference worker 0-0 is ready! [2023-03-10 19:58:04,178][1096160] All inference workers are ready! Signal rollout workers to start! [2023-03-10 19:58:04,236][1096445] Worker 3 uses CPU cores [12, 13, 14, 15] [2023-03-10 19:58:04,254][1096450] Worker 6 uses CPU cores [24, 25, 26, 27] [2023-03-10 19:58:04,473][1096448] Worker 0 uses CPU cores [0, 1, 2, 3] [2023-03-10 19:58:04,480][1096495] Worker 7 uses CPU cores [28, 29, 30, 31] [2023-03-10 19:58:04,742][1096160] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-10 19:58:07,860][1096447] Decorrelating experience for 0 frames... [2023-03-10 19:58:07,872][1096447] Decorrelating experience for 64 frames... [2023-03-10 19:58:07,889][1096449] Decorrelating experience for 0 frames... [2023-03-10 19:58:07,901][1096449] Decorrelating experience for 64 frames... [2023-03-10 19:58:07,902][1096446] Decorrelating experience for 0 frames... [2023-03-10 19:58:07,904][1096447] Decorrelating experience for 128 frames... [2023-03-10 19:58:07,921][1096446] Decorrelating experience for 64 frames... [2023-03-10 19:58:07,933][1096449] Decorrelating experience for 128 frames... [2023-03-10 19:58:07,956][1096447] Decorrelating experience for 192 frames... [2023-03-10 19:58:07,973][1096446] Decorrelating experience for 128 frames... [2023-03-10 19:58:07,986][1096449] Decorrelating experience for 192 frames... [2023-03-10 19:58:08,001][1096450] Decorrelating experience for 0 frames... [2023-03-10 19:58:08,013][1096450] Decorrelating experience for 64 frames... [2023-03-10 19:58:08,046][1096450] Decorrelating experience for 128 frames... [2023-03-10 19:58:08,055][1096446] Decorrelating experience for 192 frames... [2023-03-10 19:58:08,063][1096445] Decorrelating experience for 0 frames... [2023-03-10 19:58:08,075][1096445] Decorrelating experience for 64 frames... [2023-03-10 19:58:08,098][1096450] Decorrelating experience for 192 frames... [2023-03-10 19:58:08,107][1096445] Decorrelating experience for 128 frames... [2023-03-10 19:58:08,159][1096445] Decorrelating experience for 192 frames... [2023-03-10 19:58:08,179][1096448] Decorrelating experience for 0 frames... [2023-03-10 19:58:08,190][1096448] Decorrelating experience for 64 frames... [2023-03-10 19:58:08,194][1096495] Decorrelating experience for 0 frames... [2023-03-10 19:58:08,205][1096495] Decorrelating experience for 64 frames... [2023-03-10 19:58:08,222][1096448] Decorrelating experience for 128 frames... [2023-03-10 19:58:08,238][1096495] Decorrelating experience for 128 frames... [2023-03-10 19:58:08,275][1096448] Decorrelating experience for 192 frames... [2023-03-10 19:58:08,290][1096495] Decorrelating experience for 192 frames... [2023-03-10 19:58:09,044][1096444] Decorrelating experience for 0 frames... [2023-03-10 19:58:09,056][1096444] Decorrelating experience for 64 frames... [2023-03-10 19:58:09,094][1096444] Decorrelating experience for 128 frames... [2023-03-10 19:58:09,178][1096444] Decorrelating experience for 192 frames... [2023-03-10 19:58:09,742][1096160] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-10 19:58:11,588][1096447] Decorrelating experience for 256 frames... [2023-03-10 19:58:11,629][1096449] Decorrelating experience for 256 frames... [2023-03-10 19:58:11,680][1096447] Decorrelating experience for 320 frames... [2023-03-10 19:58:11,722][1096449] Decorrelating experience for 320 frames... [2023-03-10 19:58:11,752][1096450] Decorrelating experience for 256 frames... [2023-03-10 19:58:11,797][1096447] Decorrelating experience for 384 frames... [2023-03-10 19:58:11,807][1096445] Decorrelating experience for 256 frames... [2023-03-10 19:58:11,834][1096449] Decorrelating experience for 384 frames... [2023-03-10 19:58:11,845][1096450] Decorrelating experience for 320 frames... [2023-03-10 19:58:11,900][1096445] Decorrelating experience for 320 frames... [2023-03-10 19:58:11,905][1096448] Decorrelating experience for 256 frames... [2023-03-10 19:58:11,927][1096495] Decorrelating experience for 256 frames... [2023-03-10 19:58:11,933][1096447] Decorrelating experience for 448 frames... [2023-03-10 19:58:11,958][1096450] Decorrelating experience for 384 frames... [2023-03-10 19:58:11,968][1096449] Decorrelating experience for 448 frames... [2023-03-10 19:58:11,998][1096448] Decorrelating experience for 320 frames... [2023-03-10 19:58:12,014][1096445] Decorrelating experience for 384 frames... [2023-03-10 19:58:12,019][1096495] Decorrelating experience for 320 frames... [2023-03-10 19:58:12,043][1096446] Decorrelating experience for 256 frames... [2023-03-10 19:58:12,089][1096450] Decorrelating experience for 448 frames... [2023-03-10 19:58:12,132][1096495] Decorrelating experience for 384 frames... [2023-03-10 19:58:12,132][1096448] Decorrelating experience for 384 frames... [2023-03-10 19:58:12,143][1096446] Decorrelating experience for 320 frames... [2023-03-10 19:58:12,177][1096445] Decorrelating experience for 448 frames... [2023-03-10 19:58:12,255][1096446] Decorrelating experience for 384 frames... [2023-03-10 19:58:12,264][1096448] Decorrelating experience for 448 frames... [2023-03-10 19:58:12,266][1096495] Decorrelating experience for 448 frames... [2023-03-10 19:58:12,388][1096446] Decorrelating experience for 448 frames... [2023-03-10 19:58:13,290][1096444] Decorrelating experience for 256 frames... [2023-03-10 19:58:13,380][1096444] Decorrelating experience for 320 frames... [2023-03-10 19:58:13,489][1096444] Decorrelating experience for 384 frames... [2023-03-10 19:58:13,618][1096444] Decorrelating experience for 448 frames... [2023-03-10 19:58:14,742][1096160] Fps is (10 sec: 409.6, 60 sec: 409.6, 300 sec: 409.6). Total num frames: 4096. Throughput: 0: 358.4. Samples: 3584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 19:58:14,742][1096160] Avg episode reward: [(0, '166.238')] [2023-03-10 19:58:14,792][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000000016_8192.pth... [2023-03-10 19:58:17,493][1096443] Updated weights for policy 0, policy_version 80 (0.0005) [2023-03-10 19:58:19,741][1096160] Fps is (10 sec: 6553.7, 60 sec: 4369.1, 300 sec: 4369.1). Total num frames: 65536. Throughput: 0: 4168.8. Samples: 62532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 19:58:19,742][1096160] Avg episode reward: [(0, '2182.046')] [2023-03-10 19:58:20,911][1096443] Updated weights for policy 0, policy_version 160 (0.0004) [2023-03-10 19:58:22,039][1096160] Heartbeat connected on Batcher_0 [2023-03-10 19:58:22,041][1096160] Heartbeat connected on LearnerWorker_p0 [2023-03-10 19:58:22,045][1096160] Heartbeat connected on InferenceWorker_p0-w0 [2023-03-10 19:58:22,049][1096160] Heartbeat connected on RolloutWorker_w0 [2023-03-10 19:58:22,051][1096160] Heartbeat connected on RolloutWorker_w1 [2023-03-10 19:58:22,054][1096160] Heartbeat connected on RolloutWorker_w2 [2023-03-10 19:58:22,056][1096160] Heartbeat connected on RolloutWorker_w3 [2023-03-10 19:58:22,058][1096160] Heartbeat connected on RolloutWorker_w4 [2023-03-10 19:58:22,059][1096160] Heartbeat connected on RolloutWorker_w5 [2023-03-10 19:58:22,061][1096160] Heartbeat connected on RolloutWorker_w6 [2023-03-10 19:58:22,064][1096160] Heartbeat connected on RolloutWorker_w7 [2023-03-10 19:58:24,207][1096443] Updated weights for policy 0, policy_version 240 (0.0005) [2023-03-10 19:58:24,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 6348.8, 300 sec: 6348.8). Total num frames: 126976. Throughput: 0: 4880.8. Samples: 97616. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 19:58:24,742][1096160] Avg episode reward: [(0, '3171.534')] [2023-03-10 19:58:24,743][1096399] Saving new best policy, reward=3171.534! [2023-03-10 19:58:27,715][1096443] Updated weights for policy 0, policy_version 320 (0.0006) [2023-03-10 19:58:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 7372.8, 300 sec: 7372.8). Total num frames: 184320. Throughput: 0: 6775.5. Samples: 169388. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 19:58:29,742][1096160] Avg episode reward: [(0, '3704.437')] [2023-03-10 19:58:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000000360_184320.pth... [2023-03-10 19:58:29,747][1096399] Saving new best policy, reward=3704.437! [2023-03-10 19:58:31,269][1096443] Updated weights for policy 0, policy_version 400 (0.0005) [2023-03-10 19:58:34,729][1096443] Updated weights for policy 0, policy_version 480 (0.0005) [2023-03-10 19:58:34,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 245760. Throughput: 0: 8037.5. Samples: 241124. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 19:58:34,742][1096160] Avg episode reward: [(0, '3786.698')] [2023-03-10 19:58:34,743][1096399] Saving new best policy, reward=3786.698! [2023-03-10 19:58:37,962][1096443] Updated weights for policy 0, policy_version 560 (0.0005) [2023-03-10 19:58:39,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 8777.1, 300 sec: 8777.1). Total num frames: 307200. Throughput: 0: 7915.0. Samples: 277024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 19:58:39,742][1096160] Avg episode reward: [(0, '4074.040')] [2023-03-10 19:58:39,743][1096399] Saving new best policy, reward=4074.040! [2023-03-10 19:58:41,285][1096443] Updated weights for policy 0, policy_version 640 (0.0005) [2023-03-10 19:58:44,632][1096443] Updated weights for policy 0, policy_version 720 (0.0005) [2023-03-10 19:58:44,742][1096160] Fps is (10 sec: 12287.7, 60 sec: 9216.0, 300 sec: 9216.0). Total num frames: 368640. Throughput: 0: 8806.7. Samples: 352268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 19:58:44,742][1096160] Avg episode reward: [(0, '4343.386')] [2023-03-10 19:58:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000000720_368640.pth... [2023-03-10 19:58:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000000016_8192.pth [2023-03-10 19:58:44,748][1096399] Saving new best policy, reward=4343.386! [2023-03-10 19:58:48,177][1096443] Updated weights for policy 0, policy_version 800 (0.0004) [2023-03-10 19:58:49,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 9466.3, 300 sec: 9466.3). Total num frames: 425984. Throughput: 0: 9377.6. Samples: 421992. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 19:58:49,742][1096160] Avg episode reward: [(0, '4683.180')] [2023-03-10 19:58:49,742][1096399] Saving new best policy, reward=4683.180! [2023-03-10 19:58:51,479][1096443] Updated weights for policy 0, policy_version 880 (0.0005) [2023-03-10 19:58:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 9748.5, 300 sec: 9748.5). Total num frames: 487424. Throughput: 0: 10194.1. Samples: 458736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 19:58:54,742][1096160] Avg episode reward: [(0, '4805.492')] [2023-03-10 19:58:54,743][1096399] Saving new best policy, reward=4805.492! [2023-03-10 19:58:54,884][1096443] Updated weights for policy 0, policy_version 960 (0.0005) [2023-03-10 19:58:58,238][1096443] Updated weights for policy 0, policy_version 1040 (0.0005) [2023-03-10 19:58:59,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 9979.3, 300 sec: 9979.3). Total num frames: 548864. Throughput: 0: 11753.1. Samples: 532472. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 19:58:59,742][1096160] Avg episode reward: [(0, '4815.379')] [2023-03-10 19:58:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000001072_548864.pth... [2023-03-10 19:58:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000000360_184320.pth [2023-03-10 19:58:59,749][1096399] Saving new best policy, reward=4815.379! [2023-03-10 19:59:01,618][1096443] Updated weights for policy 0, policy_version 1120 (0.0006) [2023-03-10 19:59:04,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 10171.7, 300 sec: 10171.7). Total num frames: 610304. Throughput: 0: 12059.0. Samples: 605188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 19:59:04,742][1096160] Avg episode reward: [(0, '4848.539')] [2023-03-10 19:59:04,743][1096399] Saving new best policy, reward=4848.539! [2023-03-10 19:59:05,106][1096443] Updated weights for policy 0, policy_version 1200 (0.0004) [2023-03-10 19:59:08,573][1096443] Updated weights for policy 0, policy_version 1280 (0.0005) [2023-03-10 19:59:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11127.5, 300 sec: 10271.5). Total num frames: 667648. Throughput: 0: 12053.3. Samples: 640016. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 19:59:09,742][1096160] Avg episode reward: [(0, '4854.341')] [2023-03-10 19:59:09,743][1096399] Saving new best policy, reward=4854.341! [2023-03-10 19:59:11,934][1096443] Updated weights for policy 0, policy_version 1360 (0.0005) [2023-03-10 19:59:14,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 12014.9, 300 sec: 10357.0). Total num frames: 724992. Throughput: 0: 12027.8. Samples: 710640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 19:59:14,742][1096160] Avg episode reward: [(0, '4842.963')] [2023-03-10 19:59:14,764][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000001424_729088.pth... [2023-03-10 19:59:14,766][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000000720_368640.pth [2023-03-10 19:59:15,514][1096443] Updated weights for policy 0, policy_version 1440 (0.0005) [2023-03-10 19:59:18,850][1096443] Updated weights for policy 0, policy_version 1520 (0.0005) [2023-03-10 19:59:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 10485.8). Total num frames: 786432. Throughput: 0: 12027.1. Samples: 782344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 19:59:19,742][1096160] Avg episode reward: [(0, '4848.039')] [2023-03-10 19:59:22,442][1096443] Updated weights for policy 0, policy_version 1600 (0.0005) [2023-03-10 19:59:24,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 10547.2). Total num frames: 843776. Throughput: 0: 11970.7. Samples: 815704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 19:59:24,742][1096160] Avg episode reward: [(0, '4853.177')] [2023-03-10 19:59:25,984][1096443] Updated weights for policy 0, policy_version 1680 (0.0005) [2023-03-10 19:59:29,490][1096443] Updated weights for policy 0, policy_version 1760 (0.0005) [2023-03-10 19:59:29,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 10601.4). Total num frames: 901120. Throughput: 0: 11862.0. Samples: 886056. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 19:59:29,742][1096160] Avg episode reward: [(0, '4852.580')] [2023-03-10 19:59:29,780][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000001768_905216.pth... [2023-03-10 19:59:29,782][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000001072_548864.pth [2023-03-10 19:59:32,786][1096443] Updated weights for policy 0, policy_version 1840 (0.0005) [2023-03-10 19:59:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 10695.1). Total num frames: 962560. Throughput: 0: 11915.3. Samples: 958180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 19:59:34,742][1096160] Avg episode reward: [(0, '4854.536')] [2023-03-10 19:59:34,743][1096399] Saving new best policy, reward=4854.536! [2023-03-10 19:59:36,422][1096443] Updated weights for policy 0, policy_version 1920 (0.0004) [2023-03-10 19:59:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 10735.8). Total num frames: 1019904. Throughput: 0: 11878.6. Samples: 993272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 19:59:39,742][1096160] Avg episode reward: [(0, '4851.412')] [2023-03-10 19:59:39,904][1096443] Updated weights for policy 0, policy_version 2000 (0.0005) [2023-03-10 19:59:43,466][1096443] Updated weights for policy 0, policy_version 2080 (0.0005) [2023-03-10 19:59:44,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.2, 300 sec: 10772.5). Total num frames: 1077248. Throughput: 0: 11743.5. Samples: 1060928. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 19:59:44,742][1096160] Avg episode reward: [(0, '4848.143')] [2023-03-10 19:59:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000002104_1077248.pth... [2023-03-10 19:59:44,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000001424_729088.pth [2023-03-10 19:59:46,978][1096443] Updated weights for policy 0, policy_version 2160 (0.0005) [2023-03-10 19:59:49,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 10805.6). Total num frames: 1134592. Throughput: 0: 11684.2. Samples: 1130976. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 19:59:49,742][1096160] Avg episode reward: [(0, '4852.291')] [2023-03-10 19:59:50,459][1096443] Updated weights for policy 0, policy_version 2240 (0.0005) [2023-03-10 19:59:53,925][1096443] Updated weights for policy 0, policy_version 2320 (0.0004) [2023-03-10 19:59:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 10873.0). Total num frames: 1196032. Throughput: 0: 11720.4. Samples: 1167432. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 19:59:54,742][1096160] Avg episode reward: [(0, '4860.491')] [2023-03-10 19:59:54,743][1096399] Saving new best policy, reward=4860.491! [2023-03-10 19:59:57,326][1096443] Updated weights for policy 0, policy_version 2400 (0.0005) [2023-03-10 19:59:59,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11810.2, 300 sec: 10934.6). Total num frames: 1257472. Throughput: 0: 11759.5. Samples: 1239816. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 19:59:59,742][1096160] Avg episode reward: [(0, '4857.969')] [2023-03-10 19:59:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000002456_1257472.pth... [2023-03-10 19:59:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000001768_905216.pth [2023-03-10 20:00:00,743][1096443] Updated weights for policy 0, policy_version 2480 (0.0005) [2023-03-10 20:00:03,987][1096443] Updated weights for policy 0, policy_version 2560 (0.0005) [2023-03-10 20:00:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11810.2, 300 sec: 10990.9). Total num frames: 1318912. Throughput: 0: 11817.9. Samples: 1314148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:00:04,742][1096160] Avg episode reward: [(0, '4854.850')] [2023-03-10 20:00:07,497][1096443] Updated weights for policy 0, policy_version 2640 (0.0005) [2023-03-10 20:00:09,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.2, 300 sec: 11010.1). Total num frames: 1376256. Throughput: 0: 11841.1. Samples: 1348552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:00:09,742][1096160] Avg episode reward: [(0, '4857.612')] [2023-03-10 20:00:10,987][1096443] Updated weights for policy 0, policy_version 2720 (0.0005) [2023-03-10 20:00:14,430][1096443] Updated weights for policy 0, policy_version 2800 (0.0004) [2023-03-10 20:00:14,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11027.7). Total num frames: 1433600. Throughput: 0: 11854.9. Samples: 1419524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:00:14,742][1096160] Avg episode reward: [(0, '4858.970')] [2023-03-10 20:00:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000002800_1433600.pth... [2023-03-10 20:00:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000002104_1077248.pth [2023-03-10 20:00:17,959][1096443] Updated weights for policy 0, policy_version 2880 (0.0005) [2023-03-10 20:00:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11074.4). Total num frames: 1495040. Throughput: 0: 11817.4. Samples: 1489964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:00:19,742][1096160] Avg episode reward: [(0, '4855.942')] [2023-03-10 20:00:21,607][1096443] Updated weights for policy 0, policy_version 2960 (0.0005) [2023-03-10 20:00:24,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11059.2). Total num frames: 1548288. Throughput: 0: 11777.0. Samples: 1523236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:00:24,742][1096160] Avg episode reward: [(0, '4860.904')] [2023-03-10 20:00:24,742][1096399] Saving new best policy, reward=4860.904! [2023-03-10 20:00:25,164][1096443] Updated weights for policy 0, policy_version 3040 (0.0006) [2023-03-10 20:00:28,958][1096443] Updated weights for policy 0, policy_version 3120 (0.0005) [2023-03-10 20:00:29,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11741.9, 300 sec: 11073.3). Total num frames: 1605632. Throughput: 0: 11749.5. Samples: 1589656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:00:29,742][1096160] Avg episode reward: [(0, '4856.763')] [2023-03-10 20:00:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000003136_1605632.pth... [2023-03-10 20:00:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000002456_1257472.pth [2023-03-10 20:00:32,576][1096443] Updated weights for policy 0, policy_version 3200 (0.0005) [2023-03-10 20:00:34,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11086.5). Total num frames: 1662976. Throughput: 0: 11701.8. Samples: 1657556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:00:34,742][1096160] Avg episode reward: [(0, '4857.054')] [2023-03-10 20:00:36,191][1096443] Updated weights for policy 0, policy_version 3280 (0.0006) [2023-03-10 20:00:39,506][1096443] Updated weights for policy 0, policy_version 3360 (0.0005) [2023-03-10 20:00:39,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11098.8). Total num frames: 1720320. Throughput: 0: 11689.0. Samples: 1693436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:00:39,742][1096160] Avg episode reward: [(0, '4856.290')] [2023-03-10 20:00:42,784][1096443] Updated weights for policy 0, policy_version 3440 (0.0004) [2023-03-10 20:00:44,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11161.6). Total num frames: 1785856. Throughput: 0: 11722.5. Samples: 1767332. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:00:44,742][1096160] Avg episode reward: [(0, '4825.695')] [2023-03-10 20:00:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000003488_1785856.pth... [2023-03-10 20:00:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000002800_1433600.pth [2023-03-10 20:00:46,089][1096443] Updated weights for policy 0, policy_version 3520 (0.0004) [2023-03-10 20:00:49,474][1096443] Updated weights for policy 0, policy_version 3600 (0.0005) [2023-03-10 20:00:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11810.1, 300 sec: 11170.9). Total num frames: 1843200. Throughput: 0: 11694.0. Samples: 1840380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:00:49,742][1096160] Avg episode reward: [(0, '4855.540')] [2023-03-10 20:00:53,201][1096443] Updated weights for policy 0, policy_version 3680 (0.0004) [2023-03-10 20:00:54,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11741.9, 300 sec: 11179.7). Total num frames: 1900544. Throughput: 0: 11650.9. Samples: 1872844. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:00:54,742][1096160] Avg episode reward: [(0, '4852.366')] [2023-03-10 20:00:56,543][1096443] Updated weights for policy 0, policy_version 3760 (0.0005) [2023-03-10 20:00:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.8, 300 sec: 11211.3). Total num frames: 1961984. Throughput: 0: 11690.6. Samples: 1945600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:00:59,742][1096160] Avg episode reward: [(0, '4850.576')] [2023-03-10 20:00:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000003832_1961984.pth... [2023-03-10 20:00:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000003136_1605632.pth [2023-03-10 20:00:59,914][1096443] Updated weights for policy 0, policy_version 3840 (0.0005) [2023-03-10 20:01:03,140][1096443] Updated weights for policy 0, policy_version 3920 (0.0004) [2023-03-10 20:01:04,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 11810.1, 300 sec: 11264.0). Total num frames: 2027520. Throughput: 0: 11797.6. Samples: 2020856. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:01:04,742][1096160] Avg episode reward: [(0, '4851.970')] [2023-03-10 20:01:06,366][1096443] Updated weights for policy 0, policy_version 4000 (0.0004) [2023-03-10 20:01:09,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11810.1, 300 sec: 11269.5). Total num frames: 2084864. Throughput: 0: 11860.3. Samples: 2056952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:01:09,742][1096160] Avg episode reward: [(0, '4845.615')] [2023-03-10 20:01:09,923][1096443] Updated weights for policy 0, policy_version 4080 (0.0005) [2023-03-10 20:01:13,285][1096443] Updated weights for policy 0, policy_version 4160 (0.0005) [2023-03-10 20:01:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11296.3). Total num frames: 2146304. Throughput: 0: 12004.4. Samples: 2129852. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:01:14,742][1096160] Avg episode reward: [(0, '4832.762')] [2023-03-10 20:01:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000004192_2146304.pth... [2023-03-10 20:01:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000003488_1785856.pth [2023-03-10 20:01:16,553][1096443] Updated weights for policy 0, policy_version 4240 (0.0005) [2023-03-10 20:01:19,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 11321.8). Total num frames: 2207744. Throughput: 0: 12098.1. Samples: 2201968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:01:19,742][1096160] Avg episode reward: [(0, '4836.982')] [2023-03-10 20:01:20,102][1096443] Updated weights for policy 0, policy_version 4320 (0.0005) [2023-03-10 20:01:23,498][1096443] Updated weights for policy 0, policy_version 4400 (0.0005) [2023-03-10 20:01:24,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 11325.4). Total num frames: 2265088. Throughput: 0: 12074.7. Samples: 2236796. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:01:24,742][1096160] Avg episode reward: [(0, '4851.255')] [2023-03-10 20:01:26,930][1096443] Updated weights for policy 0, policy_version 4480 (0.0006) [2023-03-10 20:01:29,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 12014.9, 300 sec: 11348.9). Total num frames: 2326528. Throughput: 0: 12052.7. Samples: 2309704. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:01:29,742][1096160] Avg episode reward: [(0, '4849.959')] [2023-03-10 20:01:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000004544_2326528.pth... [2023-03-10 20:01:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000003832_1961984.pth [2023-03-10 20:01:30,314][1096443] Updated weights for policy 0, policy_version 4560 (0.0005) [2023-03-10 20:01:33,981][1096443] Updated weights for policy 0, policy_version 4640 (0.0005) [2023-03-10 20:01:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11351.8). Total num frames: 2383872. Throughput: 0: 11975.0. Samples: 2379256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:01:34,742][1096160] Avg episode reward: [(0, '4849.769')] [2023-03-10 20:01:37,326][1096443] Updated weights for policy 0, policy_version 4720 (0.0005) [2023-03-10 20:01:39,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12083.2, 300 sec: 11373.6). Total num frames: 2445312. Throughput: 0: 12065.4. Samples: 2415788. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:01:39,742][1096160] Avg episode reward: [(0, '4848.573')] [2023-03-10 20:01:40,597][1096443] Updated weights for policy 0, policy_version 4800 (0.0005) [2023-03-10 20:01:44,129][1096443] Updated weights for policy 0, policy_version 4880 (0.0005) [2023-03-10 20:01:44,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11375.7). Total num frames: 2502656. Throughput: 0: 12035.2. Samples: 2487184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:01:44,742][1096160] Avg episode reward: [(0, '4852.608')] [2023-03-10 20:01:44,767][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000004896_2506752.pth... [2023-03-10 20:01:44,769][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000004192_2146304.pth [2023-03-10 20:01:47,613][1096443] Updated weights for policy 0, policy_version 4960 (0.0006) [2023-03-10 20:01:49,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11396.0). Total num frames: 2564096. Throughput: 0: 11974.4. Samples: 2559704. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:01:49,742][1096160] Avg episode reward: [(0, '4851.064')] [2023-03-10 20:01:50,988][1096443] Updated weights for policy 0, policy_version 5040 (0.0005) [2023-03-10 20:01:54,275][1096443] Updated weights for policy 0, policy_version 5120 (0.0005) [2023-03-10 20:01:54,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11415.4). Total num frames: 2625536. Throughput: 0: 11995.7. Samples: 2596760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:01:54,742][1096160] Avg episode reward: [(0, '4853.610')] [2023-03-10 20:01:57,619][1096443] Updated weights for policy 0, policy_version 5200 (0.0005) [2023-03-10 20:01:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11433.9). Total num frames: 2686976. Throughput: 0: 12014.4. Samples: 2670500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:01:59,742][1096160] Avg episode reward: [(0, '4856.361')] [2023-03-10 20:01:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000005248_2686976.pth... [2023-03-10 20:01:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000004544_2326528.pth [2023-03-10 20:02:00,894][1096443] Updated weights for policy 0, policy_version 5280 (0.0005) [2023-03-10 20:02:04,422][1096443] Updated weights for policy 0, policy_version 5360 (0.0005) [2023-03-10 20:02:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11451.7). Total num frames: 2748416. Throughput: 0: 12019.2. Samples: 2742832. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:02:04,742][1096160] Avg episode reward: [(0, '4855.254')] [2023-03-10 20:02:07,742][1096443] Updated weights for policy 0, policy_version 5440 (0.0005) [2023-03-10 20:02:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11452.1). Total num frames: 2805760. Throughput: 0: 12051.7. Samples: 2779124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:02:09,742][1096160] Avg episode reward: [(0, '4843.930')] [2023-03-10 20:02:11,149][1096443] Updated weights for policy 0, policy_version 5520 (0.0005) [2023-03-10 20:02:14,472][1096443] Updated weights for policy 0, policy_version 5600 (0.0004) [2023-03-10 20:02:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12015.0, 300 sec: 11468.8). Total num frames: 2867200. Throughput: 0: 12076.5. Samples: 2853144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:02:14,742][1096160] Avg episode reward: [(0, '4839.628')] [2023-03-10 20:02:14,787][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000005608_2871296.pth... [2023-03-10 20:02:14,789][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000004896_2506752.pth [2023-03-10 20:02:18,089][1096443] Updated weights for policy 0, policy_version 5680 (0.0004) [2023-03-10 20:02:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.6, 300 sec: 11468.8). Total num frames: 2924544. Throughput: 0: 12028.1. Samples: 2920520. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:02:19,742][1096160] Avg episode reward: [(0, '4846.724')] [2023-03-10 20:02:21,529][1096443] Updated weights for policy 0, policy_version 5760 (0.0005) [2023-03-10 20:02:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11484.6). Total num frames: 2985984. Throughput: 0: 12034.0. Samples: 2957320. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:02:24,742][1096160] Avg episode reward: [(0, '4850.297')] [2023-03-10 20:02:24,751][1096443] Updated weights for policy 0, policy_version 5840 (0.0005) [2023-03-10 20:02:28,209][1096443] Updated weights for policy 0, policy_version 5920 (0.0005) [2023-03-10 20:02:29,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11499.7). Total num frames: 3047424. Throughput: 0: 12087.1. Samples: 3031104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:02:29,742][1096160] Avg episode reward: [(0, '4848.457')] [2023-03-10 20:02:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000005952_3047424.pth... [2023-03-10 20:02:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000005248_2686976.pth [2023-03-10 20:02:31,658][1096443] Updated weights for policy 0, policy_version 6000 (0.0004) [2023-03-10 20:02:34,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11514.3). Total num frames: 3108864. Throughput: 0: 12048.9. Samples: 3101904. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:02:34,743][1096160] Avg episode reward: [(0, '4853.064')] [2023-03-10 20:02:35,004][1096443] Updated weights for policy 0, policy_version 6080 (0.0005) [2023-03-10 20:02:38,627][1096443] Updated weights for policy 0, policy_version 6160 (0.0005) [2023-03-10 20:02:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11513.5). Total num frames: 3166208. Throughput: 0: 12025.3. Samples: 3137900. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:02:39,742][1096160] Avg episode reward: [(0, '4852.574')] [2023-03-10 20:02:41,975][1096443] Updated weights for policy 0, policy_version 6240 (0.0005) [2023-03-10 20:02:44,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 12014.9, 300 sec: 11512.7). Total num frames: 3223552. Throughput: 0: 11964.6. Samples: 3208904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:02:44,742][1096160] Avg episode reward: [(0, '4855.819')] [2023-03-10 20:02:44,802][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000006304_3227648.pth... [2023-03-10 20:02:44,804][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000005608_2871296.pth [2023-03-10 20:02:45,458][1096443] Updated weights for policy 0, policy_version 6320 (0.0005) [2023-03-10 20:02:48,698][1096443] Updated weights for policy 0, policy_version 6400 (0.0005) [2023-03-10 20:02:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11540.7). Total num frames: 3289088. Throughput: 0: 12008.4. Samples: 3283212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:02:49,742][1096160] Avg episode reward: [(0, '4857.365')] [2023-03-10 20:02:52,103][1096443] Updated weights for policy 0, policy_version 6480 (0.0005) [2023-03-10 20:02:54,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11539.4). Total num frames: 3346432. Throughput: 0: 11998.1. Samples: 3319040. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:02:54,742][1096160] Avg episode reward: [(0, '4858.383')] [2023-03-10 20:02:55,551][1096443] Updated weights for policy 0, policy_version 6560 (0.0005) [2023-03-10 20:02:59,024][1096443] Updated weights for policy 0, policy_version 6640 (0.0005) [2023-03-10 20:02:59,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11538.2). Total num frames: 3403776. Throughput: 0: 11939.7. Samples: 3390428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:02:59,742][1096160] Avg episode reward: [(0, '4853.336')] [2023-03-10 20:02:59,767][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000006656_3407872.pth... [2023-03-10 20:02:59,769][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000005952_3047424.pth [2023-03-10 20:03:02,453][1096443] Updated weights for policy 0, policy_version 6720 (0.0005) [2023-03-10 20:03:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11746.5). Total num frames: 3465216. Throughput: 0: 12011.3. Samples: 3461028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:03:04,753][1096160] Avg episode reward: [(0, '4852.433')] [2023-03-10 20:03:06,021][1096443] Updated weights for policy 0, policy_version 6800 (0.0005) [2023-03-10 20:03:09,619][1096443] Updated weights for policy 0, policy_version 6880 (0.0005) [2023-03-10 20:03:09,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 3522560. Throughput: 0: 11934.8. Samples: 3494388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:03:09,753][1096160] Avg episode reward: [(0, '4854.480')] [2023-03-10 20:03:13,119][1096443] Updated weights for policy 0, policy_version 6960 (0.0005) [2023-03-10 20:03:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 3584000. Throughput: 0: 11853.9. Samples: 3564528. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:03:14,742][1096160] Avg episode reward: [(0, '4853.306')] [2023-03-10 20:03:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000007000_3584000.pth... [2023-03-10 20:03:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000006304_3227648.pth [2023-03-10 20:03:16,461][1096443] Updated weights for policy 0, policy_version 7040 (0.0005) [2023-03-10 20:03:19,654][1096443] Updated weights for policy 0, policy_version 7120 (0.0004) [2023-03-10 20:03:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11927.0). Total num frames: 3645440. Throughput: 0: 11955.1. Samples: 3639880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:03:19,742][1096160] Avg episode reward: [(0, '4853.421')] [2023-03-10 20:03:23,051][1096443] Updated weights for policy 0, policy_version 7200 (0.0005) [2023-03-10 20:03:24,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 3702784. Throughput: 0: 11983.2. Samples: 3677144. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:03:24,742][1096160] Avg episode reward: [(0, '4850.932')] [2023-03-10 20:03:26,531][1096443] Updated weights for policy 0, policy_version 7280 (0.0004) [2023-03-10 20:03:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 3764224. Throughput: 0: 11973.5. Samples: 3747712. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:03:29,742][1096160] Avg episode reward: [(0, '4857.221')] [2023-03-10 20:03:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000007352_3764224.pth... [2023-03-10 20:03:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000006656_3407872.pth [2023-03-10 20:03:29,980][1096443] Updated weights for policy 0, policy_version 7360 (0.0004) [2023-03-10 20:03:33,552][1096443] Updated weights for policy 0, policy_version 7440 (0.0005) [2023-03-10 20:03:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 3821568. Throughput: 0: 11872.6. Samples: 3817480. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:03:34,742][1096160] Avg episode reward: [(0, '4849.669')] [2023-03-10 20:03:36,984][1096443] Updated weights for policy 0, policy_version 7520 (0.0005) [2023-03-10 20:03:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11913.1). Total num frames: 3883008. Throughput: 0: 11882.3. Samples: 3853744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:03:39,742][1096160] Avg episode reward: [(0, '4852.376')] [2023-03-10 20:03:40,206][1096443] Updated weights for policy 0, policy_version 7600 (0.0005) [2023-03-10 20:03:43,870][1096443] Updated weights for policy 0, policy_version 7680 (0.0005) [2023-03-10 20:03:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.6, 300 sec: 11913.1). Total num frames: 3940352. Throughput: 0: 11856.4. Samples: 3923968. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:03:44,742][1096160] Avg episode reward: [(0, '4854.874')] [2023-03-10 20:03:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000007696_3940352.pth... [2023-03-10 20:03:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000007000_3584000.pth [2023-03-10 20:03:47,322][1096443] Updated weights for policy 0, policy_version 7760 (0.0006) [2023-03-10 20:03:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 4001792. Throughput: 0: 11887.1. Samples: 3995948. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:03:49,742][1096160] Avg episode reward: [(0, '4851.530')] [2023-03-10 20:03:50,713][1096443] Updated weights for policy 0, policy_version 7840 (0.0005) [2023-03-10 20:03:54,198][1096443] Updated weights for policy 0, policy_version 7920 (0.0004) [2023-03-10 20:03:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 4059136. Throughput: 0: 11942.8. Samples: 4031816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:03:54,742][1096160] Avg episode reward: [(0, '4767.785')] [2023-03-10 20:03:57,703][1096443] Updated weights for policy 0, policy_version 8000 (0.0004) [2023-03-10 20:03:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.6, 300 sec: 11899.2). Total num frames: 4120576. Throughput: 0: 11981.6. Samples: 4103700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:03:59,742][1096160] Avg episode reward: [(0, '4774.240')] [2023-03-10 20:03:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000008048_4120576.pth... [2023-03-10 20:03:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000007352_3764224.pth [2023-03-10 20:04:01,037][1096443] Updated weights for policy 0, policy_version 8080 (0.0005) [2023-03-10 20:04:04,422][1096443] Updated weights for policy 0, policy_version 8160 (0.0005) [2023-03-10 20:04:04,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 4177920. Throughput: 0: 11899.8. Samples: 4175372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:04:04,742][1096160] Avg episode reward: [(0, '4826.153')] [2023-03-10 20:04:08,000][1096443] Updated weights for policy 0, policy_version 8240 (0.0005) [2023-03-10 20:04:09,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 4235264. Throughput: 0: 11832.8. Samples: 4209620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:04:09,742][1096160] Avg episode reward: [(0, '4827.930')] [2023-03-10 20:04:11,553][1096443] Updated weights for policy 0, policy_version 8320 (0.0005) [2023-03-10 20:04:14,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11810.1, 300 sec: 11885.3). Total num frames: 4292608. Throughput: 0: 11824.3. Samples: 4279804. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:04:14,742][1096160] Avg episode reward: [(0, '4834.042')] [2023-03-10 20:04:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000008392_4296704.pth... [2023-03-10 20:04:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000007696_3940352.pth [2023-03-10 20:04:15,073][1096443] Updated weights for policy 0, policy_version 8400 (0.0005) [2023-03-10 20:04:18,583][1096443] Updated weights for policy 0, policy_version 8480 (0.0005) [2023-03-10 20:04:19,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 4354048. Throughput: 0: 11826.8. Samples: 4349684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:04:19,742][1096160] Avg episode reward: [(0, '4839.232')] [2023-03-10 20:04:21,974][1096443] Updated weights for policy 0, policy_version 8560 (0.0005) [2023-03-10 20:04:24,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 4415488. Throughput: 0: 11828.8. Samples: 4386040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:04:24,742][1096160] Avg episode reward: [(0, '4846.997')] [2023-03-10 20:04:25,383][1096443] Updated weights for policy 0, policy_version 8640 (0.0006) [2023-03-10 20:04:28,716][1096443] Updated weights for policy 0, policy_version 8720 (0.0005) [2023-03-10 20:04:29,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 4476928. Throughput: 0: 11900.7. Samples: 4459500. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:04:29,742][1096160] Avg episode reward: [(0, '4850.278')] [2023-03-10 20:04:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000008744_4476928.pth... [2023-03-10 20:04:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000008048_4120576.pth [2023-03-10 20:04:32,191][1096443] Updated weights for policy 0, policy_version 8800 (0.0005) [2023-03-10 20:04:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 4534272. Throughput: 0: 11904.9. Samples: 4531668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:04:34,742][1096160] Avg episode reward: [(0, '4809.305')] [2023-03-10 20:04:35,632][1096443] Updated weights for policy 0, policy_version 8880 (0.0004) [2023-03-10 20:04:38,879][1096443] Updated weights for policy 0, policy_version 8960 (0.0005) [2023-03-10 20:04:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11927.0). Total num frames: 4595712. Throughput: 0: 11895.3. Samples: 4567104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:04:39,742][1096160] Avg episode reward: [(0, '4800.173')] [2023-03-10 20:04:42,158][1096443] Updated weights for policy 0, policy_version 9040 (0.0005) [2023-03-10 20:04:44,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11940.9). Total num frames: 4657152. Throughput: 0: 11935.1. Samples: 4640776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:04:44,742][1096160] Avg episode reward: [(0, '4842.260')] [2023-03-10 20:04:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000009096_4657152.pth... [2023-03-10 20:04:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000008392_4296704.pth [2023-03-10 20:04:45,591][1096443] Updated weights for policy 0, policy_version 9120 (0.0004) [2023-03-10 20:04:48,990][1096443] Updated weights for policy 0, policy_version 9200 (0.0005) [2023-03-10 20:04:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11940.9). Total num frames: 4718592. Throughput: 0: 11966.6. Samples: 4713868. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:04:49,742][1096160] Avg episode reward: [(0, '4855.387')] [2023-03-10 20:04:52,412][1096443] Updated weights for policy 0, policy_version 9280 (0.0005) [2023-03-10 20:04:54,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 4775936. Throughput: 0: 12002.3. Samples: 4749724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:04:54,742][1096160] Avg episode reward: [(0, '4849.123')] [2023-03-10 20:04:55,899][1096443] Updated weights for policy 0, policy_version 9360 (0.0005) [2023-03-10 20:04:59,366][1096443] Updated weights for policy 0, policy_version 9440 (0.0005) [2023-03-10 20:04:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 4837376. Throughput: 0: 12022.4. Samples: 4820812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:04:59,742][1096160] Avg episode reward: [(0, '4846.027')] [2023-03-10 20:04:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000009448_4837376.pth... [2023-03-10 20:04:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000008744_4476928.pth [2023-03-10 20:05:02,697][1096443] Updated weights for policy 0, policy_version 9520 (0.0005) [2023-03-10 20:05:04,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11940.9). Total num frames: 4898816. Throughput: 0: 12107.8. Samples: 4894536. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:05:04,742][1096160] Avg episode reward: [(0, '4846.605')] [2023-03-10 20:05:05,965][1096443] Updated weights for policy 0, policy_version 9600 (0.0005) [2023-03-10 20:05:09,386][1096443] Updated weights for policy 0, policy_version 9680 (0.0005) [2023-03-10 20:05:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11940.9). Total num frames: 4956160. Throughput: 0: 12098.4. Samples: 4930468. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:05:09,742][1096160] Avg episode reward: [(0, '4847.428')] [2023-03-10 20:05:13,064][1096443] Updated weights for policy 0, policy_version 9760 (0.0005) [2023-03-10 20:05:14,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 12014.9, 300 sec: 11927.0). Total num frames: 5013504. Throughput: 0: 11992.5. Samples: 4999164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:05:14,742][1096160] Avg episode reward: [(0, '4848.913')] [2023-03-10 20:05:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000009792_5013504.pth... [2023-03-10 20:05:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000009096_4657152.pth [2023-03-10 20:05:16,530][1096443] Updated weights for policy 0, policy_version 9840 (0.0005) [2023-03-10 20:05:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 5074944. Throughput: 0: 11980.0. Samples: 5070768. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:05:19,742][1096160] Avg episode reward: [(0, '4850.700')] [2023-03-10 20:05:19,988][1096443] Updated weights for policy 0, policy_version 9920 (0.0005) [2023-03-10 20:05:23,437][1096443] Updated weights for policy 0, policy_version 10000 (0.0005) [2023-03-10 20:05:24,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11968.6). Total num frames: 5136384. Throughput: 0: 11954.8. Samples: 5105072. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:05:24,742][1096160] Avg episode reward: [(0, '4851.971')] [2023-03-10 20:05:26,947][1096443] Updated weights for policy 0, policy_version 10080 (0.0004) [2023-03-10 20:05:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.6, 300 sec: 11968.6). Total num frames: 5193728. Throughput: 0: 11923.9. Samples: 5177352. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:05:29,742][1096160] Avg episode reward: [(0, '4845.631')] [2023-03-10 20:05:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000010144_5193728.pth... [2023-03-10 20:05:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000009448_4837376.pth [2023-03-10 20:05:30,341][1096443] Updated weights for policy 0, policy_version 10160 (0.0004) [2023-03-10 20:05:33,908][1096443] Updated weights for policy 0, policy_version 10240 (0.0005) [2023-03-10 20:05:34,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 5251072. Throughput: 0: 11848.3. Samples: 5247040. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:05:34,742][1096160] Avg episode reward: [(0, '4843.588')] [2023-03-10 20:05:37,324][1096443] Updated weights for policy 0, policy_version 10320 (0.0005) [2023-03-10 20:05:39,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11940.9). Total num frames: 5308416. Throughput: 0: 11863.6. Samples: 5283588. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:05:39,742][1096160] Avg episode reward: [(0, '4852.002')] [2023-03-10 20:05:40,774][1096443] Updated weights for policy 0, policy_version 10400 (0.0004) [2023-03-10 20:05:44,154][1096443] Updated weights for policy 0, policy_version 10480 (0.0005) [2023-03-10 20:05:44,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 5369856. Throughput: 0: 11855.1. Samples: 5354288. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:05:44,742][1096160] Avg episode reward: [(0, '4837.347')] [2023-03-10 20:05:44,761][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000010496_5373952.pth... [2023-03-10 20:05:44,763][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000009792_5013504.pth [2023-03-10 20:05:47,475][1096443] Updated weights for policy 0, policy_version 10560 (0.0005) [2023-03-10 20:05:49,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 5431296. Throughput: 0: 11838.4. Samples: 5427264. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:05:49,742][1096160] Avg episode reward: [(0, '4841.469')] [2023-03-10 20:05:50,974][1096443] Updated weights for policy 0, policy_version 10640 (0.0005) [2023-03-10 20:05:54,320][1096443] Updated weights for policy 0, policy_version 10720 (0.0005) [2023-03-10 20:05:54,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.6, 300 sec: 11968.7). Total num frames: 5492736. Throughput: 0: 11830.6. Samples: 5462848. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:05:54,742][1096160] Avg episode reward: [(0, '4848.752')] [2023-03-10 20:05:57,609][1096443] Updated weights for policy 0, policy_version 10800 (0.0005) [2023-03-10 20:05:59,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 5554176. Throughput: 0: 11968.3. Samples: 5537736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:05:59,742][1096160] Avg episode reward: [(0, '4845.998')] [2023-03-10 20:05:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000010848_5554176.pth... [2023-03-10 20:05:59,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000010144_5193728.pth [2023-03-10 20:06:00,772][1096443] Updated weights for policy 0, policy_version 10880 (0.0005) [2023-03-10 20:06:04,301][1096443] Updated weights for policy 0, policy_version 10960 (0.0005) [2023-03-10 20:06:04,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 5615616. Throughput: 0: 12015.4. Samples: 5611460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:06:04,742][1096160] Avg episode reward: [(0, '4847.442')] [2023-03-10 20:06:07,974][1096443] Updated weights for policy 0, policy_version 11040 (0.0005) [2023-03-10 20:06:09,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11940.9). Total num frames: 5668864. Throughput: 0: 11982.9. Samples: 5644300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:06:09,742][1096160] Avg episode reward: [(0, '4842.097')] [2023-03-10 20:06:11,461][1096443] Updated weights for policy 0, policy_version 11120 (0.0005) [2023-03-10 20:06:14,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11946.6, 300 sec: 11940.9). Total num frames: 5730304. Throughput: 0: 11935.0. Samples: 5714428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:06:14,742][1096160] Avg episode reward: [(0, '4841.565')] [2023-03-10 20:06:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000011192_5730304.pth... [2023-03-10 20:06:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000010496_5373952.pth [2023-03-10 20:06:14,897][1096443] Updated weights for policy 0, policy_version 11200 (0.0005) [2023-03-10 20:06:18,321][1096443] Updated weights for policy 0, policy_version 11280 (0.0005) [2023-03-10 20:06:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 5791744. Throughput: 0: 12014.9. Samples: 5787712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:06:19,742][1096160] Avg episode reward: [(0, '4840.783')] [2023-03-10 20:06:21,512][1096443] Updated weights for policy 0, policy_version 11360 (0.0005) [2023-03-10 20:06:24,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11940.9). Total num frames: 5849088. Throughput: 0: 12019.2. Samples: 5824452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:06:24,742][1096160] Avg episode reward: [(0, '4834.333')] [2023-03-10 20:06:25,144][1096443] Updated weights for policy 0, policy_version 11440 (0.0005) [2023-03-10 20:06:28,624][1096443] Updated weights for policy 0, policy_version 11520 (0.0005) [2023-03-10 20:06:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 5910528. Throughput: 0: 11998.2. Samples: 5894208. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:06:29,742][1096160] Avg episode reward: [(0, '4826.196')] [2023-03-10 20:06:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000011544_5910528.pth... [2023-03-10 20:06:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000010848_5554176.pth [2023-03-10 20:06:32,012][1096443] Updated weights for policy 0, policy_version 11600 (0.0004) [2023-03-10 20:06:34,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 5971968. Throughput: 0: 11987.5. Samples: 5966700. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:06:34,742][1096160] Avg episode reward: [(0, '4824.839')] [2023-03-10 20:06:35,419][1096443] Updated weights for policy 0, policy_version 11680 (0.0006) [2023-03-10 20:06:38,804][1096443] Updated weights for policy 0, policy_version 11760 (0.0005) [2023-03-10 20:06:39,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 6029312. Throughput: 0: 11977.2. Samples: 6001820. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:06:39,742][1096160] Avg episode reward: [(0, '4830.859')] [2023-03-10 20:06:42,232][1096443] Updated weights for policy 0, policy_version 11840 (0.0005) [2023-03-10 20:06:44,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 6090752. Throughput: 0: 11921.5. Samples: 6074204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:06:44,742][1096160] Avg episode reward: [(0, '4833.486')] [2023-03-10 20:06:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000011896_6090752.pth... [2023-03-10 20:06:44,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000011192_5730304.pth [2023-03-10 20:06:45,694][1096443] Updated weights for policy 0, policy_version 11920 (0.0005) [2023-03-10 20:06:49,272][1096443] Updated weights for policy 0, policy_version 12000 (0.0005) [2023-03-10 20:06:49,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11940.9). Total num frames: 6148096. Throughput: 0: 11833.3. Samples: 6143956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:06:49,742][1096160] Avg episode reward: [(0, '4829.172')] [2023-03-10 20:06:52,645][1096443] Updated weights for policy 0, policy_version 12080 (0.0005) [2023-03-10 20:06:54,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11927.0). Total num frames: 6205440. Throughput: 0: 11921.4. Samples: 6180764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:06:54,742][1096160] Avg episode reward: [(0, '4834.203')] [2023-03-10 20:06:56,083][1096443] Updated weights for policy 0, policy_version 12160 (0.0004) [2023-03-10 20:06:59,523][1096443] Updated weights for policy 0, policy_version 12240 (0.0005) [2023-03-10 20:06:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11927.0). Total num frames: 6266880. Throughput: 0: 11920.1. Samples: 6250832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:06:59,742][1096160] Avg episode reward: [(0, '4827.948')] [2023-03-10 20:06:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000012240_6266880.pth... [2023-03-10 20:06:59,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000011544_5910528.pth [2023-03-10 20:07:03,025][1096443] Updated weights for policy 0, policy_version 12320 (0.0005) [2023-03-10 20:07:04,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11940.9). Total num frames: 6328320. Throughput: 0: 11901.1. Samples: 6323264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:07:04,742][1096160] Avg episode reward: [(0, '4827.859')] [2023-03-10 20:07:06,434][1096443] Updated weights for policy 0, policy_version 12400 (0.0005) [2023-03-10 20:07:09,505][1096443] Updated weights for policy 0, policy_version 12480 (0.0005) [2023-03-10 20:07:09,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11940.9). Total num frames: 6389760. Throughput: 0: 11897.5. Samples: 6359840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:07:09,742][1096160] Avg episode reward: [(0, '4825.584')] [2023-03-10 20:07:12,574][1096443] Updated weights for policy 0, policy_version 12560 (0.0005) [2023-03-10 20:07:14,741][1096160] Fps is (10 sec: 12697.8, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 6455296. Throughput: 0: 12102.1. Samples: 6438800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:07:14,742][1096160] Avg episode reward: [(0, '4830.917')] [2023-03-10 20:07:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000012608_6455296.pth... [2023-03-10 20:07:14,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000011896_6090752.pth [2023-03-10 20:07:15,940][1096443] Updated weights for policy 0, policy_version 12640 (0.0004) [2023-03-10 20:07:19,156][1096443] Updated weights for policy 0, policy_version 12720 (0.0005) [2023-03-10 20:07:19,741][1096160] Fps is (10 sec: 12697.7, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 6516736. Throughput: 0: 12142.7. Samples: 6513120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:07:19,742][1096160] Avg episode reward: [(0, '4812.316')] [2023-03-10 20:07:22,615][1096443] Updated weights for policy 0, policy_version 12800 (0.0005) [2023-03-10 20:07:24,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 11968.7). Total num frames: 6578176. Throughput: 0: 12170.5. Samples: 6549492. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:07:24,742][1096160] Avg episode reward: [(0, '4768.376')] [2023-03-10 20:07:25,932][1096443] Updated weights for policy 0, policy_version 12880 (0.0004) [2023-03-10 20:07:29,310][1096443] Updated weights for policy 0, policy_version 12960 (0.0004) [2023-03-10 20:07:29,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12151.5, 300 sec: 11968.7). Total num frames: 6639616. Throughput: 0: 12194.5. Samples: 6622960. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:07:29,742][1096160] Avg episode reward: [(0, '4752.827')] [2023-03-10 20:07:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000012968_6639616.pth... [2023-03-10 20:07:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000012240_6266880.pth [2023-03-10 20:07:32,834][1096443] Updated weights for policy 0, policy_version 13040 (0.0005) [2023-03-10 20:07:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 6696960. Throughput: 0: 12188.6. Samples: 6692444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:07:34,742][1096160] Avg episode reward: [(0, '4782.154')] [2023-03-10 20:07:36,460][1096443] Updated weights for policy 0, policy_version 13120 (0.0005) [2023-03-10 20:07:39,727][1096443] Updated weights for policy 0, policy_version 13200 (0.0005) [2023-03-10 20:07:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 11982.5). Total num frames: 6758400. Throughput: 0: 12139.6. Samples: 6727048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:07:39,742][1096160] Avg episode reward: [(0, '4733.977')] [2023-03-10 20:07:42,953][1096443] Updated weights for policy 0, policy_version 13280 (0.0006) [2023-03-10 20:07:44,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 11968.7). Total num frames: 6819840. Throughput: 0: 12273.3. Samples: 6803128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:07:44,742][1096160] Avg episode reward: [(0, '4664.118')] [2023-03-10 20:07:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000013320_6819840.pth... [2023-03-10 20:07:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000012608_6455296.pth [2023-03-10 20:07:46,376][1096443] Updated weights for policy 0, policy_version 13360 (0.0005) [2023-03-10 20:07:49,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 11968.7). Total num frames: 6877184. Throughput: 0: 12234.3. Samples: 6873808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:07:49,742][1096160] Avg episode reward: [(0, '4707.576')] [2023-03-10 20:07:49,847][1096443] Updated weights for policy 0, policy_version 13440 (0.0005) [2023-03-10 20:07:53,392][1096443] Updated weights for policy 0, policy_version 13520 (0.0004) [2023-03-10 20:07:54,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 12151.5, 300 sec: 11968.7). Total num frames: 6934528. Throughput: 0: 12173.3. Samples: 6907640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:07:54,742][1096160] Avg episode reward: [(0, '4585.540')] [2023-03-10 20:07:56,971][1096443] Updated weights for policy 0, policy_version 13600 (0.0005) [2023-03-10 20:07:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 11968.7). Total num frames: 6995968. Throughput: 0: 12014.2. Samples: 6979440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:07:59,742][1096160] Avg episode reward: [(0, '4340.651')] [2023-03-10 20:07:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000013664_6995968.pth... [2023-03-10 20:07:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000012968_6639616.pth [2023-03-10 20:08:00,291][1096443] Updated weights for policy 0, policy_version 13680 (0.0005) [2023-03-10 20:08:03,578][1096443] Updated weights for policy 0, policy_version 13760 (0.0006) [2023-03-10 20:08:04,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 11982.5). Total num frames: 7057408. Throughput: 0: 12000.4. Samples: 7053140. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:08:04,742][1096160] Avg episode reward: [(0, '4656.860')] [2023-03-10 20:08:06,876][1096443] Updated weights for policy 0, policy_version 13840 (0.0005) [2023-03-10 20:08:09,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11982.5). Total num frames: 7118848. Throughput: 0: 12015.4. Samples: 7090184. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:08:09,742][1096160] Avg episode reward: [(0, '4668.745')] [2023-03-10 20:08:10,284][1096443] Updated weights for policy 0, policy_version 13920 (0.0005) [2023-03-10 20:08:13,751][1096443] Updated weights for policy 0, policy_version 14000 (0.0005) [2023-03-10 20:08:14,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11968.7). Total num frames: 7176192. Throughput: 0: 11971.5. Samples: 7161676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:08:14,742][1096160] Avg episode reward: [(0, '4695.487')] [2023-03-10 20:08:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000014016_7176192.pth... [2023-03-10 20:08:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000013320_6819840.pth [2023-03-10 20:08:17,374][1096443] Updated weights for policy 0, policy_version 14080 (0.0005) [2023-03-10 20:08:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 7237632. Throughput: 0: 11982.8. Samples: 7231668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:08:19,742][1096160] Avg episode reward: [(0, '4714.585')] [2023-03-10 20:08:20,740][1096443] Updated weights for policy 0, policy_version 14160 (0.0005) [2023-03-10 20:08:24,175][1096443] Updated weights for policy 0, policy_version 14240 (0.0005) [2023-03-10 20:08:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 7294976. Throughput: 0: 12014.0. Samples: 7267676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:08:24,742][1096160] Avg episode reward: [(0, '4767.498')] [2023-03-10 20:08:27,484][1096443] Updated weights for policy 0, policy_version 14320 (0.0005) [2023-03-10 20:08:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 7356416. Throughput: 0: 11932.6. Samples: 7340096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:08:29,742][1096160] Avg episode reward: [(0, '4805.539')] [2023-03-10 20:08:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000014368_7356416.pth... [2023-03-10 20:08:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000013664_6995968.pth [2023-03-10 20:08:30,905][1096443] Updated weights for policy 0, policy_version 14400 (0.0005) [2023-03-10 20:08:34,338][1096443] Updated weights for policy 0, policy_version 14480 (0.0005) [2023-03-10 20:08:34,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 7417856. Throughput: 0: 11969.7. Samples: 7412444. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:08:34,742][1096160] Avg episode reward: [(0, '4812.916')] [2023-03-10 20:08:37,943][1096443] Updated weights for policy 0, policy_version 14560 (0.0004) [2023-03-10 20:08:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 7475200. Throughput: 0: 11975.0. Samples: 7446516. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:08:39,742][1096160] Avg episode reward: [(0, '4819.838')] [2023-03-10 20:08:41,458][1096443] Updated weights for policy 0, policy_version 14640 (0.0004) [2023-03-10 20:08:44,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 7532544. Throughput: 0: 11934.9. Samples: 7516512. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:08:44,742][1096160] Avg episode reward: [(0, '4823.019')] [2023-03-10 20:08:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000014712_7532544.pth... [2023-03-10 20:08:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000014016_7176192.pth [2023-03-10 20:08:44,911][1096443] Updated weights for policy 0, policy_version 14720 (0.0005) [2023-03-10 20:08:48,211][1096443] Updated weights for policy 0, policy_version 14800 (0.0005) [2023-03-10 20:08:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 7593984. Throughput: 0: 11928.1. Samples: 7589904. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:08:49,742][1096160] Avg episode reward: [(0, '4818.551')] [2023-03-10 20:08:51,529][1096443] Updated weights for policy 0, policy_version 14880 (0.0005) [2023-03-10 20:08:54,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 7655424. Throughput: 0: 11925.3. Samples: 7626824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:08:54,742][1096160] Avg episode reward: [(0, '4834.391')] [2023-03-10 20:08:55,055][1096443] Updated weights for policy 0, policy_version 14960 (0.0005) [2023-03-10 20:08:58,217][1096443] Updated weights for policy 0, policy_version 15040 (0.0004) [2023-03-10 20:08:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 7716864. Throughput: 0: 11974.8. Samples: 7700544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:08:59,742][1096160] Avg episode reward: [(0, '4835.482')] [2023-03-10 20:08:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000015072_7716864.pth... [2023-03-10 20:08:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000014368_7356416.pth [2023-03-10 20:09:01,614][1096443] Updated weights for policy 0, policy_version 15120 (0.0004) [2023-03-10 20:09:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 7774208. Throughput: 0: 12008.7. Samples: 7772060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:09:04,742][1096160] Avg episode reward: [(0, '4827.767')] [2023-03-10 20:09:05,130][1096443] Updated weights for policy 0, policy_version 15200 (0.0005) [2023-03-10 20:09:08,645][1096443] Updated weights for policy 0, policy_version 15280 (0.0005) [2023-03-10 20:09:09,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 7835648. Throughput: 0: 11983.7. Samples: 7806940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:09:09,742][1096160] Avg episode reward: [(0, '4833.848')] [2023-03-10 20:09:12,127][1096443] Updated weights for policy 0, policy_version 15360 (0.0005) [2023-03-10 20:09:14,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 7892992. Throughput: 0: 11942.6. Samples: 7877512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:09:14,742][1096160] Avg episode reward: [(0, '4828.285')] [2023-03-10 20:09:14,761][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000015424_7897088.pth... [2023-03-10 20:09:14,763][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000014712_7532544.pth [2023-03-10 20:09:15,529][1096443] Updated weights for policy 0, policy_version 15440 (0.0005) [2023-03-10 20:09:18,946][1096443] Updated weights for policy 0, policy_version 15520 (0.0005) [2023-03-10 20:09:19,741][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 7954432. Throughput: 0: 11950.7. Samples: 7950224. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:09:19,742][1096160] Avg episode reward: [(0, '4838.604')] [2023-03-10 20:09:22,399][1096443] Updated weights for policy 0, policy_version 15600 (0.0005) [2023-03-10 20:09:24,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 8011776. Throughput: 0: 11971.6. Samples: 7985236. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:09:24,742][1096160] Avg episode reward: [(0, '4841.982')] [2023-03-10 20:09:25,840][1096443] Updated weights for policy 0, policy_version 15680 (0.0005) [2023-03-10 20:09:29,441][1096443] Updated weights for policy 0, policy_version 15760 (0.0005) [2023-03-10 20:09:29,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 8069120. Throughput: 0: 12003.1. Samples: 8056652. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:09:29,742][1096160] Avg episode reward: [(0, '4840.826')] [2023-03-10 20:09:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000015760_8069120.pth... [2023-03-10 20:09:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000015072_7716864.pth [2023-03-10 20:09:32,759][1096443] Updated weights for policy 0, policy_version 15840 (0.0005) [2023-03-10 20:09:34,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 8130560. Throughput: 0: 11927.6. Samples: 8126644. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:09:34,742][1096160] Avg episode reward: [(0, '4854.710')] [2023-03-10 20:09:36,221][1096443] Updated weights for policy 0, policy_version 15920 (0.0005) [2023-03-10 20:09:39,731][1096443] Updated weights for policy 0, policy_version 16000 (0.0005) [2023-03-10 20:09:39,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 8192000. Throughput: 0: 11916.5. Samples: 8163064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:09:39,742][1096160] Avg episode reward: [(0, '4845.024')] [2023-03-10 20:09:43,162][1096443] Updated weights for policy 0, policy_version 16080 (0.0006) [2023-03-10 20:09:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 8249344. Throughput: 0: 11837.3. Samples: 8233224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:09:44,742][1096160] Avg episode reward: [(0, '4848.028')] [2023-03-10 20:09:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000016112_8249344.pth... [2023-03-10 20:09:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000015424_7897088.pth [2023-03-10 20:09:46,557][1096443] Updated weights for policy 0, policy_version 16160 (0.0005) [2023-03-10 20:09:49,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 8310784. Throughput: 0: 11877.1. Samples: 8306528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:09:49,742][1096160] Avg episode reward: [(0, '4845.561')] [2023-03-10 20:09:49,962][1096443] Updated weights for policy 0, policy_version 16240 (0.0005) [2023-03-10 20:09:53,284][1096443] Updated weights for policy 0, policy_version 16320 (0.0004) [2023-03-10 20:09:54,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 8372224. Throughput: 0: 11906.2. Samples: 8342720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:09:54,742][1096160] Avg episode reward: [(0, '4847.728')] [2023-03-10 20:09:56,740][1096443] Updated weights for policy 0, policy_version 16400 (0.0005) [2023-03-10 20:09:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 8429568. Throughput: 0: 11921.3. Samples: 8413972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:09:59,742][1096160] Avg episode reward: [(0, '4851.222')] [2023-03-10 20:09:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000016464_8429568.pth... [2023-03-10 20:09:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000015760_8069120.pth [2023-03-10 20:10:00,340][1096443] Updated weights for policy 0, policy_version 16480 (0.0005) [2023-03-10 20:10:03,779][1096443] Updated weights for policy 0, policy_version 16560 (0.0005) [2023-03-10 20:10:04,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 8486912. Throughput: 0: 11865.9. Samples: 8484192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:10:04,742][1096160] Avg episode reward: [(0, '4842.144')] [2023-03-10 20:10:07,115][1096443] Updated weights for policy 0, policy_version 16640 (0.0005) [2023-03-10 20:10:09,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 8548352. Throughput: 0: 11904.2. Samples: 8520928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:10:09,742][1096160] Avg episode reward: [(0, '4838.257')] [2023-03-10 20:10:10,658][1096443] Updated weights for policy 0, policy_version 16720 (0.0004) [2023-03-10 20:10:13,947][1096443] Updated weights for policy 0, policy_version 16800 (0.0005) [2023-03-10 20:10:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 8609792. Throughput: 0: 11921.2. Samples: 8593108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:10:14,742][1096160] Avg episode reward: [(0, '4843.242')] [2023-03-10 20:10:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000016816_8609792.pth... [2023-03-10 20:10:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000016112_8249344.pth [2023-03-10 20:10:17,247][1096443] Updated weights for policy 0, policy_version 16880 (0.0005) [2023-03-10 20:10:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.6, 300 sec: 11982.5). Total num frames: 8671232. Throughput: 0: 11990.7. Samples: 8666228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:10:19,742][1096160] Avg episode reward: [(0, '4847.585')] [2023-03-10 20:10:20,699][1096443] Updated weights for policy 0, policy_version 16960 (0.0006) [2023-03-10 20:10:23,961][1096443] Updated weights for policy 0, policy_version 17040 (0.0005) [2023-03-10 20:10:24,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 8732672. Throughput: 0: 12022.2. Samples: 8704064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:10:24,742][1096160] Avg episode reward: [(0, '4850.287')] [2023-03-10 20:10:27,426][1096443] Updated weights for policy 0, policy_version 17120 (0.0004) [2023-03-10 20:10:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 8790016. Throughput: 0: 12009.2. Samples: 8773640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:10:29,742][1096160] Avg episode reward: [(0, '4839.070')] [2023-03-10 20:10:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000017168_8790016.pth... [2023-03-10 20:10:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000016464_8429568.pth [2023-03-10 20:10:31,112][1096443] Updated weights for policy 0, policy_version 17200 (0.0005) [2023-03-10 20:10:34,649][1096443] Updated weights for policy 0, policy_version 17280 (0.0005) [2023-03-10 20:10:34,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11946.6, 300 sec: 11996.4). Total num frames: 8847360. Throughput: 0: 11927.6. Samples: 8843272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:10:34,742][1096160] Avg episode reward: [(0, '4841.584')] [2023-03-10 20:10:38,176][1096443] Updated weights for policy 0, policy_version 17360 (0.0006) [2023-03-10 20:10:39,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 8904704. Throughput: 0: 11885.4. Samples: 8877560. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:10:39,742][1096160] Avg episode reward: [(0, '4845.761')] [2023-03-10 20:10:41,499][1096443] Updated weights for policy 0, policy_version 17440 (0.0005) [2023-03-10 20:10:44,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 8966144. Throughput: 0: 11908.7. Samples: 8949864. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:10:44,742][1096160] Avg episode reward: [(0, '4847.515')] [2023-03-10 20:10:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000017512_8966144.pth... [2023-03-10 20:10:44,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000016816_8609792.pth [2023-03-10 20:10:44,813][1096443] Updated weights for policy 0, policy_version 17520 (0.0005) [2023-03-10 20:10:48,107][1096443] Updated weights for policy 0, policy_version 17600 (0.0005) [2023-03-10 20:10:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 9027584. Throughput: 0: 11996.7. Samples: 9024044. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:10:49,742][1096160] Avg episode reward: [(0, '4849.350')] [2023-03-10 20:10:51,403][1096443] Updated weights for policy 0, policy_version 17680 (0.0005) [2023-03-10 20:10:54,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 9089024. Throughput: 0: 12016.4. Samples: 9061664. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:10:54,742][1096160] Avg episode reward: [(0, '4849.034')] [2023-03-10 20:10:54,934][1096443] Updated weights for policy 0, policy_version 17760 (0.0005) [2023-03-10 20:10:58,397][1096443] Updated weights for policy 0, policy_version 17840 (0.0005) [2023-03-10 20:10:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 9150464. Throughput: 0: 11978.5. Samples: 9132140. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:10:59,742][1096160] Avg episode reward: [(0, '4850.430')] [2023-03-10 20:10:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000017872_9150464.pth... [2023-03-10 20:10:59,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000017168_8790016.pth [2023-03-10 20:11:01,767][1096443] Updated weights for policy 0, policy_version 17920 (0.0005) [2023-03-10 20:11:04,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 9207808. Throughput: 0: 11995.8. Samples: 9206040. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:11:04,742][1096160] Avg episode reward: [(0, '4853.737')] [2023-03-10 20:11:05,130][1096443] Updated weights for policy 0, policy_version 18000 (0.0005) [2023-03-10 20:11:08,544][1096443] Updated weights for policy 0, policy_version 18080 (0.0005) [2023-03-10 20:11:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 9269248. Throughput: 0: 11925.5. Samples: 9240712. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:11:09,742][1096160] Avg episode reward: [(0, '4845.809')] [2023-03-10 20:11:11,846][1096443] Updated weights for policy 0, policy_version 18160 (0.0004) [2023-03-10 20:11:14,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 9326592. Throughput: 0: 11991.3. Samples: 9313248. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:11:14,742][1096160] Avg episode reward: [(0, '4852.374')] [2023-03-10 20:11:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000018216_9326592.pth... [2023-03-10 20:11:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000017512_8966144.pth [2023-03-10 20:11:15,552][1096443] Updated weights for policy 0, policy_version 18240 (0.0005) [2023-03-10 20:11:18,904][1096443] Updated weights for policy 0, policy_version 18320 (0.0005) [2023-03-10 20:11:19,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 9388032. Throughput: 0: 12016.2. Samples: 9384000. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:11:19,742][1096160] Avg episode reward: [(0, '4842.819')] [2023-03-10 20:11:22,175][1096443] Updated weights for policy 0, policy_version 18400 (0.0005) [2023-03-10 20:11:24,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 9453568. Throughput: 0: 12073.9. Samples: 9420888. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:11:24,742][1096160] Avg episode reward: [(0, '4838.108')] [2023-03-10 20:11:25,572][1096443] Updated weights for policy 0, policy_version 18480 (0.0005) [2023-03-10 20:11:29,103][1096443] Updated weights for policy 0, policy_version 18560 (0.0005) [2023-03-10 20:11:29,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 9510912. Throughput: 0: 12078.7. Samples: 9493408. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:11:29,742][1096160] Avg episode reward: [(0, '4847.586')] [2023-03-10 20:11:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000018576_9510912.pth... [2023-03-10 20:11:29,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000017872_9150464.pth [2023-03-10 20:11:32,431][1096443] Updated weights for policy 0, policy_version 18640 (0.0005) [2023-03-10 20:11:34,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 9568256. Throughput: 0: 12002.8. Samples: 9564172. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:11:34,742][1096160] Avg episode reward: [(0, '4842.671')] [2023-03-10 20:11:35,842][1096443] Updated weights for policy 0, policy_version 18720 (0.0004) [2023-03-10 20:11:39,340][1096443] Updated weights for policy 0, policy_version 18800 (0.0004) [2023-03-10 20:11:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 9629696. Throughput: 0: 11987.2. Samples: 9601088. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:11:39,742][1096160] Avg episode reward: [(0, '4836.453')] [2023-03-10 20:11:42,592][1096443] Updated weights for policy 0, policy_version 18880 (0.0005) [2023-03-10 20:11:44,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 9691136. Throughput: 0: 12052.9. Samples: 9674520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:11:44,742][1096160] Avg episode reward: [(0, '4841.693')] [2023-03-10 20:11:44,747][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000018928_9691136.pth... [2023-03-10 20:11:44,750][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000018216_9326592.pth [2023-03-10 20:11:45,920][1096443] Updated weights for policy 0, policy_version 18960 (0.0005) [2023-03-10 20:11:49,334][1096443] Updated weights for policy 0, policy_version 19040 (0.0005) [2023-03-10 20:11:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 9752576. Throughput: 0: 12048.0. Samples: 9748200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:11:49,742][1096160] Avg episode reward: [(0, '4842.974')] [2023-03-10 20:11:52,937][1096443] Updated weights for policy 0, policy_version 19120 (0.0005) [2023-03-10 20:11:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 9809920. Throughput: 0: 12012.1. Samples: 9781256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:11:54,742][1096160] Avg episode reward: [(0, '4834.092')] [2023-03-10 20:11:56,354][1096443] Updated weights for policy 0, policy_version 19200 (0.0005) [2023-03-10 20:11:59,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 9867264. Throughput: 0: 11970.8. Samples: 9851936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:11:59,742][1096160] Avg episode reward: [(0, '4822.080')] [2023-03-10 20:11:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000019272_9867264.pth... [2023-03-10 20:11:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000018576_9510912.pth [2023-03-10 20:11:59,900][1096443] Updated weights for policy 0, policy_version 19280 (0.0004) [2023-03-10 20:12:03,232][1096443] Updated weights for policy 0, policy_version 19360 (0.0005) [2023-03-10 20:12:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 9928704. Throughput: 0: 12012.3. Samples: 9924556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:12:04,742][1096160] Avg episode reward: [(0, '4833.608')] [2023-03-10 20:12:06,555][1096443] Updated weights for policy 0, policy_version 19440 (0.0004) [2023-03-10 20:12:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 9986048. Throughput: 0: 12010.9. Samples: 9961380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:12:09,742][1096160] Avg episode reward: [(0, '4843.119')] [2023-03-10 20:12:10,067][1096443] Updated weights for policy 0, policy_version 19520 (0.0005) [2023-03-10 20:12:13,495][1096443] Updated weights for policy 0, policy_version 19600 (0.0005) [2023-03-10 20:12:14,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11968.6). Total num frames: 10047488. Throughput: 0: 11967.6. Samples: 10031948. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:12:14,742][1096160] Avg episode reward: [(0, '4844.449')] [2023-03-10 20:12:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000019624_10047488.pth... [2023-03-10 20:12:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000018928_9691136.pth [2023-03-10 20:12:16,808][1096443] Updated weights for policy 0, policy_version 19680 (0.0005) [2023-03-10 20:12:19,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11968.7). Total num frames: 10108928. Throughput: 0: 12006.6. Samples: 10104468. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:12:19,742][1096160] Avg episode reward: [(0, '4850.672')] [2023-03-10 20:12:20,377][1096443] Updated weights for policy 0, policy_version 19760 (0.0005) [2023-03-10 20:12:23,899][1096443] Updated weights for policy 0, policy_version 19840 (0.0005) [2023-03-10 20:12:24,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 10166272. Throughput: 0: 11934.8. Samples: 10138156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:12:24,742][1096160] Avg episode reward: [(0, '4850.446')] [2023-03-10 20:12:27,334][1096443] Updated weights for policy 0, policy_version 19920 (0.0005) [2023-03-10 20:12:29,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 10223616. Throughput: 0: 11904.5. Samples: 10210220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:12:29,742][1096160] Avg episode reward: [(0, '4847.839')] [2023-03-10 20:12:29,756][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000019976_10227712.pth... [2023-03-10 20:12:29,758][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000019272_9867264.pth [2023-03-10 20:12:30,762][1096443] Updated weights for policy 0, policy_version 20000 (0.0005) [2023-03-10 20:12:34,201][1096443] Updated weights for policy 0, policy_version 20080 (0.0005) [2023-03-10 20:12:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 10285056. Throughput: 0: 11840.5. Samples: 10281024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:12:34,742][1096160] Avg episode reward: [(0, '4851.451')] [2023-03-10 20:12:37,639][1096443] Updated weights for policy 0, policy_version 20160 (0.0004) [2023-03-10 20:12:39,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 10346496. Throughput: 0: 11923.3. Samples: 10317804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:12:39,742][1096160] Avg episode reward: [(0, '4845.601')] [2023-03-10 20:12:40,913][1096443] Updated weights for policy 0, policy_version 20240 (0.0005) [2023-03-10 20:12:44,354][1096443] Updated weights for policy 0, policy_version 20320 (0.0005) [2023-03-10 20:12:44,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 11946.6, 300 sec: 11968.6). Total num frames: 10407936. Throughput: 0: 11983.2. Samples: 10391180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:12:44,743][1096160] Avg episode reward: [(0, '4837.698')] [2023-03-10 20:12:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000020328_10407936.pth... [2023-03-10 20:12:44,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000019624_10047488.pth [2023-03-10 20:12:47,753][1096443] Updated weights for policy 0, policy_version 20400 (0.0005) [2023-03-10 20:12:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 10469376. Throughput: 0: 11965.6. Samples: 10463008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:12:49,742][1096160] Avg episode reward: [(0, '4836.284')] [2023-03-10 20:12:50,950][1096443] Updated weights for policy 0, policy_version 20480 (0.0005) [2023-03-10 20:12:54,322][1096443] Updated weights for policy 0, policy_version 20560 (0.0004) [2023-03-10 20:12:54,742][1096160] Fps is (10 sec: 12288.3, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 10530816. Throughput: 0: 12002.2. Samples: 10501480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:12:54,742][1096160] Avg episode reward: [(0, '4838.752')] [2023-03-10 20:12:57,905][1096443] Updated weights for policy 0, policy_version 20640 (0.0005) [2023-03-10 20:12:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11968.6). Total num frames: 10588160. Throughput: 0: 11994.9. Samples: 10571720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:12:59,742][1096160] Avg episode reward: [(0, '4835.753')] [2023-03-10 20:12:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000020680_10588160.pth... [2023-03-10 20:12:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000019976_10227712.pth [2023-03-10 20:13:01,488][1096443] Updated weights for policy 0, policy_version 20720 (0.0004) [2023-03-10 20:13:04,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 10645504. Throughput: 0: 11932.0. Samples: 10641408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:13:04,742][1096160] Avg episode reward: [(0, '4835.108')] [2023-03-10 20:13:04,955][1096443] Updated weights for policy 0, policy_version 20800 (0.0005) [2023-03-10 20:13:08,507][1096443] Updated weights for policy 0, policy_version 20880 (0.0006) [2023-03-10 20:13:09,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 10702848. Throughput: 0: 11913.2. Samples: 10674248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:13:09,742][1096160] Avg episode reward: [(0, '4839.999')] [2023-03-10 20:13:11,821][1096443] Updated weights for policy 0, policy_version 20960 (0.0005) [2023-03-10 20:13:14,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11968.7). Total num frames: 10768384. Throughput: 0: 11967.7. Samples: 10748768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:13:14,742][1096160] Avg episode reward: [(0, '4841.349')] [2023-03-10 20:13:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000021032_10768384.pth... [2023-03-10 20:13:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000020328_10407936.pth [2023-03-10 20:13:15,131][1096443] Updated weights for policy 0, policy_version 21040 (0.0005) [2023-03-10 20:13:18,617][1096443] Updated weights for policy 0, policy_version 21120 (0.0006) [2023-03-10 20:13:19,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.6, 300 sec: 11968.6). Total num frames: 10825728. Throughput: 0: 11990.2. Samples: 10820584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:13:19,742][1096160] Avg episode reward: [(0, '4837.609')] [2023-03-10 20:13:21,985][1096443] Updated weights for policy 0, policy_version 21200 (0.0005) [2023-03-10 20:13:24,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 10883072. Throughput: 0: 12002.7. Samples: 10857924. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:13:24,742][1096160] Avg episode reward: [(0, '4837.636')] [2023-03-10 20:13:25,646][1096443] Updated weights for policy 0, policy_version 21280 (0.0004) [2023-03-10 20:13:29,052][1096443] Updated weights for policy 0, policy_version 21360 (0.0004) [2023-03-10 20:13:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 10944512. Throughput: 0: 11902.5. Samples: 10926792. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:13:29,742][1096160] Avg episode reward: [(0, '4839.369')] [2023-03-10 20:13:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000021376_10944512.pth... [2023-03-10 20:13:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000020680_10588160.pth [2023-03-10 20:13:32,395][1096443] Updated weights for policy 0, policy_version 21440 (0.0005) [2023-03-10 20:13:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 11001856. Throughput: 0: 11882.8. Samples: 10997736. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:13:34,742][1096160] Avg episode reward: [(0, '4839.820')] [2023-03-10 20:13:36,104][1096443] Updated weights for policy 0, policy_version 21520 (0.0005) [2023-03-10 20:13:39,587][1096443] Updated weights for policy 0, policy_version 21600 (0.0005) [2023-03-10 20:13:39,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 11059200. Throughput: 0: 11769.9. Samples: 11031124. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:13:39,742][1096160] Avg episode reward: [(0, '4842.199')] [2023-03-10 20:13:43,008][1096443] Updated weights for policy 0, policy_version 21680 (0.0005) [2023-03-10 20:13:44,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 11120640. Throughput: 0: 11817.0. Samples: 11103488. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:13:44,742][1096160] Avg episode reward: [(0, '4847.078')] [2023-03-10 20:13:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000021720_11120640.pth... [2023-03-10 20:13:44,750][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000021032_10768384.pth [2023-03-10 20:13:46,492][1096443] Updated weights for policy 0, policy_version 21760 (0.0004) [2023-03-10 20:13:49,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11940.9). Total num frames: 11177984. Throughput: 0: 11834.3. Samples: 11173952. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:13:49,742][1096160] Avg episode reward: [(0, '4841.236')] [2023-03-10 20:13:49,880][1096443] Updated weights for policy 0, policy_version 21840 (0.0004) [2023-03-10 20:13:53,233][1096443] Updated weights for policy 0, policy_version 21920 (0.0005) [2023-03-10 20:13:54,742][1096160] Fps is (10 sec: 11878.6, 60 sec: 11810.1, 300 sec: 11940.9). Total num frames: 11239424. Throughput: 0: 11923.7. Samples: 11210816. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:13:54,742][1096160] Avg episode reward: [(0, '4842.071')] [2023-03-10 20:13:56,644][1096443] Updated weights for policy 0, policy_version 22000 (0.0005) [2023-03-10 20:13:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 11300864. Throughput: 0: 11903.1. Samples: 11284408. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:13:59,742][1096160] Avg episode reward: [(0, '4843.023')] [2023-03-10 20:13:59,756][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000022080_11304960.pth... [2023-03-10 20:13:59,756][1096443] Updated weights for policy 0, policy_version 22080 (0.0004) [2023-03-10 20:13:59,758][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000021376_10944512.pth [2023-03-10 20:14:03,144][1096443] Updated weights for policy 0, policy_version 22160 (0.0005) [2023-03-10 20:14:04,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 11362304. Throughput: 0: 11964.6. Samples: 11358988. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:14:04,742][1096160] Avg episode reward: [(0, '4837.561')] [2023-03-10 20:14:06,259][1096443] Updated weights for policy 0, policy_version 22240 (0.0004) [2023-03-10 20:14:09,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 11968.7). Total num frames: 11423744. Throughput: 0: 11990.9. Samples: 11397516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:14:09,742][1096160] Avg episode reward: [(0, '4828.288')] [2023-03-10 20:14:09,930][1096443] Updated weights for policy 0, policy_version 22320 (0.0004) [2023-03-10 20:14:13,387][1096443] Updated weights for policy 0, policy_version 22400 (0.0005) [2023-03-10 20:14:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 11481088. Throughput: 0: 12012.8. Samples: 11467368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:14:14,742][1096160] Avg episode reward: [(0, '4842.490')] [2023-03-10 20:14:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000022424_11481088.pth... [2023-03-10 20:14:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000021720_11120640.pth [2023-03-10 20:14:16,982][1096443] Updated weights for policy 0, policy_version 22480 (0.0005) [2023-03-10 20:14:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 11542528. Throughput: 0: 11983.8. Samples: 11537008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:14:19,742][1096160] Avg episode reward: [(0, '4832.417')] [2023-03-10 20:14:20,377][1096443] Updated weights for policy 0, policy_version 22560 (0.0005) [2023-03-10 20:14:23,725][1096443] Updated weights for policy 0, policy_version 22640 (0.0005) [2023-03-10 20:14:24,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 11599872. Throughput: 0: 12018.5. Samples: 11571956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:14:24,742][1096160] Avg episode reward: [(0, '4834.603')] [2023-03-10 20:14:27,149][1096443] Updated weights for policy 0, policy_version 22720 (0.0005) [2023-03-10 20:14:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 11661312. Throughput: 0: 12051.7. Samples: 11645812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:14:29,742][1096160] Avg episode reward: [(0, '4833.411')] [2023-03-10 20:14:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000022776_11661312.pth... [2023-03-10 20:14:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000022080_11304960.pth [2023-03-10 20:14:30,634][1096443] Updated weights for policy 0, policy_version 22800 (0.0005) [2023-03-10 20:14:34,197][1096443] Updated weights for policy 0, policy_version 22880 (0.0004) [2023-03-10 20:14:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 11718656. Throughput: 0: 12014.9. Samples: 11714624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:14:34,742][1096160] Avg episode reward: [(0, '4831.758')] [2023-03-10 20:14:37,494][1096443] Updated weights for policy 0, policy_version 22960 (0.0004) [2023-03-10 20:14:39,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11968.7). Total num frames: 11780096. Throughput: 0: 12013.7. Samples: 11751432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:14:39,742][1096160] Avg episode reward: [(0, '4824.768')] [2023-03-10 20:14:40,819][1096443] Updated weights for policy 0, policy_version 23040 (0.0005) [2023-03-10 20:14:44,073][1096443] Updated weights for policy 0, policy_version 23120 (0.0004) [2023-03-10 20:14:44,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12015.0, 300 sec: 11968.7). Total num frames: 11841536. Throughput: 0: 12053.4. Samples: 11826808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:14:44,742][1096160] Avg episode reward: [(0, '4842.408')] [2023-03-10 20:14:44,802][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000023136_11845632.pth... [2023-03-10 20:14:44,804][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000022424_11481088.pth [2023-03-10 20:14:47,460][1096443] Updated weights for policy 0, policy_version 23200 (0.0005) [2023-03-10 20:14:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 11902976. Throughput: 0: 12013.4. Samples: 11899592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:14:49,742][1096160] Avg episode reward: [(0, '4830.628')] [2023-03-10 20:14:50,935][1096443] Updated weights for policy 0, policy_version 23280 (0.0005) [2023-03-10 20:14:54,354][1096443] Updated weights for policy 0, policy_version 23360 (0.0005) [2023-03-10 20:14:54,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 11964416. Throughput: 0: 11955.4. Samples: 11935508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:14:54,742][1096160] Avg episode reward: [(0, '4827.973')] [2023-03-10 20:14:57,830][1096443] Updated weights for policy 0, policy_version 23440 (0.0005) [2023-03-10 20:14:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 12021760. Throughput: 0: 11958.0. Samples: 12005480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:14:59,742][1096160] Avg episode reward: [(0, '4829.869')] [2023-03-10 20:14:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000023480_12021760.pth... [2023-03-10 20:14:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000022776_11661312.pth [2023-03-10 20:15:01,329][1096443] Updated weights for policy 0, policy_version 23520 (0.0005) [2023-03-10 20:15:04,565][1096443] Updated weights for policy 0, policy_version 23600 (0.0004) [2023-03-10 20:15:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 12083200. Throughput: 0: 12046.6. Samples: 12079104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:15:04,742][1096160] Avg episode reward: [(0, '4819.627')] [2023-03-10 20:15:07,990][1096443] Updated weights for policy 0, policy_version 23680 (0.0005) [2023-03-10 20:15:09,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 12144640. Throughput: 0: 12085.7. Samples: 12115812. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:15:09,742][1096160] Avg episode reward: [(0, '4829.727')] [2023-03-10 20:15:11,388][1096443] Updated weights for policy 0, policy_version 23760 (0.0005) [2023-03-10 20:15:14,720][1096443] Updated weights for policy 0, policy_version 23840 (0.0005) [2023-03-10 20:15:14,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 12206080. Throughput: 0: 12043.7. Samples: 12187780. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:15:14,742][1096160] Avg episode reward: [(0, '4825.390')] [2023-03-10 20:15:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000023840_12206080.pth... [2023-03-10 20:15:14,750][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000023136_11845632.pth [2023-03-10 20:15:18,281][1096443] Updated weights for policy 0, policy_version 23920 (0.0005) [2023-03-10 20:15:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11968.6). Total num frames: 12263424. Throughput: 0: 12104.5. Samples: 12259328. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:15:19,742][1096160] Avg episode reward: [(0, '4810.180')] [2023-03-10 20:15:21,529][1096443] Updated weights for policy 0, policy_version 24000 (0.0005) [2023-03-10 20:15:24,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 12324864. Throughput: 0: 12107.2. Samples: 12296256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:15:24,742][1096160] Avg episode reward: [(0, '4806.103')] [2023-03-10 20:15:24,926][1096443] Updated weights for policy 0, policy_version 24080 (0.0005) [2023-03-10 20:15:28,419][1096443] Updated weights for policy 0, policy_version 24160 (0.0005) [2023-03-10 20:15:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 12382208. Throughput: 0: 12006.4. Samples: 12367096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:15:29,742][1096160] Avg episode reward: [(0, '4793.078')] [2023-03-10 20:15:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000024184_12382208.pth... [2023-03-10 20:15:29,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000023480_12021760.pth [2023-03-10 20:15:31,944][1096443] Updated weights for policy 0, policy_version 24240 (0.0005) [2023-03-10 20:15:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 12443648. Throughput: 0: 11999.1. Samples: 12439552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:15:34,742][1096160] Avg episode reward: [(0, '4824.046')] [2023-03-10 20:15:35,200][1096443] Updated weights for policy 0, policy_version 24320 (0.0005) [2023-03-10 20:15:38,756][1096443] Updated weights for policy 0, policy_version 24400 (0.0005) [2023-03-10 20:15:39,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 12500992. Throughput: 0: 12014.9. Samples: 12476176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:15:39,742][1096160] Avg episode reward: [(0, '4818.286')] [2023-03-10 20:15:42,332][1096443] Updated weights for policy 0, policy_version 24480 (0.0004) [2023-03-10 20:15:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 12562432. Throughput: 0: 11989.7. Samples: 12545016. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:15:44,742][1096160] Avg episode reward: [(0, '4806.715')] [2023-03-10 20:15:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000024536_12562432.pth... [2023-03-10 20:15:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000023840_12206080.pth [2023-03-10 20:15:45,764][1096443] Updated weights for policy 0, policy_version 24560 (0.0005) [2023-03-10 20:15:49,196][1096443] Updated weights for policy 0, policy_version 24640 (0.0005) [2023-03-10 20:15:49,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 12619776. Throughput: 0: 11925.4. Samples: 12615744. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:15:49,742][1096160] Avg episode reward: [(0, '4806.027')] [2023-03-10 20:15:52,675][1096443] Updated weights for policy 0, policy_version 24720 (0.0004) [2023-03-10 20:15:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 12681216. Throughput: 0: 11918.5. Samples: 12652148. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:15:54,742][1096160] Avg episode reward: [(0, '4831.238')] [2023-03-10 20:15:55,962][1096443] Updated weights for policy 0, policy_version 24800 (0.0004) [2023-03-10 20:15:59,287][1096443] Updated weights for policy 0, policy_version 24880 (0.0005) [2023-03-10 20:15:59,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 12742656. Throughput: 0: 11965.9. Samples: 12726248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:15:59,742][1096160] Avg episode reward: [(0, '4833.841')] [2023-03-10 20:15:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000024888_12742656.pth... [2023-03-10 20:15:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000024184_12382208.pth [2023-03-10 20:16:02,677][1096443] Updated weights for policy 0, policy_version 24960 (0.0005) [2023-03-10 20:16:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 12804096. Throughput: 0: 12003.3. Samples: 12799476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:16:04,742][1096160] Avg episode reward: [(0, '4839.389')] [2023-03-10 20:16:06,025][1096443] Updated weights for policy 0, policy_version 25040 (0.0005) [2023-03-10 20:16:09,338][1096443] Updated weights for policy 0, policy_version 25120 (0.0005) [2023-03-10 20:16:09,742][1096160] Fps is (10 sec: 12288.2, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 12865536. Throughput: 0: 12009.4. Samples: 12836680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:16:09,742][1096160] Avg episode reward: [(0, '4820.703')] [2023-03-10 20:16:12,656][1096443] Updated weights for policy 0, policy_version 25200 (0.0005) [2023-03-10 20:16:14,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 12926976. Throughput: 0: 12075.6. Samples: 12910500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:16:14,742][1096160] Avg episode reward: [(0, '4833.238')] [2023-03-10 20:16:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000025248_12926976.pth... [2023-03-10 20:16:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000024536_12562432.pth [2023-03-10 20:16:16,045][1096443] Updated weights for policy 0, policy_version 25280 (0.0006) [2023-03-10 20:16:19,571][1096443] Updated weights for policy 0, policy_version 25360 (0.0005) [2023-03-10 20:16:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11968.6). Total num frames: 12984320. Throughput: 0: 12012.4. Samples: 12980108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:16:19,742][1096160] Avg episode reward: [(0, '4839.854')] [2023-03-10 20:16:23,103][1096443] Updated weights for policy 0, policy_version 25440 (0.0005) [2023-03-10 20:16:24,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 13041664. Throughput: 0: 11990.0. Samples: 13015728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:16:24,742][1096160] Avg episode reward: [(0, '4839.142')] [2023-03-10 20:16:26,474][1096443] Updated weights for policy 0, policy_version 25520 (0.0005) [2023-03-10 20:16:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 13103104. Throughput: 0: 12044.9. Samples: 13087036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:16:29,742][1096160] Avg episode reward: [(0, '4835.467')] [2023-03-10 20:16:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000025592_13103104.pth... [2023-03-10 20:16:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000024888_12742656.pth [2023-03-10 20:16:29,924][1096443] Updated weights for policy 0, policy_version 25600 (0.0005) [2023-03-10 20:16:33,420][1096443] Updated weights for policy 0, policy_version 25680 (0.0005) [2023-03-10 20:16:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 13160448. Throughput: 0: 12035.7. Samples: 13157352. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:16:34,742][1096160] Avg episode reward: [(0, '4833.933')] [2023-03-10 20:16:36,882][1096443] Updated weights for policy 0, policy_version 25760 (0.0005) [2023-03-10 20:16:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11968.7). Total num frames: 13221888. Throughput: 0: 12024.3. Samples: 13193240. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:16:39,742][1096160] Avg episode reward: [(0, '4832.723')] [2023-03-10 20:16:40,072][1096443] Updated weights for policy 0, policy_version 25840 (0.0005) [2023-03-10 20:16:43,200][1096443] Updated weights for policy 0, policy_version 25920 (0.0005) [2023-03-10 20:16:44,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 13287424. Throughput: 0: 12107.9. Samples: 13271104. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:16:44,742][1096160] Avg episode reward: [(0, '4843.378')] [2023-03-10 20:16:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000025952_13287424.pth... [2023-03-10 20:16:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000025248_12926976.pth [2023-03-10 20:16:46,588][1096443] Updated weights for policy 0, policy_version 26000 (0.0005) [2023-03-10 20:16:49,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12151.4, 300 sec: 11996.4). Total num frames: 13348864. Throughput: 0: 12116.0. Samples: 13344696. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:16:49,742][1096160] Avg episode reward: [(0, '4838.545')] [2023-03-10 20:16:49,977][1096443] Updated weights for policy 0, policy_version 26080 (0.0005) [2023-03-10 20:16:53,386][1096443] Updated weights for policy 0, policy_version 26160 (0.0006) [2023-03-10 20:16:54,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 13410304. Throughput: 0: 12078.0. Samples: 13380192. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:16:54,742][1096160] Avg episode reward: [(0, '4830.883')] [2023-03-10 20:16:56,727][1096443] Updated weights for policy 0, policy_version 26240 (0.0005) [2023-03-10 20:16:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 13467648. Throughput: 0: 12029.4. Samples: 13451824. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:16:59,742][1096160] Avg episode reward: [(0, '4834.097')] [2023-03-10 20:16:59,779][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000026312_13471744.pth... [2023-03-10 20:16:59,781][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000025592_13103104.pth [2023-03-10 20:17:00,091][1096443] Updated weights for policy 0, policy_version 26320 (0.0005) [2023-03-10 20:17:03,471][1096443] Updated weights for policy 0, policy_version 26400 (0.0005) [2023-03-10 20:17:04,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 13529088. Throughput: 0: 12139.2. Samples: 13526372. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:17:04,742][1096160] Avg episode reward: [(0, '4838.598')] [2023-03-10 20:17:06,873][1096443] Updated weights for policy 0, policy_version 26480 (0.0005) [2023-03-10 20:17:09,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 13590528. Throughput: 0: 12137.8. Samples: 13561928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:17:09,742][1096160] Avg episode reward: [(0, '4838.503')] [2023-03-10 20:17:10,285][1096443] Updated weights for policy 0, policy_version 26560 (0.0004) [2023-03-10 20:17:13,599][1096443] Updated weights for policy 0, policy_version 26640 (0.0005) [2023-03-10 20:17:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 13651968. Throughput: 0: 12188.7. Samples: 13635528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:17:14,742][1096160] Avg episode reward: [(0, '4786.457')] [2023-03-10 20:17:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000026664_13651968.pth... [2023-03-10 20:17:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000025952_13287424.pth [2023-03-10 20:17:16,989][1096443] Updated weights for policy 0, policy_version 26720 (0.0005) [2023-03-10 20:17:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 13709312. Throughput: 0: 12195.0. Samples: 13706128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:17:19,742][1096160] Avg episode reward: [(0, '4846.496')] [2023-03-10 20:17:20,556][1096443] Updated weights for policy 0, policy_version 26800 (0.0005) [2023-03-10 20:17:23,901][1096443] Updated weights for policy 0, policy_version 26880 (0.0005) [2023-03-10 20:17:24,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12024.2). Total num frames: 13770752. Throughput: 0: 12197.9. Samples: 13742144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:17:24,742][1096160] Avg episode reward: [(0, '4839.548')] [2023-03-10 20:17:27,173][1096443] Updated weights for policy 0, policy_version 26960 (0.0004) [2023-03-10 20:17:29,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12219.7, 300 sec: 12038.1). Total num frames: 13836288. Throughput: 0: 12193.8. Samples: 13819824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:17:29,742][1096160] Avg episode reward: [(0, '4843.392')] [2023-03-10 20:17:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000027024_13836288.pth... [2023-03-10 20:17:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000026312_13471744.pth [2023-03-10 20:17:30,298][1096443] Updated weights for policy 0, policy_version 27040 (0.0005) [2023-03-10 20:17:33,713][1096443] Updated weights for policy 0, policy_version 27120 (0.0005) [2023-03-10 20:17:34,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12024.2). Total num frames: 13893632. Throughput: 0: 12145.7. Samples: 13891252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:17:34,742][1096160] Avg episode reward: [(0, '4829.239')] [2023-03-10 20:17:37,181][1096443] Updated weights for policy 0, policy_version 27200 (0.0004) [2023-03-10 20:17:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 12024.2). Total num frames: 13955072. Throughput: 0: 12139.6. Samples: 13926472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:17:39,742][1096160] Avg episode reward: [(0, '4839.121')] [2023-03-10 20:17:40,531][1096443] Updated weights for policy 0, policy_version 27280 (0.0005) [2023-03-10 20:17:43,833][1096443] Updated weights for policy 0, policy_version 27360 (0.0005) [2023-03-10 20:17:44,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12024.2). Total num frames: 14016512. Throughput: 0: 12192.9. Samples: 14000504. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:17:44,742][1096160] Avg episode reward: [(0, '4844.909')] [2023-03-10 20:17:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000027376_14016512.pth... [2023-03-10 20:17:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000026664_13651968.pth [2023-03-10 20:17:47,229][1096443] Updated weights for policy 0, policy_version 27440 (0.0004) [2023-03-10 20:17:49,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 14073856. Throughput: 0: 12106.1. Samples: 14071148. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:17:49,742][1096160] Avg episode reward: [(0, '4837.416')] [2023-03-10 20:17:50,842][1096443] Updated weights for policy 0, policy_version 27520 (0.0004) [2023-03-10 20:17:54,055][1096443] Updated weights for policy 0, policy_version 27600 (0.0004) [2023-03-10 20:17:54,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 14139392. Throughput: 0: 12180.3. Samples: 14110040. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:17:54,742][1096160] Avg episode reward: [(0, '4835.764')] [2023-03-10 20:17:57,528][1096443] Updated weights for policy 0, policy_version 27680 (0.0004) [2023-03-10 20:17:59,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 14196736. Throughput: 0: 12113.3. Samples: 14180624. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:17:59,742][1096160] Avg episode reward: [(0, '4840.333')] [2023-03-10 20:17:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000027728_14196736.pth... [2023-03-10 20:17:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000027024_13836288.pth [2023-03-10 20:18:00,763][1096443] Updated weights for policy 0, policy_version 27760 (0.0005) [2023-03-10 20:18:04,151][1096443] Updated weights for policy 0, policy_version 27840 (0.0004) [2023-03-10 20:18:04,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12151.5, 300 sec: 12052.0). Total num frames: 14258176. Throughput: 0: 12187.1. Samples: 14254548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:18:04,742][1096160] Avg episode reward: [(0, '4839.410')] [2023-03-10 20:18:07,616][1096443] Updated weights for policy 0, policy_version 27920 (0.0005) [2023-03-10 20:18:09,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 14319616. Throughput: 0: 12193.7. Samples: 14290860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:18:09,742][1096160] Avg episode reward: [(0, '4844.447')] [2023-03-10 20:18:11,046][1096443] Updated weights for policy 0, policy_version 28000 (0.0005) [2023-03-10 20:18:14,483][1096443] Updated weights for policy 0, policy_version 28080 (0.0005) [2023-03-10 20:18:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 14376960. Throughput: 0: 12045.4. Samples: 14361868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:18:14,742][1096160] Avg episode reward: [(0, '4829.659')] [2023-03-10 20:18:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000028080_14376960.pth... [2023-03-10 20:18:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000027376_14016512.pth [2023-03-10 20:18:17,945][1096443] Updated weights for policy 0, policy_version 28160 (0.0005) [2023-03-10 20:18:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12052.0). Total num frames: 14438400. Throughput: 0: 12058.8. Samples: 14433900. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:18:19,742][1096160] Avg episode reward: [(0, '4849.281')] [2023-03-10 20:18:21,433][1096443] Updated weights for policy 0, policy_version 28240 (0.0005) [2023-03-10 20:18:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 14495744. Throughput: 0: 12013.5. Samples: 14467080. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:18:24,742][1096160] Avg episode reward: [(0, '4843.409')] [2023-03-10 20:18:24,938][1096443] Updated weights for policy 0, policy_version 28320 (0.0004) [2023-03-10 20:18:28,318][1096443] Updated weights for policy 0, policy_version 28400 (0.0004) [2023-03-10 20:18:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 14557184. Throughput: 0: 11996.7. Samples: 14540356. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:18:29,742][1096160] Avg episode reward: [(0, '4855.502')] [2023-03-10 20:18:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000028432_14557184.pth... [2023-03-10 20:18:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000027728_14196736.pth [2023-03-10 20:18:31,768][1096443] Updated weights for policy 0, policy_version 28480 (0.0005) [2023-03-10 20:18:34,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12015.0, 300 sec: 12052.0). Total num frames: 14614528. Throughput: 0: 11984.1. Samples: 14610432. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:18:34,742][1096160] Avg episode reward: [(0, '4846.848')] [2023-03-10 20:18:35,306][1096443] Updated weights for policy 0, policy_version 28560 (0.0004) [2023-03-10 20:18:38,473][1096443] Updated weights for policy 0, policy_version 28640 (0.0005) [2023-03-10 20:18:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 14675968. Throughput: 0: 11955.7. Samples: 14648044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:18:39,742][1096160] Avg episode reward: [(0, '4841.901')] [2023-03-10 20:18:41,879][1096443] Updated weights for policy 0, policy_version 28720 (0.0005) [2023-03-10 20:18:44,741][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 14737408. Throughput: 0: 12010.5. Samples: 14721096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:18:44,754][1096160] Avg episode reward: [(0, '4850.598')] [2023-03-10 20:18:44,756][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000028784_14737408.pth... [2023-03-10 20:18:44,758][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000028080_14376960.pth [2023-03-10 20:18:45,242][1096443] Updated weights for policy 0, policy_version 28800 (0.0005) [2023-03-10 20:18:48,702][1096443] Updated weights for policy 0, policy_version 28880 (0.0005) [2023-03-10 20:18:49,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 14798848. Throughput: 0: 11953.7. Samples: 14792464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:18:49,752][1096160] Avg episode reward: [(0, '4841.917')] [2023-03-10 20:18:52,228][1096443] Updated weights for policy 0, policy_version 28960 (0.0005) [2023-03-10 20:18:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 14856192. Throughput: 0: 11927.2. Samples: 14827584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:18:54,753][1096160] Avg episode reward: [(0, '4847.520')] [2023-03-10 20:18:55,648][1096443] Updated weights for policy 0, policy_version 29040 (0.0005) [2023-03-10 20:18:59,005][1096443] Updated weights for policy 0, policy_version 29120 (0.0005) [2023-03-10 20:18:59,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 14917632. Throughput: 0: 11954.7. Samples: 14899832. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:18:59,742][1096160] Avg episode reward: [(0, '4847.488')] [2023-03-10 20:18:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000029136_14917632.pth... [2023-03-10 20:18:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000028432_14557184.pth [2023-03-10 20:19:02,461][1096443] Updated weights for policy 0, policy_version 29200 (0.0005) [2023-03-10 20:19:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 14979072. Throughput: 0: 11944.9. Samples: 14971420. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:19:04,742][1096160] Avg episode reward: [(0, '4851.302')] [2023-03-10 20:19:05,697][1096443] Updated weights for policy 0, policy_version 29280 (0.0005) [2023-03-10 20:19:09,153][1096443] Updated weights for policy 0, policy_version 29360 (0.0005) [2023-03-10 20:19:09,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 15036416. Throughput: 0: 12026.9. Samples: 15008288. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:19:09,742][1096160] Avg episode reward: [(0, '4847.249')] [2023-03-10 20:19:12,615][1096443] Updated weights for policy 0, policy_version 29440 (0.0004) [2023-03-10 20:19:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 15097856. Throughput: 0: 12024.0. Samples: 15081436. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:19:14,742][1096160] Avg episode reward: [(0, '4850.323')] [2023-03-10 20:19:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000029488_15097856.pth... [2023-03-10 20:19:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000028784_14737408.pth [2023-03-10 20:19:15,930][1096443] Updated weights for policy 0, policy_version 29520 (0.0005) [2023-03-10 20:19:19,362][1096443] Updated weights for policy 0, policy_version 29600 (0.0004) [2023-03-10 20:19:19,741][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 15155200. Throughput: 0: 12077.4. Samples: 15153916. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:19:19,742][1096160] Avg episode reward: [(0, '4844.727')] [2023-03-10 20:19:22,912][1096443] Updated weights for policy 0, policy_version 29680 (0.0005) [2023-03-10 20:19:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 15216640. Throughput: 0: 11999.7. Samples: 15188032. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:19:24,742][1096160] Avg episode reward: [(0, '4804.020')] [2023-03-10 20:19:26,217][1096443] Updated weights for policy 0, policy_version 29760 (0.0005) [2023-03-10 20:19:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 15273984. Throughput: 0: 12012.5. Samples: 15261660. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:19:29,742][1096160] Avg episode reward: [(0, '4820.783')] [2023-03-10 20:19:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000029832_15273984.pth... [2023-03-10 20:19:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000029136_14917632.pth [2023-03-10 20:19:29,795][1096443] Updated weights for policy 0, policy_version 29840 (0.0005) [2023-03-10 20:19:33,172][1096443] Updated weights for policy 0, policy_version 29920 (0.0004) [2023-03-10 20:19:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 15335424. Throughput: 0: 11975.0. Samples: 15331340. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:19:34,742][1096160] Avg episode reward: [(0, '4850.372')] [2023-03-10 20:19:36,625][1096443] Updated weights for policy 0, policy_version 30000 (0.0005) [2023-03-10 20:19:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 15392768. Throughput: 0: 12004.3. Samples: 15367776. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:19:39,752][1096160] Avg episode reward: [(0, '4847.896')] [2023-03-10 20:19:40,284][1096443] Updated weights for policy 0, policy_version 30080 (0.0005) [2023-03-10 20:19:43,675][1096443] Updated weights for policy 0, policy_version 30160 (0.0005) [2023-03-10 20:19:44,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 15450112. Throughput: 0: 11932.8. Samples: 15436808. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:19:44,742][1096160] Avg episode reward: [(0, '4845.628')] [2023-03-10 20:19:44,776][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000030184_15454208.pth... [2023-03-10 20:19:44,778][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000029488_15097856.pth [2023-03-10 20:19:47,060][1096443] Updated weights for policy 0, policy_version 30240 (0.0005) [2023-03-10 20:19:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 15511552. Throughput: 0: 11933.5. Samples: 15508428. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:19:49,742][1096160] Avg episode reward: [(0, '4825.100')] [2023-03-10 20:19:50,566][1096443] Updated weights for policy 0, policy_version 30320 (0.0005) [2023-03-10 20:19:53,954][1096443] Updated weights for policy 0, policy_version 30400 (0.0005) [2023-03-10 20:19:54,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 15572992. Throughput: 0: 11910.7. Samples: 15544272. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:19:54,742][1096160] Avg episode reward: [(0, '4819.059')] [2023-03-10 20:19:57,480][1096443] Updated weights for policy 0, policy_version 30480 (0.0005) [2023-03-10 20:19:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 15630336. Throughput: 0: 11855.0. Samples: 15614912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:19:59,742][1096160] Avg episode reward: [(0, '4852.539')] [2023-03-10 20:19:59,754][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000030536_15634432.pth... [2023-03-10 20:19:59,756][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000029832_15273984.pth [2023-03-10 20:20:00,843][1096443] Updated weights for policy 0, policy_version 30560 (0.0005) [2023-03-10 20:20:04,243][1096443] Updated weights for policy 0, policy_version 30640 (0.0005) [2023-03-10 20:20:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 15691776. Throughput: 0: 11858.4. Samples: 15687544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:20:04,742][1096160] Avg episode reward: [(0, '4850.349')] [2023-03-10 20:20:07,825][1096443] Updated weights for policy 0, policy_version 30720 (0.0005) [2023-03-10 20:20:09,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 15749120. Throughput: 0: 11854.0. Samples: 15721460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:20:09,742][1096160] Avg episode reward: [(0, '4852.816')] [2023-03-10 20:20:10,951][1096443] Updated weights for policy 0, policy_version 30800 (0.0005) [2023-03-10 20:20:14,390][1096443] Updated weights for policy 0, policy_version 30880 (0.0005) [2023-03-10 20:20:14,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 15814656. Throughput: 0: 11886.0. Samples: 15796532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:20:14,742][1096160] Avg episode reward: [(0, '4847.651')] [2023-03-10 20:20:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000030888_15814656.pth... [2023-03-10 20:20:14,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000030184_15454208.pth [2023-03-10 20:20:17,745][1096443] Updated weights for policy 0, policy_version 30960 (0.0004) [2023-03-10 20:20:19,741][1096160] Fps is (10 sec: 12697.6, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 15876096. Throughput: 0: 11991.2. Samples: 15870944. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:20:19,742][1096160] Avg episode reward: [(0, '4847.354')] [2023-03-10 20:20:20,921][1096443] Updated weights for policy 0, policy_version 31040 (0.0004) [2023-03-10 20:20:24,131][1096443] Updated weights for policy 0, policy_version 31120 (0.0005) [2023-03-10 20:20:24,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 15937536. Throughput: 0: 12057.7. Samples: 15910372. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:20:24,742][1096160] Avg episode reward: [(0, '4854.396')] [2023-03-10 20:20:27,655][1096443] Updated weights for policy 0, policy_version 31200 (0.0005) [2023-03-10 20:20:29,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 15998976. Throughput: 0: 12095.9. Samples: 15981120. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:20:29,742][1096160] Avg episode reward: [(0, '4851.559')] [2023-03-10 20:20:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000031248_15998976.pth... [2023-03-10 20:20:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000030536_15634432.pth [2023-03-10 20:20:31,015][1096443] Updated weights for policy 0, policy_version 31280 (0.0005) [2023-03-10 20:20:34,270][1096443] Updated weights for policy 0, policy_version 31360 (0.0005) [2023-03-10 20:20:34,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 16060416. Throughput: 0: 12172.2. Samples: 16056176. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:20:34,742][1096160] Avg episode reward: [(0, '4852.030')] [2023-03-10 20:20:37,753][1096443] Updated weights for policy 0, policy_version 31440 (0.0005) [2023-03-10 20:20:39,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 16117760. Throughput: 0: 12161.1. Samples: 16091520. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:20:39,742][1096160] Avg episode reward: [(0, '4854.138')] [2023-03-10 20:20:41,231][1096443] Updated weights for policy 0, policy_version 31520 (0.0005) [2023-03-10 20:20:44,376][1096443] Updated weights for policy 0, policy_version 31600 (0.0005) [2023-03-10 20:20:44,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 16183296. Throughput: 0: 12218.5. Samples: 16164744. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:20:44,742][1096160] Avg episode reward: [(0, '4851.001')] [2023-03-10 20:20:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000031608_16183296.pth... [2023-03-10 20:20:44,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000030888_15814656.pth [2023-03-10 20:20:47,754][1096443] Updated weights for policy 0, policy_version 31680 (0.0005) [2023-03-10 20:20:49,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 16240640. Throughput: 0: 12216.1. Samples: 16237268. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:20:49,742][1096160] Avg episode reward: [(0, '4854.929')] [2023-03-10 20:20:51,200][1096443] Updated weights for policy 0, policy_version 31760 (0.0005) [2023-03-10 20:20:54,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 16297984. Throughput: 0: 12265.0. Samples: 16273388. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:20:54,742][1096160] Avg episode reward: [(0, '4855.732')] [2023-03-10 20:20:54,788][1096443] Updated weights for policy 0, policy_version 31840 (0.0004) [2023-03-10 20:20:58,163][1096443] Updated weights for policy 0, policy_version 31920 (0.0005) [2023-03-10 20:20:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 12052.0). Total num frames: 16359424. Throughput: 0: 12151.5. Samples: 16343348. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:20:59,742][1096160] Avg episode reward: [(0, '4854.631')] [2023-03-10 20:20:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000031952_16359424.pth... [2023-03-10 20:20:59,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000031248_15998976.pth [2023-03-10 20:21:01,651][1096443] Updated weights for policy 0, policy_version 32000 (0.0005) [2023-03-10 20:21:04,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12052.0). Total num frames: 16420864. Throughput: 0: 12127.1. Samples: 16416664. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:21:04,742][1096160] Avg episode reward: [(0, '4852.121')] [2023-03-10 20:21:04,928][1096443] Updated weights for policy 0, policy_version 32080 (0.0005) [2023-03-10 20:21:08,281][1096443] Updated weights for policy 0, policy_version 32160 (0.0005) [2023-03-10 20:21:09,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12052.0). Total num frames: 16482304. Throughput: 0: 12072.5. Samples: 16453632. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:21:09,742][1096160] Avg episode reward: [(0, '4856.991')] [2023-03-10 20:21:11,727][1096443] Updated weights for policy 0, policy_version 32240 (0.0005) [2023-03-10 20:21:14,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 16539648. Throughput: 0: 12111.6. Samples: 16526144. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:21:14,742][1096160] Avg episode reward: [(0, '4857.996')] [2023-03-10 20:21:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000032304_16539648.pth... [2023-03-10 20:21:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000031608_16183296.pth [2023-03-10 20:21:15,114][1096443] Updated weights for policy 0, policy_version 32320 (0.0005) [2023-03-10 20:21:18,582][1096443] Updated weights for policy 0, policy_version 32400 (0.0005) [2023-03-10 20:21:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 16601088. Throughput: 0: 12010.6. Samples: 16596652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:21:19,742][1096160] Avg episode reward: [(0, '4851.411')] [2023-03-10 20:21:22,035][1096443] Updated weights for policy 0, policy_version 32480 (0.0004) [2023-03-10 20:21:24,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 16662528. Throughput: 0: 12014.7. Samples: 16632184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:21:24,742][1096160] Avg episode reward: [(0, '4847.202')] [2023-03-10 20:21:25,351][1096443] Updated weights for policy 0, policy_version 32560 (0.0005) [2023-03-10 20:21:28,815][1096443] Updated weights for policy 0, policy_version 32640 (0.0005) [2023-03-10 20:21:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 16719872. Throughput: 0: 11995.9. Samples: 16704560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:21:29,756][1096160] Avg episode reward: [(0, '4832.169')] [2023-03-10 20:21:29,759][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000032656_16719872.pth... [2023-03-10 20:21:29,761][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000031952_16359424.pth [2023-03-10 20:21:32,413][1096443] Updated weights for policy 0, policy_version 32720 (0.0005) [2023-03-10 20:21:34,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 16777216. Throughput: 0: 11909.2. Samples: 16773184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:21:34,742][1096160] Avg episode reward: [(0, '4845.565')] [2023-03-10 20:21:36,024][1096443] Updated weights for policy 0, policy_version 32800 (0.0005) [2023-03-10 20:21:39,520][1096443] Updated weights for policy 0, policy_version 32880 (0.0005) [2023-03-10 20:21:39,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.6, 300 sec: 12024.2). Total num frames: 16834560. Throughput: 0: 11869.1. Samples: 16807496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:21:39,742][1096160] Avg episode reward: [(0, '4846.258')] [2023-03-10 20:21:42,856][1096443] Updated weights for policy 0, policy_version 32960 (0.0005) [2023-03-10 20:21:44,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 16896000. Throughput: 0: 11933.4. Samples: 16880352. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:21:44,742][1096160] Avg episode reward: [(0, '4827.570')] [2023-03-10 20:21:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000033000_16896000.pth... [2023-03-10 20:21:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000032304_16539648.pth [2023-03-10 20:21:46,161][1096443] Updated weights for policy 0, policy_version 33040 (0.0005) [2023-03-10 20:21:49,693][1096443] Updated weights for policy 0, policy_version 33120 (0.0005) [2023-03-10 20:21:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.6, 300 sec: 12024.2). Total num frames: 16957440. Throughput: 0: 11923.7. Samples: 16953232. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:21:49,742][1096160] Avg episode reward: [(0, '4800.400')] [2023-03-10 20:21:53,123][1096443] Updated weights for policy 0, policy_version 33200 (0.0005) [2023-03-10 20:21:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 17014784. Throughput: 0: 11842.4. Samples: 16986540. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:21:54,742][1096160] Avg episode reward: [(0, '4790.561')] [2023-03-10 20:21:56,644][1096443] Updated weights for policy 0, policy_version 33280 (0.0005) [2023-03-10 20:21:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.6, 300 sec: 12024.2). Total num frames: 17076224. Throughput: 0: 11864.5. Samples: 17060048. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:21:59,742][1096160] Avg episode reward: [(0, '4808.699')] [2023-03-10 20:21:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000033352_17076224.pth... [2023-03-10 20:21:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000032656_16719872.pth [2023-03-10 20:21:59,926][1096443] Updated weights for policy 0, policy_version 33360 (0.0005) [2023-03-10 20:22:03,497][1096443] Updated weights for policy 0, policy_version 33440 (0.0005) [2023-03-10 20:22:04,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 17133568. Throughput: 0: 11840.6. Samples: 17129480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:22:04,742][1096160] Avg episode reward: [(0, '4840.107')] [2023-03-10 20:22:07,080][1096443] Updated weights for policy 0, policy_version 33520 (0.0005) [2023-03-10 20:22:09,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11810.1, 300 sec: 11996.4). Total num frames: 17190912. Throughput: 0: 11813.0. Samples: 17163768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:22:09,742][1096160] Avg episode reward: [(0, '4844.219')] [2023-03-10 20:22:10,445][1096443] Updated weights for policy 0, policy_version 33600 (0.0005) [2023-03-10 20:22:13,920][1096443] Updated weights for policy 0, policy_version 33680 (0.0005) [2023-03-10 20:22:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 17252352. Throughput: 0: 11809.3. Samples: 17235976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:22:14,742][1096160] Avg episode reward: [(0, '4842.623')] [2023-03-10 20:22:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000033696_17252352.pth... [2023-03-10 20:22:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000033000_16896000.pth [2023-03-10 20:22:17,363][1096443] Updated weights for policy 0, policy_version 33760 (0.0005) [2023-03-10 20:22:19,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.2, 300 sec: 11996.4). Total num frames: 17309696. Throughput: 0: 11850.3. Samples: 17306448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:22:19,742][1096160] Avg episode reward: [(0, '4839.251')] [2023-03-10 20:22:20,775][1096443] Updated weights for policy 0, policy_version 33840 (0.0005) [2023-03-10 20:22:24,391][1096443] Updated weights for policy 0, policy_version 33920 (0.0005) [2023-03-10 20:22:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 17371136. Throughput: 0: 11887.9. Samples: 17342452. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:22:24,742][1096160] Avg episode reward: [(0, '4841.178')] [2023-03-10 20:22:27,863][1096443] Updated weights for policy 0, policy_version 34000 (0.0005) [2023-03-10 20:22:29,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 17428480. Throughput: 0: 11817.9. Samples: 17412160. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:22:29,742][1096160] Avg episode reward: [(0, '4830.029')] [2023-03-10 20:22:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000034040_17428480.pth... [2023-03-10 20:22:29,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000033352_17076224.pth [2023-03-10 20:22:31,095][1096443] Updated weights for policy 0, policy_version 34080 (0.0004) [2023-03-10 20:22:34,320][1096443] Updated weights for policy 0, policy_version 34160 (0.0005) [2023-03-10 20:22:34,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 17494016. Throughput: 0: 11914.3. Samples: 17489376. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:22:34,742][1096160] Avg episode reward: [(0, '4847.211')] [2023-03-10 20:22:37,791][1096443] Updated weights for policy 0, policy_version 34240 (0.0004) [2023-03-10 20:22:39,741][1096160] Fps is (10 sec: 12697.8, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 17555456. Throughput: 0: 11935.1. Samples: 17523620. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:22:39,742][1096160] Avg episode reward: [(0, '4845.032')] [2023-03-10 20:22:41,010][1096443] Updated weights for policy 0, policy_version 34320 (0.0005) [2023-03-10 20:22:44,180][1096443] Updated weights for policy 0, policy_version 34400 (0.0005) [2023-03-10 20:22:44,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 17616896. Throughput: 0: 12006.1. Samples: 17600320. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:22:44,742][1096160] Avg episode reward: [(0, '4845.836')] [2023-03-10 20:22:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000034408_17616896.pth... [2023-03-10 20:22:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000033696_17252352.pth [2023-03-10 20:22:47,866][1096443] Updated weights for policy 0, policy_version 34480 (0.0004) [2023-03-10 20:22:49,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 17674240. Throughput: 0: 12016.2. Samples: 17670208. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:22:49,742][1096160] Avg episode reward: [(0, '4852.301')] [2023-03-10 20:22:51,250][1096443] Updated weights for policy 0, policy_version 34560 (0.0005) [2023-03-10 20:22:54,636][1096443] Updated weights for policy 0, policy_version 34640 (0.0005) [2023-03-10 20:22:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 17735680. Throughput: 0: 12071.7. Samples: 17706996. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:22:54,742][1096160] Avg episode reward: [(0, '4837.673')] [2023-03-10 20:22:58,229][1096443] Updated weights for policy 0, policy_version 34720 (0.0005) [2023-03-10 20:22:59,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 17793024. Throughput: 0: 12014.9. Samples: 17776648. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:22:59,742][1096160] Avg episode reward: [(0, '4832.190')] [2023-03-10 20:22:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000034752_17793024.pth... [2023-03-10 20:22:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000034040_17428480.pth [2023-03-10 20:23:01,774][1096443] Updated weights for policy 0, policy_version 34800 (0.0005) [2023-03-10 20:23:04,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 17850368. Throughput: 0: 12004.5. Samples: 17846652. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:23:04,742][1096160] Avg episode reward: [(0, '4824.134')] [2023-03-10 20:23:05,178][1096443] Updated weights for policy 0, policy_version 34880 (0.0005) [2023-03-10 20:23:08,676][1096443] Updated weights for policy 0, policy_version 34960 (0.0005) [2023-03-10 20:23:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 17911808. Throughput: 0: 12015.4. Samples: 17883144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:23:09,742][1096160] Avg episode reward: [(0, '4840.680')] [2023-03-10 20:23:12,054][1096443] Updated weights for policy 0, policy_version 35040 (0.0004) [2023-03-10 20:23:14,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 17969152. Throughput: 0: 12015.0. Samples: 17952832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:23:14,742][1096160] Avg episode reward: [(0, '4837.943')] [2023-03-10 20:23:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000035096_17969152.pth... [2023-03-10 20:23:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000034408_17616896.pth [2023-03-10 20:23:15,554][1096443] Updated weights for policy 0, policy_version 35120 (0.0005) [2023-03-10 20:23:18,891][1096443] Updated weights for policy 0, policy_version 35200 (0.0005) [2023-03-10 20:23:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 18030592. Throughput: 0: 11937.4. Samples: 18026560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:23:19,742][1096160] Avg episode reward: [(0, '4850.676')] [2023-03-10 20:23:22,263][1096443] Updated weights for policy 0, policy_version 35280 (0.0005) [2023-03-10 20:23:24,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 18092032. Throughput: 0: 11993.0. Samples: 18063304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:23:24,742][1096160] Avg episode reward: [(0, '4841.422')] [2023-03-10 20:23:25,656][1096443] Updated weights for policy 0, policy_version 35360 (0.0005) [2023-03-10 20:23:29,147][1096443] Updated weights for policy 0, policy_version 35440 (0.0004) [2023-03-10 20:23:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 18149376. Throughput: 0: 11857.5. Samples: 18133908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:23:29,742][1096160] Avg episode reward: [(0, '4855.409')] [2023-03-10 20:23:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000035448_18149376.pth... [2023-03-10 20:23:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000034752_17793024.pth [2023-03-10 20:23:32,645][1096443] Updated weights for policy 0, policy_version 35520 (0.0004) [2023-03-10 20:23:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.6, 300 sec: 11982.5). Total num frames: 18210816. Throughput: 0: 11866.6. Samples: 18204204. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:23:34,742][1096160] Avg episode reward: [(0, '4853.727')] [2023-03-10 20:23:36,224][1096443] Updated weights for policy 0, policy_version 35600 (0.0005) [2023-03-10 20:23:39,650][1096443] Updated weights for policy 0, policy_version 35680 (0.0005) [2023-03-10 20:23:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11968.6). Total num frames: 18268160. Throughput: 0: 11832.0. Samples: 18239436. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:23:39,742][1096160] Avg episode reward: [(0, '4859.052')] [2023-03-10 20:23:43,053][1096443] Updated weights for policy 0, policy_version 35760 (0.0005) [2023-03-10 20:23:44,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11954.8). Total num frames: 18325504. Throughput: 0: 11883.6. Samples: 18311408. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:23:44,742][1096160] Avg episode reward: [(0, '4856.740')] [2023-03-10 20:23:44,785][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000035800_18329600.pth... [2023-03-10 20:23:44,787][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000035096_17969152.pth [2023-03-10 20:23:46,414][1096443] Updated weights for policy 0, policy_version 35840 (0.0005) [2023-03-10 20:23:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 18386944. Throughput: 0: 11913.7. Samples: 18382768. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:23:49,742][1096160] Avg episode reward: [(0, '4854.695')] [2023-03-10 20:23:49,929][1096443] Updated weights for policy 0, policy_version 35920 (0.0005) [2023-03-10 20:23:53,426][1096443] Updated weights for policy 0, policy_version 36000 (0.0005) [2023-03-10 20:23:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11954.8). Total num frames: 18444288. Throughput: 0: 11896.0. Samples: 18418464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:23:54,742][1096160] Avg episode reward: [(0, '4844.574')] [2023-03-10 20:23:56,902][1096443] Updated weights for policy 0, policy_version 36080 (0.0005) [2023-03-10 20:23:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 18505728. Throughput: 0: 11922.5. Samples: 18489344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:23:59,742][1096160] Avg episode reward: [(0, '4856.220')] [2023-03-10 20:23:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000036144_18505728.pth... [2023-03-10 20:23:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000035448_18149376.pth [2023-03-10 20:24:00,291][1096443] Updated weights for policy 0, policy_version 36160 (0.0005) [2023-03-10 20:24:03,547][1096443] Updated weights for policy 0, policy_version 36240 (0.0005) [2023-03-10 20:24:04,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 18567168. Throughput: 0: 11922.8. Samples: 18563084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:24:04,742][1096160] Avg episode reward: [(0, '4856.675')] [2023-03-10 20:24:06,874][1096443] Updated weights for policy 0, policy_version 36320 (0.0004) [2023-03-10 20:24:09,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 18628608. Throughput: 0: 11926.6. Samples: 18600000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:24:09,742][1096160] Avg episode reward: [(0, '4852.989')] [2023-03-10 20:24:10,265][1096443] Updated weights for policy 0, policy_version 36400 (0.0005) [2023-03-10 20:24:13,624][1096443] Updated weights for policy 0, policy_version 36480 (0.0005) [2023-03-10 20:24:14,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 18690048. Throughput: 0: 11993.1. Samples: 18673600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:24:14,742][1096160] Avg episode reward: [(0, '4852.097')] [2023-03-10 20:24:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000036504_18690048.pth... [2023-03-10 20:24:14,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000035800_18329600.pth [2023-03-10 20:24:17,019][1096443] Updated weights for policy 0, policy_version 36560 (0.0005) [2023-03-10 20:24:19,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 18751488. Throughput: 0: 12033.2. Samples: 18745696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:24:19,742][1096160] Avg episode reward: [(0, '4856.660')] [2023-03-10 20:24:20,331][1096443] Updated weights for policy 0, policy_version 36640 (0.0005) [2023-03-10 20:24:23,692][1096443] Updated weights for policy 0, policy_version 36720 (0.0004) [2023-03-10 20:24:24,742][1096160] Fps is (10 sec: 12288.2, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 18812928. Throughput: 0: 12083.9. Samples: 18783212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:24:24,742][1096160] Avg episode reward: [(0, '4857.549')] [2023-03-10 20:24:27,110][1096443] Updated weights for policy 0, policy_version 36800 (0.0004) [2023-03-10 20:24:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 18870272. Throughput: 0: 12058.6. Samples: 18854044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:24:29,742][1096160] Avg episode reward: [(0, '4848.764')] [2023-03-10 20:24:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000036856_18870272.pth... [2023-03-10 20:24:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000036144_18505728.pth [2023-03-10 20:24:30,650][1096443] Updated weights for policy 0, policy_version 36880 (0.0004) [2023-03-10 20:24:34,064][1096443] Updated weights for policy 0, policy_version 36960 (0.0005) [2023-03-10 20:24:34,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12015.0, 300 sec: 11996.4). Total num frames: 18931712. Throughput: 0: 12064.0. Samples: 18925648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:24:34,742][1096160] Avg episode reward: [(0, '4857.477')] [2023-03-10 20:24:37,022][1096443] Updated weights for policy 0, policy_version 37040 (0.0005) [2023-03-10 20:24:39,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 12151.5, 300 sec: 12024.2). Total num frames: 18997248. Throughput: 0: 12204.2. Samples: 18967656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:24:39,743][1096160] Avg episode reward: [(0, '4854.977')] [2023-03-10 20:24:40,470][1096443] Updated weights for policy 0, policy_version 37120 (0.0005) [2023-03-10 20:24:43,951][1096443] Updated weights for policy 0, policy_version 37200 (0.0005) [2023-03-10 20:24:44,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 19054592. Throughput: 0: 12197.2. Samples: 19038220. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:24:44,742][1096160] Avg episode reward: [(0, '4861.938')] [2023-03-10 20:24:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000037216_19054592.pth... [2023-03-10 20:24:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000036504_18690048.pth [2023-03-10 20:24:44,748][1096399] Saving new best policy, reward=4861.938! [2023-03-10 20:24:47,302][1096443] Updated weights for policy 0, policy_version 37280 (0.0005) [2023-03-10 20:24:49,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 19116032. Throughput: 0: 12153.4. Samples: 19109984. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:24:49,742][1096160] Avg episode reward: [(0, '4858.146')] [2023-03-10 20:24:50,765][1096443] Updated weights for policy 0, policy_version 37360 (0.0005) [2023-03-10 20:24:53,820][1096443] Updated weights for policy 0, policy_version 37440 (0.0005) [2023-03-10 20:24:54,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12024.2). Total num frames: 19177472. Throughput: 0: 12193.3. Samples: 19148696. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:24:54,742][1096160] Avg episode reward: [(0, '4855.206')] [2023-03-10 20:24:57,187][1096443] Updated weights for policy 0, policy_version 37520 (0.0005) [2023-03-10 20:24:59,742][1096160] Fps is (10 sec: 12287.7, 60 sec: 12219.7, 300 sec: 12024.2). Total num frames: 19238912. Throughput: 0: 12198.6. Samples: 19222536. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:24:59,742][1096160] Avg episode reward: [(0, '4857.725')] [2023-03-10 20:24:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000037576_19238912.pth... [2023-03-10 20:24:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000036856_18870272.pth [2023-03-10 20:25:00,626][1096443] Updated weights for policy 0, policy_version 37600 (0.0005) [2023-03-10 20:25:04,137][1096443] Updated weights for policy 0, policy_version 37680 (0.0005) [2023-03-10 20:25:04,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12024.2). Total num frames: 19296256. Throughput: 0: 12158.9. Samples: 19292844. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:25:04,742][1096160] Avg episode reward: [(0, '4858.019')] [2023-03-10 20:25:07,540][1096443] Updated weights for policy 0, policy_version 37760 (0.0005) [2023-03-10 20:25:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 19357696. Throughput: 0: 12130.6. Samples: 19329088. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:25:09,742][1096160] Avg episode reward: [(0, '4855.623')] [2023-03-10 20:25:10,973][1096443] Updated weights for policy 0, policy_version 37840 (0.0005) [2023-03-10 20:25:14,426][1096443] Updated weights for policy 0, policy_version 37920 (0.0004) [2023-03-10 20:25:14,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 19415040. Throughput: 0: 12148.2. Samples: 19400712. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:25:14,742][1096160] Avg episode reward: [(0, '4856.341')] [2023-03-10 20:25:14,747][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000037928_19419136.pth... [2023-03-10 20:25:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000037216_19054592.pth [2023-03-10 20:25:17,907][1096443] Updated weights for policy 0, policy_version 38000 (0.0005) [2023-03-10 20:25:19,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 19476480. Throughput: 0: 12131.7. Samples: 19471576. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:25:19,742][1096160] Avg episode reward: [(0, '4858.786')] [2023-03-10 20:25:21,435][1096443] Updated weights for policy 0, policy_version 38080 (0.0005) [2023-03-10 20:25:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 19533824. Throughput: 0: 11951.4. Samples: 19505468. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:25:24,742][1096160] Avg episode reward: [(0, '4853.049')] [2023-03-10 20:25:24,862][1096443] Updated weights for policy 0, policy_version 38160 (0.0005) [2023-03-10 20:25:28,071][1096443] Updated weights for policy 0, policy_version 38240 (0.0005) [2023-03-10 20:25:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 19595264. Throughput: 0: 12053.2. Samples: 19580612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:25:29,742][1096160] Avg episode reward: [(0, '4846.153')] [2023-03-10 20:25:29,759][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000038280_19599360.pth... [2023-03-10 20:25:29,762][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000037576_19238912.pth [2023-03-10 20:25:31,570][1096443] Updated weights for policy 0, policy_version 38320 (0.0005) [2023-03-10 20:25:34,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 19656704. Throughput: 0: 12028.3. Samples: 19651260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:25:34,742][1096160] Avg episode reward: [(0, '4852.000')] [2023-03-10 20:25:35,105][1096443] Updated weights for policy 0, policy_version 38400 (0.0005) [2023-03-10 20:25:38,337][1096443] Updated weights for policy 0, policy_version 38480 (0.0005) [2023-03-10 20:25:39,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 19718144. Throughput: 0: 11986.7. Samples: 19688096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:25:39,742][1096160] Avg episode reward: [(0, '4849.245')] [2023-03-10 20:25:41,648][1096443] Updated weights for policy 0, policy_version 38560 (0.0005) [2023-03-10 20:25:44,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 11982.5). Total num frames: 19775488. Throughput: 0: 11984.5. Samples: 19761836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:25:44,742][1096160] Avg episode reward: [(0, '4845.938')] [2023-03-10 20:25:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000038624_19775488.pth... [2023-03-10 20:25:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000037928_19419136.pth [2023-03-10 20:25:45,157][1096443] Updated weights for policy 0, policy_version 38640 (0.0005) [2023-03-10 20:25:48,417][1096443] Updated weights for policy 0, policy_version 38720 (0.0005) [2023-03-10 20:25:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 19836928. Throughput: 0: 12024.3. Samples: 19833936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:25:49,742][1096160] Avg episode reward: [(0, '4842.207')] [2023-03-10 20:25:51,810][1096443] Updated weights for policy 0, policy_version 38800 (0.0005) [2023-03-10 20:25:54,742][1096160] Fps is (10 sec: 12697.4, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 19902464. Throughput: 0: 12042.0. Samples: 19870980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:25:54,742][1096160] Avg episode reward: [(0, '4856.158')] [2023-03-10 20:25:55,015][1096443] Updated weights for policy 0, policy_version 38880 (0.0005) [2023-03-10 20:25:58,220][1096443] Updated weights for policy 0, policy_version 38960 (0.0005) [2023-03-10 20:25:59,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 19963904. Throughput: 0: 12151.9. Samples: 19947548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:25:59,742][1096160] Avg episode reward: [(0, '4857.907')] [2023-03-10 20:25:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000038992_19963904.pth... [2023-03-10 20:25:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000038280_19599360.pth [2023-03-10 20:26:01,588][1096443] Updated weights for policy 0, policy_version 39040 (0.0005) [2023-03-10 20:26:04,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 20025344. Throughput: 0: 12205.4. Samples: 20020820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:26:04,742][1096160] Avg episode reward: [(0, '4854.095')] [2023-03-10 20:26:04,887][1096443] Updated weights for policy 0, policy_version 39120 (0.0005) [2023-03-10 20:26:08,243][1096443] Updated weights for policy 0, policy_version 39200 (0.0005) [2023-03-10 20:26:09,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12024.2). Total num frames: 20086784. Throughput: 0: 12277.0. Samples: 20057932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:26:09,742][1096160] Avg episode reward: [(0, '4853.778')] [2023-03-10 20:26:11,533][1096443] Updated weights for policy 0, policy_version 39280 (0.0005) [2023-03-10 20:26:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12024.2). Total num frames: 20148224. Throughput: 0: 12249.5. Samples: 20131840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:26:14,742][1096160] Avg episode reward: [(0, '4857.176')] [2023-03-10 20:26:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000039352_20148224.pth... [2023-03-10 20:26:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000038624_19775488.pth [2023-03-10 20:26:14,922][1096443] Updated weights for policy 0, policy_version 39360 (0.0004) [2023-03-10 20:26:18,140][1096443] Updated weights for policy 0, policy_version 39440 (0.0004) [2023-03-10 20:26:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12024.2). Total num frames: 20209664. Throughput: 0: 12377.1. Samples: 20208228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:26:19,742][1096160] Avg episode reward: [(0, '4857.950')] [2023-03-10 20:26:21,288][1096443] Updated weights for policy 0, policy_version 39520 (0.0004) [2023-03-10 20:26:24,590][1096443] Updated weights for policy 0, policy_version 39600 (0.0005) [2023-03-10 20:26:24,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 12356.3, 300 sec: 12052.0). Total num frames: 20275200. Throughput: 0: 12412.4. Samples: 20246656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:26:24,742][1096160] Avg episode reward: [(0, '4845.853')] [2023-03-10 20:26:28,062][1096443] Updated weights for policy 0, policy_version 39680 (0.0005) [2023-03-10 20:26:29,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12356.3, 300 sec: 12065.8). Total num frames: 20336640. Throughput: 0: 12371.3. Samples: 20318548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:26:29,742][1096160] Avg episode reward: [(0, '4847.498')] [2023-03-10 20:26:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000039720_20336640.pth... [2023-03-10 20:26:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000038992_19963904.pth [2023-03-10 20:26:31,395][1096443] Updated weights for policy 0, policy_version 39760 (0.0005) [2023-03-10 20:26:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 20393984. Throughput: 0: 12356.1. Samples: 20389960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:26:34,742][1096160] Avg episode reward: [(0, '4855.053')] [2023-03-10 20:26:34,925][1096443] Updated weights for policy 0, policy_version 39840 (0.0005) [2023-03-10 20:26:38,450][1096443] Updated weights for policy 0, policy_version 39920 (0.0005) [2023-03-10 20:26:39,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 12219.7, 300 sec: 12052.0). Total num frames: 20451328. Throughput: 0: 12322.6. Samples: 20425496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:26:39,742][1096160] Avg episode reward: [(0, '4853.210')] [2023-03-10 20:26:41,843][1096443] Updated weights for policy 0, policy_version 40000 (0.0005) [2023-03-10 20:26:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12288.0, 300 sec: 12052.0). Total num frames: 20512768. Throughput: 0: 12236.9. Samples: 20498208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:26:44,742][1096160] Avg episode reward: [(0, '4850.140')] [2023-03-10 20:26:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000040064_20512768.pth... [2023-03-10 20:26:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000039352_20148224.pth [2023-03-10 20:26:45,349][1096443] Updated weights for policy 0, policy_version 40080 (0.0004) [2023-03-10 20:26:48,614][1096443] Updated weights for policy 0, policy_version 40160 (0.0005) [2023-03-10 20:26:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12065.8). Total num frames: 20574208. Throughput: 0: 12204.4. Samples: 20570020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:26:49,742][1096160] Avg episode reward: [(0, '4851.317')] [2023-03-10 20:26:52,146][1096443] Updated weights for policy 0, policy_version 40240 (0.0004) [2023-03-10 20:26:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12052.0). Total num frames: 20631552. Throughput: 0: 12121.1. Samples: 20603380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:26:54,742][1096160] Avg episode reward: [(0, '4856.462')] [2023-03-10 20:26:55,621][1096443] Updated weights for policy 0, policy_version 40320 (0.0005) [2023-03-10 20:26:59,013][1096443] Updated weights for policy 0, policy_version 40400 (0.0005) [2023-03-10 20:26:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 20692992. Throughput: 0: 12016.7. Samples: 20672592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:26:59,742][1096160] Avg episode reward: [(0, '4849.589')] [2023-03-10 20:26:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000040416_20692992.pth... [2023-03-10 20:26:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000039720_20336640.pth [2023-03-10 20:27:02,329][1096443] Updated weights for policy 0, policy_version 40480 (0.0006) [2023-03-10 20:27:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 20750336. Throughput: 0: 11959.7. Samples: 20746416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:27:04,742][1096160] Avg episode reward: [(0, '4850.949')] [2023-03-10 20:27:05,922][1096443] Updated weights for policy 0, policy_version 40560 (0.0005) [2023-03-10 20:27:09,490][1096443] Updated weights for policy 0, policy_version 40640 (0.0005) [2023-03-10 20:27:09,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 12015.0, 300 sec: 12052.0). Total num frames: 20807680. Throughput: 0: 11883.0. Samples: 20781392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:27:09,742][1096160] Avg episode reward: [(0, '4852.011')] [2023-03-10 20:27:13,004][1096443] Updated weights for policy 0, policy_version 40720 (0.0005) [2023-03-10 20:27:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 20869120. Throughput: 0: 11850.0. Samples: 20851796. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:27:14,742][1096160] Avg episode reward: [(0, '4850.874')] [2023-03-10 20:27:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000040760_20869120.pth... [2023-03-10 20:27:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000040064_20512768.pth [2023-03-10 20:27:16,427][1096443] Updated weights for policy 0, policy_version 40800 (0.0005) [2023-03-10 20:27:19,717][1096443] Updated weights for policy 0, policy_version 40880 (0.0004) [2023-03-10 20:27:19,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 20930560. Throughput: 0: 11866.9. Samples: 20923972. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:27:19,742][1096160] Avg episode reward: [(0, '4843.502')] [2023-03-10 20:27:23,059][1096443] Updated weights for policy 0, policy_version 40960 (0.0005) [2023-03-10 20:27:24,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 20987904. Throughput: 0: 11897.5. Samples: 20960884. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:27:24,742][1096160] Avg episode reward: [(0, '4835.228')] [2023-03-10 20:27:26,420][1096443] Updated weights for policy 0, policy_version 41040 (0.0005) [2023-03-10 20:27:29,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.2, 300 sec: 12038.1). Total num frames: 21045248. Throughput: 0: 11875.0. Samples: 21032580. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:27:29,742][1096160] Avg episode reward: [(0, '4852.324')] [2023-03-10 20:27:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000041112_21049344.pth... [2023-03-10 20:27:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000040416_20692992.pth [2023-03-10 20:27:30,097][1096443] Updated weights for policy 0, policy_version 41120 (0.0005) [2023-03-10 20:27:33,366][1096443] Updated weights for policy 0, policy_version 41200 (0.0005) [2023-03-10 20:27:34,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 21110784. Throughput: 0: 11892.4. Samples: 21105180. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:27:34,742][1096160] Avg episode reward: [(0, '4856.089')] [2023-03-10 20:27:36,829][1096443] Updated weights for policy 0, policy_version 41280 (0.0005) [2023-03-10 20:27:39,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 21168128. Throughput: 0: 11917.1. Samples: 21139648. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:27:39,742][1096160] Avg episode reward: [(0, '4846.074')] [2023-03-10 20:27:40,387][1096443] Updated weights for policy 0, policy_version 41360 (0.0005) [2023-03-10 20:27:43,840][1096443] Updated weights for policy 0, policy_version 41440 (0.0005) [2023-03-10 20:27:44,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 21225472. Throughput: 0: 11941.2. Samples: 21209944. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:27:44,742][1096160] Avg episode reward: [(0, '4840.665')] [2023-03-10 20:27:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000041456_21225472.pth... [2023-03-10 20:27:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000040760_20869120.pth [2023-03-10 20:27:47,477][1096443] Updated weights for policy 0, policy_version 41520 (0.0005) [2023-03-10 20:27:49,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 12024.2). Total num frames: 21282816. Throughput: 0: 11830.4. Samples: 21278784. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:27:49,742][1096160] Avg episode reward: [(0, '4847.901')] [2023-03-10 20:27:51,063][1096443] Updated weights for policy 0, policy_version 41600 (0.0004) [2023-03-10 20:27:54,617][1096443] Updated weights for policy 0, policy_version 41680 (0.0005) [2023-03-10 20:27:54,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11810.1, 300 sec: 12024.2). Total num frames: 21340160. Throughput: 0: 11792.7. Samples: 21312064. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:27:54,742][1096160] Avg episode reward: [(0, '4855.243')] [2023-03-10 20:27:57,989][1096443] Updated weights for policy 0, policy_version 41760 (0.0005) [2023-03-10 20:27:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 12038.1). Total num frames: 21401600. Throughput: 0: 11838.0. Samples: 21384504. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:27:59,742][1096160] Avg episode reward: [(0, '4849.879')] [2023-03-10 20:27:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000041800_21401600.pth... [2023-03-10 20:27:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000041112_21049344.pth [2023-03-10 20:28:01,378][1096443] Updated weights for policy 0, policy_version 41840 (0.0005) [2023-03-10 20:28:04,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 12024.2). Total num frames: 21458944. Throughput: 0: 11823.7. Samples: 21456040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:28:04,742][1096160] Avg episode reward: [(0, '4851.519')] [2023-03-10 20:28:04,879][1096443] Updated weights for policy 0, policy_version 41920 (0.0005) [2023-03-10 20:28:08,421][1096443] Updated weights for policy 0, policy_version 42000 (0.0005) [2023-03-10 20:28:09,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 12024.2). Total num frames: 21516288. Throughput: 0: 11745.7. Samples: 21489440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:28:09,742][1096160] Avg episode reward: [(0, '4855.195')] [2023-03-10 20:28:11,895][1096443] Updated weights for policy 0, policy_version 42080 (0.0005) [2023-03-10 20:28:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 12024.2). Total num frames: 21577728. Throughput: 0: 11744.0. Samples: 21561060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:28:14,742][1096160] Avg episode reward: [(0, '4851.605')] [2023-03-10 20:28:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000042144_21577728.pth... [2023-03-10 20:28:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000041456_21225472.pth [2023-03-10 20:28:15,372][1096443] Updated weights for policy 0, policy_version 42160 (0.0004) [2023-03-10 20:28:18,849][1096443] Updated weights for policy 0, policy_version 42240 (0.0005) [2023-03-10 20:28:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 12010.3). Total num frames: 21635072. Throughput: 0: 11685.8. Samples: 21631040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:28:19,742][1096160] Avg episode reward: [(0, '4847.907')] [2023-03-10 20:28:22,398][1096443] Updated weights for policy 0, policy_version 42320 (0.0005) [2023-03-10 20:28:24,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 12024.2). Total num frames: 21696512. Throughput: 0: 11684.6. Samples: 21665456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:28:24,742][1096160] Avg episode reward: [(0, '4850.380')] [2023-03-10 20:28:25,550][1096443] Updated weights for policy 0, policy_version 42400 (0.0004) [2023-03-10 20:28:28,941][1096443] Updated weights for policy 0, policy_version 42480 (0.0004) [2023-03-10 20:28:29,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 21757952. Throughput: 0: 11812.5. Samples: 21741508. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:28:29,742][1096160] Avg episode reward: [(0, '4855.086')] [2023-03-10 20:28:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000042496_21757952.pth... [2023-03-10 20:28:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000041800_21401600.pth [2023-03-10 20:28:32,350][1096443] Updated weights for policy 0, policy_version 42560 (0.0005) [2023-03-10 20:28:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 12024.2). Total num frames: 21815296. Throughput: 0: 11876.4. Samples: 21813220. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:28:34,742][1096160] Avg episode reward: [(0, '4856.772')] [2023-03-10 20:28:35,795][1096443] Updated weights for policy 0, policy_version 42640 (0.0005) [2023-03-10 20:28:39,126][1096443] Updated weights for policy 0, policy_version 42720 (0.0005) [2023-03-10 20:28:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 12038.1). Total num frames: 21876736. Throughput: 0: 11942.1. Samples: 21849460. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:28:39,742][1096160] Avg episode reward: [(0, '4856.919')] [2023-03-10 20:28:42,552][1096443] Updated weights for policy 0, policy_version 42800 (0.0005) [2023-03-10 20:28:44,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 12024.2). Total num frames: 21934080. Throughput: 0: 11937.8. Samples: 21921704. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:28:44,742][1096160] Avg episode reward: [(0, '4849.917')] [2023-03-10 20:28:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000042848_21938176.pth... [2023-03-10 20:28:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000042144_21577728.pth [2023-03-10 20:28:46,121][1096443] Updated weights for policy 0, policy_version 42880 (0.0005) [2023-03-10 20:28:49,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11810.2, 300 sec: 12024.2). Total num frames: 21991424. Throughput: 0: 11832.2. Samples: 21988488. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:28:49,752][1096160] Avg episode reward: [(0, '4836.430')] [2023-03-10 20:28:49,783][1096443] Updated weights for policy 0, policy_version 42960 (0.0005) [2023-03-10 20:28:53,204][1096443] Updated weights for policy 0, policy_version 43040 (0.0004) [2023-03-10 20:28:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 22052864. Throughput: 0: 11894.0. Samples: 22024668. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:28:54,753][1096160] Avg episode reward: [(0, '4838.506')] [2023-03-10 20:28:56,545][1096443] Updated weights for policy 0, policy_version 43120 (0.0004) [2023-03-10 20:28:59,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 22114304. Throughput: 0: 11919.5. Samples: 22097436. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:28:59,742][1096160] Avg episode reward: [(0, '4841.530')] [2023-03-10 20:28:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000043192_22114304.pth... [2023-03-10 20:28:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000042496_21757952.pth [2023-03-10 20:28:59,914][1096443] Updated weights for policy 0, policy_version 43200 (0.0004) [2023-03-10 20:29:03,094][1096443] Updated weights for policy 0, policy_version 43280 (0.0005) [2023-03-10 20:29:04,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 22175744. Throughput: 0: 12019.8. Samples: 22171928. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:29:04,742][1096160] Avg episode reward: [(0, '4845.185')] [2023-03-10 20:29:06,442][1096443] Updated weights for policy 0, policy_version 43360 (0.0004) [2023-03-10 20:29:09,741][1096160] Fps is (10 sec: 12288.2, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 22237184. Throughput: 0: 12117.5. Samples: 22210744. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:29:09,742][1096160] Avg episode reward: [(0, '4857.040')] [2023-03-10 20:29:09,778][1096443] Updated weights for policy 0, policy_version 43440 (0.0005) [2023-03-10 20:29:13,241][1096443] Updated weights for policy 0, policy_version 43520 (0.0005) [2023-03-10 20:29:14,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 22298624. Throughput: 0: 12013.7. Samples: 22282128. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:29:14,742][1096160] Avg episode reward: [(0, '4851.347')] [2023-03-10 20:29:14,747][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000043552_22298624.pth... [2023-03-10 20:29:14,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000042848_21938176.pth [2023-03-10 20:29:16,559][1096443] Updated weights for policy 0, policy_version 43600 (0.0005) [2023-03-10 20:29:19,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 22360064. Throughput: 0: 12058.8. Samples: 22355868. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:29:19,742][1096160] Avg episode reward: [(0, '4861.168')] [2023-03-10 20:29:20,026][1096443] Updated weights for policy 0, policy_version 43680 (0.0005) [2023-03-10 20:29:23,386][1096443] Updated weights for policy 0, policy_version 43760 (0.0005) [2023-03-10 20:29:24,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 22417408. Throughput: 0: 12012.7. Samples: 22390032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:29:24,742][1096160] Avg episode reward: [(0, '4856.776')] [2023-03-10 20:29:26,870][1096443] Updated weights for policy 0, policy_version 43840 (0.0006) [2023-03-10 20:29:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 22478848. Throughput: 0: 12014.6. Samples: 22462360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:29:29,742][1096160] Avg episode reward: [(0, '4858.214')] [2023-03-10 20:29:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000043904_22478848.pth... [2023-03-10 20:29:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000043192_22114304.pth [2023-03-10 20:29:30,309][1096443] Updated weights for policy 0, policy_version 43920 (0.0005) [2023-03-10 20:29:33,695][1096443] Updated weights for policy 0, policy_version 44000 (0.0005) [2023-03-10 20:29:34,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 22540288. Throughput: 0: 12155.3. Samples: 22535476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:29:34,742][1096160] Avg episode reward: [(0, '4858.388')] [2023-03-10 20:29:37,113][1096443] Updated weights for policy 0, policy_version 44080 (0.0005) [2023-03-10 20:29:39,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 22597632. Throughput: 0: 12116.5. Samples: 22569912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:29:39,742][1096160] Avg episode reward: [(0, '4853.878')] [2023-03-10 20:29:40,435][1096443] Updated weights for policy 0, policy_version 44160 (0.0005) [2023-03-10 20:29:43,807][1096443] Updated weights for policy 0, policy_version 44240 (0.0005) [2023-03-10 20:29:44,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 22659072. Throughput: 0: 12125.0. Samples: 22643060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:29:44,756][1096160] Avg episode reward: [(0, '4853.789')] [2023-03-10 20:29:44,759][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000044256_22659072.pth... [2023-03-10 20:29:44,760][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000043552_22298624.pth [2023-03-10 20:29:47,118][1096443] Updated weights for policy 0, policy_version 44320 (0.0005) [2023-03-10 20:29:49,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.4, 300 sec: 12010.3). Total num frames: 22720512. Throughput: 0: 12155.9. Samples: 22718944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:29:49,753][1096160] Avg episode reward: [(0, '4855.571')] [2023-03-10 20:29:50,490][1096443] Updated weights for policy 0, policy_version 44400 (0.0005) [2023-03-10 20:29:53,886][1096443] Updated weights for policy 0, policy_version 44480 (0.0005) [2023-03-10 20:29:54,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 22781952. Throughput: 0: 12057.8. Samples: 22753344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:29:54,753][1096160] Avg episode reward: [(0, '4848.840')] [2023-03-10 20:29:57,256][1096443] Updated weights for policy 0, policy_version 44560 (0.0005) [2023-03-10 20:29:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 22839296. Throughput: 0: 12105.5. Samples: 22826876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:29:59,742][1096160] Avg episode reward: [(0, '4853.471')] [2023-03-10 20:29:59,791][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000044616_22843392.pth... [2023-03-10 20:29:59,793][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000043904_22478848.pth [2023-03-10 20:30:00,751][1096443] Updated weights for policy 0, policy_version 44640 (0.0005) [2023-03-10 20:30:04,135][1096443] Updated weights for policy 0, policy_version 44720 (0.0005) [2023-03-10 20:30:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 22900736. Throughput: 0: 12035.4. Samples: 22897460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:30:04,742][1096160] Avg episode reward: [(0, '4846.695')] [2023-03-10 20:30:07,439][1096443] Updated weights for policy 0, policy_version 44800 (0.0005) [2023-03-10 20:30:09,742][1096160] Fps is (10 sec: 12697.7, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 22966272. Throughput: 0: 12108.2. Samples: 22934900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:30:09,742][1096160] Avg episode reward: [(0, '4841.846')] [2023-03-10 20:30:10,793][1096443] Updated weights for policy 0, policy_version 44880 (0.0005) [2023-03-10 20:30:14,091][1096443] Updated weights for policy 0, policy_version 44960 (0.0004) [2023-03-10 20:30:14,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 23023616. Throughput: 0: 12136.7. Samples: 23008512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:30:14,742][1096160] Avg episode reward: [(0, '4843.570')] [2023-03-10 20:30:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000044968_23023616.pth... [2023-03-10 20:30:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000044256_22659072.pth [2023-03-10 20:30:17,470][1096443] Updated weights for policy 0, policy_version 45040 (0.0005) [2023-03-10 20:30:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 23085056. Throughput: 0: 12120.9. Samples: 23080916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:30:19,742][1096160] Avg episode reward: [(0, '4837.675')] [2023-03-10 20:30:20,978][1096443] Updated weights for policy 0, policy_version 45120 (0.0005) [2023-03-10 20:30:24,383][1096443] Updated weights for policy 0, policy_version 45200 (0.0004) [2023-03-10 20:30:24,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12151.4, 300 sec: 12038.1). Total num frames: 23146496. Throughput: 0: 12169.8. Samples: 23117556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:30:24,742][1096160] Avg episode reward: [(0, '4842.974')] [2023-03-10 20:30:27,788][1096443] Updated weights for policy 0, policy_version 45280 (0.0005) [2023-03-10 20:30:29,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 23207936. Throughput: 0: 12142.0. Samples: 23189452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:30:29,742][1096160] Avg episode reward: [(0, '4842.518')] [2023-03-10 20:30:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000045328_23207936.pth... [2023-03-10 20:30:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000044616_22843392.pth [2023-03-10 20:30:31,084][1096443] Updated weights for policy 0, policy_version 45360 (0.0005) [2023-03-10 20:30:34,732][1096443] Updated weights for policy 0, policy_version 45440 (0.0004) [2023-03-10 20:30:34,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 23265280. Throughput: 0: 12018.5. Samples: 23259776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:30:34,742][1096160] Avg episode reward: [(0, '4856.778')] [2023-03-10 20:30:38,002][1096443] Updated weights for policy 0, policy_version 45520 (0.0004) [2023-03-10 20:30:39,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 23322624. Throughput: 0: 12085.8. Samples: 23297204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:30:39,742][1096160] Avg episode reward: [(0, '4846.633')] [2023-03-10 20:30:41,505][1096443] Updated weights for policy 0, policy_version 45600 (0.0005) [2023-03-10 20:30:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 23384064. Throughput: 0: 12019.3. Samples: 23367744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:30:44,742][1096160] Avg episode reward: [(0, '4848.339')] [2023-03-10 20:30:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000045672_23384064.pth... [2023-03-10 20:30:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000044968_23023616.pth [2023-03-10 20:30:44,916][1096443] Updated weights for policy 0, policy_version 45680 (0.0005) [2023-03-10 20:30:48,388][1096443] Updated weights for policy 0, policy_version 45760 (0.0005) [2023-03-10 20:30:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 23441408. Throughput: 0: 12030.6. Samples: 23438836. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:30:49,742][1096160] Avg episode reward: [(0, '4844.979')] [2023-03-10 20:30:51,916][1096443] Updated weights for policy 0, policy_version 45840 (0.0005) [2023-03-10 20:30:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 23502848. Throughput: 0: 11985.3. Samples: 23474240. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:30:54,742][1096160] Avg episode reward: [(0, '4851.363')] [2023-03-10 20:30:55,330][1096443] Updated weights for policy 0, policy_version 45920 (0.0005) [2023-03-10 20:30:58,656][1096443] Updated weights for policy 0, policy_version 46000 (0.0005) [2023-03-10 20:30:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 23564288. Throughput: 0: 11965.9. Samples: 23546980. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:30:59,742][1096160] Avg episode reward: [(0, '4848.541')] [2023-03-10 20:30:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000046024_23564288.pth... [2023-03-10 20:30:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000045328_23207936.pth [2023-03-10 20:31:02,043][1096443] Updated weights for policy 0, policy_version 46080 (0.0005) [2023-03-10 20:31:04,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 23621632. Throughput: 0: 11939.3. Samples: 23618184. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:31:04,742][1096160] Avg episode reward: [(0, '4850.068')] [2023-03-10 20:31:05,494][1096443] Updated weights for policy 0, policy_version 46160 (0.0004) [2023-03-10 20:31:08,971][1096443] Updated weights for policy 0, policy_version 46240 (0.0004) [2023-03-10 20:31:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.6, 300 sec: 11982.5). Total num frames: 23683072. Throughput: 0: 11931.3. Samples: 23654464. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:31:09,753][1096160] Avg episode reward: [(0, '4852.864')] [2023-03-10 20:31:12,328][1096443] Updated weights for policy 0, policy_version 46320 (0.0005) [2023-03-10 20:31:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 23744512. Throughput: 0: 11970.5. Samples: 23728124. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:31:14,742][1096160] Avg episode reward: [(0, '4853.253')] [2023-03-10 20:31:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000046376_23744512.pth... [2023-03-10 20:31:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000045672_23384064.pth [2023-03-10 20:31:15,644][1096443] Updated weights for policy 0, policy_version 46400 (0.0005) [2023-03-10 20:31:19,156][1096443] Updated weights for policy 0, policy_version 46480 (0.0005) [2023-03-10 20:31:19,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 23801856. Throughput: 0: 11963.9. Samples: 23798152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:31:19,742][1096160] Avg episode reward: [(0, '4847.767')] [2023-03-10 20:31:22,594][1096443] Updated weights for policy 0, policy_version 46560 (0.0005) [2023-03-10 20:31:24,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 23863296. Throughput: 0: 11943.0. Samples: 23834640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:31:24,742][1096160] Avg episode reward: [(0, '4855.345')] [2023-03-10 20:31:25,945][1096443] Updated weights for policy 0, policy_version 46640 (0.0005) [2023-03-10 20:31:29,297][1096443] Updated weights for policy 0, policy_version 46720 (0.0005) [2023-03-10 20:31:29,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 23924736. Throughput: 0: 12010.8. Samples: 23908232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:31:29,742][1096160] Avg episode reward: [(0, '4854.774')] [2023-03-10 20:31:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000046728_23924736.pth... [2023-03-10 20:31:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000046024_23564288.pth [2023-03-10 20:31:32,751][1096443] Updated weights for policy 0, policy_version 46800 (0.0005) [2023-03-10 20:31:34,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 23982080. Throughput: 0: 12048.8. Samples: 23981032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:31:34,742][1096160] Avg episode reward: [(0, '4855.357')] [2023-03-10 20:31:36,025][1096443] Updated weights for policy 0, policy_version 46880 (0.0004) [2023-03-10 20:31:39,648][1096443] Updated weights for policy 0, policy_version 46960 (0.0005) [2023-03-10 20:31:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11968.7). Total num frames: 24043520. Throughput: 0: 12020.3. Samples: 24015156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:31:39,742][1096160] Avg episode reward: [(0, '4844.291')] [2023-03-10 20:31:43,151][1096443] Updated weights for policy 0, policy_version 47040 (0.0005) [2023-03-10 20:31:44,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 24100864. Throughput: 0: 11953.5. Samples: 24084888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:31:44,745][1096160] Avg episode reward: [(0, '4851.158')] [2023-03-10 20:31:44,748][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000047072_24100864.pth... [2023-03-10 20:31:44,751][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000046376_23744512.pth [2023-03-10 20:31:46,558][1096443] Updated weights for policy 0, policy_version 47120 (0.0005) [2023-03-10 20:31:49,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11968.7). Total num frames: 24162304. Throughput: 0: 12009.5. Samples: 24158612. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:31:49,753][1096160] Avg episode reward: [(0, '4848.744')] [2023-03-10 20:31:49,819][1096443] Updated weights for policy 0, policy_version 47200 (0.0006) [2023-03-10 20:31:53,094][1096443] Updated weights for policy 0, policy_version 47280 (0.0005) [2023-03-10 20:31:54,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 11968.7). Total num frames: 24223744. Throughput: 0: 12015.0. Samples: 24195136. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:31:54,742][1096160] Avg episode reward: [(0, '4856.607')] [2023-03-10 20:31:56,626][1096443] Updated weights for policy 0, policy_version 47360 (0.0005) [2023-03-10 20:31:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12015.0, 300 sec: 11982.5). Total num frames: 24285184. Throughput: 0: 12009.4. Samples: 24268548. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:31:59,742][1096160] Avg episode reward: [(0, '4843.972')] [2023-03-10 20:31:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000047432_24285184.pth... [2023-03-10 20:31:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000046728_23924736.pth [2023-03-10 20:31:59,894][1096443] Updated weights for policy 0, policy_version 47440 (0.0005) [2023-03-10 20:32:03,330][1096443] Updated weights for policy 0, policy_version 47520 (0.0005) [2023-03-10 20:32:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 24346624. Throughput: 0: 12096.8. Samples: 24342508. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:32:04,742][1096160] Avg episode reward: [(0, '4815.002')] [2023-03-10 20:32:06,631][1096443] Updated weights for policy 0, policy_version 47600 (0.0005) [2023-03-10 20:32:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12015.0, 300 sec: 11982.5). Total num frames: 24403968. Throughput: 0: 12089.3. Samples: 24378656. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:32:09,742][1096160] Avg episode reward: [(0, '4836.498')] [2023-03-10 20:32:10,181][1096443] Updated weights for policy 0, policy_version 47680 (0.0005) [2023-03-10 20:32:13,690][1096443] Updated weights for policy 0, policy_version 47760 (0.0005) [2023-03-10 20:32:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 24465408. Throughput: 0: 12014.9. Samples: 24448900. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:32:14,742][1096160] Avg episode reward: [(0, '4855.972')] [2023-03-10 20:32:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000047784_24465408.pth... [2023-03-10 20:32:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000047072_24100864.pth [2023-03-10 20:32:17,128][1096443] Updated weights for policy 0, policy_version 47840 (0.0005) [2023-03-10 20:32:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 24522752. Throughput: 0: 11956.0. Samples: 24519052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:32:19,742][1096160] Avg episode reward: [(0, '4853.146')] [2023-03-10 20:32:20,439][1096443] Updated weights for policy 0, policy_version 47920 (0.0005) [2023-03-10 20:32:23,752][1096443] Updated weights for policy 0, policy_version 48000 (0.0005) [2023-03-10 20:32:24,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 11996.4). Total num frames: 24584192. Throughput: 0: 12017.3. Samples: 24555932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:32:24,742][1096160] Avg episode reward: [(0, '4855.425')] [2023-03-10 20:32:27,098][1096443] Updated weights for policy 0, policy_version 48080 (0.0004) [2023-03-10 20:32:29,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 24645632. Throughput: 0: 12098.3. Samples: 24629312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:32:29,742][1096160] Avg episode reward: [(0, '4853.233')] [2023-03-10 20:32:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000048136_24645632.pth... [2023-03-10 20:32:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000047432_24285184.pth [2023-03-10 20:32:30,570][1096443] Updated weights for policy 0, policy_version 48160 (0.0005) [2023-03-10 20:32:33,953][1096443] Updated weights for policy 0, policy_version 48240 (0.0005) [2023-03-10 20:32:34,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 24707072. Throughput: 0: 12093.0. Samples: 24702800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:32:34,742][1096160] Avg episode reward: [(0, '4857.334')] [2023-03-10 20:32:37,332][1096443] Updated weights for policy 0, policy_version 48320 (0.0005) [2023-03-10 20:32:39,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 24768512. Throughput: 0: 12092.0. Samples: 24739276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:32:39,753][1096160] Avg episode reward: [(0, '4850.934')] [2023-03-10 20:32:40,729][1096443] Updated weights for policy 0, policy_version 48400 (0.0005) [2023-03-10 20:32:43,987][1096443] Updated weights for policy 0, policy_version 48480 (0.0004) [2023-03-10 20:32:44,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12024.2). Total num frames: 24829952. Throughput: 0: 12093.9. Samples: 24812772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:32:44,755][1096160] Avg episode reward: [(0, '4850.287')] [2023-03-10 20:32:44,758][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000048496_24829952.pth... [2023-03-10 20:32:44,760][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000047784_24465408.pth [2023-03-10 20:32:47,236][1096443] Updated weights for policy 0, policy_version 48560 (0.0005) [2023-03-10 20:32:49,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 24891392. Throughput: 0: 12083.7. Samples: 24886272. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:32:49,753][1096160] Avg episode reward: [(0, '4856.191')] [2023-03-10 20:32:50,835][1096443] Updated weights for policy 0, policy_version 48640 (0.0005) [2023-03-10 20:32:54,100][1096443] Updated weights for policy 0, policy_version 48720 (0.0005) [2023-03-10 20:32:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 24948736. Throughput: 0: 12076.1. Samples: 24922080. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:32:54,742][1096160] Avg episode reward: [(0, '4858.006')] [2023-03-10 20:32:57,522][1096443] Updated weights for policy 0, policy_version 48800 (0.0005) [2023-03-10 20:32:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 25010176. Throughput: 0: 12138.3. Samples: 24995124. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:32:59,742][1096160] Avg episode reward: [(0, '4859.231')] [2023-03-10 20:32:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000048848_25010176.pth... [2023-03-10 20:32:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000048136_24645632.pth [2023-03-10 20:33:01,062][1096443] Updated weights for policy 0, policy_version 48880 (0.0005) [2023-03-10 20:33:04,303][1096443] Updated weights for policy 0, policy_version 48960 (0.0005) [2023-03-10 20:33:04,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 25071616. Throughput: 0: 12171.6. Samples: 25066776. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:33:04,742][1096160] Avg episode reward: [(0, '4855.505')] [2023-03-10 20:33:07,529][1096443] Updated weights for policy 0, policy_version 49040 (0.0005) [2023-03-10 20:33:09,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12052.0). Total num frames: 25133056. Throughput: 0: 12182.1. Samples: 25104128. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:33:09,744][1096160] Avg episode reward: [(0, '4856.988')] [2023-03-10 20:33:10,890][1096443] Updated weights for policy 0, policy_version 49120 (0.0005) [2023-03-10 20:33:14,434][1096443] Updated weights for policy 0, policy_version 49200 (0.0005) [2023-03-10 20:33:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 25190400. Throughput: 0: 12191.8. Samples: 25177944. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 20:33:14,742][1096160] Avg episode reward: [(0, '4854.919')] [2023-03-10 20:33:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000049200_25190400.pth... [2023-03-10 20:33:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000048496_24829952.pth [2023-03-10 20:33:18,060][1096443] Updated weights for policy 0, policy_version 49280 (0.0004) [2023-03-10 20:33:19,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 25247744. Throughput: 0: 12036.7. Samples: 25244452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:33:19,742][1096160] Avg episode reward: [(0, '4856.828')] [2023-03-10 20:33:21,487][1096443] Updated weights for policy 0, policy_version 49360 (0.0005) [2023-03-10 20:33:24,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 25309184. Throughput: 0: 12065.3. Samples: 25282212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:33:24,742][1096160] Avg episode reward: [(0, '4857.798')] [2023-03-10 20:33:24,851][1096443] Updated weights for policy 0, policy_version 49440 (0.0005) [2023-03-10 20:33:28,366][1096443] Updated weights for policy 0, policy_version 49520 (0.0005) [2023-03-10 20:33:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 25366528. Throughput: 0: 12005.4. Samples: 25353016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:33:29,742][1096160] Avg episode reward: [(0, '4854.315')] [2023-03-10 20:33:29,790][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000049552_25370624.pth... [2023-03-10 20:33:29,792][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000048848_25010176.pth [2023-03-10 20:33:31,839][1096443] Updated weights for policy 0, policy_version 49600 (0.0005) [2023-03-10 20:33:34,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 25427968. Throughput: 0: 11945.0. Samples: 25423796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:33:34,742][1096160] Avg episode reward: [(0, '4852.191')] [2023-03-10 20:33:35,381][1096443] Updated weights for policy 0, policy_version 49680 (0.0005) [2023-03-10 20:33:39,039][1096443] Updated weights for policy 0, policy_version 49760 (0.0005) [2023-03-10 20:33:39,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 25481216. Throughput: 0: 11906.1. Samples: 25457852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:33:39,744][1096160] Avg episode reward: [(0, '4850.166')] [2023-03-10 20:33:42,375][1096443] Updated weights for policy 0, policy_version 49840 (0.0005) [2023-03-10 20:33:44,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 25542656. Throughput: 0: 11871.4. Samples: 25529336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:33:44,756][1096160] Avg episode reward: [(0, '4854.506')] [2023-03-10 20:33:44,790][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000049896_25546752.pth... [2023-03-10 20:33:44,792][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000049200_25190400.pth [2023-03-10 20:33:45,843][1096443] Updated weights for policy 0, policy_version 49920 (0.0004) [2023-03-10 20:33:49,336][1096443] Updated weights for policy 0, policy_version 50000 (0.0005) [2023-03-10 20:33:49,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 25604096. Throughput: 0: 11824.8. Samples: 25598888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:33:49,742][1096160] Avg episode reward: [(0, '4849.070')] [2023-03-10 20:33:52,754][1096443] Updated weights for policy 0, policy_version 50080 (0.0005) [2023-03-10 20:33:54,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 25661440. Throughput: 0: 11778.8. Samples: 25634172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:33:54,742][1096160] Avg episode reward: [(0, '4854.711')] [2023-03-10 20:33:56,233][1096443] Updated weights for policy 0, policy_version 50160 (0.0004) [2023-03-10 20:33:59,298][1096443] Updated weights for policy 0, policy_version 50240 (0.0004) [2023-03-10 20:33:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 25726976. Throughput: 0: 11796.7. Samples: 25708796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:33:59,742][1096160] Avg episode reward: [(0, '4847.371')] [2023-03-10 20:33:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000050248_25726976.pth... [2023-03-10 20:33:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000049552_25370624.pth [2023-03-10 20:34:02,680][1096443] Updated weights for policy 0, policy_version 50320 (0.0005) [2023-03-10 20:34:04,742][1096160] Fps is (10 sec: 12697.4, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 25788416. Throughput: 0: 11927.7. Samples: 25781200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:34:04,742][1096160] Avg episode reward: [(0, '4848.638')] [2023-03-10 20:34:06,086][1096443] Updated weights for policy 0, policy_version 50400 (0.0005) [2023-03-10 20:34:09,491][1096443] Updated weights for policy 0, policy_version 50480 (0.0005) [2023-03-10 20:34:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 25845760. Throughput: 0: 11893.4. Samples: 25817416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:34:09,742][1096160] Avg episode reward: [(0, '4856.816')] [2023-03-10 20:34:12,883][1096443] Updated weights for policy 0, policy_version 50560 (0.0005) [2023-03-10 20:34:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 25907200. Throughput: 0: 11958.4. Samples: 25891144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:34:14,742][1096160] Avg episode reward: [(0, '4856.988')] [2023-03-10 20:34:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000050600_25907200.pth... [2023-03-10 20:34:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000049896_25546752.pth [2023-03-10 20:34:16,246][1096443] Updated weights for policy 0, policy_version 50640 (0.0005) [2023-03-10 20:34:19,566][1096443] Updated weights for policy 0, policy_version 50720 (0.0004) [2023-03-10 20:34:19,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 25968640. Throughput: 0: 12015.8. Samples: 25964508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:34:19,742][1096160] Avg episode reward: [(0, '4859.928')] [2023-03-10 20:34:23,124][1096443] Updated weights for policy 0, policy_version 50800 (0.0004) [2023-03-10 20:34:24,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 26025984. Throughput: 0: 12025.7. Samples: 25999008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:34:24,742][1096160] Avg episode reward: [(0, '4855.485')] [2023-03-10 20:34:26,739][1096443] Updated weights for policy 0, policy_version 50880 (0.0005) [2023-03-10 20:34:29,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 26083328. Throughput: 0: 11955.4. Samples: 26067328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:34:29,742][1096160] Avg episode reward: [(0, '4852.690')] [2023-03-10 20:34:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000050944_26083328.pth... [2023-03-10 20:34:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000050248_25726976.pth [2023-03-10 20:34:30,269][1096443] Updated weights for policy 0, policy_version 50960 (0.0005) [2023-03-10 20:34:33,586][1096443] Updated weights for policy 0, policy_version 51040 (0.0004) [2023-03-10 20:34:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 26144768. Throughput: 0: 12039.7. Samples: 26140676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:34:34,742][1096160] Avg episode reward: [(0, '4850.205')] [2023-03-10 20:34:36,943][1096443] Updated weights for policy 0, policy_version 51120 (0.0006) [2023-03-10 20:34:39,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 26202112. Throughput: 0: 12063.1. Samples: 26177012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:34:39,742][1096160] Avg episode reward: [(0, '4850.725')] [2023-03-10 20:34:40,504][1096443] Updated weights for policy 0, policy_version 51200 (0.0005) [2023-03-10 20:34:43,824][1096443] Updated weights for policy 0, policy_version 51280 (0.0005) [2023-03-10 20:34:44,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 12010.3). Total num frames: 26263552. Throughput: 0: 11965.4. Samples: 26247240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:34:44,742][1096160] Avg episode reward: [(0, '4852.139')] [2023-03-10 20:34:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000051296_26263552.pth... [2023-03-10 20:34:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000050600_25907200.pth [2023-03-10 20:34:47,265][1096443] Updated weights for policy 0, policy_version 51360 (0.0005) [2023-03-10 20:34:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 26324992. Throughput: 0: 11988.2. Samples: 26320668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:34:49,742][1096160] Avg episode reward: [(0, '4848.009')] [2023-03-10 20:34:50,766][1096443] Updated weights for policy 0, policy_version 51440 (0.0005) [2023-03-10 20:34:54,274][1096443] Updated weights for policy 0, policy_version 51520 (0.0005) [2023-03-10 20:34:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 26382336. Throughput: 0: 11918.1. Samples: 26353728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:34:54,742][1096160] Avg episode reward: [(0, '4831.992')] [2023-03-10 20:34:57,852][1096443] Updated weights for policy 0, policy_version 51600 (0.0005) [2023-03-10 20:34:59,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11996.4). Total num frames: 26439680. Throughput: 0: 11825.6. Samples: 26423296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:34:59,742][1096160] Avg episode reward: [(0, '4840.972')] [2023-03-10 20:34:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000051640_26439680.pth... [2023-03-10 20:34:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000050944_26083328.pth [2023-03-10 20:35:01,382][1096443] Updated weights for policy 0, policy_version 51680 (0.0005) [2023-03-10 20:35:04,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11968.7). Total num frames: 26497024. Throughput: 0: 11796.5. Samples: 26495352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:35:04,742][1096160] Avg episode reward: [(0, '4841.183')] [2023-03-10 20:35:04,841][1096443] Updated weights for policy 0, policy_version 51760 (0.0005) [2023-03-10 20:35:08,373][1096443] Updated weights for policy 0, policy_version 51840 (0.0005) [2023-03-10 20:35:09,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11810.2, 300 sec: 11968.7). Total num frames: 26554368. Throughput: 0: 11792.0. Samples: 26529648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:35:09,742][1096160] Avg episode reward: [(0, '4846.080')] [2023-03-10 20:35:11,872][1096443] Updated weights for policy 0, policy_version 51920 (0.0004) [2023-03-10 20:35:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11968.6). Total num frames: 26615808. Throughput: 0: 11806.9. Samples: 26598640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:35:14,742][1096160] Avg episode reward: [(0, '4845.623')] [2023-03-10 20:35:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000051984_26615808.pth... [2023-03-10 20:35:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000051296_26263552.pth [2023-03-10 20:35:15,165][1096443] Updated weights for policy 0, policy_version 52000 (0.0005) [2023-03-10 20:35:18,631][1096443] Updated weights for policy 0, policy_version 52080 (0.0004) [2023-03-10 20:35:19,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11968.7). Total num frames: 26677248. Throughput: 0: 11815.9. Samples: 26672392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:35:19,742][1096160] Avg episode reward: [(0, '4847.290')] [2023-03-10 20:35:22,019][1096443] Updated weights for policy 0, policy_version 52160 (0.0004) [2023-03-10 20:35:24,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 26738688. Throughput: 0: 11817.0. Samples: 26708776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:35:24,742][1096160] Avg episode reward: [(0, '4857.476')] [2023-03-10 20:35:25,360][1096443] Updated weights for policy 0, policy_version 52240 (0.0005) [2023-03-10 20:35:28,852][1096443] Updated weights for policy 0, policy_version 52320 (0.0005) [2023-03-10 20:35:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11968.6). Total num frames: 26796032. Throughput: 0: 11859.0. Samples: 26780896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:35:29,742][1096160] Avg episode reward: [(0, '4839.529')] [2023-03-10 20:35:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000052336_26796032.pth... [2023-03-10 20:35:29,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000051640_26439680.pth [2023-03-10 20:35:32,448][1096443] Updated weights for policy 0, policy_version 52400 (0.0004) [2023-03-10 20:35:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 26857472. Throughput: 0: 11754.9. Samples: 26849640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:35:34,742][1096160] Avg episode reward: [(0, '4834.521')] [2023-03-10 20:35:35,730][1096443] Updated weights for policy 0, policy_version 52480 (0.0005) [2023-03-10 20:35:39,140][1096443] Updated weights for policy 0, policy_version 52560 (0.0004) [2023-03-10 20:35:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 26914816. Throughput: 0: 11833.1. Samples: 26886216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:35:39,742][1096160] Avg episode reward: [(0, '4851.371')] [2023-03-10 20:35:42,659][1096443] Updated weights for policy 0, policy_version 52640 (0.0004) [2023-03-10 20:35:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 26976256. Throughput: 0: 11915.9. Samples: 26959512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:35:44,742][1096160] Avg episode reward: [(0, '4859.105')] [2023-03-10 20:35:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000052688_26976256.pth... [2023-03-10 20:35:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000051984_26615808.pth [2023-03-10 20:35:45,989][1096443] Updated weights for policy 0, policy_version 52720 (0.0005) [2023-03-10 20:35:49,450][1096443] Updated weights for policy 0, policy_version 52800 (0.0004) [2023-03-10 20:35:49,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 27037696. Throughput: 0: 11898.0. Samples: 27030760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:35:49,742][1096160] Avg episode reward: [(0, '4856.952')] [2023-03-10 20:35:52,829][1096443] Updated weights for policy 0, policy_version 52880 (0.0005) [2023-03-10 20:35:54,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 27095040. Throughput: 0: 11939.5. Samples: 27066924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:35:54,742][1096160] Avg episode reward: [(0, '4861.020')] [2023-03-10 20:35:56,184][1096443] Updated weights for policy 0, policy_version 52960 (0.0005) [2023-03-10 20:35:59,447][1096443] Updated weights for policy 0, policy_version 53040 (0.0005) [2023-03-10 20:35:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 27156480. Throughput: 0: 12046.1. Samples: 27140716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:35:59,742][1096160] Avg episode reward: [(0, '4857.241')] [2023-03-10 20:35:59,802][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000053048_27160576.pth... [2023-03-10 20:35:59,803][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000052336_26796032.pth [2023-03-10 20:36:03,138][1096443] Updated weights for policy 0, policy_version 53120 (0.0005) [2023-03-10 20:36:04,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 27213824. Throughput: 0: 11947.1. Samples: 27210012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:36:04,742][1096160] Avg episode reward: [(0, '4855.008')] [2023-03-10 20:36:06,779][1096443] Updated weights for policy 0, policy_version 53200 (0.0005) [2023-03-10 20:36:09,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 27271168. Throughput: 0: 11861.9. Samples: 27242560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:36:09,742][1096160] Avg episode reward: [(0, '4858.473')] [2023-03-10 20:36:10,315][1096443] Updated weights for policy 0, policy_version 53280 (0.0004) [2023-03-10 20:36:13,822][1096443] Updated weights for policy 0, policy_version 53360 (0.0005) [2023-03-10 20:36:14,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 27328512. Throughput: 0: 11806.7. Samples: 27312200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:36:14,742][1096160] Avg episode reward: [(0, '4856.791')] [2023-03-10 20:36:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000053376_27328512.pth... [2023-03-10 20:36:14,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000052688_26976256.pth [2023-03-10 20:36:17,182][1096443] Updated weights for policy 0, policy_version 53440 (0.0004) [2023-03-10 20:36:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 27389952. Throughput: 0: 11913.7. Samples: 27385756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:36:19,742][1096160] Avg episode reward: [(0, '4854.077')] [2023-03-10 20:36:20,585][1096443] Updated weights for policy 0, policy_version 53520 (0.0005) [2023-03-10 20:36:24,245][1096443] Updated weights for policy 0, policy_version 53600 (0.0004) [2023-03-10 20:36:24,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11940.9). Total num frames: 27447296. Throughput: 0: 11872.4. Samples: 27420472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:36:24,742][1096160] Avg episode reward: [(0, '4856.776')] [2023-03-10 20:36:27,466][1096443] Updated weights for policy 0, policy_version 53680 (0.0005) [2023-03-10 20:36:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 27508736. Throughput: 0: 11864.4. Samples: 27493412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:36:29,742][1096160] Avg episode reward: [(0, '4855.285')] [2023-03-10 20:36:29,757][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000053736_27512832.pth... [2023-03-10 20:36:29,759][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000053048_27160576.pth [2023-03-10 20:36:30,822][1096443] Updated weights for policy 0, policy_version 53760 (0.0005) [2023-03-10 20:36:34,341][1096443] Updated weights for policy 0, policy_version 53840 (0.0005) [2023-03-10 20:36:34,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 27570176. Throughput: 0: 11878.2. Samples: 27565280. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:36:34,742][1096160] Avg episode reward: [(0, '4858.022')] [2023-03-10 20:36:37,826][1096443] Updated weights for policy 0, policy_version 53920 (0.0005) [2023-03-10 20:36:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 27627520. Throughput: 0: 11858.5. Samples: 27600556. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:36:39,742][1096160] Avg episode reward: [(0, '4855.274')] [2023-03-10 20:36:41,215][1096443] Updated weights for policy 0, policy_version 54000 (0.0005) [2023-03-10 20:36:44,718][1096443] Updated weights for policy 0, policy_version 54080 (0.0005) [2023-03-10 20:36:44,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 27688960. Throughput: 0: 11810.4. Samples: 27672184. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:36:44,742][1096160] Avg episode reward: [(0, '4857.431')] [2023-03-10 20:36:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000054080_27688960.pth... [2023-03-10 20:36:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000053376_27328512.pth [2023-03-10 20:36:48,155][1096443] Updated weights for policy 0, policy_version 54160 (0.0004) [2023-03-10 20:36:49,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11940.9). Total num frames: 27746304. Throughput: 0: 11840.7. Samples: 27742844. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:36:49,752][1096160] Avg episode reward: [(0, '4855.086')] [2023-03-10 20:36:51,332][1096443] Updated weights for policy 0, policy_version 54240 (0.0004) [2023-03-10 20:36:54,611][1096443] Updated weights for policy 0, policy_version 54320 (0.0005) [2023-03-10 20:36:54,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 27811840. Throughput: 0: 11982.9. Samples: 27781792. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:36:54,752][1096160] Avg episode reward: [(0, '4855.669')] [2023-03-10 20:36:57,801][1096443] Updated weights for policy 0, policy_version 54400 (0.0004) [2023-03-10 20:36:59,742][1096160] Fps is (10 sec: 13107.0, 60 sec: 12014.9, 300 sec: 11968.6). Total num frames: 27877376. Throughput: 0: 12152.2. Samples: 27859048. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:36:59,742][1096160] Avg episode reward: [(0, '4857.351')] [2023-03-10 20:36:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000054448_27877376.pth... [2023-03-10 20:36:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000053736_27512832.pth [2023-03-10 20:37:00,958][1096443] Updated weights for policy 0, policy_version 54480 (0.0005) [2023-03-10 20:37:04,433][1096443] Updated weights for policy 0, policy_version 54560 (0.0005) [2023-03-10 20:37:04,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 27938816. Throughput: 0: 12155.2. Samples: 27932740. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:37:04,742][1096160] Avg episode reward: [(0, '4855.739')] [2023-03-10 20:37:07,954][1096443] Updated weights for policy 0, policy_version 54640 (0.0004) [2023-03-10 20:37:09,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 27992064. Throughput: 0: 12155.7. Samples: 27967480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:37:09,742][1096160] Avg episode reward: [(0, '4855.726')] [2023-03-10 20:37:11,471][1096443] Updated weights for policy 0, policy_version 54720 (0.0005) [2023-03-10 20:37:14,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 28053504. Throughput: 0: 12097.3. Samples: 28037788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:37:14,742][1096160] Avg episode reward: [(0, '4862.107')] [2023-03-10 20:37:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000054792_28053504.pth... [2023-03-10 20:37:14,745][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000054080_27688960.pth [2023-03-10 20:37:14,746][1096399] Saving new best policy, reward=4862.107! [2023-03-10 20:37:14,799][1096443] Updated weights for policy 0, policy_version 54800 (0.0005) [2023-03-10 20:37:18,021][1096443] Updated weights for policy 0, policy_version 54880 (0.0004) [2023-03-10 20:37:19,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 28114944. Throughput: 0: 12143.1. Samples: 28111716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:37:19,742][1096160] Avg episode reward: [(0, '4857.516')] [2023-03-10 20:37:21,566][1096443] Updated weights for policy 0, policy_version 54960 (0.0005) [2023-03-10 20:37:24,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 11968.7). Total num frames: 28176384. Throughput: 0: 12158.2. Samples: 28147676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:37:24,742][1096160] Avg episode reward: [(0, '4848.920')] [2023-03-10 20:37:24,834][1096443] Updated weights for policy 0, policy_version 55040 (0.0004) [2023-03-10 20:37:28,131][1096443] Updated weights for policy 0, policy_version 55120 (0.0005) [2023-03-10 20:37:29,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12151.5, 300 sec: 11968.6). Total num frames: 28237824. Throughput: 0: 12223.0. Samples: 28222220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:37:29,742][1096160] Avg episode reward: [(0, '4857.220')] [2023-03-10 20:37:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000055152_28237824.pth... [2023-03-10 20:37:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000054448_27877376.pth [2023-03-10 20:37:31,594][1096443] Updated weights for policy 0, policy_version 55200 (0.0005) [2023-03-10 20:37:34,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11968.7). Total num frames: 28299264. Throughput: 0: 12282.7. Samples: 28295568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:37:34,742][1096160] Avg episode reward: [(0, '4852.253')] [2023-03-10 20:37:34,809][1096443] Updated weights for policy 0, policy_version 55280 (0.0005) [2023-03-10 20:37:38,193][1096443] Updated weights for policy 0, policy_version 55360 (0.0004) [2023-03-10 20:37:39,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 11968.7). Total num frames: 28360704. Throughput: 0: 12230.6. Samples: 28332168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:37:39,742][1096160] Avg episode reward: [(0, '4858.665')] [2023-03-10 20:37:41,396][1096443] Updated weights for policy 0, policy_version 55440 (0.0005) [2023-03-10 20:37:44,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 11968.6). Total num frames: 28422144. Throughput: 0: 12203.5. Samples: 28408204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:37:44,742][1096160] Avg episode reward: [(0, '4856.206')] [2023-03-10 20:37:44,754][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000055520_28426240.pth... [2023-03-10 20:37:44,755][1096443] Updated weights for policy 0, policy_version 55520 (0.0005) [2023-03-10 20:37:44,756][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000054792_28053504.pth [2023-03-10 20:37:48,299][1096443] Updated weights for policy 0, policy_version 55600 (0.0004) [2023-03-10 20:37:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 11982.5). Total num frames: 28483584. Throughput: 0: 12146.8. Samples: 28479344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:37:49,742][1096160] Avg episode reward: [(0, '4857.768')] [2023-03-10 20:37:51,494][1096443] Updated weights for policy 0, policy_version 55680 (0.0005) [2023-03-10 20:37:54,741][1096160] Fps is (10 sec: 12288.2, 60 sec: 12219.7, 300 sec: 11982.5). Total num frames: 28545024. Throughput: 0: 12210.0. Samples: 28516928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:37:54,742][1096160] Avg episode reward: [(0, '4857.438')] [2023-03-10 20:37:54,813][1096443] Updated weights for policy 0, policy_version 55760 (0.0005) [2023-03-10 20:37:58,350][1096443] Updated weights for policy 0, policy_version 55840 (0.0004) [2023-03-10 20:37:59,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 28602368. Throughput: 0: 12259.9. Samples: 28589484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:37:59,742][1096160] Avg episode reward: [(0, '4856.395')] [2023-03-10 20:37:59,755][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000055872_28606464.pth... [2023-03-10 20:37:59,757][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000055152_28237824.pth [2023-03-10 20:38:01,660][1096443] Updated weights for policy 0, policy_version 55920 (0.0005) [2023-03-10 20:38:04,742][1096160] Fps is (10 sec: 12287.7, 60 sec: 12151.4, 300 sec: 11982.5). Total num frames: 28667904. Throughput: 0: 12244.2. Samples: 28662708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:38:04,742][1096160] Avg episode reward: [(0, '4854.810')] [2023-03-10 20:38:04,970][1096443] Updated weights for policy 0, policy_version 56000 (0.0004) [2023-03-10 20:38:08,303][1096443] Updated weights for policy 0, policy_version 56080 (0.0005) [2023-03-10 20:38:09,741][1096160] Fps is (10 sec: 12697.6, 60 sec: 12288.0, 300 sec: 11996.4). Total num frames: 28729344. Throughput: 0: 12236.4. Samples: 28698312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:38:09,742][1096160] Avg episode reward: [(0, '4856.843')] [2023-03-10 20:38:11,566][1096443] Updated weights for policy 0, policy_version 56160 (0.0005) [2023-03-10 20:38:14,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 11996.4). Total num frames: 28786688. Throughput: 0: 12261.0. Samples: 28773964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:38:14,742][1096160] Avg episode reward: [(0, '4858.002')] [2023-03-10 20:38:14,763][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000056232_28790784.pth... [2023-03-10 20:38:14,764][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000055520_28426240.pth [2023-03-10 20:38:15,109][1096443] Updated weights for policy 0, policy_version 56240 (0.0005) [2023-03-10 20:38:18,416][1096443] Updated weights for policy 0, policy_version 56320 (0.0004) [2023-03-10 20:38:19,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12288.0, 300 sec: 12010.3). Total num frames: 28852224. Throughput: 0: 12263.0. Samples: 28847404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:38:19,742][1096160] Avg episode reward: [(0, '4858.653')] [2023-03-10 20:38:21,640][1096443] Updated weights for policy 0, policy_version 56400 (0.0005) [2023-03-10 20:38:24,742][1096160] Fps is (10 sec: 12697.7, 60 sec: 12288.0, 300 sec: 12024.2). Total num frames: 28913664. Throughput: 0: 12277.3. Samples: 28884644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:38:24,742][1096160] Avg episode reward: [(0, '4857.007')] [2023-03-10 20:38:24,964][1096443] Updated weights for policy 0, policy_version 56480 (0.0005) [2023-03-10 20:38:28,435][1096443] Updated weights for policy 0, policy_version 56560 (0.0005) [2023-03-10 20:38:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12010.3). Total num frames: 28971008. Throughput: 0: 12170.1. Samples: 28955860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:38:29,742][1096160] Avg episode reward: [(0, '4852.774')] [2023-03-10 20:38:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000056584_28971008.pth... [2023-03-10 20:38:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000055872_28606464.pth [2023-03-10 20:38:31,984][1096443] Updated weights for policy 0, policy_version 56640 (0.0005) [2023-03-10 20:38:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12219.7, 300 sec: 12038.1). Total num frames: 29032448. Throughput: 0: 12198.2. Samples: 29028264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:38:34,742][1096160] Avg episode reward: [(0, '4857.466')] [2023-03-10 20:38:35,320][1096443] Updated weights for policy 0, policy_version 56720 (0.0005) [2023-03-10 20:38:38,585][1096443] Updated weights for policy 0, policy_version 56800 (0.0005) [2023-03-10 20:38:39,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12038.1). Total num frames: 29093888. Throughput: 0: 12173.3. Samples: 29064728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:38:39,753][1096160] Avg episode reward: [(0, '4857.735')] [2023-03-10 20:38:42,071][1096443] Updated weights for policy 0, policy_version 56880 (0.0005) [2023-03-10 20:38:44,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12151.5, 300 sec: 12024.2). Total num frames: 29151232. Throughput: 0: 12136.8. Samples: 29135640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:38:44,755][1096160] Avg episode reward: [(0, '4851.211')] [2023-03-10 20:38:44,758][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000056936_29151232.pth... [2023-03-10 20:38:44,760][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000056232_28790784.pth [2023-03-10 20:38:45,679][1096443] Updated weights for policy 0, policy_version 56960 (0.0004) [2023-03-10 20:38:48,946][1096443] Updated weights for policy 0, policy_version 57040 (0.0004) [2023-03-10 20:38:49,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 29212672. Throughput: 0: 12128.7. Samples: 29208496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:38:49,752][1096160] Avg episode reward: [(0, '4857.905')] [2023-03-10 20:38:52,162][1096443] Updated weights for policy 0, policy_version 57120 (0.0005) [2023-03-10 20:38:54,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12024.2). Total num frames: 29274112. Throughput: 0: 12164.6. Samples: 29245720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:38:54,742][1096160] Avg episode reward: [(0, '4857.358')] [2023-03-10 20:38:55,477][1096443] Updated weights for policy 0, policy_version 57200 (0.0005) [2023-03-10 20:38:58,949][1096443] Updated weights for policy 0, policy_version 57280 (0.0004) [2023-03-10 20:38:59,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12219.7, 300 sec: 12024.2). Total num frames: 29335552. Throughput: 0: 12114.0. Samples: 29319096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:38:59,742][1096160] Avg episode reward: [(0, '4860.005')] [2023-03-10 20:38:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000057296_29335552.pth... [2023-03-10 20:38:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000056584_28971008.pth [2023-03-10 20:39:02,107][1096443] Updated weights for policy 0, policy_version 57360 (0.0004) [2023-03-10 20:39:04,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 29396992. Throughput: 0: 12124.5. Samples: 29393008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:39:04,743][1096160] Avg episode reward: [(0, '4860.173')] [2023-03-10 20:39:05,551][1096443] Updated weights for policy 0, policy_version 57440 (0.0005) [2023-03-10 20:39:08,965][1096443] Updated weights for policy 0, policy_version 57520 (0.0005) [2023-03-10 20:39:09,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.4, 300 sec: 12038.1). Total num frames: 29458432. Throughput: 0: 12099.9. Samples: 29429140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:39:09,753][1096160] Avg episode reward: [(0, '4860.316')] [2023-03-10 20:39:12,206][1096443] Updated weights for policy 0, policy_version 57600 (0.0005) [2023-03-10 20:39:14,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12038.1). Total num frames: 29519872. Throughput: 0: 12170.9. Samples: 29503552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:39:14,742][1096160] Avg episode reward: [(0, '4859.388')] [2023-03-10 20:39:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000057656_29519872.pth... [2023-03-10 20:39:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000056936_29151232.pth [2023-03-10 20:39:15,626][1096443] Updated weights for policy 0, policy_version 57680 (0.0005) [2023-03-10 20:39:19,112][1096443] Updated weights for policy 0, policy_version 57760 (0.0005) [2023-03-10 20:39:19,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 29577216. Throughput: 0: 12135.7. Samples: 29574368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:39:19,742][1096160] Avg episode reward: [(0, '4860.430')] [2023-03-10 20:39:22,483][1096443] Updated weights for policy 0, policy_version 57840 (0.0005) [2023-03-10 20:39:24,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 29638656. Throughput: 0: 12118.2. Samples: 29610048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:39:24,742][1096160] Avg episode reward: [(0, '4859.412')] [2023-03-10 20:39:25,915][1096443] Updated weights for policy 0, policy_version 57920 (0.0005) [2023-03-10 20:39:29,084][1096443] Updated weights for policy 0, policy_version 58000 (0.0005) [2023-03-10 20:39:29,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12219.7, 300 sec: 12065.8). Total num frames: 29704192. Throughput: 0: 12180.8. Samples: 29683776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:39:29,742][1096160] Avg episode reward: [(0, '4861.260')] [2023-03-10 20:39:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000058016_29704192.pth... [2023-03-10 20:39:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000057296_29335552.pth [2023-03-10 20:39:32,433][1096443] Updated weights for policy 0, policy_version 58080 (0.0005) [2023-03-10 20:39:34,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 29761536. Throughput: 0: 12212.7. Samples: 29758068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:39:34,742][1096160] Avg episode reward: [(0, '4862.774')] [2023-03-10 20:39:34,742][1096399] Saving new best policy, reward=4862.774! [2023-03-10 20:39:35,801][1096443] Updated weights for policy 0, policy_version 58160 (0.0005) [2023-03-10 20:39:39,345][1096443] Updated weights for policy 0, policy_version 58240 (0.0004) [2023-03-10 20:39:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12151.4, 300 sec: 12065.8). Total num frames: 29822976. Throughput: 0: 12198.5. Samples: 29794652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:39:39,753][1096160] Avg episode reward: [(0, '4857.235')] [2023-03-10 20:39:42,658][1096443] Updated weights for policy 0, policy_version 58320 (0.0005) [2023-03-10 20:39:44,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12065.8). Total num frames: 29884416. Throughput: 0: 12178.6. Samples: 29867132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:39:44,756][1096160] Avg episode reward: [(0, '4857.632')] [2023-03-10 20:39:44,759][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000058368_29884416.pth... [2023-03-10 20:39:44,762][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000057656_29519872.pth [2023-03-10 20:39:46,032][1096443] Updated weights for policy 0, policy_version 58400 (0.0006) [2023-03-10 20:39:49,399][1096443] Updated weights for policy 0, policy_version 58480 (0.0005) [2023-03-10 20:39:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 29945856. Throughput: 0: 12132.1. Samples: 29938952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:39:49,742][1096160] Avg episode reward: [(0, '4856.145')] [2023-03-10 20:39:52,661][1096443] Updated weights for policy 0, policy_version 58560 (0.0005) [2023-03-10 20:39:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 30003200. Throughput: 0: 12186.0. Samples: 29977508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:39:54,742][1096160] Avg episode reward: [(0, '4857.944')] [2023-03-10 20:39:55,992][1096443] Updated weights for policy 0, policy_version 58640 (0.0005) [2023-03-10 20:39:59,454][1096443] Updated weights for policy 0, policy_version 58720 (0.0005) [2023-03-10 20:39:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 30064640. Throughput: 0: 12156.3. Samples: 30050584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:39:59,742][1096160] Avg episode reward: [(0, '4856.055')] [2023-03-10 20:39:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000058720_30064640.pth... [2023-03-10 20:39:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000058016_29704192.pth [2023-03-10 20:40:03,125][1096443] Updated weights for policy 0, policy_version 58800 (0.0006) [2023-03-10 20:40:04,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 30126080. Throughput: 0: 12085.6. Samples: 30118220. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:40:04,742][1096160] Avg episode reward: [(0, '4854.309')] [2023-03-10 20:40:06,319][1096443] Updated weights for policy 0, policy_version 58880 (0.0005) [2023-03-10 20:40:09,634][1096443] Updated weights for policy 0, policy_version 58960 (0.0005) [2023-03-10 20:40:09,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 30187520. Throughput: 0: 12173.7. Samples: 30157864. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:40:09,742][1096160] Avg episode reward: [(0, '4861.796')] [2023-03-10 20:40:12,774][1096443] Updated weights for policy 0, policy_version 59040 (0.0005) [2023-03-10 20:40:14,742][1096160] Fps is (10 sec: 12697.4, 60 sec: 12219.7, 300 sec: 12121.4). Total num frames: 30253056. Throughput: 0: 12248.8. Samples: 30234972. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:40:14,742][1096160] Avg episode reward: [(0, '4857.946')] [2023-03-10 20:40:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000059088_30253056.pth... [2023-03-10 20:40:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000058368_29884416.pth [2023-03-10 20:40:15,873][1096443] Updated weights for policy 0, policy_version 59120 (0.0005) [2023-03-10 20:40:19,322][1096443] Updated weights for policy 0, policy_version 59200 (0.0005) [2023-03-10 20:40:19,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 12288.0, 300 sec: 12121.4). Total num frames: 30314496. Throughput: 0: 12263.4. Samples: 30309920. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:40:19,742][1096160] Avg episode reward: [(0, '4855.004')] [2023-03-10 20:40:22,793][1096443] Updated weights for policy 0, policy_version 59280 (0.0005) [2023-03-10 20:40:24,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12219.8, 300 sec: 12121.4). Total num frames: 30371840. Throughput: 0: 12197.5. Samples: 30343536. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:40:24,742][1096160] Avg episode reward: [(0, '4856.649')] [2023-03-10 20:40:26,228][1096443] Updated weights for policy 0, policy_version 59360 (0.0005) [2023-03-10 20:40:29,663][1096443] Updated weights for policy 0, policy_version 59440 (0.0005) [2023-03-10 20:40:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12121.4). Total num frames: 30433280. Throughput: 0: 12216.7. Samples: 30416884. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:40:29,742][1096160] Avg episode reward: [(0, '4859.299')] [2023-03-10 20:40:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000059440_30433280.pth... [2023-03-10 20:40:29,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000058720_30064640.pth [2023-03-10 20:40:33,028][1096443] Updated weights for policy 0, policy_version 59520 (0.0005) [2023-03-10 20:40:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 12121.4). Total num frames: 30490624. Throughput: 0: 12210.0. Samples: 30488404. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:40:34,742][1096160] Avg episode reward: [(0, '4857.668')] [2023-03-10 20:40:36,545][1096443] Updated weights for policy 0, policy_version 59600 (0.0005) [2023-03-10 20:40:39,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 30547968. Throughput: 0: 12124.0. Samples: 30523088. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:40:39,742][1096160] Avg episode reward: [(0, '4858.085')] [2023-03-10 20:40:40,187][1096443] Updated weights for policy 0, policy_version 59680 (0.0004) [2023-03-10 20:40:43,589][1096443] Updated weights for policy 0, policy_version 59760 (0.0005) [2023-03-10 20:40:44,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 30609408. Throughput: 0: 12055.7. Samples: 30593088. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:40:44,742][1096160] Avg episode reward: [(0, '4856.458')] [2023-03-10 20:40:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000059784_30609408.pth... [2023-03-10 20:40:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000059088_30253056.pth [2023-03-10 20:40:47,120][1096443] Updated weights for policy 0, policy_version 59840 (0.0005) [2023-03-10 20:40:49,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12107.5). Total num frames: 30666752. Throughput: 0: 12112.9. Samples: 30663300. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:40:49,742][1096160] Avg episode reward: [(0, '4857.588')] [2023-03-10 20:40:50,443][1096443] Updated weights for policy 0, policy_version 59920 (0.0005) [2023-03-10 20:40:53,793][1096443] Updated weights for policy 0, policy_version 60000 (0.0005) [2023-03-10 20:40:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 30728192. Throughput: 0: 12050.5. Samples: 30700136. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:40:54,742][1096160] Avg episode reward: [(0, '4860.147')] [2023-03-10 20:40:57,328][1096443] Updated weights for policy 0, policy_version 60080 (0.0006) [2023-03-10 20:40:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 30789632. Throughput: 0: 11943.5. Samples: 30772428. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:40:59,742][1096160] Avg episode reward: [(0, '4860.005')] [2023-03-10 20:40:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000060136_30789632.pth... [2023-03-10 20:40:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000059440_30433280.pth [2023-03-10 20:41:00,704][1096443] Updated weights for policy 0, policy_version 60160 (0.0005) [2023-03-10 20:41:04,016][1096443] Updated weights for policy 0, policy_version 60240 (0.0005) [2023-03-10 20:41:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12135.3). Total num frames: 30851072. Throughput: 0: 11908.4. Samples: 30845796. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:41:04,742][1096160] Avg episode reward: [(0, '4859.636')] [2023-03-10 20:41:07,378][1096443] Updated weights for policy 0, policy_version 60320 (0.0005) [2023-03-10 20:41:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12135.3). Total num frames: 30908416. Throughput: 0: 11954.9. Samples: 30881508. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:41:09,742][1096160] Avg episode reward: [(0, '4860.271')] [2023-03-10 20:41:10,855][1096443] Updated weights for policy 0, policy_version 60400 (0.0005) [2023-03-10 20:41:14,249][1096443] Updated weights for policy 0, policy_version 60480 (0.0006) [2023-03-10 20:41:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12135.3). Total num frames: 30969856. Throughput: 0: 11925.8. Samples: 30953544. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 20:41:14,742][1096160] Avg episode reward: [(0, '4861.865')] [2023-03-10 20:41:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000060488_30969856.pth... [2023-03-10 20:41:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000059784_30609408.pth [2023-03-10 20:41:17,661][1096443] Updated weights for policy 0, policy_version 60560 (0.0005) [2023-03-10 20:41:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12149.2). Total num frames: 31031296. Throughput: 0: 11955.5. Samples: 31026400. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:41:19,742][1096160] Avg episode reward: [(0, '4861.972')] [2023-03-10 20:41:20,895][1096443] Updated weights for policy 0, policy_version 60640 (0.0005) [2023-03-10 20:41:24,284][1096443] Updated weights for policy 0, policy_version 60720 (0.0005) [2023-03-10 20:41:24,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12149.2). Total num frames: 31092736. Throughput: 0: 12014.2. Samples: 31063728. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:41:24,742][1096160] Avg episode reward: [(0, '4861.874')] [2023-03-10 20:41:27,700][1096443] Updated weights for policy 0, policy_version 60800 (0.0005) [2023-03-10 20:41:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12135.3). Total num frames: 31150080. Throughput: 0: 12050.1. Samples: 31135344. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:41:29,742][1096160] Avg episode reward: [(0, '4862.053')] [2023-03-10 20:41:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000060840_31150080.pth... [2023-03-10 20:41:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000060136_30789632.pth [2023-03-10 20:41:31,143][1096443] Updated weights for policy 0, policy_version 60880 (0.0005) [2023-03-10 20:41:34,579][1096443] Updated weights for policy 0, policy_version 60960 (0.0004) [2023-03-10 20:41:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12149.2). Total num frames: 31211520. Throughput: 0: 12093.1. Samples: 31207488. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:41:34,742][1096160] Avg episode reward: [(0, '4860.347')] [2023-03-10 20:41:38,051][1096443] Updated weights for policy 0, policy_version 61040 (0.0004) [2023-03-10 20:41:39,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12149.2). Total num frames: 31272960. Throughput: 0: 12078.2. Samples: 31243656. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:41:39,742][1096160] Avg episode reward: [(0, '4857.677')] [2023-03-10 20:41:41,265][1096443] Updated weights for policy 0, policy_version 61120 (0.0005) [2023-03-10 20:41:44,594][1096443] Updated weights for policy 0, policy_version 61200 (0.0005) [2023-03-10 20:41:44,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12163.0). Total num frames: 31334400. Throughput: 0: 12124.2. Samples: 31318016. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:41:44,742][1096160] Avg episode reward: [(0, '4861.661')] [2023-03-10 20:41:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000061200_31334400.pth... [2023-03-10 20:41:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000060488_30969856.pth [2023-03-10 20:41:47,894][1096443] Updated weights for policy 0, policy_version 61280 (0.0005) [2023-03-10 20:41:49,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12149.1). Total num frames: 31395840. Throughput: 0: 12132.4. Samples: 31391756. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:41:49,742][1096160] Avg episode reward: [(0, '4863.754')] [2023-03-10 20:41:49,743][1096399] Saving new best policy, reward=4863.754! [2023-03-10 20:41:51,169][1096443] Updated weights for policy 0, policy_version 61360 (0.0005) [2023-03-10 20:41:54,477][1096443] Updated weights for policy 0, policy_version 61440 (0.0004) [2023-03-10 20:41:54,741][1096160] Fps is (10 sec: 12288.2, 60 sec: 12151.5, 300 sec: 12135.3). Total num frames: 31457280. Throughput: 0: 12162.6. Samples: 31428824. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 20:41:54,742][1096160] Avg episode reward: [(0, '4863.777')] [2023-03-10 20:41:54,742][1096399] Saving new best policy, reward=4863.777! [2023-03-10 20:41:57,856][1096443] Updated weights for policy 0, policy_version 61520 (0.0005) [2023-03-10 20:41:59,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12135.3). Total num frames: 31518720. Throughput: 0: 12197.0. Samples: 31502408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:41:59,742][1096160] Avg episode reward: [(0, '4860.493')] [2023-03-10 20:41:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000061560_31518720.pth... [2023-03-10 20:41:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000060840_31150080.pth [2023-03-10 20:42:01,467][1096443] Updated weights for policy 0, policy_version 61600 (0.0005) [2023-03-10 20:42:04,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12149.2). Total num frames: 31576064. Throughput: 0: 12125.2. Samples: 31572032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:42:04,752][1096160] Avg episode reward: [(0, '4860.761')] [2023-03-10 20:42:04,829][1096443] Updated weights for policy 0, policy_version 61680 (0.0004) [2023-03-10 20:42:08,388][1096443] Updated weights for policy 0, policy_version 61760 (0.0005) [2023-03-10 20:42:09,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 12083.2, 300 sec: 12135.3). Total num frames: 31633408. Throughput: 0: 12110.0. Samples: 31608676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:42:09,753][1096160] Avg episode reward: [(0, '4861.393')] [2023-03-10 20:42:11,912][1096443] Updated weights for policy 0, policy_version 61840 (0.0005) [2023-03-10 20:42:14,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 12083.2, 300 sec: 12135.3). Total num frames: 31694848. Throughput: 0: 12053.1. Samples: 31677736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:42:14,742][1096160] Avg episode reward: [(0, '4859.123')] [2023-03-10 20:42:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000061904_31694848.pth... [2023-03-10 20:42:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000061200_31334400.pth [2023-03-10 20:42:15,354][1096443] Updated weights for policy 0, policy_version 61920 (0.0005) [2023-03-10 20:42:18,736][1096443] Updated weights for policy 0, policy_version 62000 (0.0005) [2023-03-10 20:42:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12121.4). Total num frames: 31752192. Throughput: 0: 12035.2. Samples: 31749072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:42:19,742][1096160] Avg episode reward: [(0, '4862.702')] [2023-03-10 20:42:22,007][1096443] Updated weights for policy 0, policy_version 62080 (0.0005) [2023-03-10 20:42:24,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12135.3). Total num frames: 31817728. Throughput: 0: 12105.6. Samples: 31788408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:42:24,742][1096160] Avg episode reward: [(0, '4859.564')] [2023-03-10 20:42:25,235][1096443] Updated weights for policy 0, policy_version 62160 (0.0004) [2023-03-10 20:42:28,623][1096443] Updated weights for policy 0, policy_version 62240 (0.0004) [2023-03-10 20:42:29,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12151.5, 300 sec: 12135.3). Total num frames: 31879168. Throughput: 0: 12106.0. Samples: 31862784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:42:29,742][1096160] Avg episode reward: [(0, '4859.405')] [2023-03-10 20:42:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000062264_31879168.pth... [2023-03-10 20:42:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000061560_31518720.pth [2023-03-10 20:42:31,657][1096443] Updated weights for policy 0, policy_version 62320 (0.0004) [2023-03-10 20:42:34,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12219.7, 300 sec: 12149.2). Total num frames: 31944704. Throughput: 0: 12169.8. Samples: 31939396. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:42:34,742][1096160] Avg episode reward: [(0, '4857.945')] [2023-03-10 20:42:34,939][1096443] Updated weights for policy 0, policy_version 62400 (0.0005) [2023-03-10 20:42:38,286][1096443] Updated weights for policy 0, policy_version 62480 (0.0005) [2023-03-10 20:42:39,741][1096160] Fps is (10 sec: 12697.8, 60 sec: 12219.7, 300 sec: 12149.2). Total num frames: 32006144. Throughput: 0: 12160.3. Samples: 31976036. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:42:39,742][1096160] Avg episode reward: [(0, '4857.125')] [2023-03-10 20:42:41,658][1096443] Updated weights for policy 0, policy_version 62560 (0.0005) [2023-03-10 20:42:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12135.3). Total num frames: 32063488. Throughput: 0: 12162.4. Samples: 32049716. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:42:44,742][1096160] Avg episode reward: [(0, '4854.584')] [2023-03-10 20:42:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000062624_32063488.pth... [2023-03-10 20:42:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000061904_31694848.pth [2023-03-10 20:42:45,118][1096443] Updated weights for policy 0, policy_version 62640 (0.0005) [2023-03-10 20:42:48,424][1096443] Updated weights for policy 0, policy_version 62720 (0.0005) [2023-03-10 20:42:49,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12149.1). Total num frames: 32129024. Throughput: 0: 12256.0. Samples: 32123552. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:42:49,742][1096160] Avg episode reward: [(0, '4858.710')] [2023-03-10 20:42:51,756][1096443] Updated weights for policy 0, policy_version 62800 (0.0006) [2023-03-10 20:42:54,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12149.2). Total num frames: 32186368. Throughput: 0: 12236.4. Samples: 32159316. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:42:54,742][1096160] Avg episode reward: [(0, '4859.097')] [2023-03-10 20:42:55,198][1096443] Updated weights for policy 0, policy_version 62880 (0.0005) [2023-03-10 20:42:58,821][1096443] Updated weights for policy 0, policy_version 62960 (0.0005) [2023-03-10 20:42:59,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 32243712. Throughput: 0: 12270.0. Samples: 32229884. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:42:59,742][1096160] Avg episode reward: [(0, '4862.200')] [2023-03-10 20:42:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000062976_32243712.pth... [2023-03-10 20:42:59,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000062264_31879168.pth [2023-03-10 20:43:02,130][1096443] Updated weights for policy 0, policy_version 63040 (0.0005) [2023-03-10 20:43:04,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12121.4). Total num frames: 32305152. Throughput: 0: 12267.9. Samples: 32301128. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:43:04,752][1096160] Avg episode reward: [(0, '4857.758')] [2023-03-10 20:43:05,462][1096443] Updated weights for policy 0, policy_version 63120 (0.0005) [2023-03-10 20:43:08,858][1096443] Updated weights for policy 0, policy_version 63200 (0.0005) [2023-03-10 20:43:09,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12135.3). Total num frames: 32366592. Throughput: 0: 12240.6. Samples: 32339236. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:43:09,753][1096160] Avg episode reward: [(0, '4855.107')] [2023-03-10 20:43:12,459][1096443] Updated weights for policy 0, policy_version 63280 (0.0005) [2023-03-10 20:43:14,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 32423936. Throughput: 0: 12107.5. Samples: 32407624. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:43:14,742][1096160] Avg episode reward: [(0, '4860.206')] [2023-03-10 20:43:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000063328_32423936.pth... [2023-03-10 20:43:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000062624_32063488.pth [2023-03-10 20:43:15,929][1096443] Updated weights for policy 0, policy_version 63360 (0.0005) [2023-03-10 20:43:19,327][1096443] Updated weights for policy 0, policy_version 63440 (0.0004) [2023-03-10 20:43:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 32485376. Throughput: 0: 12017.7. Samples: 32480192. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:43:19,742][1096160] Avg episode reward: [(0, '4857.099')] [2023-03-10 20:43:22,540][1096443] Updated weights for policy 0, policy_version 63520 (0.0004) [2023-03-10 20:43:24,742][1096160] Fps is (10 sec: 12697.8, 60 sec: 12219.7, 300 sec: 12135.3). Total num frames: 32550912. Throughput: 0: 12046.8. Samples: 32518144. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:43:24,742][1096160] Avg episode reward: [(0, '4859.325')] [2023-03-10 20:43:25,681][1096443] Updated weights for policy 0, policy_version 63600 (0.0005) [2023-03-10 20:43:29,088][1096443] Updated weights for policy 0, policy_version 63680 (0.0005) [2023-03-10 20:43:29,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 12219.7, 300 sec: 12135.3). Total num frames: 32612352. Throughput: 0: 12088.3. Samples: 32593688. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:43:29,742][1096160] Avg episode reward: [(0, '4857.283')] [2023-03-10 20:43:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000063696_32612352.pth... [2023-03-10 20:43:29,750][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000062976_32243712.pth [2023-03-10 20:43:32,449][1096443] Updated weights for policy 0, policy_version 63760 (0.0005) [2023-03-10 20:43:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 32669696. Throughput: 0: 12045.9. Samples: 32665616. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:43:34,742][1096160] Avg episode reward: [(0, '4860.534')] [2023-03-10 20:43:36,024][1096443] Updated weights for policy 0, policy_version 63840 (0.0004) [2023-03-10 20:43:39,271][1096443] Updated weights for policy 0, policy_version 63920 (0.0004) [2023-03-10 20:43:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12135.3). Total num frames: 32731136. Throughput: 0: 12044.1. Samples: 32701300. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:43:39,742][1096160] Avg episode reward: [(0, '4858.528')] [2023-03-10 20:43:42,690][1096443] Updated weights for policy 0, policy_version 64000 (0.0004) [2023-03-10 20:43:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 32788480. Throughput: 0: 12109.2. Samples: 32774800. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:43:44,754][1096160] Avg episode reward: [(0, '4859.323')] [2023-03-10 20:43:44,765][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000064048_32792576.pth... [2023-03-10 20:43:44,767][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000063328_32423936.pth [2023-03-10 20:43:46,105][1096443] Updated weights for policy 0, policy_version 64080 (0.0005) [2023-03-10 20:43:49,572][1096443] Updated weights for policy 0, policy_version 64160 (0.0005) [2023-03-10 20:43:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12121.4). Total num frames: 32849920. Throughput: 0: 12104.3. Samples: 32845824. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 20:43:49,753][1096160] Avg episode reward: [(0, '4861.499')] [2023-03-10 20:43:53,047][1096443] Updated weights for policy 0, policy_version 64240 (0.0005) [2023-03-10 20:43:54,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12107.5). Total num frames: 32907264. Throughput: 0: 12062.3. Samples: 32882036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:43:54,742][1096160] Avg episode reward: [(0, '4861.727')] [2023-03-10 20:43:56,551][1096443] Updated weights for policy 0, policy_version 64320 (0.0004) [2023-03-10 20:43:59,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 32964608. Throughput: 0: 12068.8. Samples: 32950716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:43:59,742][1096160] Avg episode reward: [(0, '4858.257')] [2023-03-10 20:43:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000064384_32964608.pth... [2023-03-10 20:43:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000063696_32612352.pth [2023-03-10 20:44:00,151][1096443] Updated weights for policy 0, policy_version 64400 (0.0005) [2023-03-10 20:44:03,703][1096443] Updated weights for policy 0, policy_version 64480 (0.0005) [2023-03-10 20:44:04,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 33026048. Throughput: 0: 11992.8. Samples: 33019868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:44:04,742][1096160] Avg episode reward: [(0, '4854.719')] [2023-03-10 20:44:07,105][1096443] Updated weights for policy 0, policy_version 64560 (0.0005) [2023-03-10 20:44:09,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 33083392. Throughput: 0: 11950.5. Samples: 33055916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:44:09,742][1096160] Avg episode reward: [(0, '4856.295')] [2023-03-10 20:44:10,775][1096443] Updated weights for policy 0, policy_version 64640 (0.0005) [2023-03-10 20:44:14,342][1096443] Updated weights for policy 0, policy_version 64720 (0.0004) [2023-03-10 20:44:14,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 33140736. Throughput: 0: 11794.0. Samples: 33124416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:44:14,742][1096160] Avg episode reward: [(0, '4857.073')] [2023-03-10 20:44:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000064728_33140736.pth... [2023-03-10 20:44:14,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000064048_32792576.pth [2023-03-10 20:44:17,918][1096443] Updated weights for policy 0, policy_version 64800 (0.0005) [2023-03-10 20:44:19,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 33198080. Throughput: 0: 11737.7. Samples: 33193812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:44:19,742][1096160] Avg episode reward: [(0, '4853.467')] [2023-03-10 20:44:21,375][1096443] Updated weights for policy 0, policy_version 64880 (0.0004) [2023-03-10 20:44:24,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11741.9, 300 sec: 12038.1). Total num frames: 33255424. Throughput: 0: 11719.3. Samples: 33228668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:44:24,742][1096160] Avg episode reward: [(0, '4854.480')] [2023-03-10 20:44:24,873][1096443] Updated weights for policy 0, policy_version 64960 (0.0004) [2023-03-10 20:44:28,201][1096443] Updated weights for policy 0, policy_version 65040 (0.0005) [2023-03-10 20:44:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 12052.0). Total num frames: 33316864. Throughput: 0: 11683.2. Samples: 33300544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:44:29,742][1096160] Avg episode reward: [(0, '4852.457')] [2023-03-10 20:44:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000065072_33316864.pth... [2023-03-10 20:44:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000064384_32964608.pth [2023-03-10 20:44:31,537][1096443] Updated weights for policy 0, policy_version 65120 (0.0005) [2023-03-10 20:44:34,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 11810.1, 300 sec: 12052.0). Total num frames: 33378304. Throughput: 0: 11711.2. Samples: 33372828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:44:34,742][1096160] Avg episode reward: [(0, '4852.014')] [2023-03-10 20:44:35,054][1096443] Updated weights for policy 0, policy_version 65200 (0.0005) [2023-03-10 20:44:38,536][1096443] Updated weights for policy 0, policy_version 65280 (0.0005) [2023-03-10 20:44:39,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 12038.1). Total num frames: 33435648. Throughput: 0: 11679.5. Samples: 33407612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:44:39,742][1096160] Avg episode reward: [(0, '4856.903')] [2023-03-10 20:44:41,869][1096443] Updated weights for policy 0, policy_version 65360 (0.0005) [2023-03-10 20:44:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 12038.1). Total num frames: 33497088. Throughput: 0: 11778.9. Samples: 33480768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:44:44,746][1096160] Avg episode reward: [(0, '4842.084')] [2023-03-10 20:44:44,781][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000065432_33501184.pth... [2023-03-10 20:44:44,783][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000064728_33140736.pth [2023-03-10 20:44:45,113][1096443] Updated weights for policy 0, policy_version 65440 (0.0005) [2023-03-10 20:44:48,539][1096443] Updated weights for policy 0, policy_version 65520 (0.0005) [2023-03-10 20:44:49,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 12052.0). Total num frames: 33558528. Throughput: 0: 11878.7. Samples: 33554408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:44:49,742][1096160] Avg episode reward: [(0, '4850.360')] [2023-03-10 20:44:51,897][1096443] Updated weights for policy 0, policy_version 65600 (0.0004) [2023-03-10 20:44:54,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 12038.1). Total num frames: 33615872. Throughput: 0: 11894.6. Samples: 33591172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:44:54,742][1096160] Avg episode reward: [(0, '4852.154')] [2023-03-10 20:44:55,509][1096443] Updated weights for policy 0, policy_version 65680 (0.0005) [2023-03-10 20:44:59,124][1096443] Updated weights for policy 0, policy_version 65760 (0.0005) [2023-03-10 20:44:59,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 12024.2). Total num frames: 33673216. Throughput: 0: 11884.6. Samples: 33659220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:44:59,742][1096160] Avg episode reward: [(0, '4849.608')] [2023-03-10 20:44:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000065768_33673216.pth... [2023-03-10 20:44:59,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000065072_33316864.pth [2023-03-10 20:45:02,415][1096443] Updated weights for policy 0, policy_version 65840 (0.0005) [2023-03-10 20:45:04,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 33738752. Throughput: 0: 11998.7. Samples: 33733752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:45:04,742][1096160] Avg episode reward: [(0, '4851.625')] [2023-03-10 20:45:05,740][1096443] Updated weights for policy 0, policy_version 65920 (0.0005) [2023-03-10 20:45:09,199][1096443] Updated weights for policy 0, policy_version 66000 (0.0005) [2023-03-10 20:45:09,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 33796096. Throughput: 0: 12018.5. Samples: 33769500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:45:09,742][1096160] Avg episode reward: [(0, '4847.246')] [2023-03-10 20:45:12,738][1096443] Updated weights for policy 0, policy_version 66080 (0.0005) [2023-03-10 20:45:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 33857536. Throughput: 0: 11943.1. Samples: 33837984. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:45:14,742][1096160] Avg episode reward: [(0, '4852.980')] [2023-03-10 20:45:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000066128_33857536.pth... [2023-03-10 20:45:14,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000065432_33501184.pth [2023-03-10 20:45:16,086][1096443] Updated weights for policy 0, policy_version 66160 (0.0005) [2023-03-10 20:45:19,593][1096443] Updated weights for policy 0, policy_version 66240 (0.0004) [2023-03-10 20:45:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 33914880. Throughput: 0: 11949.3. Samples: 33910548. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:45:19,742][1096160] Avg episode reward: [(0, '4855.554')] [2023-03-10 20:45:23,103][1096443] Updated weights for policy 0, policy_version 66320 (0.0005) [2023-03-10 20:45:24,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 33972224. Throughput: 0: 11950.8. Samples: 33945396. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:45:24,742][1096160] Avg episode reward: [(0, '4858.496')] [2023-03-10 20:45:26,485][1096443] Updated weights for policy 0, policy_version 66400 (0.0004) [2023-03-10 20:45:29,467][1096443] Updated weights for policy 0, policy_version 66480 (0.0004) [2023-03-10 20:45:29,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 34037760. Throughput: 0: 11964.1. Samples: 34019152. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:45:29,742][1096160] Avg episode reward: [(0, '4854.811')] [2023-03-10 20:45:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000066480_34037760.pth... [2023-03-10 20:45:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000065768_33673216.pth [2023-03-10 20:45:32,815][1096443] Updated weights for policy 0, policy_version 66560 (0.0005) [2023-03-10 20:45:34,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 34099200. Throughput: 0: 12016.9. Samples: 34095168. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:45:34,742][1096160] Avg episode reward: [(0, '4858.557')] [2023-03-10 20:45:36,383][1096443] Updated weights for policy 0, policy_version 66640 (0.0004) [2023-03-10 20:45:39,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 34156544. Throughput: 0: 11969.6. Samples: 34129804. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:45:39,752][1096160] Avg episode reward: [(0, '4858.051')] [2023-03-10 20:45:39,822][1096443] Updated weights for policy 0, policy_version 66720 (0.0004) [2023-03-10 20:45:43,407][1096443] Updated weights for policy 0, policy_version 66800 (0.0004) [2023-03-10 20:45:44,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 34213888. Throughput: 0: 12015.9. Samples: 34199936. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:45:44,742][1096160] Avg episode reward: [(0, '4857.839')] [2023-03-10 20:45:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000066824_34213888.pth... [2023-03-10 20:45:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000066128_33857536.pth [2023-03-10 20:45:46,902][1096443] Updated weights for policy 0, policy_version 66880 (0.0004) [2023-03-10 20:45:49,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 34271232. Throughput: 0: 11885.4. Samples: 34268592. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:45:49,742][1096160] Avg episode reward: [(0, '4851.079')] [2023-03-10 20:45:50,538][1096443] Updated weights for policy 0, policy_version 66960 (0.0005) [2023-03-10 20:45:53,850][1096443] Updated weights for policy 0, policy_version 67040 (0.0005) [2023-03-10 20:45:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.6, 300 sec: 12010.3). Total num frames: 34332672. Throughput: 0: 11901.4. Samples: 34305064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:45:54,742][1096160] Avg episode reward: [(0, '4859.872')] [2023-03-10 20:45:57,376][1096443] Updated weights for policy 0, policy_version 67120 (0.0005) [2023-03-10 20:45:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 34390016. Throughput: 0: 11942.9. Samples: 34375412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:45:59,742][1096160] Avg episode reward: [(0, '4860.327')] [2023-03-10 20:45:59,772][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000067176_34394112.pth... [2023-03-10 20:45:59,773][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000066480_34037760.pth [2023-03-10 20:46:00,786][1096443] Updated weights for policy 0, policy_version 67200 (0.0005) [2023-03-10 20:46:04,356][1096443] Updated weights for policy 0, policy_version 67280 (0.0004) [2023-03-10 20:46:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 34451456. Throughput: 0: 11914.9. Samples: 34446720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:46:04,753][1096160] Avg episode reward: [(0, '4856.672')] [2023-03-10 20:46:07,740][1096443] Updated weights for policy 0, policy_version 67360 (0.0005) [2023-03-10 20:46:09,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.6, 300 sec: 12010.3). Total num frames: 34512896. Throughput: 0: 11949.9. Samples: 34483144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:46:09,753][1096160] Avg episode reward: [(0, '4859.344')] [2023-03-10 20:46:11,082][1096443] Updated weights for policy 0, policy_version 67440 (0.0005) [2023-03-10 20:46:14,246][1096443] Updated weights for policy 0, policy_version 67520 (0.0005) [2023-03-10 20:46:14,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 34574336. Throughput: 0: 11953.9. Samples: 34557080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:46:14,742][1096160] Avg episode reward: [(0, '4858.071')] [2023-03-10 20:46:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000067528_34574336.pth... [2023-03-10 20:46:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000066824_34213888.pth [2023-03-10 20:46:17,701][1096443] Updated weights for policy 0, policy_version 67600 (0.0005) [2023-03-10 20:46:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 34635776. Throughput: 0: 11919.5. Samples: 34631544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:46:19,742][1096160] Avg episode reward: [(0, '4860.601')] [2023-03-10 20:46:20,978][1096443] Updated weights for policy 0, policy_version 67680 (0.0004) [2023-03-10 20:46:24,223][1096443] Updated weights for policy 0, policy_version 67760 (0.0005) [2023-03-10 20:46:24,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 34697216. Throughput: 0: 11951.4. Samples: 34667620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:46:24,742][1096160] Avg episode reward: [(0, '4858.336')] [2023-03-10 20:46:27,507][1096443] Updated weights for policy 0, policy_version 67840 (0.0005) [2023-03-10 20:46:29,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 34758656. Throughput: 0: 12053.6. Samples: 34742352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:46:29,742][1096160] Avg episode reward: [(0, '4852.199')] [2023-03-10 20:46:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000067888_34758656.pth... [2023-03-10 20:46:29,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000067176_34394112.pth [2023-03-10 20:46:30,892][1096443] Updated weights for policy 0, policy_version 67920 (0.0005) [2023-03-10 20:46:34,118][1096443] Updated weights for policy 0, policy_version 68000 (0.0005) [2023-03-10 20:46:34,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 34820096. Throughput: 0: 12191.0. Samples: 34817188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:46:34,742][1096160] Avg episode reward: [(0, '4853.709')] [2023-03-10 20:46:37,544][1096443] Updated weights for policy 0, policy_version 68080 (0.0005) [2023-03-10 20:46:39,741][1096160] Fps is (10 sec: 12288.3, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 34881536. Throughput: 0: 12175.0. Samples: 34852936. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:46:39,742][1096160] Avg episode reward: [(0, '4857.007')] [2023-03-10 20:46:40,910][1096443] Updated weights for policy 0, policy_version 68160 (0.0005) [2023-03-10 20:46:44,382][1096443] Updated weights for policy 0, policy_version 68240 (0.0005) [2023-03-10 20:46:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 34938880. Throughput: 0: 12232.3. Samples: 34925868. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:46:44,742][1096160] Avg episode reward: [(0, '4854.957')] [2023-03-10 20:46:44,767][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000068248_34942976.pth... [2023-03-10 20:46:44,769][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000067528_34574336.pth [2023-03-10 20:46:47,808][1096443] Updated weights for policy 0, policy_version 68320 (0.0005) [2023-03-10 20:46:49,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 35000320. Throughput: 0: 12231.0. Samples: 34997112. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:46:49,742][1096160] Avg episode reward: [(0, '4859.376')] [2023-03-10 20:46:51,264][1096443] Updated weights for policy 0, policy_version 68400 (0.0005) [2023-03-10 20:46:54,741][1096443] Updated weights for policy 0, policy_version 68480 (0.0005) [2023-03-10 20:46:54,741][1096160] Fps is (10 sec: 12288.2, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 35061760. Throughput: 0: 12200.8. Samples: 35032180. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:46:54,742][1096160] Avg episode reward: [(0, '4855.297')] [2023-03-10 20:46:58,306][1096443] Updated weights for policy 0, policy_version 68560 (0.0005) [2023-03-10 20:46:59,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 12151.4, 300 sec: 12010.3). Total num frames: 35119104. Throughput: 0: 12115.7. Samples: 35102288. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:46:59,742][1096160] Avg episode reward: [(0, '4856.453')] [2023-03-10 20:46:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000068592_35119104.pth... [2023-03-10 20:46:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000067888_34758656.pth [2023-03-10 20:47:01,634][1096443] Updated weights for policy 0, policy_version 68640 (0.0005) [2023-03-10 20:47:04,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 35176448. Throughput: 0: 12031.7. Samples: 35172968. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:47:04,742][1096160] Avg episode reward: [(0, '4855.759')] [2023-03-10 20:47:05,229][1096443] Updated weights for policy 0, policy_version 68720 (0.0005) [2023-03-10 20:47:08,694][1096443] Updated weights for policy 0, policy_version 68800 (0.0004) [2023-03-10 20:47:09,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 35233792. Throughput: 0: 12035.7. Samples: 35209224. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:47:09,742][1096160] Avg episode reward: [(0, '4860.709')] [2023-03-10 20:47:12,238][1096443] Updated weights for policy 0, policy_version 68880 (0.0005) [2023-03-10 20:47:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 35295232. Throughput: 0: 11922.3. Samples: 35278856. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 20:47:14,742][1096160] Avg episode reward: [(0, '4860.684')] [2023-03-10 20:47:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000068936_35295232.pth... [2023-03-10 20:47:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000068248_34942976.pth [2023-03-10 20:47:15,644][1096443] Updated weights for policy 0, policy_version 68960 (0.0004) [2023-03-10 20:47:19,218][1096443] Updated weights for policy 0, policy_version 69040 (0.0005) [2023-03-10 20:47:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 35352576. Throughput: 0: 11806.8. Samples: 35348492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:47:19,742][1096160] Avg episode reward: [(0, '4855.826')] [2023-03-10 20:47:22,608][1096443] Updated weights for policy 0, policy_version 69120 (0.0005) [2023-03-10 20:47:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 35414016. Throughput: 0: 11830.8. Samples: 35385324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:47:24,742][1096160] Avg episode reward: [(0, '4858.611')] [2023-03-10 20:47:26,160][1096443] Updated weights for policy 0, policy_version 69200 (0.0005) [2023-03-10 20:47:29,439][1096443] Updated weights for policy 0, policy_version 69280 (0.0005) [2023-03-10 20:47:29,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 35475456. Throughput: 0: 11760.0. Samples: 35455068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:47:29,742][1096160] Avg episode reward: [(0, '4857.610')] [2023-03-10 20:47:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000069288_35475456.pth... [2023-03-10 20:47:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000068592_35119104.pth [2023-03-10 20:47:32,760][1096443] Updated weights for policy 0, policy_version 69360 (0.0004) [2023-03-10 20:47:34,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 35532800. Throughput: 0: 11849.4. Samples: 35530336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:47:34,742][1096160] Avg episode reward: [(0, '4855.942')] [2023-03-10 20:47:36,139][1096443] Updated weights for policy 0, policy_version 69440 (0.0004) [2023-03-10 20:47:39,686][1096443] Updated weights for policy 0, policy_version 69520 (0.0005) [2023-03-10 20:47:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 35594240. Throughput: 0: 11854.6. Samples: 35565640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:47:39,742][1096160] Avg episode reward: [(0, '4854.641')] [2023-03-10 20:47:42,973][1096443] Updated weights for policy 0, policy_version 69600 (0.0005) [2023-03-10 20:47:44,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 35655680. Throughput: 0: 11930.9. Samples: 35639180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:47:44,742][1096160] Avg episode reward: [(0, '4851.392')] [2023-03-10 20:47:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000069640_35655680.pth... [2023-03-10 20:47:44,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000068936_35295232.pth [2023-03-10 20:47:46,331][1096443] Updated weights for policy 0, policy_version 69680 (0.0005) [2023-03-10 20:47:49,659][1096443] Updated weights for policy 0, policy_version 69760 (0.0004) [2023-03-10 20:47:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.6, 300 sec: 11968.6). Total num frames: 35717120. Throughput: 0: 11985.7. Samples: 35712324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:47:49,742][1096160] Avg episode reward: [(0, '4845.338')] [2023-03-10 20:47:52,903][1096443] Updated weights for policy 0, policy_version 69840 (0.0005) [2023-03-10 20:47:54,741][1096160] Fps is (10 sec: 12288.2, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 35778560. Throughput: 0: 12012.6. Samples: 35749792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:47:54,742][1096160] Avg episode reward: [(0, '4851.102')] [2023-03-10 20:47:56,417][1096443] Updated weights for policy 0, policy_version 69920 (0.0005) [2023-03-10 20:47:59,665][1096443] Updated weights for policy 0, policy_version 70000 (0.0005) [2023-03-10 20:47:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 35840000. Throughput: 0: 12098.8. Samples: 35823304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:47:59,742][1096160] Avg episode reward: [(0, '4853.372')] [2023-03-10 20:47:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000070000_35840000.pth... [2023-03-10 20:47:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000069288_35475456.pth [2023-03-10 20:48:03,065][1096443] Updated weights for policy 0, policy_version 70080 (0.0005) [2023-03-10 20:48:04,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 35901440. Throughput: 0: 12158.9. Samples: 35895644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:48:04,742][1096160] Avg episode reward: [(0, '4848.081')] [2023-03-10 20:48:06,500][1096443] Updated weights for policy 0, policy_version 70160 (0.0005) [2023-03-10 20:48:09,706][1096443] Updated weights for policy 0, policy_version 70240 (0.0005) [2023-03-10 20:48:09,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11996.4). Total num frames: 35962880. Throughput: 0: 12147.3. Samples: 35931952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:48:09,742][1096160] Avg episode reward: [(0, '4838.345')] [2023-03-10 20:48:13,216][1096443] Updated weights for policy 0, policy_version 70320 (0.0005) [2023-03-10 20:48:14,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 36020224. Throughput: 0: 12196.4. Samples: 36003904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:48:14,742][1096160] Avg episode reward: [(0, '4843.486')] [2023-03-10 20:48:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000070352_36020224.pth... [2023-03-10 20:48:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000069640_35655680.pth [2023-03-10 20:48:16,621][1096443] Updated weights for policy 0, policy_version 70400 (0.0005) [2023-03-10 20:48:19,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 11968.6). Total num frames: 36081664. Throughput: 0: 12154.5. Samples: 36077288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:48:19,742][1096160] Avg episode reward: [(0, '4836.526')] [2023-03-10 20:48:20,015][1096443] Updated weights for policy 0, policy_version 70480 (0.0005) [2023-03-10 20:48:23,516][1096443] Updated weights for policy 0, policy_version 70560 (0.0005) [2023-03-10 20:48:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11954.8). Total num frames: 36139008. Throughput: 0: 12138.7. Samples: 36111880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:48:24,742][1096160] Avg episode reward: [(0, '4827.794')] [2023-03-10 20:48:26,897][1096443] Updated weights for policy 0, policy_version 70640 (0.0004) [2023-03-10 20:48:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11968.6). Total num frames: 36200448. Throughput: 0: 12096.5. Samples: 36183520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:48:29,742][1096160] Avg episode reward: [(0, '4826.101')] [2023-03-10 20:48:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000070704_36200448.pth... [2023-03-10 20:48:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000070000_35840000.pth [2023-03-10 20:48:30,398][1096443] Updated weights for policy 0, policy_version 70720 (0.0004) [2023-03-10 20:48:33,787][1096443] Updated weights for policy 0, policy_version 70800 (0.0005) [2023-03-10 20:48:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11954.8). Total num frames: 36257792. Throughput: 0: 12055.6. Samples: 36254824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:48:34,742][1096160] Avg episode reward: [(0, '4846.379')] [2023-03-10 20:48:37,134][1096443] Updated weights for policy 0, policy_version 70880 (0.0005) [2023-03-10 20:48:39,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 36319232. Throughput: 0: 12032.1. Samples: 36291236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:48:39,742][1096160] Avg episode reward: [(0, '4851.378')] [2023-03-10 20:48:40,508][1096443] Updated weights for policy 0, policy_version 70960 (0.0005) [2023-03-10 20:48:44,057][1096443] Updated weights for policy 0, policy_version 71040 (0.0005) [2023-03-10 20:48:44,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 36380672. Throughput: 0: 12018.3. Samples: 36364124. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:48:44,742][1096160] Avg episode reward: [(0, '4842.045')] [2023-03-10 20:48:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000071056_36380672.pth... [2023-03-10 20:48:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000070352_36020224.pth [2023-03-10 20:48:47,113][1096443] Updated weights for policy 0, policy_version 71120 (0.0004) [2023-03-10 20:48:49,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 36442112. Throughput: 0: 12053.0. Samples: 36438028. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:48:49,742][1096160] Avg episode reward: [(0, '4850.270')] [2023-03-10 20:48:50,630][1096443] Updated weights for policy 0, policy_version 71200 (0.0005) [2023-03-10 20:48:53,974][1096443] Updated weights for policy 0, policy_version 71280 (0.0005) [2023-03-10 20:48:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 36499456. Throughput: 0: 12064.0. Samples: 36474832. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:48:54,742][1096160] Avg episode reward: [(0, '4825.794')] [2023-03-10 20:48:57,490][1096443] Updated weights for policy 0, policy_version 71360 (0.0004) [2023-03-10 20:48:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 36560896. Throughput: 0: 12013.3. Samples: 36544504. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:48:59,742][1096160] Avg episode reward: [(0, '4824.004')] [2023-03-10 20:48:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000071408_36560896.pth... [2023-03-10 20:48:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000070704_36200448.pth [2023-03-10 20:49:00,866][1096443] Updated weights for policy 0, policy_version 71440 (0.0004) [2023-03-10 20:49:04,194][1096443] Updated weights for policy 0, policy_version 71520 (0.0005) [2023-03-10 20:49:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 36622336. Throughput: 0: 12022.7. Samples: 36618312. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:49:04,742][1096160] Avg episode reward: [(0, '4839.362')] [2023-03-10 20:49:07,480][1096443] Updated weights for policy 0, policy_version 71600 (0.0005) [2023-03-10 20:49:09,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 36683776. Throughput: 0: 12083.2. Samples: 36655624. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:49:09,742][1096160] Avg episode reward: [(0, '4844.289')] [2023-03-10 20:49:10,958][1096443] Updated weights for policy 0, policy_version 71680 (0.0004) [2023-03-10 20:49:14,379][1096443] Updated weights for policy 0, policy_version 71760 (0.0004) [2023-03-10 20:49:14,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 36741120. Throughput: 0: 12098.2. Samples: 36727940. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:49:14,742][1096160] Avg episode reward: [(0, '4849.764')] [2023-03-10 20:49:14,751][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000071768_36745216.pth... [2023-03-10 20:49:14,753][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000071056_36380672.pth [2023-03-10 20:49:17,795][1096443] Updated weights for policy 0, policy_version 71840 (0.0004) [2023-03-10 20:49:19,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 36802560. Throughput: 0: 12097.3. Samples: 36799200. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:49:19,742][1096160] Avg episode reward: [(0, '4851.971')] [2023-03-10 20:49:21,246][1096443] Updated weights for policy 0, policy_version 71920 (0.0005) [2023-03-10 20:49:24,515][1096443] Updated weights for policy 0, policy_version 72000 (0.0005) [2023-03-10 20:49:24,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 36864000. Throughput: 0: 12091.1. Samples: 36835336. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 20:49:24,742][1096160] Avg episode reward: [(0, '4851.086')] [2023-03-10 20:49:28,079][1096443] Updated weights for policy 0, policy_version 72080 (0.0004) [2023-03-10 20:49:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 36921344. Throughput: 0: 12064.7. Samples: 36907036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:49:29,742][1096160] Avg episode reward: [(0, '4836.110')] [2023-03-10 20:49:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000072112_36921344.pth... [2023-03-10 20:49:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000071408_36560896.pth [2023-03-10 20:49:31,576][1096443] Updated weights for policy 0, policy_version 72160 (0.0004) [2023-03-10 20:49:34,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 36982784. Throughput: 0: 12005.3. Samples: 36978268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:49:34,742][1096160] Avg episode reward: [(0, '4848.736')] [2023-03-10 20:49:35,018][1096443] Updated weights for policy 0, policy_version 72240 (0.0005) [2023-03-10 20:49:38,386][1096443] Updated weights for policy 0, policy_version 72320 (0.0005) [2023-03-10 20:49:39,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 37044224. Throughput: 0: 12009.0. Samples: 37015236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:49:39,742][1096160] Avg episode reward: [(0, '4850.390')] [2023-03-10 20:49:41,758][1096443] Updated weights for policy 0, policy_version 72400 (0.0006) [2023-03-10 20:49:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 37101568. Throughput: 0: 12016.5. Samples: 37085248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:49:44,742][1096160] Avg episode reward: [(0, '4845.584')] [2023-03-10 20:49:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000072464_37101568.pth... [2023-03-10 20:49:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000071768_36745216.pth [2023-03-10 20:49:45,219][1096443] Updated weights for policy 0, policy_version 72480 (0.0005) [2023-03-10 20:49:48,825][1096443] Updated weights for policy 0, policy_version 72560 (0.0005) [2023-03-10 20:49:49,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 37158912. Throughput: 0: 11934.1. Samples: 37155344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:49:49,742][1096160] Avg episode reward: [(0, '4850.967')] [2023-03-10 20:49:52,215][1096443] Updated weights for policy 0, policy_version 72640 (0.0005) [2023-03-10 20:49:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 37220352. Throughput: 0: 11912.7. Samples: 37191696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:49:54,742][1096160] Avg episode reward: [(0, '4850.728')] [2023-03-10 20:49:55,594][1096443] Updated weights for policy 0, policy_version 72720 (0.0004) [2023-03-10 20:49:58,802][1096443] Updated weights for policy 0, policy_version 72800 (0.0005) [2023-03-10 20:49:59,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 37281792. Throughput: 0: 11945.2. Samples: 37265472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:49:59,742][1096160] Avg episode reward: [(0, '4837.365')] [2023-03-10 20:49:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000072816_37281792.pth... [2023-03-10 20:49:59,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000072112_36921344.pth [2023-03-10 20:50:02,352][1096443] Updated weights for policy 0, policy_version 72880 (0.0004) [2023-03-10 20:50:04,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 37339136. Throughput: 0: 11926.9. Samples: 37335912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:50:04,742][1096160] Avg episode reward: [(0, '4843.298')] [2023-03-10 20:50:05,864][1096443] Updated weights for policy 0, policy_version 72960 (0.0005) [2023-03-10 20:50:09,241][1096443] Updated weights for policy 0, policy_version 73040 (0.0004) [2023-03-10 20:50:09,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 37400576. Throughput: 0: 11923.9. Samples: 37371912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:50:09,742][1096160] Avg episode reward: [(0, '4847.989')] [2023-03-10 20:50:12,726][1096443] Updated weights for policy 0, policy_version 73120 (0.0004) [2023-03-10 20:50:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 37462016. Throughput: 0: 11917.8. Samples: 37443340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:50:14,742][1096160] Avg episode reward: [(0, '4845.688')] [2023-03-10 20:50:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000073168_37462016.pth... [2023-03-10 20:50:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000072464_37101568.pth [2023-03-10 20:50:16,052][1096443] Updated weights for policy 0, policy_version 73200 (0.0005) [2023-03-10 20:50:19,313][1096443] Updated weights for policy 0, policy_version 73280 (0.0005) [2023-03-10 20:50:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 37523456. Throughput: 0: 12019.2. Samples: 37519132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:50:19,742][1096160] Avg episode reward: [(0, '4849.232')] [2023-03-10 20:50:22,557][1096443] Updated weights for policy 0, policy_version 73360 (0.0005) [2023-03-10 20:50:24,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 37584896. Throughput: 0: 12018.7. Samples: 37556076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:50:24,742][1096160] Avg episode reward: [(0, '4849.426')] [2023-03-10 20:50:25,932][1096443] Updated weights for policy 0, policy_version 73440 (0.0004) [2023-03-10 20:50:29,249][1096443] Updated weights for policy 0, policy_version 73520 (0.0005) [2023-03-10 20:50:29,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 37646336. Throughput: 0: 12101.1. Samples: 37629796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:50:29,742][1096160] Avg episode reward: [(0, '4835.129')] [2023-03-10 20:50:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000073528_37646336.pth... [2023-03-10 20:50:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000072816_37281792.pth [2023-03-10 20:50:32,597][1096443] Updated weights for policy 0, policy_version 73600 (0.0005) [2023-03-10 20:50:34,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 37707776. Throughput: 0: 12185.1. Samples: 37703676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:50:34,742][1096160] Avg episode reward: [(0, '4850.082')] [2023-03-10 20:50:35,971][1096443] Updated weights for policy 0, policy_version 73680 (0.0005) [2023-03-10 20:50:39,312][1096443] Updated weights for policy 0, policy_version 73760 (0.0005) [2023-03-10 20:50:39,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 37769216. Throughput: 0: 12196.9. Samples: 37740556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:50:39,742][1096160] Avg episode reward: [(0, '4855.304')] [2023-03-10 20:50:42,466][1096443] Updated weights for policy 0, policy_version 73840 (0.0004) [2023-03-10 20:50:44,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 37830656. Throughput: 0: 12197.1. Samples: 37814344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:50:44,742][1096160] Avg episode reward: [(0, '4852.491')] [2023-03-10 20:50:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000073888_37830656.pth... [2023-03-10 20:50:44,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000073168_37462016.pth [2023-03-10 20:50:45,895][1096443] Updated weights for policy 0, policy_version 73920 (0.0004) [2023-03-10 20:50:49,169][1096443] Updated weights for policy 0, policy_version 74000 (0.0005) [2023-03-10 20:50:49,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12065.8). Total num frames: 37892096. Throughput: 0: 12272.6. Samples: 37888180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:50:49,742][1096160] Avg episode reward: [(0, '4854.341')] [2023-03-10 20:50:52,549][1096443] Updated weights for policy 0, policy_version 74080 (0.0005) [2023-03-10 20:50:54,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 37953536. Throughput: 0: 12289.2. Samples: 37924928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:50:54,742][1096160] Avg episode reward: [(0, '4847.722')] [2023-03-10 20:50:55,812][1096443] Updated weights for policy 0, policy_version 74160 (0.0004) [2023-03-10 20:50:59,076][1096443] Updated weights for policy 0, policy_version 74240 (0.0005) [2023-03-10 20:50:59,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 38014976. Throughput: 0: 12395.7. Samples: 38001144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:50:59,742][1096160] Avg episode reward: [(0, '4840.494')] [2023-03-10 20:50:59,789][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000074256_38019072.pth... [2023-03-10 20:50:59,790][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000073528_37646336.pth [2023-03-10 20:51:02,504][1096443] Updated weights for policy 0, policy_version 74320 (0.0005) [2023-03-10 20:51:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12079.7). Total num frames: 38076416. Throughput: 0: 12294.5. Samples: 38072384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:51:04,743][1096160] Avg episode reward: [(0, '4845.545')] [2023-03-10 20:51:05,835][1096443] Updated weights for policy 0, policy_version 74400 (0.0005) [2023-03-10 20:51:09,240][1096443] Updated weights for policy 0, policy_version 74480 (0.0005) [2023-03-10 20:51:09,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12288.0, 300 sec: 12079.7). Total num frames: 38137856. Throughput: 0: 12296.1. Samples: 38109400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:51:09,742][1096160] Avg episode reward: [(0, '4854.751')] [2023-03-10 20:51:12,476][1096443] Updated weights for policy 0, policy_version 74560 (0.0005) [2023-03-10 20:51:14,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12356.3, 300 sec: 12093.6). Total num frames: 38203392. Throughput: 0: 12306.6. Samples: 38183596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:51:14,742][1096160] Avg episode reward: [(0, '4846.661')] [2023-03-10 20:51:14,747][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000074616_38203392.pth... [2023-03-10 20:51:14,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000073888_37830656.pth [2023-03-10 20:51:15,732][1096443] Updated weights for policy 0, policy_version 74640 (0.0004) [2023-03-10 20:51:19,153][1096443] Updated weights for policy 0, policy_version 74720 (0.0005) [2023-03-10 20:51:19,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12288.0, 300 sec: 12079.7). Total num frames: 38260736. Throughput: 0: 12298.0. Samples: 38257084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:51:19,742][1096160] Avg episode reward: [(0, '4838.713')] [2023-03-10 20:51:22,344][1096443] Updated weights for policy 0, policy_version 74800 (0.0005) [2023-03-10 20:51:24,741][1096160] Fps is (10 sec: 12288.3, 60 sec: 12356.3, 300 sec: 12093.6). Total num frames: 38326272. Throughput: 0: 12339.6. Samples: 38295836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:51:24,742][1096160] Avg episode reward: [(0, '4841.710')] [2023-03-10 20:51:25,744][1096443] Updated weights for policy 0, policy_version 74880 (0.0004) [2023-03-10 20:51:29,078][1096443] Updated weights for policy 0, policy_version 74960 (0.0005) [2023-03-10 20:51:29,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12288.0, 300 sec: 12079.7). Total num frames: 38383616. Throughput: 0: 12362.2. Samples: 38370644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:51:29,742][1096160] Avg episode reward: [(0, '4841.004')] [2023-03-10 20:51:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000074968_38383616.pth... [2023-03-10 20:51:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000074256_38019072.pth [2023-03-10 20:51:32,533][1096443] Updated weights for policy 0, policy_version 75040 (0.0005) [2023-03-10 20:51:34,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 12288.0, 300 sec: 12079.7). Total num frames: 38445056. Throughput: 0: 12282.9. Samples: 38440912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:51:34,742][1096160] Avg episode reward: [(0, '4845.234')] [2023-03-10 20:51:35,958][1096443] Updated weights for policy 0, policy_version 75120 (0.0005) [2023-03-10 20:51:39,214][1096443] Updated weights for policy 0, policy_version 75200 (0.0005) [2023-03-10 20:51:39,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12093.6). Total num frames: 38506496. Throughput: 0: 12275.6. Samples: 38477332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:51:39,742][1096160] Avg episode reward: [(0, '4839.646')] [2023-03-10 20:51:42,736][1096443] Updated weights for policy 0, policy_version 75280 (0.0005) [2023-03-10 20:51:44,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 38563840. Throughput: 0: 12177.6. Samples: 38549136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:51:44,742][1096160] Avg episode reward: [(0, '4843.989')] [2023-03-10 20:51:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000075320_38563840.pth... [2023-03-10 20:51:44,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000074616_38203392.pth [2023-03-10 20:51:45,981][1096443] Updated weights for policy 0, policy_version 75360 (0.0005) [2023-03-10 20:51:49,355][1096443] Updated weights for policy 0, policy_version 75440 (0.0005) [2023-03-10 20:51:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12093.6). Total num frames: 38629376. Throughput: 0: 12249.3. Samples: 38623604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:51:49,742][1096160] Avg episode reward: [(0, '4844.009')] [2023-03-10 20:51:52,682][1096443] Updated weights for policy 0, policy_version 75520 (0.0005) [2023-03-10 20:51:54,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 38686720. Throughput: 0: 12260.6. Samples: 38661128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:51:54,742][1096160] Avg episode reward: [(0, '4844.763')] [2023-03-10 20:51:56,216][1096443] Updated weights for policy 0, policy_version 75600 (0.0005) [2023-03-10 20:51:59,661][1096443] Updated weights for policy 0, policy_version 75680 (0.0004) [2023-03-10 20:51:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 38748160. Throughput: 0: 12178.1. Samples: 38731608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:51:59,742][1096160] Avg episode reward: [(0, '4845.081')] [2023-03-10 20:51:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000075680_38748160.pth... [2023-03-10 20:51:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000074968_38383616.pth [2023-03-10 20:52:02,972][1096443] Updated weights for policy 0, policy_version 75760 (0.0005) [2023-03-10 20:52:04,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 38805504. Throughput: 0: 12122.3. Samples: 38802588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:52:04,742][1096160] Avg episode reward: [(0, '4844.343')] [2023-03-10 20:52:06,467][1096443] Updated weights for policy 0, policy_version 75840 (0.0005) [2023-03-10 20:52:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 38866944. Throughput: 0: 12062.3. Samples: 38838640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:52:09,742][1096160] Avg episode reward: [(0, '4849.530')] [2023-03-10 20:52:10,009][1096443] Updated weights for policy 0, policy_version 75920 (0.0005) [2023-03-10 20:52:13,254][1096443] Updated weights for policy 0, policy_version 76000 (0.0005) [2023-03-10 20:52:14,742][1096160] Fps is (10 sec: 12287.7, 60 sec: 12083.2, 300 sec: 12121.4). Total num frames: 38928384. Throughput: 0: 12030.1. Samples: 38912000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:52:14,743][1096160] Avg episode reward: [(0, '4839.290')] [2023-03-10 20:52:14,747][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000076032_38928384.pth... [2023-03-10 20:52:14,750][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000075320_38563840.pth [2023-03-10 20:52:16,644][1096443] Updated weights for policy 0, policy_version 76080 (0.0005) [2023-03-10 20:52:19,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.4, 300 sec: 12121.4). Total num frames: 38989824. Throughput: 0: 12037.3. Samples: 38982592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:52:19,742][1096160] Avg episode reward: [(0, '4839.420')] [2023-03-10 20:52:20,122][1096443] Updated weights for policy 0, policy_version 76160 (0.0005) [2023-03-10 20:52:23,696][1096443] Updated weights for policy 0, policy_version 76240 (0.0005) [2023-03-10 20:52:24,742][1096160] Fps is (10 sec: 11878.7, 60 sec: 12014.9, 300 sec: 12107.5). Total num frames: 39047168. Throughput: 0: 12017.8. Samples: 39018132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:52:24,742][1096160] Avg episode reward: [(0, '4844.732')] [2023-03-10 20:52:27,039][1096443] Updated weights for policy 0, policy_version 76320 (0.0005) [2023-03-10 20:52:29,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 12014.9, 300 sec: 12107.5). Total num frames: 39104512. Throughput: 0: 11990.6. Samples: 39088712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:52:29,742][1096160] Avg episode reward: [(0, '4845.262')] [2023-03-10 20:52:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000076376_39104512.pth... [2023-03-10 20:52:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000075680_38748160.pth [2023-03-10 20:52:30,485][1096443] Updated weights for policy 0, policy_version 76400 (0.0005) [2023-03-10 20:52:33,916][1096443] Updated weights for policy 0, policy_version 76480 (0.0005) [2023-03-10 20:52:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12015.0, 300 sec: 12107.5). Total num frames: 39165952. Throughput: 0: 11962.6. Samples: 39161920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:52:34,742][1096160] Avg episode reward: [(0, '4844.159')] [2023-03-10 20:52:37,344][1096443] Updated weights for policy 0, policy_version 76560 (0.0004) [2023-03-10 20:52:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12093.6). Total num frames: 39223296. Throughput: 0: 11943.4. Samples: 39198580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:52:39,742][1096160] Avg episode reward: [(0, '4852.313')] [2023-03-10 20:52:40,799][1096443] Updated weights for policy 0, policy_version 76640 (0.0005) [2023-03-10 20:52:44,046][1096443] Updated weights for policy 0, policy_version 76720 (0.0005) [2023-03-10 20:52:44,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 39288832. Throughput: 0: 11956.1. Samples: 39269632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:52:44,742][1096160] Avg episode reward: [(0, '4843.683')] [2023-03-10 20:52:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000076736_39288832.pth... [2023-03-10 20:52:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000076032_38928384.pth [2023-03-10 20:52:47,268][1096443] Updated weights for policy 0, policy_version 76800 (0.0005) [2023-03-10 20:52:49,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12014.9, 300 sec: 12107.5). Total num frames: 39350272. Throughput: 0: 12078.5. Samples: 39346120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:52:49,742][1096160] Avg episode reward: [(0, '4853.095')] [2023-03-10 20:52:50,560][1096443] Updated weights for policy 0, policy_version 76880 (0.0004) [2023-03-10 20:52:53,925][1096443] Updated weights for policy 0, policy_version 76960 (0.0005) [2023-03-10 20:52:54,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 39411712. Throughput: 0: 12099.2. Samples: 39383104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:52:54,742][1096160] Avg episode reward: [(0, '4845.722')] [2023-03-10 20:52:57,090][1096443] Updated weights for policy 0, policy_version 77040 (0.0005) [2023-03-10 20:52:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 39473152. Throughput: 0: 12110.4. Samples: 39456968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:52:59,742][1096160] Avg episode reward: [(0, '4846.766')] [2023-03-10 20:52:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000077096_39473152.pth... [2023-03-10 20:52:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000076376_39104512.pth [2023-03-10 20:53:00,651][1096443] Updated weights for policy 0, policy_version 77120 (0.0005) [2023-03-10 20:53:03,998][1096443] Updated weights for policy 0, policy_version 77200 (0.0004) [2023-03-10 20:53:04,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 39530496. Throughput: 0: 12156.9. Samples: 39529652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:53:04,742][1096160] Avg episode reward: [(0, '4851.972')] [2023-03-10 20:53:07,564][1096443] Updated weights for policy 0, policy_version 77280 (0.0004) [2023-03-10 20:53:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12107.5). Total num frames: 39591936. Throughput: 0: 12115.5. Samples: 39563328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:53:09,742][1096160] Avg episode reward: [(0, '4850.758')] [2023-03-10 20:53:11,009][1096443] Updated weights for policy 0, policy_version 77360 (0.0005) [2023-03-10 20:53:14,556][1096443] Updated weights for policy 0, policy_version 77440 (0.0005) [2023-03-10 20:53:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12015.0, 300 sec: 12093.6). Total num frames: 39649280. Throughput: 0: 12093.2. Samples: 39632908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:53:14,742][1096160] Avg episode reward: [(0, '4850.344')] [2023-03-10 20:53:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000077440_39649280.pth... [2023-03-10 20:53:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000076736_39288832.pth [2023-03-10 20:53:18,042][1096443] Updated weights for policy 0, policy_version 77520 (0.0004) [2023-03-10 20:53:19,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11946.7, 300 sec: 12093.6). Total num frames: 39706624. Throughput: 0: 12017.1. Samples: 39702692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:53:19,742][1096160] Avg episode reward: [(0, '4848.311')] [2023-03-10 20:53:21,445][1096443] Updated weights for policy 0, policy_version 77600 (0.0005) [2023-03-10 20:53:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 39768064. Throughput: 0: 12018.3. Samples: 39739404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:53:24,742][1096160] Avg episode reward: [(0, '4849.028')] [2023-03-10 20:53:24,950][1096443] Updated weights for policy 0, policy_version 77680 (0.0005) [2023-03-10 20:53:28,424][1096443] Updated weights for policy 0, policy_version 77760 (0.0005) [2023-03-10 20:53:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 39825408. Throughput: 0: 12032.3. Samples: 39811084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:53:29,742][1096160] Avg episode reward: [(0, '4856.454')] [2023-03-10 20:53:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000077784_39825408.pth... [2023-03-10 20:53:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000077096_39473152.pth [2023-03-10 20:53:31,931][1096443] Updated weights for policy 0, policy_version 77840 (0.0005) [2023-03-10 20:53:34,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 39886848. Throughput: 0: 11918.6. Samples: 39882456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:53:34,742][1096160] Avg episode reward: [(0, '4853.544')] [2023-03-10 20:53:35,370][1096443] Updated weights for policy 0, policy_version 77920 (0.0005) [2023-03-10 20:53:38,814][1096443] Updated weights for policy 0, policy_version 78000 (0.0005) [2023-03-10 20:53:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 39944192. Throughput: 0: 11849.0. Samples: 39916312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:53:39,742][1096160] Avg episode reward: [(0, '4849.427')] [2023-03-10 20:53:42,315][1096443] Updated weights for policy 0, policy_version 78080 (0.0005) [2023-03-10 20:53:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 40005632. Throughput: 0: 11806.3. Samples: 39988252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:53:44,742][1096160] Avg episode reward: [(0, '4851.648')] [2023-03-10 20:53:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000078136_40005632.pth... [2023-03-10 20:53:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000077440_39649280.pth [2023-03-10 20:53:45,672][1096443] Updated weights for policy 0, policy_version 78160 (0.0004) [2023-03-10 20:53:49,238][1096443] Updated weights for policy 0, policy_version 78240 (0.0005) [2023-03-10 20:53:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12079.7). Total num frames: 40062976. Throughput: 0: 11760.8. Samples: 40058888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:53:49,742][1096160] Avg episode reward: [(0, '4846.997')] [2023-03-10 20:53:52,649][1096443] Updated weights for policy 0, policy_version 78320 (0.0005) [2023-03-10 20:53:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12079.7). Total num frames: 40124416. Throughput: 0: 11829.6. Samples: 40095660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:53:54,742][1096160] Avg episode reward: [(0, '4836.914')] [2023-03-10 20:53:56,008][1096443] Updated weights for policy 0, policy_version 78400 (0.0005) [2023-03-10 20:53:59,322][1096443] Updated weights for policy 0, policy_version 78480 (0.0005) [2023-03-10 20:53:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 12079.7). Total num frames: 40185856. Throughput: 0: 11867.2. Samples: 40166932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:53:59,742][1096160] Avg episode reward: [(0, '4856.657')] [2023-03-10 20:53:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000078488_40185856.pth... [2023-03-10 20:53:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000077784_39825408.pth [2023-03-10 20:54:02,707][1096443] Updated weights for policy 0, policy_version 78560 (0.0005) [2023-03-10 20:54:04,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 40243200. Throughput: 0: 11935.8. Samples: 40239800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:54:04,742][1096160] Avg episode reward: [(0, '4855.059')] [2023-03-10 20:54:06,158][1096443] Updated weights for policy 0, policy_version 78640 (0.0005) [2023-03-10 20:54:09,597][1096443] Updated weights for policy 0, policy_version 78720 (0.0005) [2023-03-10 20:54:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12079.7). Total num frames: 40304640. Throughput: 0: 11925.1. Samples: 40276032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:54:09,742][1096160] Avg episode reward: [(0, '4856.737')] [2023-03-10 20:54:13,047][1096443] Updated weights for policy 0, policy_version 78800 (0.0004) [2023-03-10 20:54:14,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 40361984. Throughput: 0: 11931.4. Samples: 40347996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:54:14,742][1096160] Avg episode reward: [(0, '4859.390')] [2023-03-10 20:54:14,767][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000078840_40366080.pth... [2023-03-10 20:54:14,769][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000078136_40005632.pth [2023-03-10 20:54:16,457][1096443] Updated weights for policy 0, policy_version 78880 (0.0005) [2023-03-10 20:54:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 40423424. Throughput: 0: 11925.9. Samples: 40419124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:54:19,742][1096160] Avg episode reward: [(0, '4856.260')] [2023-03-10 20:54:20,081][1096443] Updated weights for policy 0, policy_version 78960 (0.0005) [2023-03-10 20:54:23,620][1096443] Updated weights for policy 0, policy_version 79040 (0.0005) [2023-03-10 20:54:24,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 40480768. Throughput: 0: 11907.7. Samples: 40452160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:54:24,742][1096160] Avg episode reward: [(0, '4857.438')] [2023-03-10 20:54:26,858][1096443] Updated weights for policy 0, policy_version 79120 (0.0005) [2023-03-10 20:54:29,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 12052.0). Total num frames: 40538112. Throughput: 0: 11934.1. Samples: 40525284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:54:29,742][1096160] Avg episode reward: [(0, '4852.643')] [2023-03-10 20:54:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000079176_40538112.pth... [2023-03-10 20:54:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000078488_40185856.pth [2023-03-10 20:54:30,448][1096443] Updated weights for policy 0, policy_version 79200 (0.0004) [2023-03-10 20:54:33,822][1096443] Updated weights for policy 0, policy_version 79280 (0.0005) [2023-03-10 20:54:34,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11878.4, 300 sec: 12052.0). Total num frames: 40599552. Throughput: 0: 11925.1. Samples: 40595520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:54:34,742][1096160] Avg episode reward: [(0, '4850.500')] [2023-03-10 20:54:37,253][1096443] Updated weights for policy 0, policy_version 79360 (0.0004) [2023-03-10 20:54:39,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 40660992. Throughput: 0: 11925.1. Samples: 40632288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:54:39,742][1096160] Avg episode reward: [(0, '4842.733')] [2023-03-10 20:54:40,616][1096443] Updated weights for policy 0, policy_version 79440 (0.0004) [2023-03-10 20:54:44,055][1096443] Updated weights for policy 0, policy_version 79520 (0.0005) [2023-03-10 20:54:44,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 40718336. Throughput: 0: 11949.3. Samples: 40704652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:54:44,742][1096160] Avg episode reward: [(0, '4850.427')] [2023-03-10 20:54:44,783][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000079536_40722432.pth... [2023-03-10 20:54:44,785][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000078840_40366080.pth [2023-03-10 20:54:47,439][1096443] Updated weights for policy 0, policy_version 79600 (0.0005) [2023-03-10 20:54:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 40779776. Throughput: 0: 11950.6. Samples: 40777576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:54:49,742][1096160] Avg episode reward: [(0, '4856.427')] [2023-03-10 20:54:50,774][1096443] Updated weights for policy 0, policy_version 79680 (0.0004) [2023-03-10 20:54:54,245][1096443] Updated weights for policy 0, policy_version 79760 (0.0004) [2023-03-10 20:54:54,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 40841216. Throughput: 0: 11924.1. Samples: 40812616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:54:54,742][1096160] Avg episode reward: [(0, '4855.024')] [2023-03-10 20:54:57,668][1096443] Updated weights for policy 0, policy_version 79840 (0.0004) [2023-03-10 20:54:59,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 40902656. Throughput: 0: 11922.7. Samples: 40884520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:54:59,742][1096160] Avg episode reward: [(0, '4856.771')] [2023-03-10 20:54:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000079888_40902656.pth... [2023-03-10 20:54:59,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000079176_40538112.pth [2023-03-10 20:55:01,039][1096443] Updated weights for policy 0, policy_version 79920 (0.0004) [2023-03-10 20:55:04,399][1096443] Updated weights for policy 0, policy_version 80000 (0.0004) [2023-03-10 20:55:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 40960000. Throughput: 0: 11958.4. Samples: 40957252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:55:04,742][1096160] Avg episode reward: [(0, '4847.258')] [2023-03-10 20:55:07,687][1096443] Updated weights for policy 0, policy_version 80080 (0.0004) [2023-03-10 20:55:09,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 41025536. Throughput: 0: 12072.4. Samples: 40995420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:55:09,742][1096160] Avg episode reward: [(0, '4851.164')] [2023-03-10 20:55:11,047][1096443] Updated weights for policy 0, policy_version 80160 (0.0005) [2023-03-10 20:55:14,705][1096443] Updated weights for policy 0, policy_version 80240 (0.0005) [2023-03-10 20:55:14,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 41082880. Throughput: 0: 12028.3. Samples: 41066560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:55:14,742][1096160] Avg episode reward: [(0, '4847.917')] [2023-03-10 20:55:14,747][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000080240_41082880.pth... [2023-03-10 20:55:14,750][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000079536_40722432.pth [2023-03-10 20:55:18,275][1096443] Updated weights for policy 0, policy_version 80320 (0.0005) [2023-03-10 20:55:19,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 41140224. Throughput: 0: 12003.9. Samples: 41135696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:55:19,742][1096160] Avg episode reward: [(0, '4846.906')] [2023-03-10 20:55:21,771][1096443] Updated weights for policy 0, policy_version 80400 (0.0005) [2023-03-10 20:55:24,742][1096160] Fps is (10 sec: 11469.0, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 41197568. Throughput: 0: 11932.7. Samples: 41169260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:55:24,742][1096160] Avg episode reward: [(0, '4841.778')] [2023-03-10 20:55:25,152][1096443] Updated weights for policy 0, policy_version 80480 (0.0005) [2023-03-10 20:55:28,657][1096443] Updated weights for policy 0, policy_version 80560 (0.0005) [2023-03-10 20:55:29,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 41259008. Throughput: 0: 11951.0. Samples: 41242444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:55:29,742][1096160] Avg episode reward: [(0, '4852.862')] [2023-03-10 20:55:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000080584_41259008.pth... [2023-03-10 20:55:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000079888_40902656.pth [2023-03-10 20:55:32,019][1096443] Updated weights for policy 0, policy_version 80640 (0.0004) [2023-03-10 20:55:34,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 41320448. Throughput: 0: 11905.9. Samples: 41313340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:55:34,742][1096160] Avg episode reward: [(0, '4858.483')] [2023-03-10 20:55:35,378][1096443] Updated weights for policy 0, policy_version 80720 (0.0004) [2023-03-10 20:55:38,436][1096443] Updated weights for policy 0, policy_version 80800 (0.0004) [2023-03-10 20:55:39,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 12015.0, 300 sec: 12038.1). Total num frames: 41381888. Throughput: 0: 12012.5. Samples: 41353176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:55:39,742][1096160] Avg episode reward: [(0, '4847.648')] [2023-03-10 20:55:41,896][1096443] Updated weights for policy 0, policy_version 80880 (0.0004) [2023-03-10 20:55:44,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12015.0, 300 sec: 12024.2). Total num frames: 41439232. Throughput: 0: 12051.0. Samples: 41426812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:55:44,742][1096160] Avg episode reward: [(0, '4837.194')] [2023-03-10 20:55:44,777][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000080944_41443328.pth... [2023-03-10 20:55:44,778][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000080240_41082880.pth [2023-03-10 20:55:45,400][1096443] Updated weights for policy 0, policy_version 80960 (0.0005) [2023-03-10 20:55:48,612][1096443] Updated weights for policy 0, policy_version 81040 (0.0005) [2023-03-10 20:55:49,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 41504768. Throughput: 0: 12074.3. Samples: 41500596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:55:49,742][1096160] Avg episode reward: [(0, '4855.577')] [2023-03-10 20:55:52,005][1096443] Updated weights for policy 0, policy_version 81120 (0.0004) [2023-03-10 20:55:54,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 41562112. Throughput: 0: 12029.4. Samples: 41536744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:55:54,742][1096160] Avg episode reward: [(0, '4859.852')] [2023-03-10 20:55:55,411][1096443] Updated weights for policy 0, policy_version 81200 (0.0005) [2023-03-10 20:55:58,576][1096443] Updated weights for policy 0, policy_version 81280 (0.0005) [2023-03-10 20:55:59,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 41627648. Throughput: 0: 12102.0. Samples: 41611148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:55:59,742][1096160] Avg episode reward: [(0, '4862.502')] [2023-03-10 20:55:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000081304_41627648.pth... [2023-03-10 20:55:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000080584_41259008.pth [2023-03-10 20:56:01,957][1096443] Updated weights for policy 0, policy_version 81360 (0.0005) [2023-03-10 20:56:04,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 41689088. Throughput: 0: 12198.3. Samples: 41684620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:56:04,742][1096160] Avg episode reward: [(0, '4855.945')] [2023-03-10 20:56:05,401][1096443] Updated weights for policy 0, policy_version 81440 (0.0005) [2023-03-10 20:56:08,791][1096443] Updated weights for policy 0, policy_version 81520 (0.0005) [2023-03-10 20:56:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 41746432. Throughput: 0: 12219.7. Samples: 41719148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:56:09,742][1096160] Avg episode reward: [(0, '4853.556')] [2023-03-10 20:56:12,423][1096443] Updated weights for policy 0, policy_version 81600 (0.0005) [2023-03-10 20:56:14,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 12015.0, 300 sec: 12010.3). Total num frames: 41803776. Throughput: 0: 12142.6. Samples: 41788864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:56:14,742][1096160] Avg episode reward: [(0, '4850.025')] [2023-03-10 20:56:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000081648_41803776.pth... [2023-03-10 20:56:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000080944_41443328.pth [2023-03-10 20:56:15,917][1096443] Updated weights for policy 0, policy_version 81680 (0.0005) [2023-03-10 20:56:19,233][1096443] Updated weights for policy 0, policy_version 81760 (0.0005) [2023-03-10 20:56:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 41865216. Throughput: 0: 12173.2. Samples: 41861132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:56:19,742][1096160] Avg episode reward: [(0, '4857.277')] [2023-03-10 20:56:22,490][1096443] Updated weights for policy 0, policy_version 81840 (0.0005) [2023-03-10 20:56:24,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 41926656. Throughput: 0: 12108.2. Samples: 41898048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:56:24,742][1096160] Avg episode reward: [(0, '4855.134')] [2023-03-10 20:56:25,916][1096443] Updated weights for policy 0, policy_version 81920 (0.0005) [2023-03-10 20:56:29,325][1096443] Updated weights for policy 0, policy_version 82000 (0.0005) [2023-03-10 20:56:29,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.4, 300 sec: 12010.3). Total num frames: 41988096. Throughput: 0: 12086.3. Samples: 41970696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:56:29,742][1096160] Avg episode reward: [(0, '4852.979')] [2023-03-10 20:56:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000082008_41988096.pth... [2023-03-10 20:56:29,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000081304_41627648.pth [2023-03-10 20:56:32,606][1096443] Updated weights for policy 0, policy_version 82080 (0.0005) [2023-03-10 20:56:34,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12219.7, 300 sec: 12024.2). Total num frames: 42053632. Throughput: 0: 12145.9. Samples: 42047160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:56:34,742][1096160] Avg episode reward: [(0, '4851.988')] [2023-03-10 20:56:35,697][1096443] Updated weights for policy 0, policy_version 82160 (0.0004) [2023-03-10 20:56:39,151][1096443] Updated weights for policy 0, policy_version 82240 (0.0005) [2023-03-10 20:56:39,742][1096160] Fps is (10 sec: 12288.2, 60 sec: 12151.5, 300 sec: 12024.2). Total num frames: 42110976. Throughput: 0: 12156.6. Samples: 42083792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:56:39,742][1096160] Avg episode reward: [(0, '4842.742')] [2023-03-10 20:56:42,802][1096443] Updated weights for policy 0, policy_version 82320 (0.0005) [2023-03-10 20:56:44,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 12151.5, 300 sec: 11996.4). Total num frames: 42168320. Throughput: 0: 12019.1. Samples: 42152008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:56:44,742][1096160] Avg episode reward: [(0, '4850.627')] [2023-03-10 20:56:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000082360_42168320.pth... [2023-03-10 20:56:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000081648_41803776.pth [2023-03-10 20:56:46,277][1096443] Updated weights for policy 0, policy_version 82400 (0.0005) [2023-03-10 20:56:49,673][1096443] Updated weights for policy 0, policy_version 82480 (0.0005) [2023-03-10 20:56:49,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 42229760. Throughput: 0: 12015.8. Samples: 42225332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:56:49,744][1096160] Avg episode reward: [(0, '4854.701')] [2023-03-10 20:56:52,735][1096443] Updated weights for policy 0, policy_version 82560 (0.0005) [2023-03-10 20:56:54,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 42291200. Throughput: 0: 12078.7. Samples: 42262688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:56:54,753][1096160] Avg episode reward: [(0, '4847.764')] [2023-03-10 20:56:56,259][1096443] Updated weights for policy 0, policy_version 82640 (0.0005) [2023-03-10 20:56:59,710][1096443] Updated weights for policy 0, policy_version 82720 (0.0006) [2023-03-10 20:56:59,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 42352640. Throughput: 0: 12163.2. Samples: 42336208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:56:59,754][1096160] Avg episode reward: [(0, '4849.217')] [2023-03-10 20:56:59,758][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000082720_42352640.pth... [2023-03-10 20:56:59,761][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000082008_41988096.pth [2023-03-10 20:57:02,884][1096443] Updated weights for policy 0, policy_version 82800 (0.0005) [2023-03-10 20:57:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 42414080. Throughput: 0: 12196.5. Samples: 42409976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:57:04,742][1096160] Avg episode reward: [(0, '4854.679')] [2023-03-10 20:57:06,003][1096443] Updated weights for policy 0, policy_version 82880 (0.0005) [2023-03-10 20:57:09,345][1096443] Updated weights for policy 0, policy_version 82960 (0.0004) [2023-03-10 20:57:09,742][1096160] Fps is (10 sec: 12697.7, 60 sec: 12219.7, 300 sec: 12038.1). Total num frames: 42479616. Throughput: 0: 12275.4. Samples: 42450440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:57:09,742][1096160] Avg episode reward: [(0, '4857.224')] [2023-03-10 20:57:12,383][1096443] Updated weights for policy 0, policy_version 83040 (0.0005) [2023-03-10 20:57:14,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 12288.0, 300 sec: 12038.1). Total num frames: 42541056. Throughput: 0: 12373.3. Samples: 42527492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:57:14,742][1096160] Avg episode reward: [(0, '4852.124')] [2023-03-10 20:57:14,759][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000083096_42545152.pth... [2023-03-10 20:57:14,761][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000082360_42168320.pth [2023-03-10 20:57:15,734][1096443] Updated weights for policy 0, policy_version 83120 (0.0005) [2023-03-10 20:57:19,225][1096443] Updated weights for policy 0, policy_version 83200 (0.0005) [2023-03-10 20:57:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12288.0, 300 sec: 12052.0). Total num frames: 42602496. Throughput: 0: 12250.0. Samples: 42598408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:57:19,742][1096160] Avg episode reward: [(0, '4847.237')] [2023-03-10 20:57:22,752][1096443] Updated weights for policy 0, policy_version 83280 (0.0004) [2023-03-10 20:57:24,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 12052.0). Total num frames: 42659840. Throughput: 0: 12200.2. Samples: 42632800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:57:24,742][1096160] Avg episode reward: [(0, '4847.966')] [2023-03-10 20:57:26,094][1096443] Updated weights for policy 0, policy_version 83360 (0.0005) [2023-03-10 20:57:29,616][1096443] Updated weights for policy 0, policy_version 83440 (0.0006) [2023-03-10 20:57:29,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12219.8, 300 sec: 12052.0). Total num frames: 42721280. Throughput: 0: 12293.9. Samples: 42705232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:57:29,742][1096160] Avg episode reward: [(0, '4854.439')] [2023-03-10 20:57:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000083440_42721280.pth... [2023-03-10 20:57:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000082720_42352640.pth [2023-03-10 20:57:33,048][1096443] Updated weights for policy 0, policy_version 83520 (0.0005) [2023-03-10 20:57:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 42778624. Throughput: 0: 12215.7. Samples: 42775040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:57:34,742][1096160] Avg episode reward: [(0, '4848.324')] [2023-03-10 20:57:36,425][1096443] Updated weights for policy 0, policy_version 83600 (0.0005) [2023-03-10 20:57:39,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 42840064. Throughput: 0: 12233.0. Samples: 42813172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:57:39,742][1096160] Avg episode reward: [(0, '4850.661')] [2023-03-10 20:57:39,847][1096443] Updated weights for policy 0, policy_version 83680 (0.0005) [2023-03-10 20:57:43,196][1096443] Updated weights for policy 0, policy_version 83760 (0.0004) [2023-03-10 20:57:44,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12038.1). Total num frames: 42901504. Throughput: 0: 12199.5. Samples: 42885184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:57:44,742][1096160] Avg episode reward: [(0, '4856.226')] [2023-03-10 20:57:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000083792_42901504.pth... [2023-03-10 20:57:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000083096_42545152.pth [2023-03-10 20:57:46,782][1096443] Updated weights for policy 0, policy_version 83840 (0.0004) [2023-03-10 20:57:49,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12151.5, 300 sec: 12024.2). Total num frames: 42958848. Throughput: 0: 12107.9. Samples: 42954832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:57:49,742][1096160] Avg episode reward: [(0, '4853.210')] [2023-03-10 20:57:50,245][1096443] Updated weights for policy 0, policy_version 83920 (0.0005) [2023-03-10 20:57:53,512][1096443] Updated weights for policy 0, policy_version 84000 (0.0005) [2023-03-10 20:57:54,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12038.1). Total num frames: 43024384. Throughput: 0: 12026.1. Samples: 42991616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:57:54,742][1096160] Avg episode reward: [(0, '4847.008')] [2023-03-10 20:57:56,663][1096443] Updated weights for policy 0, policy_version 84080 (0.0004) [2023-03-10 20:57:59,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 43081728. Throughput: 0: 12029.1. Samples: 43068800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:57:59,742][1096160] Avg episode reward: [(0, '4845.496')] [2023-03-10 20:57:59,783][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000084152_43085824.pth... [2023-03-10 20:57:59,785][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000083440_42721280.pth [2023-03-10 20:58:00,129][1096443] Updated weights for policy 0, policy_version 84160 (0.0005) [2023-03-10 20:58:03,468][1096443] Updated weights for policy 0, policy_version 84240 (0.0005) [2023-03-10 20:58:04,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 43143168. Throughput: 0: 12026.4. Samples: 43139596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:58:04,742][1096160] Avg episode reward: [(0, '4850.138')] [2023-03-10 20:58:06,933][1096443] Updated weights for policy 0, policy_version 84320 (0.0005) [2023-03-10 20:58:09,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 43204608. Throughput: 0: 12069.7. Samples: 43175936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:58:09,742][1096160] Avg episode reward: [(0, '4855.981')] [2023-03-10 20:58:10,303][1096443] Updated weights for policy 0, policy_version 84400 (0.0004) [2023-03-10 20:58:13,636][1096443] Updated weights for policy 0, policy_version 84480 (0.0005) [2023-03-10 20:58:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 43261952. Throughput: 0: 12098.5. Samples: 43249664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:58:14,742][1096160] Avg episode reward: [(0, '4852.661')] [2023-03-10 20:58:14,788][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000084504_43266048.pth... [2023-03-10 20:58:14,789][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000083792_42901504.pth [2023-03-10 20:58:17,201][1096443] Updated weights for policy 0, policy_version 84560 (0.0005) [2023-03-10 20:58:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 43323392. Throughput: 0: 12087.5. Samples: 43318976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:58:19,742][1096160] Avg episode reward: [(0, '4850.771')] [2023-03-10 20:58:20,667][1096443] Updated weights for policy 0, policy_version 84640 (0.0005) [2023-03-10 20:58:24,099][1096443] Updated weights for policy 0, policy_version 84720 (0.0004) [2023-03-10 20:58:24,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 43384832. Throughput: 0: 12043.7. Samples: 43355140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:58:24,742][1096160] Avg episode reward: [(0, '4847.222')] [2023-03-10 20:58:27,342][1096443] Updated weights for policy 0, policy_version 84800 (0.0004) [2023-03-10 20:58:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 43442176. Throughput: 0: 12076.5. Samples: 43428628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:58:29,742][1096160] Avg episode reward: [(0, '4859.434')] [2023-03-10 20:58:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000084856_43446272.pth... [2023-03-10 20:58:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000084152_43085824.pth [2023-03-10 20:58:30,722][1096443] Updated weights for policy 0, policy_version 84880 (0.0004) [2023-03-10 20:58:34,280][1096443] Updated weights for policy 0, policy_version 84960 (0.0005) [2023-03-10 20:58:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 43503616. Throughput: 0: 12104.2. Samples: 43499520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:58:34,742][1096160] Avg episode reward: [(0, '4849.728')] [2023-03-10 20:58:37,728][1096443] Updated weights for policy 0, policy_version 85040 (0.0004) [2023-03-10 20:58:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 43560960. Throughput: 0: 12061.0. Samples: 43534360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:58:39,742][1096160] Avg episode reward: [(0, '4853.817')] [2023-03-10 20:58:41,078][1096443] Updated weights for policy 0, policy_version 85120 (0.0005) [2023-03-10 20:58:44,444][1096443] Updated weights for policy 0, policy_version 85200 (0.0005) [2023-03-10 20:58:44,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 43622400. Throughput: 0: 11945.0. Samples: 43606324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:58:44,742][1096160] Avg episode reward: [(0, '4853.668')] [2023-03-10 20:58:44,762][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000085208_43626496.pth... [2023-03-10 20:58:44,763][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000084504_43266048.pth [2023-03-10 20:58:47,652][1096443] Updated weights for policy 0, policy_version 85280 (0.0005) [2023-03-10 20:58:49,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 43683840. Throughput: 0: 12053.4. Samples: 43682000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:58:49,742][1096160] Avg episode reward: [(0, '4854.375')] [2023-03-10 20:58:51,345][1096443] Updated weights for policy 0, policy_version 85360 (0.0005) [2023-03-10 20:58:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 43741184. Throughput: 0: 11975.1. Samples: 43714816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:58:54,742][1096160] Avg episode reward: [(0, '4845.320')] [2023-03-10 20:58:54,903][1096443] Updated weights for policy 0, policy_version 85440 (0.0005) [2023-03-10 20:58:58,196][1096443] Updated weights for policy 0, policy_version 85520 (0.0005) [2023-03-10 20:58:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 43802624. Throughput: 0: 11925.3. Samples: 43786304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:58:59,742][1096160] Avg episode reward: [(0, '4846.873')] [2023-03-10 20:58:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000085552_43802624.pth... [2023-03-10 20:58:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000084856_43446272.pth [2023-03-10 20:59:01,810][1096443] Updated weights for policy 0, policy_version 85600 (0.0005) [2023-03-10 20:59:04,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 43859968. Throughput: 0: 11932.5. Samples: 43855936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:59:04,742][1096160] Avg episode reward: [(0, '4848.365')] [2023-03-10 20:59:05,144][1096443] Updated weights for policy 0, policy_version 85680 (0.0005) [2023-03-10 20:59:08,449][1096443] Updated weights for policy 0, policy_version 85760 (0.0004) [2023-03-10 20:59:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 43921408. Throughput: 0: 11950.6. Samples: 43892916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:59:09,742][1096160] Avg episode reward: [(0, '4849.042')] [2023-03-10 20:59:11,873][1096443] Updated weights for policy 0, policy_version 85840 (0.0004) [2023-03-10 20:59:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 43982848. Throughput: 0: 11950.3. Samples: 43966392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:59:14,742][1096160] Avg episode reward: [(0, '4846.632')] [2023-03-10 20:59:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000085904_43982848.pth... [2023-03-10 20:59:14,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000085208_43626496.pth [2023-03-10 20:59:15,263][1096443] Updated weights for policy 0, policy_version 85920 (0.0005) [2023-03-10 20:59:18,651][1096443] Updated weights for policy 0, policy_version 86000 (0.0005) [2023-03-10 20:59:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 44044288. Throughput: 0: 11994.1. Samples: 44039252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:59:19,742][1096160] Avg episode reward: [(0, '4843.336')] [2023-03-10 20:59:22,040][1096443] Updated weights for policy 0, policy_version 86080 (0.0005) [2023-03-10 20:59:24,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 44105728. Throughput: 0: 12023.4. Samples: 44075412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:59:24,742][1096160] Avg episode reward: [(0, '4844.107')] [2023-03-10 20:59:25,344][1096443] Updated weights for policy 0, policy_version 86160 (0.0005) [2023-03-10 20:59:28,678][1096443] Updated weights for policy 0, policy_version 86240 (0.0005) [2023-03-10 20:59:29,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 44167168. Throughput: 0: 12064.2. Samples: 44149212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:59:29,742][1096160] Avg episode reward: [(0, '4844.834')] [2023-03-10 20:59:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000086264_44167168.pth... [2023-03-10 20:59:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000085552_43802624.pth [2023-03-10 20:59:32,030][1096443] Updated weights for policy 0, policy_version 86320 (0.0004) [2023-03-10 20:59:34,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 44224512. Throughput: 0: 11994.2. Samples: 44221740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:59:34,742][1096160] Avg episode reward: [(0, '4849.158')] [2023-03-10 20:59:35,448][1096443] Updated weights for policy 0, policy_version 86400 (0.0005) [2023-03-10 20:59:38,659][1096443] Updated weights for policy 0, policy_version 86480 (0.0005) [2023-03-10 20:59:39,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12107.5). Total num frames: 44290048. Throughput: 0: 12130.8. Samples: 44260700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:59:39,742][1096160] Avg episode reward: [(0, '4844.305')] [2023-03-10 20:59:42,182][1096443] Updated weights for policy 0, policy_version 86560 (0.0005) [2023-03-10 20:59:44,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 44347392. Throughput: 0: 12104.7. Samples: 44331016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:59:44,742][1096160] Avg episode reward: [(0, '4845.246')] [2023-03-10 20:59:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000086616_44347392.pth... [2023-03-10 20:59:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000085904_43982848.pth [2023-03-10 20:59:45,623][1096443] Updated weights for policy 0, policy_version 86640 (0.0005) [2023-03-10 20:59:49,010][1096443] Updated weights for policy 0, policy_version 86720 (0.0005) [2023-03-10 20:59:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 44408832. Throughput: 0: 12171.7. Samples: 44403664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:59:49,742][1096160] Avg episode reward: [(0, '4845.974')] [2023-03-10 20:59:52,359][1096443] Updated weights for policy 0, policy_version 86800 (0.0005) [2023-03-10 20:59:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 44466176. Throughput: 0: 12168.7. Samples: 44440508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 20:59:54,742][1096160] Avg episode reward: [(0, '4846.296')] [2023-03-10 20:59:55,883][1096443] Updated weights for policy 0, policy_version 86880 (0.0005) [2023-03-10 20:59:59,487][1096443] Updated weights for policy 0, policy_version 86960 (0.0005) [2023-03-10 20:59:59,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 12015.0, 300 sec: 12079.7). Total num frames: 44523520. Throughput: 0: 12067.8. Samples: 44509440. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 20:59:59,742][1096160] Avg episode reward: [(0, '4851.815')] [2023-03-10 20:59:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000086960_44523520.pth... [2023-03-10 20:59:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000086264_44167168.pth [2023-03-10 21:00:02,605][1096443] Updated weights for policy 0, policy_version 87040 (0.0004) [2023-03-10 21:00:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 44589056. Throughput: 0: 12126.8. Samples: 44584960. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:00:04,742][1096160] Avg episode reward: [(0, '4844.330')] [2023-03-10 21:00:06,040][1096443] Updated weights for policy 0, policy_version 87120 (0.0005) [2023-03-10 21:00:09,498][1096443] Updated weights for policy 0, policy_version 87200 (0.0005) [2023-03-10 21:00:09,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 44646400. Throughput: 0: 12090.4. Samples: 44619480. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:00:09,742][1096160] Avg episode reward: [(0, '4846.101')] [2023-03-10 21:00:12,740][1096443] Updated weights for policy 0, policy_version 87280 (0.0005) [2023-03-10 21:00:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 44707840. Throughput: 0: 12091.9. Samples: 44693348. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:00:14,742][1096160] Avg episode reward: [(0, '4836.920')] [2023-03-10 21:00:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000087320_44707840.pth... [2023-03-10 21:00:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000086616_44347392.pth [2023-03-10 21:00:16,345][1096443] Updated weights for policy 0, policy_version 87360 (0.0005) [2023-03-10 21:00:19,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 44765184. Throughput: 0: 11995.8. Samples: 44761552. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:00:19,742][1096160] Avg episode reward: [(0, '4845.010')] [2023-03-10 21:00:19,848][1096443] Updated weights for policy 0, policy_version 87440 (0.0005) [2023-03-10 21:00:23,489][1096443] Updated weights for policy 0, policy_version 87520 (0.0004) [2023-03-10 21:00:24,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 12079.7). Total num frames: 44822528. Throughput: 0: 11892.2. Samples: 44795848. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:00:24,742][1096160] Avg episode reward: [(0, '4842.676')] [2023-03-10 21:00:26,998][1096443] Updated weights for policy 0, policy_version 87600 (0.0005) [2023-03-10 21:00:29,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 12065.8). Total num frames: 44879872. Throughput: 0: 11879.1. Samples: 44865576. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:00:29,742][1096160] Avg episode reward: [(0, '4841.810')] [2023-03-10 21:00:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000087656_44879872.pth... [2023-03-10 21:00:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000086960_44523520.pth [2023-03-10 21:00:30,467][1096443] Updated weights for policy 0, policy_version 87680 (0.0005) [2023-03-10 21:00:33,774][1096443] Updated weights for policy 0, policy_version 87760 (0.0005) [2023-03-10 21:00:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12065.8). Total num frames: 44941312. Throughput: 0: 11883.8. Samples: 44938436. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:00:34,742][1096160] Avg episode reward: [(0, '4848.953')] [2023-03-10 21:00:37,217][1096443] Updated weights for policy 0, policy_version 87840 (0.0005) [2023-03-10 21:00:39,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 12079.7). Total num frames: 45002752. Throughput: 0: 11858.6. Samples: 44974144. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:00:39,742][1096160] Avg episode reward: [(0, '4848.371')] [2023-03-10 21:00:40,472][1096443] Updated weights for policy 0, policy_version 87920 (0.0005) [2023-03-10 21:00:43,640][1096443] Updated weights for policy 0, policy_version 88000 (0.0005) [2023-03-10 21:00:44,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12014.9, 300 sec: 12079.7). Total num frames: 45068288. Throughput: 0: 12013.2. Samples: 45050036. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:00:44,742][1096160] Avg episode reward: [(0, '4849.134')] [2023-03-10 21:00:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000088024_45068288.pth... [2023-03-10 21:00:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000087320_44707840.pth [2023-03-10 21:00:47,034][1096443] Updated weights for policy 0, policy_version 88080 (0.0006) [2023-03-10 21:00:49,742][1096160] Fps is (10 sec: 12697.4, 60 sec: 12014.9, 300 sec: 12093.6). Total num frames: 45129728. Throughput: 0: 11969.5. Samples: 45123588. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:00:49,742][1096160] Avg episode reward: [(0, '4849.375')] [2023-03-10 21:00:50,429][1096443] Updated weights for policy 0, policy_version 88160 (0.0005) [2023-03-10 21:00:53,752][1096443] Updated weights for policy 0, policy_version 88240 (0.0005) [2023-03-10 21:00:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 12065.8). Total num frames: 45187072. Throughput: 0: 12015.6. Samples: 45160184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:00:54,742][1096160] Avg episode reward: [(0, '4845.467')] [2023-03-10 21:00:57,208][1096443] Updated weights for policy 0, policy_version 88320 (0.0004) [2023-03-10 21:00:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 45248512. Throughput: 0: 11974.5. Samples: 45232200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:00:59,742][1096160] Avg episode reward: [(0, '4849.088')] [2023-03-10 21:00:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000088376_45248512.pth... [2023-03-10 21:00:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000087656_44879872.pth [2023-03-10 21:01:00,588][1096443] Updated weights for policy 0, policy_version 88400 (0.0005) [2023-03-10 21:01:03,933][1096443] Updated weights for policy 0, policy_version 88480 (0.0005) [2023-03-10 21:01:04,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12015.0, 300 sec: 12079.7). Total num frames: 45309952. Throughput: 0: 12094.9. Samples: 45305824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:01:04,742][1096160] Avg episode reward: [(0, '4849.298')] [2023-03-10 21:01:07,309][1096443] Updated weights for policy 0, policy_version 88560 (0.0005) [2023-03-10 21:01:09,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 45371392. Throughput: 0: 12149.9. Samples: 45342596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:01:09,742][1096160] Avg episode reward: [(0, '4853.822')] [2023-03-10 21:01:10,684][1096443] Updated weights for policy 0, policy_version 88640 (0.0005) [2023-03-10 21:01:13,910][1096443] Updated weights for policy 0, policy_version 88720 (0.0004) [2023-03-10 21:01:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 45432832. Throughput: 0: 12223.9. Samples: 45415652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:01:14,742][1096160] Avg episode reward: [(0, '4848.893')] [2023-03-10 21:01:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000088736_45432832.pth... [2023-03-10 21:01:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000088024_45068288.pth [2023-03-10 21:01:17,418][1096443] Updated weights for policy 0, policy_version 88800 (0.0005) [2023-03-10 21:01:19,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 45494272. Throughput: 0: 12262.3. Samples: 45490240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:01:19,742][1096160] Avg episode reward: [(0, '4846.803')] [2023-03-10 21:01:20,497][1096443] Updated weights for policy 0, policy_version 88880 (0.0004) [2023-03-10 21:01:23,973][1096443] Updated weights for policy 0, policy_version 88960 (0.0005) [2023-03-10 21:01:24,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 45555712. Throughput: 0: 12285.9. Samples: 45527008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:01:24,753][1096160] Avg episode reward: [(0, '4855.476')] [2023-03-10 21:01:27,414][1096443] Updated weights for policy 0, policy_version 89040 (0.0005) [2023-03-10 21:01:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12219.7, 300 sec: 12065.8). Total num frames: 45613056. Throughput: 0: 12160.6. Samples: 45597264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:01:29,742][1096160] Avg episode reward: [(0, '4858.182')] [2023-03-10 21:01:29,773][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000089096_45617152.pth... [2023-03-10 21:01:29,775][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000088376_45248512.pth [2023-03-10 21:01:30,796][1096443] Updated weights for policy 0, policy_version 89120 (0.0005) [2023-03-10 21:01:34,172][1096443] Updated weights for policy 0, policy_version 89200 (0.0005) [2023-03-10 21:01:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12079.7). Total num frames: 45674496. Throughput: 0: 12154.4. Samples: 45670536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:01:34,742][1096160] Avg episode reward: [(0, '4857.821')] [2023-03-10 21:01:37,271][1096443] Updated weights for policy 0, policy_version 89280 (0.0005) [2023-03-10 21:01:39,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 12288.0, 300 sec: 12107.5). Total num frames: 45740032. Throughput: 0: 12235.1. Samples: 45710764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:01:39,742][1096160] Avg episode reward: [(0, '4859.803')] [2023-03-10 21:01:40,637][1096443] Updated weights for policy 0, policy_version 89360 (0.0005) [2023-03-10 21:01:43,995][1096443] Updated weights for policy 0, policy_version 89440 (0.0005) [2023-03-10 21:01:44,742][1096160] Fps is (10 sec: 12697.4, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 45801472. Throughput: 0: 12283.8. Samples: 45784972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:01:44,742][1096160] Avg episode reward: [(0, '4843.160')] [2023-03-10 21:01:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000089456_45801472.pth... [2023-03-10 21:01:44,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000088736_45432832.pth [2023-03-10 21:01:47,405][1096443] Updated weights for policy 0, policy_version 89520 (0.0005) [2023-03-10 21:01:49,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12107.5). Total num frames: 45862912. Throughput: 0: 12253.1. Samples: 45857216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:01:49,742][1096160] Avg episode reward: [(0, '4851.215')] [2023-03-10 21:01:50,703][1096443] Updated weights for policy 0, policy_version 89600 (0.0005) [2023-03-10 21:01:54,213][1096443] Updated weights for policy 0, policy_version 89680 (0.0005) [2023-03-10 21:01:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 45920256. Throughput: 0: 12225.3. Samples: 45892732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:01:54,742][1096160] Avg episode reward: [(0, '4846.263')] [2023-03-10 21:01:57,513][1096443] Updated weights for policy 0, policy_version 89760 (0.0004) [2023-03-10 21:01:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 45981696. Throughput: 0: 12228.6. Samples: 45965940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:01:59,742][1096160] Avg episode reward: [(0, '4839.195')] [2023-03-10 21:01:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000089808_45981696.pth... [2023-03-10 21:01:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000089096_45617152.pth [2023-03-10 21:02:01,040][1096443] Updated weights for policy 0, policy_version 89840 (0.0005) [2023-03-10 21:02:04,554][1096443] Updated weights for policy 0, policy_version 89920 (0.0005) [2023-03-10 21:02:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12151.4, 300 sec: 12065.8). Total num frames: 46039040. Throughput: 0: 12106.1. Samples: 46035016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:02:04,742][1096160] Avg episode reward: [(0, '4843.461')] [2023-03-10 21:02:07,850][1096443] Updated weights for policy 0, policy_version 90000 (0.0004) [2023-03-10 21:02:09,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12065.8). Total num frames: 46100480. Throughput: 0: 12114.4. Samples: 46072156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:02:09,742][1096160] Avg episode reward: [(0, '4847.537')] [2023-03-10 21:02:11,229][1096443] Updated weights for policy 0, policy_version 90080 (0.0005) [2023-03-10 21:02:14,544][1096443] Updated weights for policy 0, policy_version 90160 (0.0005) [2023-03-10 21:02:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.4, 300 sec: 12065.8). Total num frames: 46161920. Throughput: 0: 12182.3. Samples: 46145468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:02:14,742][1096160] Avg episode reward: [(0, '4855.650')] [2023-03-10 21:02:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000090160_46161920.pth... [2023-03-10 21:02:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000089456_45801472.pth [2023-03-10 21:02:17,770][1096443] Updated weights for policy 0, policy_version 90240 (0.0005) [2023-03-10 21:02:19,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 46223360. Throughput: 0: 12225.7. Samples: 46220692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:02:19,742][1096160] Avg episode reward: [(0, '4845.268')] [2023-03-10 21:02:21,214][1096443] Updated weights for policy 0, policy_version 90320 (0.0004) [2023-03-10 21:02:24,479][1096443] Updated weights for policy 0, policy_version 90400 (0.0005) [2023-03-10 21:02:24,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12079.7). Total num frames: 46284800. Throughput: 0: 12146.9. Samples: 46257376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:02:24,742][1096160] Avg episode reward: [(0, '4853.081')] [2023-03-10 21:02:27,845][1096443] Updated weights for policy 0, policy_version 90480 (0.0005) [2023-03-10 21:02:29,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 46346240. Throughput: 0: 12138.0. Samples: 46331180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:02:29,742][1096160] Avg episode reward: [(0, '4856.079')] [2023-03-10 21:02:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000090520_46346240.pth... [2023-03-10 21:02:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000089808_45981696.pth [2023-03-10 21:02:31,237][1096443] Updated weights for policy 0, policy_version 90560 (0.0004) [2023-03-10 21:02:34,577][1096443] Updated weights for policy 0, policy_version 90640 (0.0005) [2023-03-10 21:02:34,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 12093.6). Total num frames: 46407680. Throughput: 0: 12141.7. Samples: 46403592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:02:34,742][1096160] Avg episode reward: [(0, '4856.678')] [2023-03-10 21:02:37,814][1096443] Updated weights for policy 0, policy_version 90720 (0.0005) [2023-03-10 21:02:39,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 46469120. Throughput: 0: 12181.4. Samples: 46440896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:02:39,742][1096160] Avg episode reward: [(0, '4853.463')] [2023-03-10 21:02:41,413][1096443] Updated weights for policy 0, policy_version 90800 (0.0005) [2023-03-10 21:02:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12093.6). Total num frames: 46526464. Throughput: 0: 12120.6. Samples: 46511368. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:02:44,742][1096160] Avg episode reward: [(0, '4849.592')] [2023-03-10 21:02:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000090872_46526464.pth... [2023-03-10 21:02:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000090160_46161920.pth [2023-03-10 21:02:44,956][1096443] Updated weights for policy 0, policy_version 90880 (0.0005) [2023-03-10 21:02:48,313][1096443] Updated weights for policy 0, policy_version 90960 (0.0005) [2023-03-10 21:02:49,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 46587904. Throughput: 0: 12190.7. Samples: 46583596. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:02:49,742][1096160] Avg episode reward: [(0, '4856.142')] [2023-03-10 21:02:51,626][1096443] Updated weights for policy 0, policy_version 91040 (0.0005) [2023-03-10 21:02:54,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12093.6). Total num frames: 46649344. Throughput: 0: 12184.0. Samples: 46620436. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:02:54,742][1096160] Avg episode reward: [(0, '4860.415')] [2023-03-10 21:02:54,937][1096443] Updated weights for policy 0, policy_version 91120 (0.0005) [2023-03-10 21:02:58,444][1096443] Updated weights for policy 0, policy_version 91200 (0.0005) [2023-03-10 21:02:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 46706688. Throughput: 0: 12123.2. Samples: 46691012. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:02:59,742][1096160] Avg episode reward: [(0, '4856.288')] [2023-03-10 21:02:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000091224_46706688.pth... [2023-03-10 21:02:59,745][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000090520_46346240.pth [2023-03-10 21:03:01,980][1096443] Updated weights for policy 0, policy_version 91280 (0.0005) [2023-03-10 21:03:04,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 46764032. Throughput: 0: 11988.2. Samples: 46760160. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:03:04,742][1096160] Avg episode reward: [(0, '4858.404')] [2023-03-10 21:03:05,536][1096443] Updated weights for policy 0, policy_version 91360 (0.0005) [2023-03-10 21:03:09,054][1096443] Updated weights for policy 0, policy_version 91440 (0.0005) [2023-03-10 21:03:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12079.7). Total num frames: 46825472. Throughput: 0: 11984.3. Samples: 46796668. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:03:09,742][1096160] Avg episode reward: [(0, '4859.138')] [2023-03-10 21:03:12,616][1096443] Updated weights for policy 0, policy_version 91520 (0.0005) [2023-03-10 21:03:14,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 12052.0). Total num frames: 46878720. Throughput: 0: 11883.8. Samples: 46865948. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:03:14,742][1096160] Avg episode reward: [(0, '4855.846')] [2023-03-10 21:03:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000091560_46878720.pth... [2023-03-10 21:03:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000090872_46526464.pth [2023-03-10 21:03:16,291][1096443] Updated weights for policy 0, policy_version 91600 (0.0004) [2023-03-10 21:03:19,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11878.4, 300 sec: 12038.1). Total num frames: 46936064. Throughput: 0: 11772.6. Samples: 46933360. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:03:19,742][1096160] Avg episode reward: [(0, '4858.410')] [2023-03-10 21:03:19,764][1096443] Updated weights for policy 0, policy_version 91680 (0.0004) [2023-03-10 21:03:23,257][1096443] Updated weights for policy 0, policy_version 91760 (0.0005) [2023-03-10 21:03:24,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 12038.1). Total num frames: 46993408. Throughput: 0: 11733.5. Samples: 46968904. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:03:24,742][1096160] Avg episode reward: [(0, '4861.067')] [2023-03-10 21:03:26,992][1096443] Updated weights for policy 0, policy_version 91840 (0.0005) [2023-03-10 21:03:29,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11741.9, 300 sec: 12024.2). Total num frames: 47050752. Throughput: 0: 11710.0. Samples: 47038320. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:03:29,742][1096160] Avg episode reward: [(0, '4857.651')] [2023-03-10 21:03:29,756][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000091904_47054848.pth... [2023-03-10 21:03:29,759][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000091224_46706688.pth [2023-03-10 21:03:30,401][1096443] Updated weights for policy 0, policy_version 91920 (0.0005) [2023-03-10 21:03:33,574][1096443] Updated weights for policy 0, policy_version 92000 (0.0005) [2023-03-10 21:03:34,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 12052.0). Total num frames: 47116288. Throughput: 0: 11737.3. Samples: 47111776. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:03:34,742][1096160] Avg episode reward: [(0, '4855.990')] [2023-03-10 21:03:37,165][1096443] Updated weights for policy 0, policy_version 92080 (0.0004) [2023-03-10 21:03:39,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 11741.8, 300 sec: 12038.1). Total num frames: 47173632. Throughput: 0: 11661.0. Samples: 47145184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:03:39,742][1096160] Avg episode reward: [(0, '4860.516')] [2023-03-10 21:03:40,702][1096443] Updated weights for policy 0, policy_version 92160 (0.0005) [2023-03-10 21:03:44,242][1096443] Updated weights for policy 0, policy_version 92240 (0.0005) [2023-03-10 21:03:44,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 12024.2). Total num frames: 47230976. Throughput: 0: 11635.3. Samples: 47214600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:03:44,742][1096160] Avg episode reward: [(0, '4859.698')] [2023-03-10 21:03:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000092248_47230976.pth... [2023-03-10 21:03:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000091560_46878720.pth [2023-03-10 21:03:48,004][1096443] Updated weights for policy 0, policy_version 92320 (0.0004) [2023-03-10 21:03:49,742][1096160] Fps is (10 sec: 11059.4, 60 sec: 11605.3, 300 sec: 12010.3). Total num frames: 47284224. Throughput: 0: 11588.0. Samples: 47281620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:03:49,742][1096160] Avg episode reward: [(0, '4859.087')] [2023-03-10 21:03:51,445][1096443] Updated weights for policy 0, policy_version 92400 (0.0005) [2023-03-10 21:03:54,742][1096160] Fps is (10 sec: 11059.3, 60 sec: 11537.1, 300 sec: 11996.4). Total num frames: 47341568. Throughput: 0: 11564.2. Samples: 47317056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:03:54,742][1096160] Avg episode reward: [(0, '4857.044')] [2023-03-10 21:03:55,129][1096443] Updated weights for policy 0, policy_version 92480 (0.0005) [2023-03-10 21:03:58,601][1096443] Updated weights for policy 0, policy_version 92560 (0.0004) [2023-03-10 21:03:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 12010.3). Total num frames: 47403008. Throughput: 0: 11570.7. Samples: 47386632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:03:59,742][1096160] Avg episode reward: [(0, '4861.123')] [2023-03-10 21:03:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000092584_47403008.pth... [2023-03-10 21:03:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000091904_47054848.pth [2023-03-10 21:04:02,052][1096443] Updated weights for policy 0, policy_version 92640 (0.0005) [2023-03-10 21:04:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11996.4). Total num frames: 47460352. Throughput: 0: 11682.5. Samples: 47459072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:04:04,742][1096160] Avg episode reward: [(0, '4859.920')] [2023-03-10 21:04:05,435][1096443] Updated weights for policy 0, policy_version 92720 (0.0005) [2023-03-10 21:04:08,747][1096443] Updated weights for policy 0, policy_version 92800 (0.0005) [2023-03-10 21:04:09,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11673.6, 300 sec: 12010.3). Total num frames: 47525888. Throughput: 0: 11713.4. Samples: 47496008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:04:09,753][1096160] Avg episode reward: [(0, '4858.087')] [2023-03-10 21:04:12,061][1096443] Updated weights for policy 0, policy_version 92880 (0.0005) [2023-03-10 21:04:14,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11741.9, 300 sec: 11996.4). Total num frames: 47583232. Throughput: 0: 11787.5. Samples: 47568756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:04:14,742][1096160] Avg episode reward: [(0, '4859.304')] [2023-03-10 21:04:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000092936_47583232.pth... [2023-03-10 21:04:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000092248_47230976.pth [2023-03-10 21:04:15,562][1096443] Updated weights for policy 0, policy_version 92960 (0.0005) [2023-03-10 21:04:18,900][1096443] Updated weights for policy 0, policy_version 93040 (0.0004) [2023-03-10 21:04:19,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11996.4). Total num frames: 47644672. Throughput: 0: 11758.4. Samples: 47640904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:04:19,742][1096160] Avg episode reward: [(0, '4857.877')] [2023-03-10 21:04:22,305][1096443] Updated weights for policy 0, policy_version 93120 (0.0004) [2023-03-10 21:04:24,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11996.4). Total num frames: 47706112. Throughput: 0: 11821.0. Samples: 47677128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:04:24,742][1096160] Avg episode reward: [(0, '4855.653')] [2023-03-10 21:04:25,783][1096443] Updated weights for policy 0, policy_version 93200 (0.0005) [2023-03-10 21:04:29,218][1096443] Updated weights for policy 0, policy_version 93280 (0.0004) [2023-03-10 21:04:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11996.4). Total num frames: 47763456. Throughput: 0: 11836.8. Samples: 47747256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:04:29,742][1096160] Avg episode reward: [(0, '4856.158')] [2023-03-10 21:04:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000093288_47763456.pth... [2023-03-10 21:04:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000092584_47403008.pth [2023-03-10 21:04:32,719][1096443] Updated weights for policy 0, policy_version 93360 (0.0005) [2023-03-10 21:04:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 47824896. Throughput: 0: 11952.6. Samples: 47819488. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:04:34,742][1096160] Avg episode reward: [(0, '4858.494')] [2023-03-10 21:04:35,974][1096443] Updated weights for policy 0, policy_version 93440 (0.0005) [2023-03-10 21:04:39,169][1096443] Updated weights for policy 0, policy_version 93520 (0.0005) [2023-03-10 21:04:39,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 47890432. Throughput: 0: 11989.4. Samples: 47856580. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:04:39,742][1096160] Avg episode reward: [(0, '4857.396')] [2023-03-10 21:04:42,433][1096443] Updated weights for policy 0, policy_version 93600 (0.0005) [2023-03-10 21:04:44,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 47947776. Throughput: 0: 12119.7. Samples: 47932016. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:04:44,742][1096160] Avg episode reward: [(0, '4857.608')] [2023-03-10 21:04:44,759][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000093656_47951872.pth... [2023-03-10 21:04:44,761][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000092936_47583232.pth [2023-03-10 21:04:45,925][1096443] Updated weights for policy 0, policy_version 93680 (0.0005) [2023-03-10 21:04:49,348][1096443] Updated weights for policy 0, policy_version 93760 (0.0005) [2023-03-10 21:04:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 48009216. Throughput: 0: 12110.6. Samples: 48004048. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:04:49,742][1096160] Avg episode reward: [(0, '4855.511')] [2023-03-10 21:04:52,820][1096443] Updated weights for policy 0, policy_version 93840 (0.0005) [2023-03-10 21:04:54,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 48066560. Throughput: 0: 12043.4. Samples: 48037960. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:04:54,742][1096160] Avg episode reward: [(0, '4860.215')] [2023-03-10 21:04:56,159][1096443] Updated weights for policy 0, policy_version 93920 (0.0005) [2023-03-10 21:04:59,674][1096443] Updated weights for policy 0, policy_version 94000 (0.0005) [2023-03-10 21:04:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 48128000. Throughput: 0: 12062.1. Samples: 48111552. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:04:59,742][1096160] Avg episode reward: [(0, '4855.816')] [2023-03-10 21:04:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000094000_48128000.pth... [2023-03-10 21:04:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000093288_47763456.pth [2023-03-10 21:05:03,357][1096443] Updated weights for policy 0, policy_version 94080 (0.0004) [2023-03-10 21:05:04,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 48181248. Throughput: 0: 11952.4. Samples: 48178760. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:05:04,742][1096160] Avg episode reward: [(0, '4852.060')] [2023-03-10 21:05:06,938][1096443] Updated weights for policy 0, policy_version 94160 (0.0005) [2023-03-10 21:05:09,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 48242688. Throughput: 0: 11930.4. Samples: 48213996. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:05:09,742][1096160] Avg episode reward: [(0, '4857.878')] [2023-03-10 21:05:10,275][1096443] Updated weights for policy 0, policy_version 94240 (0.0005) [2023-03-10 21:05:13,672][1096443] Updated weights for policy 0, policy_version 94320 (0.0004) [2023-03-10 21:05:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 48304128. Throughput: 0: 12010.8. Samples: 48287744. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:05:14,742][1096160] Avg episode reward: [(0, '4860.311')] [2023-03-10 21:05:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000094344_48304128.pth... [2023-03-10 21:05:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000093656_47951872.pth [2023-03-10 21:05:17,205][1096443] Updated weights for policy 0, policy_version 94400 (0.0005) [2023-03-10 21:05:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 48361472. Throughput: 0: 11954.5. Samples: 48357440. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:05:19,742][1096160] Avg episode reward: [(0, '4857.477')] [2023-03-10 21:05:20,635][1096443] Updated weights for policy 0, policy_version 94480 (0.0006) [2023-03-10 21:05:23,945][1096443] Updated weights for policy 0, policy_version 94560 (0.0005) [2023-03-10 21:05:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 48422912. Throughput: 0: 11948.3. Samples: 48394252. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:05:24,742][1096160] Avg episode reward: [(0, '4856.825')] [2023-03-10 21:05:27,221][1096443] Updated weights for policy 0, policy_version 94640 (0.0005) [2023-03-10 21:05:29,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 48484352. Throughput: 0: 11909.2. Samples: 48467932. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:05:29,742][1096160] Avg episode reward: [(0, '4854.244')] [2023-03-10 21:05:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000094696_48484352.pth... [2023-03-10 21:05:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000094000_48128000.pth [2023-03-10 21:05:30,667][1096443] Updated weights for policy 0, policy_version 94720 (0.0004) [2023-03-10 21:05:34,222][1096443] Updated weights for policy 0, policy_version 94800 (0.0005) [2023-03-10 21:05:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 48541696. Throughput: 0: 11856.9. Samples: 48537608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:05:34,742][1096160] Avg episode reward: [(0, '4855.334')] [2023-03-10 21:05:37,643][1096443] Updated weights for policy 0, policy_version 94880 (0.0005) [2023-03-10 21:05:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 48603136. Throughput: 0: 11918.3. Samples: 48574284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:05:39,742][1096160] Avg episode reward: [(0, '4859.480')] [2023-03-10 21:05:41,005][1096443] Updated weights for policy 0, policy_version 94960 (0.0004) [2023-03-10 21:05:44,290][1096443] Updated weights for policy 0, policy_version 95040 (0.0005) [2023-03-10 21:05:44,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.6, 300 sec: 11982.5). Total num frames: 48664576. Throughput: 0: 11925.3. Samples: 48648192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:05:44,742][1096160] Avg episode reward: [(0, '4857.551')] [2023-03-10 21:05:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000095048_48664576.pth... [2023-03-10 21:05:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000094344_48304128.pth [2023-03-10 21:05:47,715][1096443] Updated weights for policy 0, policy_version 95120 (0.0005) [2023-03-10 21:05:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 48726016. Throughput: 0: 12059.5. Samples: 48721440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:05:49,742][1096160] Avg episode reward: [(0, '4858.072')] [2023-03-10 21:05:50,943][1096443] Updated weights for policy 0, policy_version 95200 (0.0005) [2023-03-10 21:05:54,347][1096443] Updated weights for policy 0, policy_version 95280 (0.0005) [2023-03-10 21:05:54,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 48787456. Throughput: 0: 12072.7. Samples: 48757268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:05:54,742][1096160] Avg episode reward: [(0, '4859.725')] [2023-03-10 21:05:57,936][1096443] Updated weights for policy 0, policy_version 95360 (0.0005) [2023-03-10 21:05:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 48844800. Throughput: 0: 12015.1. Samples: 48828424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:05:59,754][1096160] Avg episode reward: [(0, '4857.394')] [2023-03-10 21:05:59,758][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000095400_48844800.pth... [2023-03-10 21:05:59,760][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000094696_48484352.pth [2023-03-10 21:06:01,393][1096443] Updated weights for policy 0, policy_version 95440 (0.0005) [2023-03-10 21:06:04,692][1096443] Updated weights for policy 0, policy_version 95520 (0.0005) [2023-03-10 21:06:04,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 48906240. Throughput: 0: 12074.2. Samples: 48900780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:06:04,742][1096160] Avg episode reward: [(0, '4859.133')] [2023-03-10 21:06:08,290][1096443] Updated weights for policy 0, policy_version 95600 (0.0005) [2023-03-10 21:06:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11968.6). Total num frames: 48963584. Throughput: 0: 12017.8. Samples: 48935052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:06:09,742][1096160] Avg episode reward: [(0, '4860.995')] [2023-03-10 21:06:11,617][1096443] Updated weights for policy 0, policy_version 95680 (0.0005) [2023-03-10 21:06:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11968.6). Total num frames: 49025024. Throughput: 0: 11972.5. Samples: 49006692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:06:14,742][1096160] Avg episode reward: [(0, '4860.927')] [2023-03-10 21:06:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000095752_49025024.pth... [2023-03-10 21:06:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000095048_48664576.pth [2023-03-10 21:06:15,123][1096443] Updated weights for policy 0, policy_version 95760 (0.0005) [2023-03-10 21:06:18,534][1096443] Updated weights for policy 0, policy_version 95840 (0.0004) [2023-03-10 21:06:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 49082368. Throughput: 0: 12016.2. Samples: 49078336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:06:19,742][1096160] Avg episode reward: [(0, '4862.302')] [2023-03-10 21:06:22,062][1096443] Updated weights for policy 0, policy_version 95920 (0.0005) [2023-03-10 21:06:24,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 49139712. Throughput: 0: 11970.5. Samples: 49112956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:06:24,742][1096160] Avg episode reward: [(0, '4862.328')] [2023-03-10 21:06:25,398][1096443] Updated weights for policy 0, policy_version 96000 (0.0005) [2023-03-10 21:06:28,827][1096443] Updated weights for policy 0, policy_version 96080 (0.0005) [2023-03-10 21:06:29,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 49201152. Throughput: 0: 11950.9. Samples: 49185980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:06:29,742][1096160] Avg episode reward: [(0, '4858.707')] [2023-03-10 21:06:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000096096_49201152.pth... [2023-03-10 21:06:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000095400_48844800.pth [2023-03-10 21:06:32,408][1096443] Updated weights for policy 0, policy_version 96160 (0.0005) [2023-03-10 21:06:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 49258496. Throughput: 0: 11845.0. Samples: 49254464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:06:34,742][1096160] Avg episode reward: [(0, '4858.355')] [2023-03-10 21:06:35,978][1096443] Updated weights for policy 0, policy_version 96240 (0.0005) [2023-03-10 21:06:39,403][1096443] Updated weights for policy 0, policy_version 96320 (0.0004) [2023-03-10 21:06:39,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 49315840. Throughput: 0: 11859.1. Samples: 49290928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:06:39,742][1096160] Avg episode reward: [(0, '4863.565')] [2023-03-10 21:06:42,839][1096443] Updated weights for policy 0, policy_version 96400 (0.0005) [2023-03-10 21:06:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 49377280. Throughput: 0: 11834.1. Samples: 49360960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:06:44,742][1096160] Avg episode reward: [(0, '4854.603')] [2023-03-10 21:06:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000096440_49377280.pth... [2023-03-10 21:06:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000095752_49025024.pth [2023-03-10 21:06:46,295][1096443] Updated weights for policy 0, policy_version 96480 (0.0004) [2023-03-10 21:06:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11913.1). Total num frames: 49434624. Throughput: 0: 11772.4. Samples: 49430536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:06:49,742][1096160] Avg episode reward: [(0, '4856.430')] [2023-03-10 21:06:50,014][1096443] Updated weights for policy 0, policy_version 96560 (0.0005) [2023-03-10 21:06:53,595][1096443] Updated weights for policy 0, policy_version 96640 (0.0005) [2023-03-10 21:06:54,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11899.2). Total num frames: 49491968. Throughput: 0: 11775.3. Samples: 49464940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:06:54,742][1096160] Avg episode reward: [(0, '4860.230')] [2023-03-10 21:06:57,142][1096443] Updated weights for policy 0, policy_version 96720 (0.0005) [2023-03-10 21:06:59,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11899.2). Total num frames: 49549312. Throughput: 0: 11696.9. Samples: 49533052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:06:59,742][1096160] Avg episode reward: [(0, '4854.796')] [2023-03-10 21:06:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000096776_49549312.pth... [2023-03-10 21:06:59,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000096096_49201152.pth [2023-03-10 21:07:00,582][1096443] Updated weights for policy 0, policy_version 96800 (0.0005) [2023-03-10 21:07:03,886][1096443] Updated weights for policy 0, policy_version 96880 (0.0005) [2023-03-10 21:07:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11899.2). Total num frames: 49610752. Throughput: 0: 11740.5. Samples: 49606656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:07:04,742][1096160] Avg episode reward: [(0, '4861.874')] [2023-03-10 21:07:07,316][1096443] Updated weights for policy 0, policy_version 96960 (0.0004) [2023-03-10 21:07:09,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11885.3). Total num frames: 49668096. Throughput: 0: 11781.3. Samples: 49643116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:07:09,742][1096160] Avg episode reward: [(0, '4856.870')] [2023-03-10 21:07:10,704][1096443] Updated weights for policy 0, policy_version 97040 (0.0005) [2023-03-10 21:07:14,031][1096443] Updated weights for policy 0, policy_version 97120 (0.0005) [2023-03-10 21:07:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11885.3). Total num frames: 49729536. Throughput: 0: 11777.3. Samples: 49715960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:07:14,742][1096160] Avg episode reward: [(0, '4862.867')] [2023-03-10 21:07:14,786][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000097136_49733632.pth... [2023-03-10 21:07:14,788][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000096440_49377280.pth [2023-03-10 21:07:17,739][1096443] Updated weights for policy 0, policy_version 97200 (0.0005) [2023-03-10 21:07:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11871.5). Total num frames: 49786880. Throughput: 0: 11763.6. Samples: 49783828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:07:19,742][1096160] Avg episode reward: [(0, '4857.431')] [2023-03-10 21:07:21,280][1096443] Updated weights for policy 0, policy_version 97280 (0.0005) [2023-03-10 21:07:24,671][1096443] Updated weights for policy 0, policy_version 97360 (0.0005) [2023-03-10 21:07:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11871.5). Total num frames: 49848320. Throughput: 0: 11704.7. Samples: 49817640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:07:24,742][1096160] Avg episode reward: [(0, '4853.039')] [2023-03-10 21:07:28,284][1096443] Updated weights for policy 0, policy_version 97440 (0.0005) [2023-03-10 21:07:29,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11843.7). Total num frames: 49901568. Throughput: 0: 11739.8. Samples: 49889252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:07:29,742][1096160] Avg episode reward: [(0, '4860.700')] [2023-03-10 21:07:29,757][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000097472_49905664.pth... [2023-03-10 21:07:29,771][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000096776_49549312.pth [2023-03-10 21:07:31,850][1096443] Updated weights for policy 0, policy_version 97520 (0.0005) [2023-03-10 21:07:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11857.6). Total num frames: 49967104. Throughput: 0: 11797.6. Samples: 49961428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:07:34,742][1096160] Avg episode reward: [(0, '4863.144')] [2023-03-10 21:07:34,971][1096443] Updated weights for policy 0, policy_version 97600 (0.0004) [2023-03-10 21:07:38,489][1096443] Updated weights for policy 0, policy_version 97680 (0.0005) [2023-03-10 21:07:39,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11857.6). Total num frames: 50024448. Throughput: 0: 11847.3. Samples: 49998068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:07:39,742][1096160] Avg episode reward: [(0, '4856.919')] [2023-03-10 21:07:41,979][1096443] Updated weights for policy 0, policy_version 97760 (0.0005) [2023-03-10 21:07:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11857.6). Total num frames: 50085888. Throughput: 0: 11869.2. Samples: 50067168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:07:44,742][1096160] Avg episode reward: [(0, '4861.454')] [2023-03-10 21:07:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000097824_50085888.pth... [2023-03-10 21:07:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000097136_49733632.pth [2023-03-10 21:07:45,321][1096443] Updated weights for policy 0, policy_version 97840 (0.0005) [2023-03-10 21:07:48,776][1096443] Updated weights for policy 0, policy_version 97920 (0.0004) [2023-03-10 21:07:49,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11810.2, 300 sec: 11843.7). Total num frames: 50143232. Throughput: 0: 11842.6. Samples: 50139572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:07:49,752][1096160] Avg episode reward: [(0, '4862.105')] [2023-03-10 21:07:52,139][1096443] Updated weights for policy 0, policy_version 98000 (0.0005) [2023-03-10 21:07:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11857.6). Total num frames: 50204672. Throughput: 0: 11857.2. Samples: 50176692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:07:54,742][1096160] Avg episode reward: [(0, '4858.693')] [2023-03-10 21:07:55,526][1096443] Updated weights for policy 0, policy_version 98080 (0.0005) [2023-03-10 21:07:58,976][1096443] Updated weights for policy 0, policy_version 98160 (0.0005) [2023-03-10 21:07:59,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 11946.7, 300 sec: 11871.5). Total num frames: 50266112. Throughput: 0: 11851.5. Samples: 50249280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:07:59,742][1096160] Avg episode reward: [(0, '4863.096')] [2023-03-10 21:07:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000098176_50266112.pth... [2023-03-10 21:07:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000097472_49905664.pth [2023-03-10 21:08:02,225][1096443] Updated weights for policy 0, policy_version 98240 (0.0005) [2023-03-10 21:08:04,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11871.5). Total num frames: 50327552. Throughput: 0: 11998.1. Samples: 50323744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:08:04,742][1096160] Avg episode reward: [(0, '4860.332')] [2023-03-10 21:08:05,548][1096443] Updated weights for policy 0, policy_version 98320 (0.0005) [2023-03-10 21:08:09,002][1096443] Updated weights for policy 0, policy_version 98400 (0.0005) [2023-03-10 21:08:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.6, 300 sec: 11885.3). Total num frames: 50384896. Throughput: 0: 12061.0. Samples: 50360384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:08:09,742][1096160] Avg episode reward: [(0, '4862.631')] [2023-03-10 21:08:12,563][1096443] Updated weights for policy 0, policy_version 98480 (0.0005) [2023-03-10 21:08:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11899.2). Total num frames: 50446336. Throughput: 0: 12015.4. Samples: 50429948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:08:14,742][1096160] Avg episode reward: [(0, '4861.040')] [2023-03-10 21:08:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000098528_50446336.pth... [2023-03-10 21:08:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000097824_50085888.pth [2023-03-10 21:08:16,135][1096443] Updated weights for policy 0, policy_version 98560 (0.0005) [2023-03-10 21:08:19,640][1096443] Updated weights for policy 0, policy_version 98640 (0.0004) [2023-03-10 21:08:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11899.2). Total num frames: 50503680. Throughput: 0: 11958.2. Samples: 50499548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:08:19,742][1096160] Avg episode reward: [(0, '4859.812')] [2023-03-10 21:08:23,218][1096443] Updated weights for policy 0, policy_version 98720 (0.0005) [2023-03-10 21:08:24,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 50561024. Throughput: 0: 11873.0. Samples: 50532352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:08:24,742][1096160] Avg episode reward: [(0, '4861.865')] [2023-03-10 21:08:26,666][1096443] Updated weights for policy 0, policy_version 98800 (0.0004) [2023-03-10 21:08:29,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11871.5). Total num frames: 50618368. Throughput: 0: 11894.6. Samples: 50602424. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:08:29,742][1096160] Avg episode reward: [(0, '4858.900')] [2023-03-10 21:08:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000098864_50618368.pth... [2023-03-10 21:08:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000098176_50266112.pth [2023-03-10 21:08:30,153][1096443] Updated weights for policy 0, policy_version 98880 (0.0005) [2023-03-10 21:08:33,565][1096443] Updated weights for policy 0, policy_version 98960 (0.0005) [2023-03-10 21:08:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 50679808. Throughput: 0: 11915.6. Samples: 50675776. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:08:34,742][1096160] Avg episode reward: [(0, '4856.354')] [2023-03-10 21:08:36,812][1096443] Updated weights for policy 0, policy_version 99040 (0.0005) [2023-03-10 21:08:39,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11899.2). Total num frames: 50741248. Throughput: 0: 11913.2. Samples: 50712784. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:08:39,742][1096160] Avg episode reward: [(0, '4854.008')] [2023-03-10 21:08:40,091][1096443] Updated weights for policy 0, policy_version 99120 (0.0005) [2023-03-10 21:08:43,276][1096443] Updated weights for policy 0, policy_version 99200 (0.0005) [2023-03-10 21:08:44,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12014.9, 300 sec: 11940.9). Total num frames: 50806784. Throughput: 0: 12023.5. Samples: 50790336. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:08:44,742][1096160] Avg episode reward: [(0, '4854.244')] [2023-03-10 21:08:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000099232_50806784.pth... [2023-03-10 21:08:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000098528_50446336.pth [2023-03-10 21:08:46,495][1096443] Updated weights for policy 0, policy_version 99280 (0.0005) [2023-03-10 21:08:49,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12083.2, 300 sec: 11954.8). Total num frames: 50868224. Throughput: 0: 12010.0. Samples: 50864192. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:08:49,742][1096160] Avg episode reward: [(0, '4858.648')] [2023-03-10 21:08:49,940][1096443] Updated weights for policy 0, policy_version 99360 (0.0005) [2023-03-10 21:08:53,458][1096443] Updated weights for policy 0, policy_version 99440 (0.0004) [2023-03-10 21:08:54,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12014.9, 300 sec: 11940.9). Total num frames: 50925568. Throughput: 0: 11976.9. Samples: 50899344. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:08:54,742][1096160] Avg episode reward: [(0, '4858.749')] [2023-03-10 21:08:56,718][1096443] Updated weights for policy 0, policy_version 99520 (0.0005) [2023-03-10 21:08:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 50987008. Throughput: 0: 12036.2. Samples: 50971576. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:08:59,742][1096160] Avg episode reward: [(0, '4855.197')] [2023-03-10 21:08:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000099584_50987008.pth... [2023-03-10 21:08:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000098864_50618368.pth [2023-03-10 21:09:00,224][1096443] Updated weights for policy 0, policy_version 99600 (0.0005) [2023-03-10 21:09:03,796][1096443] Updated weights for policy 0, policy_version 99680 (0.0005) [2023-03-10 21:09:04,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 51044352. Throughput: 0: 12018.2. Samples: 51040364. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:09:04,742][1096160] Avg episode reward: [(0, '4850.267')] [2023-03-10 21:09:07,386][1096443] Updated weights for policy 0, policy_version 99760 (0.0005) [2023-03-10 21:09:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11940.9). Total num frames: 51105792. Throughput: 0: 12082.2. Samples: 51076052. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:09:09,742][1096160] Avg episode reward: [(0, '4854.128')] [2023-03-10 21:09:10,746][1096443] Updated weights for policy 0, policy_version 99840 (0.0005) [2023-03-10 21:09:14,308][1096443] Updated weights for policy 0, policy_version 99920 (0.0005) [2023-03-10 21:09:14,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 51163136. Throughput: 0: 12096.3. Samples: 51146760. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:09:14,742][1096160] Avg episode reward: [(0, '4854.113')] [2023-03-10 21:09:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000099928_51163136.pth... [2023-03-10 21:09:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000099232_50806784.pth [2023-03-10 21:09:17,907][1096443] Updated weights for policy 0, policy_version 100000 (0.0005) [2023-03-10 21:09:19,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11913.1). Total num frames: 51220480. Throughput: 0: 12001.7. Samples: 51215852. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:09:19,742][1096160] Avg episode reward: [(0, '4853.486')] [2023-03-10 21:09:21,556][1096443] Updated weights for policy 0, policy_version 100080 (0.0005) [2023-03-10 21:09:24,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11913.1). Total num frames: 51277824. Throughput: 0: 11913.4. Samples: 51248888. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:09:24,753][1096160] Avg episode reward: [(0, '4853.527')] [2023-03-10 21:09:25,053][1096443] Updated weights for policy 0, policy_version 100160 (0.0005) [2023-03-10 21:09:28,586][1096443] Updated weights for policy 0, policy_version 100240 (0.0005) [2023-03-10 21:09:29,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11899.2). Total num frames: 51335168. Throughput: 0: 11743.3. Samples: 51318784. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:09:29,755][1096160] Avg episode reward: [(0, '4855.104')] [2023-03-10 21:09:29,759][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000100264_51335168.pth... [2023-03-10 21:09:29,761][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000099584_50987008.pth [2023-03-10 21:09:32,182][1096443] Updated weights for policy 0, policy_version 100320 (0.0005) [2023-03-10 21:09:34,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 51392512. Throughput: 0: 11649.7. Samples: 51388428. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:09:34,753][1096160] Avg episode reward: [(0, '4857.565')] [2023-03-10 21:09:35,661][1096443] Updated weights for policy 0, policy_version 100400 (0.0005) [2023-03-10 21:09:38,965][1096443] Updated weights for policy 0, policy_version 100480 (0.0005) [2023-03-10 21:09:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 51453952. Throughput: 0: 11687.4. Samples: 51425280. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:09:39,742][1096160] Avg episode reward: [(0, '4860.280')] [2023-03-10 21:09:42,659][1096443] Updated weights for policy 0, policy_version 100560 (0.0005) [2023-03-10 21:09:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11871.5). Total num frames: 51511296. Throughput: 0: 11599.7. Samples: 51493564. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:09:44,742][1096160] Avg episode reward: [(0, '4857.880')] [2023-03-10 21:09:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000100608_51511296.pth... [2023-03-10 21:09:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000099928_51163136.pth [2023-03-10 21:09:46,160][1096443] Updated weights for policy 0, policy_version 100640 (0.0005) [2023-03-10 21:09:49,679][1096443] Updated weights for policy 0, policy_version 100720 (0.0004) [2023-03-10 21:09:49,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11871.5). Total num frames: 51568640. Throughput: 0: 11640.6. Samples: 51564192. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:09:49,742][1096160] Avg episode reward: [(0, '4854.862')] [2023-03-10 21:09:53,011][1096443] Updated weights for policy 0, policy_version 100800 (0.0005) [2023-03-10 21:09:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.8, 300 sec: 11871.5). Total num frames: 51630080. Throughput: 0: 11662.0. Samples: 51600840. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:09:54,742][1096160] Avg episode reward: [(0, '4856.363')] [2023-03-10 21:09:56,432][1096443] Updated weights for policy 0, policy_version 100880 (0.0004) [2023-03-10 21:09:59,597][1096443] Updated weights for policy 0, policy_version 100960 (0.0005) [2023-03-10 21:09:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11741.9, 300 sec: 11899.2). Total num frames: 51691520. Throughput: 0: 11696.5. Samples: 51673104. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:09:59,742][1096160] Avg episode reward: [(0, '4856.840')] [2023-03-10 21:09:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000100960_51691520.pth... [2023-03-10 21:09:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000100264_51335168.pth [2023-03-10 21:10:02,860][1096443] Updated weights for policy 0, policy_version 101040 (0.0005) [2023-03-10 21:10:04,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 51752960. Throughput: 0: 11844.9. Samples: 51748872. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:10:04,742][1096160] Avg episode reward: [(0, '4857.793')] [2023-03-10 21:10:06,214][1096443] Updated weights for policy 0, policy_version 101120 (0.0005) [2023-03-10 21:10:09,570][1096443] Updated weights for policy 0, policy_version 101200 (0.0005) [2023-03-10 21:10:09,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 51814400. Throughput: 0: 11929.3. Samples: 51785708. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:10:09,742][1096160] Avg episode reward: [(0, '4856.644')] [2023-03-10 21:10:12,850][1096443] Updated weights for policy 0, policy_version 101280 (0.0005) [2023-03-10 21:10:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 51875840. Throughput: 0: 12016.4. Samples: 51859520. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:10:14,742][1096160] Avg episode reward: [(0, '4857.571')] [2023-03-10 21:10:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000101320_51875840.pth... [2023-03-10 21:10:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000100608_51511296.pth [2023-03-10 21:10:16,482][1096443] Updated weights for policy 0, policy_version 101360 (0.0005) [2023-03-10 21:10:19,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11885.3). Total num frames: 51929088. Throughput: 0: 11986.7. Samples: 51927828. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:10:19,742][1096160] Avg episode reward: [(0, '4859.249')] [2023-03-10 21:10:20,085][1096443] Updated weights for policy 0, policy_version 101440 (0.0005) [2023-03-10 21:10:23,502][1096443] Updated weights for policy 0, policy_version 101520 (0.0005) [2023-03-10 21:10:24,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 51990528. Throughput: 0: 11981.0. Samples: 51964424. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:10:24,753][1096160] Avg episode reward: [(0, '4852.848')] [2023-03-10 21:10:27,028][1096443] Updated weights for policy 0, policy_version 101600 (0.0005) [2023-03-10 21:10:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 52047872. Throughput: 0: 11962.8. Samples: 52031892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:10:29,755][1096160] Avg episode reward: [(0, '4858.348')] [2023-03-10 21:10:29,759][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000101656_52047872.pth... [2023-03-10 21:10:29,762][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000100960_51691520.pth [2023-03-10 21:10:30,582][1096443] Updated weights for policy 0, policy_version 101680 (0.0005) [2023-03-10 21:10:33,882][1096443] Updated weights for policy 0, policy_version 101760 (0.0005) [2023-03-10 21:10:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11885.3). Total num frames: 52109312. Throughput: 0: 12024.2. Samples: 52105280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:10:34,742][1096160] Avg episode reward: [(0, '4854.787')] [2023-03-10 21:10:37,319][1096443] Updated weights for policy 0, policy_version 101840 (0.0005) [2023-03-10 21:10:39,742][1096160] Fps is (10 sec: 11878.6, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 52166656. Throughput: 0: 12004.9. Samples: 52141060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:10:39,742][1096160] Avg episode reward: [(0, '4853.841')] [2023-03-10 21:10:40,848][1096443] Updated weights for policy 0, policy_version 101920 (0.0005) [2023-03-10 21:10:44,173][1096443] Updated weights for policy 0, policy_version 102000 (0.0005) [2023-03-10 21:10:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11871.5). Total num frames: 52228096. Throughput: 0: 11978.0. Samples: 52212112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:10:44,742][1096160] Avg episode reward: [(0, '4858.066')] [2023-03-10 21:10:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000102008_52228096.pth... [2023-03-10 21:10:44,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000101320_51875840.pth [2023-03-10 21:10:47,586][1096443] Updated weights for policy 0, policy_version 102080 (0.0004) [2023-03-10 21:10:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11871.5). Total num frames: 52289536. Throughput: 0: 11916.3. Samples: 52285104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:10:49,742][1096160] Avg episode reward: [(0, '4854.630')] [2023-03-10 21:10:51,073][1096443] Updated weights for policy 0, policy_version 102160 (0.0004) [2023-03-10 21:10:54,467][1096443] Updated weights for policy 0, policy_version 102240 (0.0004) [2023-03-10 21:10:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11871.5). Total num frames: 52346880. Throughput: 0: 11873.3. Samples: 52320008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:10:54,742][1096160] Avg episode reward: [(0, '4856.602')] [2023-03-10 21:10:58,000][1096443] Updated weights for policy 0, policy_version 102320 (0.0005) [2023-03-10 21:10:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11871.5). Total num frames: 52408320. Throughput: 0: 11817.8. Samples: 52391320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:10:59,742][1096160] Avg episode reward: [(0, '4856.003')] [2023-03-10 21:10:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000102360_52408320.pth... [2023-03-10 21:10:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000101656_52047872.pth [2023-03-10 21:11:01,209][1096443] Updated weights for policy 0, policy_version 102400 (0.0005) [2023-03-10 21:11:04,741][1096443] Updated weights for policy 0, policy_version 102480 (0.0005) [2023-03-10 21:11:04,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11885.3). Total num frames: 52469760. Throughput: 0: 11918.5. Samples: 52464160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:11:04,742][1096160] Avg episode reward: [(0, '4855.706')] [2023-03-10 21:11:08,233][1096443] Updated weights for policy 0, policy_version 102560 (0.0005) [2023-03-10 21:11:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 52527104. Throughput: 0: 11868.6. Samples: 52498512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:11:09,742][1096160] Avg episode reward: [(0, '4855.735')] [2023-03-10 21:11:11,640][1096443] Updated weights for policy 0, policy_version 102640 (0.0005) [2023-03-10 21:11:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 52588544. Throughput: 0: 11979.7. Samples: 52570976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:11:14,742][1096160] Avg episode reward: [(0, '4855.100')] [2023-03-10 21:11:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000102712_52588544.pth... [2023-03-10 21:11:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000102008_52228096.pth [2023-03-10 21:11:14,925][1096443] Updated weights for policy 0, policy_version 102720 (0.0005) [2023-03-10 21:11:18,493][1096443] Updated weights for policy 0, policy_version 102800 (0.0005) [2023-03-10 21:11:19,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11885.3). Total num frames: 52645888. Throughput: 0: 11925.0. Samples: 52641904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:11:19,742][1096160] Avg episode reward: [(0, '4859.692')] [2023-03-10 21:11:21,917][1096443] Updated weights for policy 0, policy_version 102880 (0.0005) [2023-03-10 21:11:24,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 52703232. Throughput: 0: 11944.5. Samples: 52678564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:11:24,753][1096160] Avg episode reward: [(0, '4856.146')] [2023-03-10 21:11:25,631][1096443] Updated weights for policy 0, policy_version 102960 (0.0005) [2023-03-10 21:11:28,888][1096443] Updated weights for policy 0, policy_version 103040 (0.0005) [2023-03-10 21:11:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11885.3). Total num frames: 52764672. Throughput: 0: 11916.6. Samples: 52748360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:11:29,755][1096160] Avg episode reward: [(0, '4857.231')] [2023-03-10 21:11:29,758][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000103056_52764672.pth... [2023-03-10 21:11:29,761][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000102360_52408320.pth [2023-03-10 21:11:32,473][1096443] Updated weights for policy 0, policy_version 103120 (0.0005) [2023-03-10 21:11:34,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 52822016. Throughput: 0: 11840.0. Samples: 52817904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:11:34,742][1096160] Avg episode reward: [(0, '4860.031')] [2023-03-10 21:11:35,864][1096443] Updated weights for policy 0, policy_version 103200 (0.0005) [2023-03-10 21:11:39,171][1096443] Updated weights for policy 0, policy_version 103280 (0.0005) [2023-03-10 21:11:39,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.6, 300 sec: 11885.3). Total num frames: 52883456. Throughput: 0: 11939.7. Samples: 52857296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:11:39,742][1096160] Avg episode reward: [(0, '4856.773')] [2023-03-10 21:11:42,606][1096443] Updated weights for policy 0, policy_version 103360 (0.0005) [2023-03-10 21:11:44,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11899.2). Total num frames: 52944896. Throughput: 0: 11936.6. Samples: 52928464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:11:44,742][1096160] Avg episode reward: [(0, '4850.052')] [2023-03-10 21:11:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000103408_52944896.pth... [2023-03-10 21:11:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000102712_52588544.pth [2023-03-10 21:11:45,908][1096443] Updated weights for policy 0, policy_version 103440 (0.0005) [2023-03-10 21:11:49,324][1096443] Updated weights for policy 0, policy_version 103520 (0.0005) [2023-03-10 21:11:49,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11913.1). Total num frames: 53006336. Throughput: 0: 11954.0. Samples: 53002092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:11:49,742][1096160] Avg episode reward: [(0, '4855.487')] [2023-03-10 21:11:52,749][1096443] Updated weights for policy 0, policy_version 103600 (0.0005) [2023-03-10 21:11:54,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11913.1). Total num frames: 53063680. Throughput: 0: 11941.4. Samples: 53035872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:11:54,742][1096160] Avg episode reward: [(0, '4852.051')] [2023-03-10 21:11:56,167][1096443] Updated weights for policy 0, policy_version 103680 (0.0005) [2023-03-10 21:11:59,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 53121024. Throughput: 0: 11950.7. Samples: 53108756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:11:59,742][1096160] Avg episode reward: [(0, '4849.906')] [2023-03-10 21:11:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000103752_53121024.pth... [2023-03-10 21:11:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000103056_52764672.pth [2023-03-10 21:11:59,817][1096443] Updated weights for policy 0, policy_version 103760 (0.0005) [2023-03-10 21:12:03,235][1096443] Updated weights for policy 0, policy_version 103840 (0.0005) [2023-03-10 21:12:04,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 53178368. Throughput: 0: 11907.0. Samples: 53177720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:12:04,742][1096160] Avg episode reward: [(0, '4845.113')] [2023-03-10 21:12:06,835][1096443] Updated weights for policy 0, policy_version 103920 (0.0005) [2023-03-10 21:12:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 53239808. Throughput: 0: 11835.1. Samples: 53211144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:12:09,742][1096160] Avg episode reward: [(0, '4848.599')] [2023-03-10 21:12:10,263][1096443] Updated weights for policy 0, policy_version 104000 (0.0005) [2023-03-10 21:12:13,546][1096443] Updated weights for policy 0, policy_version 104080 (0.0005) [2023-03-10 21:12:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 53301248. Throughput: 0: 11922.8. Samples: 53284888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:12:14,742][1096160] Avg episode reward: [(0, '4848.277')] [2023-03-10 21:12:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000104104_53301248.pth... [2023-03-10 21:12:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000103408_52944896.pth [2023-03-10 21:12:17,053][1096443] Updated weights for policy 0, policy_version 104160 (0.0005) [2023-03-10 21:12:19,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 53358592. Throughput: 0: 11954.2. Samples: 53355844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:12:19,742][1096160] Avg episode reward: [(0, '4848.374')] [2023-03-10 21:12:20,428][1096443] Updated weights for policy 0, policy_version 104240 (0.0006) [2023-03-10 21:12:23,828][1096443] Updated weights for policy 0, policy_version 104320 (0.0005) [2023-03-10 21:12:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 53420032. Throughput: 0: 11896.5. Samples: 53392636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:12:24,753][1096160] Avg episode reward: [(0, '4849.539')] [2023-03-10 21:12:27,213][1096443] Updated weights for policy 0, policy_version 104400 (0.0005) [2023-03-10 21:12:29,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 11946.7, 300 sec: 11913.1). Total num frames: 53481472. Throughput: 0: 11922.3. Samples: 53464968. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:12:29,742][1096160] Avg episode reward: [(0, '4847.173')] [2023-03-10 21:12:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000104456_53481472.pth... [2023-03-10 21:12:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000103752_53121024.pth [2023-03-10 21:12:30,780][1096443] Updated weights for policy 0, policy_version 104480 (0.0006) [2023-03-10 21:12:34,314][1096443] Updated weights for policy 0, policy_version 104560 (0.0004) [2023-03-10 21:12:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11913.1). Total num frames: 53538816. Throughput: 0: 11821.6. Samples: 53534064. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:12:34,742][1096160] Avg episode reward: [(0, '4854.238')] [2023-03-10 21:12:37,644][1096443] Updated weights for policy 0, policy_version 104640 (0.0005) [2023-03-10 21:12:39,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 53596160. Throughput: 0: 11904.7. Samples: 53571584. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:12:39,742][1096160] Avg episode reward: [(0, '4861.005')] [2023-03-10 21:12:41,084][1096443] Updated weights for policy 0, policy_version 104720 (0.0004) [2023-03-10 21:12:44,646][1096443] Updated weights for policy 0, policy_version 104800 (0.0005) [2023-03-10 21:12:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 53657600. Throughput: 0: 11831.3. Samples: 53641164. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:12:44,742][1096160] Avg episode reward: [(0, '4859.262')] [2023-03-10 21:12:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000104800_53657600.pth... [2023-03-10 21:12:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000104104_53301248.pth [2023-03-10 21:12:48,139][1096443] Updated weights for policy 0, policy_version 104880 (0.0005) [2023-03-10 21:12:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 53714944. Throughput: 0: 11861.9. Samples: 53711508. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:12:49,742][1096160] Avg episode reward: [(0, '4852.940')] [2023-03-10 21:12:51,576][1096443] Updated weights for policy 0, policy_version 104960 (0.0005) [2023-03-10 21:12:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 53776384. Throughput: 0: 11925.1. Samples: 53747776. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:12:54,742][1096160] Avg episode reward: [(0, '4854.551')] [2023-03-10 21:12:54,906][1096443] Updated weights for policy 0, policy_version 105040 (0.0005) [2023-03-10 21:12:58,324][1096443] Updated weights for policy 0, policy_version 105120 (0.0005) [2023-03-10 21:12:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11899.2). Total num frames: 53837824. Throughput: 0: 11920.8. Samples: 53821324. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:12:59,742][1096160] Avg episode reward: [(0, '4855.104')] [2023-03-10 21:12:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000105152_53837824.pth... [2023-03-10 21:12:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000104456_53481472.pth [2023-03-10 21:13:01,683][1096443] Updated weights for policy 0, policy_version 105200 (0.0005) [2023-03-10 21:13:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11913.1). Total num frames: 53899264. Throughput: 0: 11916.0. Samples: 53892064. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:13:04,742][1096160] Avg episode reward: [(0, '4857.080')] [2023-03-10 21:13:05,107][1096443] Updated weights for policy 0, policy_version 105280 (0.0005) [2023-03-10 21:13:08,406][1096443] Updated weights for policy 0, policy_version 105360 (0.0005) [2023-03-10 21:13:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11899.2). Total num frames: 53956608. Throughput: 0: 11897.2. Samples: 53928008. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:13:09,742][1096160] Avg episode reward: [(0, '4854.619')] [2023-03-10 21:13:11,722][1096443] Updated weights for policy 0, policy_version 105440 (0.0005) [2023-03-10 21:13:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11913.1). Total num frames: 54018048. Throughput: 0: 11945.5. Samples: 54002516. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:13:14,742][1096160] Avg episode reward: [(0, '4856.130')] [2023-03-10 21:13:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000105504_54018048.pth... [2023-03-10 21:13:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000104800_53657600.pth [2023-03-10 21:13:15,102][1096443] Updated weights for policy 0, policy_version 105520 (0.0004) [2023-03-10 21:13:18,530][1096443] Updated weights for policy 0, policy_version 105600 (0.0005) [2023-03-10 21:13:19,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 11927.0). Total num frames: 54079488. Throughput: 0: 12031.0. Samples: 54075456. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:13:19,742][1096160] Avg episode reward: [(0, '4852.347')] [2023-03-10 21:13:21,884][1096443] Updated weights for policy 0, policy_version 105680 (0.0005) [2023-03-10 21:13:24,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 11940.9). Total num frames: 54140928. Throughput: 0: 12019.1. Samples: 54112444. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:13:24,753][1096160] Avg episode reward: [(0, '4854.267')] [2023-03-10 21:13:25,209][1096443] Updated weights for policy 0, policy_version 105760 (0.0005) [2023-03-10 21:13:28,459][1096443] Updated weights for policy 0, policy_version 105840 (0.0004) [2023-03-10 21:13:29,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11940.9). Total num frames: 54202368. Throughput: 0: 12120.4. Samples: 54186580. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:13:29,755][1096160] Avg episode reward: [(0, '4858.466')] [2023-03-10 21:13:29,759][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000105864_54202368.pth... [2023-03-10 21:13:29,761][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000105152_53837824.pth [2023-03-10 21:13:31,953][1096443] Updated weights for policy 0, policy_version 105920 (0.0005) [2023-03-10 21:13:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11927.0). Total num frames: 54259712. Throughput: 0: 12125.1. Samples: 54257136. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:13:34,753][1096160] Avg episode reward: [(0, '4853.017')] [2023-03-10 21:13:35,499][1096443] Updated weights for policy 0, policy_version 106000 (0.0004) [2023-03-10 21:13:38,809][1096443] Updated weights for policy 0, policy_version 106080 (0.0005) [2023-03-10 21:13:39,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 12083.2, 300 sec: 11913.1). Total num frames: 54321152. Throughput: 0: 12135.1. Samples: 54293856. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:13:39,742][1096160] Avg episode reward: [(0, '4861.176')] [2023-03-10 21:13:42,372][1096443] Updated weights for policy 0, policy_version 106160 (0.0005) [2023-03-10 21:13:44,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 11899.2). Total num frames: 54378496. Throughput: 0: 12042.0. Samples: 54363212. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:13:44,742][1096160] Avg episode reward: [(0, '4859.946')] [2023-03-10 21:13:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000106208_54378496.pth... [2023-03-10 21:13:44,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000105504_54018048.pth [2023-03-10 21:13:45,857][1096443] Updated weights for policy 0, policy_version 106240 (0.0005) [2023-03-10 21:13:49,438][1096443] Updated weights for policy 0, policy_version 106320 (0.0004) [2023-03-10 21:13:49,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 12015.0, 300 sec: 11899.2). Total num frames: 54435840. Throughput: 0: 12023.8. Samples: 54433136. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:13:49,742][1096160] Avg episode reward: [(0, '4855.187')] [2023-03-10 21:13:52,755][1096443] Updated weights for policy 0, policy_version 106400 (0.0004) [2023-03-10 21:13:54,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11913.1). Total num frames: 54501376. Throughput: 0: 12033.3. Samples: 54469508. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:13:54,742][1096160] Avg episode reward: [(0, '4856.885')] [2023-03-10 21:13:56,012][1096443] Updated weights for policy 0, policy_version 106480 (0.0005) [2023-03-10 21:13:59,432][1096443] Updated weights for policy 0, policy_version 106560 (0.0005) [2023-03-10 21:13:59,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12014.9, 300 sec: 11913.1). Total num frames: 54558720. Throughput: 0: 12048.6. Samples: 54544704. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:13:59,742][1096160] Avg episode reward: [(0, '4858.477')] [2023-03-10 21:13:59,749][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000106568_54562816.pth... [2023-03-10 21:13:59,752][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000105864_54202368.pth [2023-03-10 21:14:02,745][1096443] Updated weights for policy 0, policy_version 106640 (0.0004) [2023-03-10 21:14:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11913.1). Total num frames: 54620160. Throughput: 0: 12015.1. Samples: 54616136. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:14:04,742][1096160] Avg episode reward: [(0, '4862.321')] [2023-03-10 21:14:06,284][1096443] Updated weights for policy 0, policy_version 106720 (0.0004) [2023-03-10 21:14:09,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11913.1). Total num frames: 54677504. Throughput: 0: 12007.7. Samples: 54652788. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:14:09,742][1096160] Avg episode reward: [(0, '4859.642')] [2023-03-10 21:14:09,809][1096443] Updated weights for policy 0, policy_version 106800 (0.0004) [2023-03-10 21:14:13,193][1096443] Updated weights for policy 0, policy_version 106880 (0.0005) [2023-03-10 21:14:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11927.0). Total num frames: 54738944. Throughput: 0: 11912.3. Samples: 54722632. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:14:14,742][1096160] Avg episode reward: [(0, '4860.193')] [2023-03-10 21:14:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000106912_54738944.pth... [2023-03-10 21:14:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000106208_54378496.pth [2023-03-10 21:14:16,602][1096443] Updated weights for policy 0, policy_version 106960 (0.0004) [2023-03-10 21:14:19,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11940.9). Total num frames: 54800384. Throughput: 0: 11981.1. Samples: 54796288. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:14:19,742][1096160] Avg episode reward: [(0, '4854.636')] [2023-03-10 21:14:19,931][1096443] Updated weights for policy 0, policy_version 107040 (0.0005) [2023-03-10 21:14:23,239][1096443] Updated weights for policy 0, policy_version 107120 (0.0005) [2023-03-10 21:14:24,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 54861824. Throughput: 0: 11932.0. Samples: 54830796. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:14:24,753][1096160] Avg episode reward: [(0, '4860.910')] [2023-03-10 21:14:26,627][1096443] Updated weights for policy 0, policy_version 107200 (0.0005) [2023-03-10 21:14:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 54919168. Throughput: 0: 12036.5. Samples: 54904856. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:14:29,754][1096160] Avg episode reward: [(0, '4855.379')] [2023-03-10 21:14:29,757][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000107264_54919168.pth... [2023-03-10 21:14:29,760][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000106568_54562816.pth [2023-03-10 21:14:30,207][1096443] Updated weights for policy 0, policy_version 107280 (0.0005) [2023-03-10 21:14:33,607][1096443] Updated weights for policy 0, policy_version 107360 (0.0005) [2023-03-10 21:14:34,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 11954.8). Total num frames: 54980608. Throughput: 0: 12075.0. Samples: 54976512. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:14:34,752][1096160] Avg episode reward: [(0, '4850.384')] [2023-03-10 21:14:36,901][1096443] Updated weights for policy 0, policy_version 107440 (0.0005) [2023-03-10 21:14:39,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 11968.7). Total num frames: 55042048. Throughput: 0: 12084.7. Samples: 55013320. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:14:39,742][1096160] Avg episode reward: [(0, '4852.907')] [2023-03-10 21:14:40,180][1096443] Updated weights for policy 0, policy_version 107520 (0.0005) [2023-03-10 21:14:43,408][1096443] Updated weights for policy 0, policy_version 107600 (0.0005) [2023-03-10 21:14:44,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 55103488. Throughput: 0: 12102.1. Samples: 55089300. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:14:44,742][1096160] Avg episode reward: [(0, '4854.236')] [2023-03-10 21:14:44,772][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000107632_55107584.pth... [2023-03-10 21:14:44,774][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000106912_54738944.pth [2023-03-10 21:14:46,715][1096443] Updated weights for policy 0, policy_version 107680 (0.0005) [2023-03-10 21:14:49,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.4, 300 sec: 11982.5). Total num frames: 55164928. Throughput: 0: 12112.9. Samples: 55161216. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:14:49,742][1096160] Avg episode reward: [(0, '4854.456')] [2023-03-10 21:14:50,096][1096443] Updated weights for policy 0, policy_version 107760 (0.0005) [2023-03-10 21:14:53,358][1096443] Updated weights for policy 0, policy_version 107840 (0.0005) [2023-03-10 21:14:54,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 12151.5, 300 sec: 11996.4). Total num frames: 55230464. Throughput: 0: 12170.9. Samples: 55200480. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:14:54,742][1096160] Avg episode reward: [(0, '4856.735')] [2023-03-10 21:14:56,714][1096443] Updated weights for policy 0, policy_version 107920 (0.0005) [2023-03-10 21:14:59,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 12219.7, 300 sec: 11996.4). Total num frames: 55291904. Throughput: 0: 12279.7. Samples: 55275220. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:14:59,742][1096160] Avg episode reward: [(0, '4855.987')] [2023-03-10 21:14:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000107992_55291904.pth... [2023-03-10 21:14:59,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000107264_54919168.pth [2023-03-10 21:14:59,867][1096443] Updated weights for policy 0, policy_version 108000 (0.0005) [2023-03-10 21:15:03,234][1096443] Updated weights for policy 0, policy_version 108080 (0.0005) [2023-03-10 21:15:04,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 11996.4). Total num frames: 55353344. Throughput: 0: 12289.6. Samples: 55349320. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:15:04,742][1096160] Avg episode reward: [(0, '4854.512')] [2023-03-10 21:15:06,534][1096443] Updated weights for policy 0, policy_version 108160 (0.0005) [2023-03-10 21:15:09,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12288.0, 300 sec: 11996.4). Total num frames: 55414784. Throughput: 0: 12349.2. Samples: 55386512. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:15:09,742][1096160] Avg episode reward: [(0, '4859.755')] [2023-03-10 21:15:10,030][1096443] Updated weights for policy 0, policy_version 108240 (0.0005) [2023-03-10 21:15:13,501][1096443] Updated weights for policy 0, policy_version 108320 (0.0005) [2023-03-10 21:15:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12219.7, 300 sec: 12010.3). Total num frames: 55472128. Throughput: 0: 12258.9. Samples: 55456508. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:15:14,742][1096160] Avg episode reward: [(0, '4853.922')] [2023-03-10 21:15:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000108344_55472128.pth... [2023-03-10 21:15:14,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000107632_55107584.pth [2023-03-10 21:15:16,904][1096443] Updated weights for policy 0, policy_version 108400 (0.0005) [2023-03-10 21:15:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12219.7, 300 sec: 12010.3). Total num frames: 55533568. Throughput: 0: 12276.7. Samples: 55528964. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:15:19,743][1096160] Avg episode reward: [(0, '4852.029')] [2023-03-10 21:15:20,343][1096443] Updated weights for policy 0, policy_version 108480 (0.0005) [2023-03-10 21:15:23,883][1096443] Updated weights for policy 0, policy_version 108560 (0.0005) [2023-03-10 21:15:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 55590912. Throughput: 0: 12225.0. Samples: 55563444. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:15:24,742][1096160] Avg episode reward: [(0, '4849.139')] [2023-03-10 21:15:27,363][1096443] Updated weights for policy 0, policy_version 108640 (0.0005) [2023-03-10 21:15:29,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12219.7, 300 sec: 12010.3). Total num frames: 55652352. Throughput: 0: 12141.4. Samples: 55635664. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:15:29,742][1096160] Avg episode reward: [(0, '4858.072')] [2023-03-10 21:15:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000108696_55652352.pth... [2023-03-10 21:15:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000107992_55291904.pth [2023-03-10 21:15:30,710][1096443] Updated weights for policy 0, policy_version 108720 (0.0005) [2023-03-10 21:15:33,840][1096443] Updated weights for policy 0, policy_version 108800 (0.0004) [2023-03-10 21:15:34,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12024.2). Total num frames: 55713792. Throughput: 0: 12202.5. Samples: 55710328. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:15:34,752][1096160] Avg episode reward: [(0, '4856.675')] [2023-03-10 21:15:37,230][1096443] Updated weights for policy 0, policy_version 108880 (0.0005) [2023-03-10 21:15:39,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12024.2). Total num frames: 55775232. Throughput: 0: 12135.1. Samples: 55746560. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:15:39,742][1096160] Avg episode reward: [(0, '4857.936')] [2023-03-10 21:15:40,687][1096443] Updated weights for policy 0, policy_version 108960 (0.0005) [2023-03-10 21:15:44,184][1096443] Updated weights for policy 0, policy_version 109040 (0.0006) [2023-03-10 21:15:44,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 55832576. Throughput: 0: 12026.2. Samples: 55816400. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:15:44,742][1096160] Avg episode reward: [(0, '4858.627')] [2023-03-10 21:15:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000109048_55832576.pth... [2023-03-10 21:15:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000108344_55472128.pth [2023-03-10 21:15:47,600][1096443] Updated weights for policy 0, policy_version 109120 (0.0005) [2023-03-10 21:15:49,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12024.2). Total num frames: 55894016. Throughput: 0: 12004.5. Samples: 55889524. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:15:49,742][1096160] Avg episode reward: [(0, '4853.432')] [2023-03-10 21:15:50,993][1096443] Updated weights for policy 0, policy_version 109200 (0.0005) [2023-03-10 21:15:54,560][1096443] Updated weights for policy 0, policy_version 109280 (0.0005) [2023-03-10 21:15:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 55951360. Throughput: 0: 11954.8. Samples: 55924476. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:15:54,742][1096160] Avg episode reward: [(0, '4857.685')] [2023-03-10 21:15:57,958][1096443] Updated weights for policy 0, policy_version 109360 (0.0005) [2023-03-10 21:15:59,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 56012800. Throughput: 0: 11987.9. Samples: 55995964. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:15:59,742][1096160] Avg episode reward: [(0, '4856.879')] [2023-03-10 21:15:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000109400_56012800.pth... [2023-03-10 21:15:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000108696_55652352.pth [2023-03-10 21:16:01,419][1096443] Updated weights for policy 0, policy_version 109440 (0.0005) [2023-03-10 21:16:04,716][1096443] Updated weights for policy 0, policy_version 109520 (0.0005) [2023-03-10 21:16:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 56074240. Throughput: 0: 11975.5. Samples: 56067860. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:16:04,742][1096160] Avg episode reward: [(0, '4858.498')] [2023-03-10 21:16:07,989][1096443] Updated weights for policy 0, policy_version 109600 (0.0004) [2023-03-10 21:16:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 56131584. Throughput: 0: 12076.3. Samples: 56106876. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:16:09,742][1096160] Avg episode reward: [(0, '4854.026')] [2023-03-10 21:16:11,456][1096443] Updated weights for policy 0, policy_version 109680 (0.0005) [2023-03-10 21:16:14,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 12024.2). Total num frames: 56193024. Throughput: 0: 12023.3. Samples: 56176712. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:16:14,742][1096160] Avg episode reward: [(0, '4858.011')] [2023-03-10 21:16:14,804][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000109760_56197120.pth... [2023-03-10 21:16:14,804][1096443] Updated weights for policy 0, policy_version 109760 (0.0005) [2023-03-10 21:16:14,805][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000109048_55832576.pth [2023-03-10 21:16:18,383][1096443] Updated weights for policy 0, policy_version 109840 (0.0005) [2023-03-10 21:16:19,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 56250368. Throughput: 0: 11918.6. Samples: 56246664. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:16:19,742][1096160] Avg episode reward: [(0, '4856.633')] [2023-03-10 21:16:21,936][1096443] Updated weights for policy 0, policy_version 109920 (0.0005) [2023-03-10 21:16:24,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 56311808. Throughput: 0: 11921.3. Samples: 56283020. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:16:24,742][1096160] Avg episode reward: [(0, '4858.672')] [2023-03-10 21:16:25,333][1096443] Updated weights for policy 0, policy_version 110000 (0.0005) [2023-03-10 21:16:28,740][1096443] Updated weights for policy 0, policy_version 110080 (0.0004) [2023-03-10 21:16:29,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 56369152. Throughput: 0: 11970.3. Samples: 56355064. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:16:29,742][1096160] Avg episode reward: [(0, '4855.306')] [2023-03-10 21:16:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000110096_56369152.pth... [2023-03-10 21:16:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000109400_56012800.pth [2023-03-10 21:16:32,254][1096443] Updated weights for policy 0, policy_version 110160 (0.0004) [2023-03-10 21:16:34,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 56426496. Throughput: 0: 11902.0. Samples: 56425116. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:16:34,742][1096160] Avg episode reward: [(0, '4856.431')] [2023-03-10 21:16:35,731][1096443] Updated weights for policy 0, policy_version 110240 (0.0004) [2023-03-10 21:16:39,170][1096443] Updated weights for policy 0, policy_version 110320 (0.0004) [2023-03-10 21:16:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12010.3). Total num frames: 56487936. Throughput: 0: 11919.4. Samples: 56460848. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:16:39,742][1096160] Avg episode reward: [(0, '4860.506')] [2023-03-10 21:16:42,545][1096443] Updated weights for policy 0, policy_version 110400 (0.0004) [2023-03-10 21:16:44,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 11946.6, 300 sec: 12010.3). Total num frames: 56549376. Throughput: 0: 11934.2. Samples: 56533004. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:16:44,742][1096160] Avg episode reward: [(0, '4859.016')] [2023-03-10 21:16:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000110448_56549376.pth... [2023-03-10 21:16:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000109760_56197120.pth [2023-03-10 21:16:46,033][1096443] Updated weights for policy 0, policy_version 110480 (0.0004) [2023-03-10 21:16:49,408][1096443] Updated weights for policy 0, policy_version 110560 (0.0004) [2023-03-10 21:16:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 56610816. Throughput: 0: 11923.5. Samples: 56604416. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:16:49,742][1096160] Avg episode reward: [(0, '4857.490')] [2023-03-10 21:16:52,800][1096443] Updated weights for policy 0, policy_version 110640 (0.0005) [2023-03-10 21:16:54,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 56668160. Throughput: 0: 11890.8. Samples: 56641960. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:16:54,742][1096160] Avg episode reward: [(0, '4855.169')] [2023-03-10 21:16:56,313][1096443] Updated weights for policy 0, policy_version 110720 (0.0004) [2023-03-10 21:16:59,583][1096443] Updated weights for policy 0, policy_version 110800 (0.0005) [2023-03-10 21:16:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 56729600. Throughput: 0: 11922.2. Samples: 56713212. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:16:59,742][1096160] Avg episode reward: [(0, '4860.643')] [2023-03-10 21:16:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000110800_56729600.pth... [2023-03-10 21:16:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000110096_56369152.pth [2023-03-10 21:17:02,929][1096443] Updated weights for policy 0, policy_version 110880 (0.0005) [2023-03-10 21:17:04,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 12024.2). Total num frames: 56786944. Throughput: 0: 11999.7. Samples: 56786652. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:17:04,742][1096160] Avg episode reward: [(0, '4855.868')] [2023-03-10 21:17:06,545][1096443] Updated weights for policy 0, policy_version 110960 (0.0005) [2023-03-10 21:17:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 56848384. Throughput: 0: 11925.5. Samples: 56819668. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:17:09,742][1096160] Avg episode reward: [(0, '4858.937')] [2023-03-10 21:17:09,863][1096443] Updated weights for policy 0, policy_version 111040 (0.0004) [2023-03-10 21:17:13,343][1096443] Updated weights for policy 0, policy_version 111120 (0.0005) [2023-03-10 21:17:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.6, 300 sec: 12038.1). Total num frames: 56909824. Throughput: 0: 11937.8. Samples: 56892264. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:17:14,742][1096160] Avg episode reward: [(0, '4859.651')] [2023-03-10 21:17:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000111152_56909824.pth... [2023-03-10 21:17:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000110448_56549376.pth [2023-03-10 21:17:16,674][1096443] Updated weights for policy 0, policy_version 111200 (0.0004) [2023-03-10 21:17:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 56967168. Throughput: 0: 11993.0. Samples: 56964800. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:17:19,742][1096160] Avg episode reward: [(0, '4855.471')] [2023-03-10 21:17:20,149][1096443] Updated weights for policy 0, policy_version 111280 (0.0005) [2023-03-10 21:17:23,529][1096443] Updated weights for policy 0, policy_version 111360 (0.0005) [2023-03-10 21:17:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 57028608. Throughput: 0: 11997.1. Samples: 57000720. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:17:24,742][1096160] Avg episode reward: [(0, '4860.331')] [2023-03-10 21:17:27,005][1096443] Updated weights for policy 0, policy_version 111440 (0.0005) [2023-03-10 21:17:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.6, 300 sec: 12024.2). Total num frames: 57085952. Throughput: 0: 11968.0. Samples: 57071564. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:17:29,742][1096160] Avg episode reward: [(0, '4855.616')] [2023-03-10 21:17:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000111496_57085952.pth... [2023-03-10 21:17:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000110800_56729600.pth [2023-03-10 21:17:30,626][1096443] Updated weights for policy 0, policy_version 111520 (0.0005) [2023-03-10 21:17:34,087][1096443] Updated weights for policy 0, policy_version 111600 (0.0005) [2023-03-10 21:17:34,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 57143296. Throughput: 0: 11916.0. Samples: 57140636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:17:34,742][1096160] Avg episode reward: [(0, '4859.209')] [2023-03-10 21:17:37,502][1096443] Updated weights for policy 0, policy_version 111680 (0.0005) [2023-03-10 21:17:39,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 57204736. Throughput: 0: 11870.4. Samples: 57176128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:17:39,742][1096160] Avg episode reward: [(0, '4856.906')] [2023-03-10 21:17:40,720][1096443] Updated weights for policy 0, policy_version 111760 (0.0004) [2023-03-10 21:17:43,976][1096443] Updated weights for policy 0, policy_version 111840 (0.0005) [2023-03-10 21:17:44,741][1096160] Fps is (10 sec: 12697.6, 60 sec: 12015.0, 300 sec: 12052.0). Total num frames: 57270272. Throughput: 0: 12012.7. Samples: 57253784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:17:44,742][1096160] Avg episode reward: [(0, '4856.603')] [2023-03-10 21:17:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000111856_57270272.pth... [2023-03-10 21:17:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000111152_56909824.pth [2023-03-10 21:17:47,343][1096443] Updated weights for policy 0, policy_version 111920 (0.0005) [2023-03-10 21:17:49,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 57331712. Throughput: 0: 12012.8. Samples: 57327228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:17:49,742][1096160] Avg episode reward: [(0, '4853.817')] [2023-03-10 21:17:50,648][1096443] Updated weights for policy 0, policy_version 112000 (0.0005) [2023-03-10 21:17:54,013][1096443] Updated weights for policy 0, policy_version 112080 (0.0005) [2023-03-10 21:17:54,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 57393152. Throughput: 0: 12090.0. Samples: 57363716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:17:54,742][1096160] Avg episode reward: [(0, '4848.318')] [2023-03-10 21:17:57,318][1096443] Updated weights for policy 0, policy_version 112160 (0.0005) [2023-03-10 21:17:59,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 57454592. Throughput: 0: 12126.2. Samples: 57437944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:17:59,742][1096160] Avg episode reward: [(0, '4850.834')] [2023-03-10 21:17:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000112216_57454592.pth... [2023-03-10 21:17:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000111496_57085952.pth [2023-03-10 21:18:00,732][1096443] Updated weights for policy 0, policy_version 112240 (0.0004) [2023-03-10 21:18:04,190][1096443] Updated weights for policy 0, policy_version 112320 (0.0005) [2023-03-10 21:18:04,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 57511936. Throughput: 0: 12069.2. Samples: 57507912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:18:04,742][1096160] Avg episode reward: [(0, '4851.275')] [2023-03-10 21:18:07,662][1096443] Updated weights for policy 0, policy_version 112400 (0.0005) [2023-03-10 21:18:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 57573376. Throughput: 0: 12081.2. Samples: 57544376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:18:09,742][1096160] Avg episode reward: [(0, '4857.225')] [2023-03-10 21:18:10,974][1096443] Updated weights for policy 0, policy_version 112480 (0.0005) [2023-03-10 21:18:14,394][1096443] Updated weights for policy 0, policy_version 112560 (0.0005) [2023-03-10 21:18:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 57630720. Throughput: 0: 12122.8. Samples: 57617088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:18:14,742][1096160] Avg episode reward: [(0, '4855.834')] [2023-03-10 21:18:14,749][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000112568_57634816.pth... [2023-03-10 21:18:14,750][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000111856_57270272.pth [2023-03-10 21:18:17,867][1096443] Updated weights for policy 0, policy_version 112640 (0.0005) [2023-03-10 21:18:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 57692160. Throughput: 0: 12164.7. Samples: 57688052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:18:19,742][1096160] Avg episode reward: [(0, '4841.632')] [2023-03-10 21:18:21,364][1096443] Updated weights for policy 0, policy_version 112720 (0.0005) [2023-03-10 21:18:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 57749504. Throughput: 0: 12154.5. Samples: 57723080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:18:24,742][1096160] Avg episode reward: [(0, '4849.311')] [2023-03-10 21:18:24,923][1096443] Updated weights for policy 0, policy_version 112800 (0.0005) [2023-03-10 21:18:28,334][1096443] Updated weights for policy 0, policy_version 112880 (0.0005) [2023-03-10 21:18:29,742][1096160] Fps is (10 sec: 11469.0, 60 sec: 12014.9, 300 sec: 12024.2). Total num frames: 57806848. Throughput: 0: 12015.0. Samples: 57794460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:18:29,742][1096160] Avg episode reward: [(0, '4852.136')] [2023-03-10 21:18:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000112904_57806848.pth... [2023-03-10 21:18:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000112216_57454592.pth [2023-03-10 21:18:31,855][1096443] Updated weights for policy 0, policy_version 112960 (0.0005) [2023-03-10 21:18:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 57868288. Throughput: 0: 11931.5. Samples: 57864144. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:18:34,742][1096160] Avg episode reward: [(0, '4854.235')] [2023-03-10 21:18:35,221][1096443] Updated weights for policy 0, policy_version 113040 (0.0005) [2023-03-10 21:18:38,421][1096443] Updated weights for policy 0, policy_version 113120 (0.0004) [2023-03-10 21:18:39,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 57929728. Throughput: 0: 11941.1. Samples: 57901064. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:18:39,742][1096160] Avg episode reward: [(0, '4857.749')] [2023-03-10 21:18:41,772][1096443] Updated weights for policy 0, policy_version 113200 (0.0005) [2023-03-10 21:18:44,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12083.2, 300 sec: 12065.8). Total num frames: 57995264. Throughput: 0: 11966.0. Samples: 57976416. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:18:44,742][1096160] Avg episode reward: [(0, '4857.849')] [2023-03-10 21:18:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000113272_57995264.pth... [2023-03-10 21:18:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000112568_57634816.pth [2023-03-10 21:18:45,064][1096443] Updated weights for policy 0, policy_version 113280 (0.0005) [2023-03-10 21:18:48,565][1096443] Updated weights for policy 0, policy_version 113360 (0.0004) [2023-03-10 21:18:49,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12038.1). Total num frames: 58052608. Throughput: 0: 12013.2. Samples: 58048508. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:18:49,742][1096160] Avg episode reward: [(0, '4859.001')] [2023-03-10 21:18:51,986][1096443] Updated weights for policy 0, policy_version 113440 (0.0004) [2023-03-10 21:18:54,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 58109952. Throughput: 0: 12000.6. Samples: 58084404. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:18:54,742][1096160] Avg episode reward: [(0, '4861.043')] [2023-03-10 21:18:55,456][1096443] Updated weights for policy 0, policy_version 113520 (0.0005) [2023-03-10 21:18:58,540][1096443] Updated weights for policy 0, policy_version 113600 (0.0005) [2023-03-10 21:18:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 58175488. Throughput: 0: 12044.7. Samples: 58159100. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:18:59,742][1096160] Avg episode reward: [(0, '4859.610')] [2023-03-10 21:18:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000113624_58175488.pth... [2023-03-10 21:18:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000112904_57806848.pth [2023-03-10 21:19:02,060][1096443] Updated weights for policy 0, policy_version 113680 (0.0005) [2023-03-10 21:19:04,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12052.0). Total num frames: 58232832. Throughput: 0: 12015.5. Samples: 58228748. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:19:04,742][1096160] Avg episode reward: [(0, '4860.116')] [2023-03-10 21:19:05,500][1096443] Updated weights for policy 0, policy_version 113760 (0.0004) [2023-03-10 21:19:09,074][1096443] Updated weights for policy 0, policy_version 113840 (0.0005) [2023-03-10 21:19:09,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 12038.1). Total num frames: 58290176. Throughput: 0: 12053.0. Samples: 58265468. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:19:09,742][1096160] Avg episode reward: [(0, '4858.544')] [2023-03-10 21:19:12,589][1096443] Updated weights for policy 0, policy_version 113920 (0.0005) [2023-03-10 21:19:14,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 58347520. Throughput: 0: 12015.2. Samples: 58335144. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:19:14,742][1096160] Avg episode reward: [(0, '4862.859')] [2023-03-10 21:19:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000113968_58351616.pth... [2023-03-10 21:19:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000113272_57995264.pth [2023-03-10 21:19:16,008][1096443] Updated weights for policy 0, policy_version 114000 (0.0005) [2023-03-10 21:19:19,335][1096443] Updated weights for policy 0, policy_version 114080 (0.0005) [2023-03-10 21:19:19,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12015.0, 300 sec: 12038.1). Total num frames: 58413056. Throughput: 0: 12081.1. Samples: 58407796. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:19:19,742][1096160] Avg episode reward: [(0, '4862.232')] [2023-03-10 21:19:22,655][1096443] Updated weights for policy 0, policy_version 114160 (0.0005) [2023-03-10 21:19:24,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 12083.2, 300 sec: 12052.0). Total num frames: 58474496. Throughput: 0: 12088.2. Samples: 58445036. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:19:24,742][1096160] Avg episode reward: [(0, '4863.086')] [2023-03-10 21:19:26,150][1096443] Updated weights for policy 0, policy_version 114240 (0.0005) [2023-03-10 21:19:29,555][1096443] Updated weights for policy 0, policy_version 114320 (0.0005) [2023-03-10 21:19:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 58531840. Throughput: 0: 11991.4. Samples: 58516028. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:19:29,742][1096160] Avg episode reward: [(0, '4858.974')] [2023-03-10 21:19:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000114320_58531840.pth... [2023-03-10 21:19:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000113624_58175488.pth [2023-03-10 21:19:32,831][1096443] Updated weights for policy 0, policy_version 114400 (0.0005) [2023-03-10 21:19:34,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 58593280. Throughput: 0: 12045.0. Samples: 58590532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:19:34,742][1096160] Avg episode reward: [(0, '4861.888')] [2023-03-10 21:19:36,138][1096443] Updated weights for policy 0, policy_version 114480 (0.0005) [2023-03-10 21:19:39,741][1096443] Updated weights for policy 0, policy_version 114560 (0.0005) [2023-03-10 21:19:39,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12038.1). Total num frames: 58654720. Throughput: 0: 12044.9. Samples: 58626424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:19:39,742][1096160] Avg episode reward: [(0, '4859.985')] [2023-03-10 21:19:43,214][1096443] Updated weights for policy 0, policy_version 114640 (0.0005) [2023-03-10 21:19:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 12024.2). Total num frames: 58712064. Throughput: 0: 11925.4. Samples: 58695744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:19:44,742][1096160] Avg episode reward: [(0, '4861.374')] [2023-03-10 21:19:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000114672_58712064.pth... [2023-03-10 21:19:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000113968_58351616.pth [2023-03-10 21:19:46,748][1096443] Updated weights for policy 0, policy_version 114720 (0.0004) [2023-03-10 21:19:49,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 58769408. Throughput: 0: 11979.8. Samples: 58767836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:19:49,742][1096160] Avg episode reward: [(0, '4862.920')] [2023-03-10 21:19:50,159][1096443] Updated weights for policy 0, policy_version 114800 (0.0005) [2023-03-10 21:19:53,651][1096443] Updated weights for policy 0, policy_version 114880 (0.0005) [2023-03-10 21:19:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 58830848. Throughput: 0: 11928.3. Samples: 58802240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:19:54,742][1096160] Avg episode reward: [(0, '4862.420')] [2023-03-10 21:19:57,250][1096443] Updated weights for policy 0, policy_version 114960 (0.0005) [2023-03-10 21:19:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 58888192. Throughput: 0: 11926.1. Samples: 58871816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:19:59,742][1096160] Avg episode reward: [(0, '4862.747')] [2023-03-10 21:19:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000115016_58888192.pth... [2023-03-10 21:19:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000114320_58531840.pth [2023-03-10 21:20:00,663][1096443] Updated weights for policy 0, policy_version 115040 (0.0005) [2023-03-10 21:20:03,945][1096443] Updated weights for policy 0, policy_version 115120 (0.0004) [2023-03-10 21:20:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 58949632. Throughput: 0: 11947.1. Samples: 58945416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:20:04,742][1096160] Avg episode reward: [(0, '4860.162')] [2023-03-10 21:20:07,362][1096443] Updated weights for policy 0, policy_version 115200 (0.0004) [2023-03-10 21:20:09,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 59011072. Throughput: 0: 11889.9. Samples: 58980080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:20:09,742][1096160] Avg episode reward: [(0, '4858.631')] [2023-03-10 21:20:10,625][1096443] Updated weights for policy 0, policy_version 115280 (0.0004) [2023-03-10 21:20:13,937][1096443] Updated weights for policy 0, policy_version 115360 (0.0005) [2023-03-10 21:20:14,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 59072512. Throughput: 0: 11985.2. Samples: 59055360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:20:14,742][1096160] Avg episode reward: [(0, '4861.999')] [2023-03-10 21:20:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000115376_59072512.pth... [2023-03-10 21:20:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000114672_58712064.pth [2023-03-10 21:20:17,451][1096443] Updated weights for policy 0, policy_version 115440 (0.0005) [2023-03-10 21:20:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 59129856. Throughput: 0: 11893.9. Samples: 59125760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:20:19,742][1096160] Avg episode reward: [(0, '4863.127')] [2023-03-10 21:20:20,891][1096443] Updated weights for policy 0, policy_version 115520 (0.0005) [2023-03-10 21:20:24,336][1096443] Updated weights for policy 0, policy_version 115600 (0.0005) [2023-03-10 21:20:24,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 59191296. Throughput: 0: 11912.6. Samples: 59162492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:20:24,742][1096160] Avg episode reward: [(0, '4863.934')] [2023-03-10 21:20:24,743][1096399] Saving new best policy, reward=4863.934! [2023-03-10 21:20:27,803][1096443] Updated weights for policy 0, policy_version 115680 (0.0004) [2023-03-10 21:20:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 59248640. Throughput: 0: 11924.1. Samples: 59232328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:20:29,742][1096160] Avg episode reward: [(0, '4861.589')] [2023-03-10 21:20:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000115720_59248640.pth... [2023-03-10 21:20:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000115016_58888192.pth [2023-03-10 21:20:31,282][1096443] Updated weights for policy 0, policy_version 115760 (0.0004) [2023-03-10 21:20:34,472][1096443] Updated weights for policy 0, policy_version 115840 (0.0004) [2023-03-10 21:20:34,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 59310080. Throughput: 0: 11988.5. Samples: 59307316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:20:34,742][1096160] Avg episode reward: [(0, '4863.439')] [2023-03-10 21:20:37,802][1096443] Updated weights for policy 0, policy_version 115920 (0.0005) [2023-03-10 21:20:39,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 59371520. Throughput: 0: 12025.3. Samples: 59343380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:20:39,742][1096160] Avg episode reward: [(0, '4857.311')] [2023-03-10 21:20:41,359][1096443] Updated weights for policy 0, policy_version 116000 (0.0005) [2023-03-10 21:20:44,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 59428864. Throughput: 0: 12042.8. Samples: 59413744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:20:44,742][1096160] Avg episode reward: [(0, '4860.028')] [2023-03-10 21:20:44,767][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000116080_59432960.pth... [2023-03-10 21:20:44,768][1096443] Updated weights for policy 0, policy_version 116080 (0.0005) [2023-03-10 21:20:44,769][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000115376_59072512.pth [2023-03-10 21:20:48,196][1096443] Updated weights for policy 0, policy_version 116160 (0.0005) [2023-03-10 21:20:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 59490304. Throughput: 0: 12016.7. Samples: 59486168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:20:49,742][1096160] Avg episode reward: [(0, '4860.175')] [2023-03-10 21:20:51,688][1096443] Updated weights for policy 0, policy_version 116240 (0.0005) [2023-03-10 21:20:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 59547648. Throughput: 0: 12032.5. Samples: 59521540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:20:54,742][1096160] Avg episode reward: [(0, '4856.377')] [2023-03-10 21:20:55,168][1096443] Updated weights for policy 0, policy_version 116320 (0.0005) [2023-03-10 21:20:58,610][1096443] Updated weights for policy 0, policy_version 116400 (0.0004) [2023-03-10 21:20:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 59609088. Throughput: 0: 11940.5. Samples: 59592684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:20:59,742][1096160] Avg episode reward: [(0, '4860.280')] [2023-03-10 21:20:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000116424_59609088.pth... [2023-03-10 21:20:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000115720_59248640.pth [2023-03-10 21:21:02,135][1096443] Updated weights for policy 0, policy_version 116480 (0.0005) [2023-03-10 21:21:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 59666432. Throughput: 0: 11963.4. Samples: 59664112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:21:04,742][1096160] Avg episode reward: [(0, '4859.788')] [2023-03-10 21:21:05,396][1096443] Updated weights for policy 0, policy_version 116560 (0.0004) [2023-03-10 21:21:08,931][1096443] Updated weights for policy 0, policy_version 116640 (0.0005) [2023-03-10 21:21:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 59727872. Throughput: 0: 11953.2. Samples: 59700388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:21:09,742][1096160] Avg episode reward: [(0, '4858.733')] [2023-03-10 21:21:12,509][1096443] Updated weights for policy 0, policy_version 116720 (0.0005) [2023-03-10 21:21:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 59785216. Throughput: 0: 11923.7. Samples: 59768896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:21:14,742][1096160] Avg episode reward: [(0, '4855.631')] [2023-03-10 21:21:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000116768_59785216.pth... [2023-03-10 21:21:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000116080_59432960.pth [2023-03-10 21:21:15,945][1096443] Updated weights for policy 0, policy_version 116800 (0.0005) [2023-03-10 21:21:19,482][1096443] Updated weights for policy 0, policy_version 116880 (0.0004) [2023-03-10 21:21:19,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 59842560. Throughput: 0: 11821.6. Samples: 59839288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:21:19,742][1096160] Avg episode reward: [(0, '4857.544')] [2023-03-10 21:21:22,885][1096443] Updated weights for policy 0, policy_version 116960 (0.0005) [2023-03-10 21:21:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 59904000. Throughput: 0: 11851.0. Samples: 59876676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:21:24,742][1096160] Avg episode reward: [(0, '4856.219')] [2023-03-10 21:21:26,225][1096443] Updated weights for policy 0, policy_version 117040 (0.0005) [2023-03-10 21:21:29,629][1096443] Updated weights for policy 0, policy_version 117120 (0.0005) [2023-03-10 21:21:29,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 59965440. Throughput: 0: 11893.3. Samples: 59948944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:21:29,742][1096160] Avg episode reward: [(0, '4854.640')] [2023-03-10 21:21:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000117120_59965440.pth... [2023-03-10 21:21:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000116424_59609088.pth [2023-03-10 21:21:32,977][1096443] Updated weights for policy 0, policy_version 117200 (0.0005) [2023-03-10 21:21:34,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.6, 300 sec: 11996.4). Total num frames: 60026880. Throughput: 0: 11871.4. Samples: 60020380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:21:34,742][1096160] Avg episode reward: [(0, '4856.684')] [2023-03-10 21:21:36,266][1096443] Updated weights for policy 0, policy_version 117280 (0.0005) [2023-03-10 21:21:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 60084224. Throughput: 0: 11955.8. Samples: 60059552. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:21:39,742][1096160] Avg episode reward: [(0, '4854.967')] [2023-03-10 21:21:39,743][1096443] Updated weights for policy 0, policy_version 117360 (0.0005) [2023-03-10 21:21:43,059][1096443] Updated weights for policy 0, policy_version 117440 (0.0005) [2023-03-10 21:21:44,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 60145664. Throughput: 0: 11970.4. Samples: 60131352. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:21:44,742][1096160] Avg episode reward: [(0, '4856.457')] [2023-03-10 21:21:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000117472_60145664.pth... [2023-03-10 21:21:44,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000116768_59785216.pth [2023-03-10 21:21:46,536][1096443] Updated weights for policy 0, policy_version 117520 (0.0005) [2023-03-10 21:21:49,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 60207104. Throughput: 0: 11973.0. Samples: 60202896. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:21:49,742][1096160] Avg episode reward: [(0, '4854.214')] [2023-03-10 21:21:49,944][1096443] Updated weights for policy 0, policy_version 117600 (0.0005) [2023-03-10 21:21:53,303][1096443] Updated weights for policy 0, policy_version 117680 (0.0005) [2023-03-10 21:21:54,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 60268544. Throughput: 0: 11988.5. Samples: 60239872. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:21:54,742][1096160] Avg episode reward: [(0, '4857.363')] [2023-03-10 21:21:56,701][1096443] Updated weights for policy 0, policy_version 117760 (0.0005) [2023-03-10 21:21:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 60329984. Throughput: 0: 12037.2. Samples: 60310572. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:21:59,742][1096160] Avg episode reward: [(0, '4856.929')] [2023-03-10 21:21:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000117832_60329984.pth... [2023-03-10 21:21:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000117120_59965440.pth [2023-03-10 21:22:00,086][1096443] Updated weights for policy 0, policy_version 117840 (0.0005) [2023-03-10 21:22:03,557][1096443] Updated weights for policy 0, policy_version 117920 (0.0005) [2023-03-10 21:22:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 60387328. Throughput: 0: 12096.9. Samples: 60383648. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:22:04,742][1096160] Avg episode reward: [(0, '4858.838')] [2023-03-10 21:22:06,614][1096443] Updated weights for policy 0, policy_version 118000 (0.0005) [2023-03-10 21:22:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 11996.4). Total num frames: 60448768. Throughput: 0: 12164.9. Samples: 60424096. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:22:09,742][1096160] Avg episode reward: [(0, '4853.731')] [2023-03-10 21:22:10,140][1096443] Updated weights for policy 0, policy_version 118080 (0.0005) [2023-03-10 21:22:13,472][1096443] Updated weights for policy 0, policy_version 118160 (0.0005) [2023-03-10 21:22:14,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 60510208. Throughput: 0: 12132.7. Samples: 60494916. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:22:14,742][1096160] Avg episode reward: [(0, '4854.962')] [2023-03-10 21:22:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000118184_60510208.pth... [2023-03-10 21:22:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000117472_60145664.pth [2023-03-10 21:22:16,880][1096443] Updated weights for policy 0, policy_version 118240 (0.0005) [2023-03-10 21:22:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 60571648. Throughput: 0: 12158.8. Samples: 60567528. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:22:19,742][1096160] Avg episode reward: [(0, '4854.395')] [2023-03-10 21:22:20,185][1096443] Updated weights for policy 0, policy_version 118320 (0.0005) [2023-03-10 21:22:23,589][1096443] Updated weights for policy 0, policy_version 118400 (0.0005) [2023-03-10 21:22:24,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12151.5, 300 sec: 12024.2). Total num frames: 60633088. Throughput: 0: 12107.5. Samples: 60604388. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:22:24,742][1096160] Avg episode reward: [(0, '4858.410')] [2023-03-10 21:22:26,982][1096443] Updated weights for policy 0, policy_version 118480 (0.0005) [2023-03-10 21:22:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 60690432. Throughput: 0: 12115.1. Samples: 60676532. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:22:29,742][1096160] Avg episode reward: [(0, '4858.701')] [2023-03-10 21:22:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000118536_60690432.pth... [2023-03-10 21:22:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000117832_60329984.pth [2023-03-10 21:22:30,532][1096443] Updated weights for policy 0, policy_version 118560 (0.0005) [2023-03-10 21:22:33,918][1096443] Updated weights for policy 0, policy_version 118640 (0.0005) [2023-03-10 21:22:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 60751872. Throughput: 0: 12106.2. Samples: 60747676. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:22:34,742][1096160] Avg episode reward: [(0, '4855.951')] [2023-03-10 21:22:37,422][1096443] Updated weights for policy 0, policy_version 118720 (0.0005) [2023-03-10 21:22:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 60809216. Throughput: 0: 12027.8. Samples: 60781124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:22:39,742][1096160] Avg episode reward: [(0, '4854.221')] [2023-03-10 21:22:40,992][1096443] Updated weights for policy 0, policy_version 118800 (0.0005) [2023-03-10 21:22:44,139][1096443] Updated weights for policy 0, policy_version 118880 (0.0004) [2023-03-10 21:22:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 60870656. Throughput: 0: 12088.5. Samples: 60854556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:22:44,742][1096160] Avg episode reward: [(0, '4851.832')] [2023-03-10 21:22:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000118888_60870656.pth... [2023-03-10 21:22:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000118184_60510208.pth [2023-03-10 21:22:47,698][1096443] Updated weights for policy 0, policy_version 118960 (0.0005) [2023-03-10 21:22:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 60932096. Throughput: 0: 12058.3. Samples: 60926272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:22:49,742][1096160] Avg episode reward: [(0, '4847.371')] [2023-03-10 21:22:51,038][1096443] Updated weights for policy 0, policy_version 119040 (0.0005) [2023-03-10 21:22:54,434][1096443] Updated weights for policy 0, policy_version 119120 (0.0005) [2023-03-10 21:22:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 60989440. Throughput: 0: 12010.2. Samples: 60964556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:22:54,742][1096160] Avg episode reward: [(0, '4857.289')] [2023-03-10 21:22:57,962][1096443] Updated weights for policy 0, policy_version 119200 (0.0005) [2023-03-10 21:22:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 61050880. Throughput: 0: 11987.6. Samples: 61034360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:22:59,742][1096160] Avg episode reward: [(0, '4853.980')] [2023-03-10 21:22:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000119240_61050880.pth... [2023-03-10 21:22:59,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000118536_60690432.pth [2023-03-10 21:23:01,394][1096443] Updated weights for policy 0, policy_version 119280 (0.0005) [2023-03-10 21:23:04,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 61108224. Throughput: 0: 11929.7. Samples: 61104364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:23:04,742][1096160] Avg episode reward: [(0, '4856.375')] [2023-03-10 21:23:04,904][1096443] Updated weights for policy 0, policy_version 119360 (0.0004) [2023-03-10 21:23:08,280][1096443] Updated weights for policy 0, policy_version 119440 (0.0006) [2023-03-10 21:23:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 61169664. Throughput: 0: 11920.2. Samples: 61140796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:23:09,742][1096160] Avg episode reward: [(0, '4850.698')] [2023-03-10 21:23:11,818][1096443] Updated weights for policy 0, policy_version 119520 (0.0005) [2023-03-10 21:23:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 61227008. Throughput: 0: 11870.1. Samples: 61210688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:23:14,742][1096160] Avg episode reward: [(0, '4852.710')] [2023-03-10 21:23:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000119584_61227008.pth... [2023-03-10 21:23:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000118888_60870656.pth [2023-03-10 21:23:15,345][1096443] Updated weights for policy 0, policy_version 119600 (0.0005) [2023-03-10 21:23:18,656][1096443] Updated weights for policy 0, policy_version 119680 (0.0005) [2023-03-10 21:23:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 61288448. Throughput: 0: 11917.7. Samples: 61283972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:23:19,742][1096160] Avg episode reward: [(0, '4849.004')] [2023-03-10 21:23:22,215][1096443] Updated weights for policy 0, policy_version 119760 (0.0005) [2023-03-10 21:23:24,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11982.5). Total num frames: 61341696. Throughput: 0: 11912.5. Samples: 61317184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:23:24,742][1096160] Avg episode reward: [(0, '4851.023')] [2023-03-10 21:23:25,803][1096443] Updated weights for policy 0, policy_version 119840 (0.0004) [2023-03-10 21:23:29,192][1096443] Updated weights for policy 0, policy_version 119920 (0.0004) [2023-03-10 21:23:29,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 61403136. Throughput: 0: 11839.4. Samples: 61387328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:23:29,742][1096160] Avg episode reward: [(0, '4852.847')] [2023-03-10 21:23:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000119928_61403136.pth... [2023-03-10 21:23:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000119240_61050880.pth [2023-03-10 21:23:32,633][1096443] Updated weights for policy 0, policy_version 120000 (0.0004) [2023-03-10 21:23:34,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 61464576. Throughput: 0: 11813.6. Samples: 61457884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:23:34,742][1096160] Avg episode reward: [(0, '4847.846')] [2023-03-10 21:23:36,177][1096443] Updated weights for policy 0, policy_version 120080 (0.0005) [2023-03-10 21:23:39,640][1096443] Updated weights for policy 0, policy_version 120160 (0.0005) [2023-03-10 21:23:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 61521920. Throughput: 0: 11750.1. Samples: 61493312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:23:39,742][1096160] Avg episode reward: [(0, '4855.342')] [2023-03-10 21:23:42,928][1096443] Updated weights for policy 0, policy_version 120240 (0.0005) [2023-03-10 21:23:44,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 61583360. Throughput: 0: 11836.2. Samples: 61566988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:23:44,742][1096160] Avg episode reward: [(0, '4848.934')] [2023-03-10 21:23:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000120280_61583360.pth... [2023-03-10 21:23:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000119584_61227008.pth [2023-03-10 21:23:46,331][1096443] Updated weights for policy 0, policy_version 120320 (0.0004) [2023-03-10 21:23:49,640][1096443] Updated weights for policy 0, policy_version 120400 (0.0006) [2023-03-10 21:23:49,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 61644800. Throughput: 0: 11909.1. Samples: 61640276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:23:49,742][1096160] Avg episode reward: [(0, '4848.799')] [2023-03-10 21:23:52,995][1096443] Updated weights for policy 0, policy_version 120480 (0.0005) [2023-03-10 21:23:54,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 61702144. Throughput: 0: 11920.3. Samples: 61677208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:23:54,742][1096160] Avg episode reward: [(0, '4846.309')] [2023-03-10 21:23:56,401][1096443] Updated weights for policy 0, policy_version 120560 (0.0004) [2023-03-10 21:23:59,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 61763584. Throughput: 0: 11951.6. Samples: 61748508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:23:59,742][1096160] Avg episode reward: [(0, '4849.507')] [2023-03-10 21:23:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000120632_61763584.pth... [2023-03-10 21:23:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000119928_61403136.pth [2023-03-10 21:23:59,825][1096443] Updated weights for policy 0, policy_version 120640 (0.0005) [2023-03-10 21:24:03,331][1096443] Updated weights for policy 0, policy_version 120720 (0.0004) [2023-03-10 21:24:04,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 61820928. Throughput: 0: 11879.8. Samples: 61818560. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:24:04,742][1096160] Avg episode reward: [(0, '4846.222')] [2023-03-10 21:24:06,832][1096443] Updated weights for policy 0, policy_version 120800 (0.0004) [2023-03-10 21:24:09,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 61882368. Throughput: 0: 11941.1. Samples: 61854532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:24:09,742][1096160] Avg episode reward: [(0, '4848.533')] [2023-03-10 21:24:10,204][1096443] Updated weights for policy 0, policy_version 120880 (0.0005) [2023-03-10 21:24:13,622][1096443] Updated weights for policy 0, policy_version 120960 (0.0005) [2023-03-10 21:24:14,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 12015.0, 300 sec: 11982.5). Total num frames: 61947904. Throughput: 0: 12001.9. Samples: 61927412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:24:14,742][1096160] Avg episode reward: [(0, '4850.851')] [2023-03-10 21:24:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000120992_61947904.pth... [2023-03-10 21:24:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000120280_61583360.pth [2023-03-10 21:24:16,693][1096443] Updated weights for policy 0, policy_version 121040 (0.0005) [2023-03-10 21:24:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 62005248. Throughput: 0: 12113.8. Samples: 62003004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:24:19,742][1096160] Avg episode reward: [(0, '4847.722')] [2023-03-10 21:24:20,123][1096443] Updated weights for policy 0, policy_version 121120 (0.0005) [2023-03-10 21:24:23,366][1096443] Updated weights for policy 0, policy_version 121200 (0.0005) [2023-03-10 21:24:24,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12151.4, 300 sec: 11996.4). Total num frames: 62070784. Throughput: 0: 12131.9. Samples: 62039248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:24:24,742][1096160] Avg episode reward: [(0, '4850.083')] [2023-03-10 21:24:26,847][1096443] Updated weights for policy 0, policy_version 121280 (0.0005) [2023-03-10 21:24:29,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 62128128. Throughput: 0: 12107.3. Samples: 62111816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:24:29,742][1096160] Avg episode reward: [(0, '4851.305')] [2023-03-10 21:24:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000121344_62128128.pth... [2023-03-10 21:24:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000120632_61763584.pth [2023-03-10 21:24:30,444][1096443] Updated weights for policy 0, policy_version 121360 (0.0005) [2023-03-10 21:24:33,973][1096443] Updated weights for policy 0, policy_version 121440 (0.0004) [2023-03-10 21:24:34,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 12014.9, 300 sec: 11968.6). Total num frames: 62185472. Throughput: 0: 12013.4. Samples: 62180880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:24:34,742][1096160] Avg episode reward: [(0, '4849.891')] [2023-03-10 21:24:37,468][1096443] Updated weights for policy 0, policy_version 121520 (0.0004) [2023-03-10 21:24:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 62246912. Throughput: 0: 11974.9. Samples: 62216080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:24:39,742][1096160] Avg episode reward: [(0, '4848.012')] [2023-03-10 21:24:40,790][1096443] Updated weights for policy 0, policy_version 121600 (0.0005) [2023-03-10 21:24:44,195][1096443] Updated weights for policy 0, policy_version 121680 (0.0005) [2023-03-10 21:24:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 62304256. Throughput: 0: 12013.4. Samples: 62289112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:24:44,742][1096160] Avg episode reward: [(0, '4852.887')] [2023-03-10 21:24:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000121688_62304256.pth... [2023-03-10 21:24:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000120992_61947904.pth [2023-03-10 21:24:47,519][1096443] Updated weights for policy 0, policy_version 121760 (0.0005) [2023-03-10 21:24:49,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12015.0, 300 sec: 11982.5). Total num frames: 62365696. Throughput: 0: 12115.8. Samples: 62363772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:24:49,742][1096160] Avg episode reward: [(0, '4853.272')] [2023-03-10 21:24:50,774][1096443] Updated weights for policy 0, policy_version 121840 (0.0004) [2023-03-10 21:24:51,816][1096399] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000008 [2023-03-10 21:24:54,143][1096443] Updated weights for policy 0, policy_version 121920 (0.0005) [2023-03-10 21:24:54,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 62427136. Throughput: 0: 12088.8. Samples: 62398528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:24:54,742][1096160] Avg episode reward: [(0, '4851.432')] [2023-03-10 21:24:57,584][1096443] Updated weights for policy 0, policy_version 122000 (0.0004) [2023-03-10 21:24:59,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 62488576. Throughput: 0: 12107.6. Samples: 62472256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:24:59,742][1096160] Avg episode reward: [(0, '4855.291')] [2023-03-10 21:24:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000122048_62488576.pth... [2023-03-10 21:24:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000121344_62128128.pth [2023-03-10 21:25:00,931][1096443] Updated weights for policy 0, policy_version 122080 (0.0004) [2023-03-10 21:25:04,433][1096443] Updated weights for policy 0, policy_version 122160 (0.0005) [2023-03-10 21:25:04,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 62545920. Throughput: 0: 12027.7. Samples: 62544252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:25:04,742][1096160] Avg episode reward: [(0, '4854.480')] [2023-03-10 21:25:07,889][1096443] Updated weights for policy 0, policy_version 122240 (0.0004) [2023-03-10 21:25:09,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 62607360. Throughput: 0: 12008.8. Samples: 62579644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:25:09,742][1096160] Avg episode reward: [(0, '4851.296')] [2023-03-10 21:25:11,485][1096443] Updated weights for policy 0, policy_version 122320 (0.0005) [2023-03-10 21:25:14,711][1096443] Updated weights for policy 0, policy_version 122400 (0.0004) [2023-03-10 21:25:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 62668800. Throughput: 0: 11986.3. Samples: 62651200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:25:14,742][1096160] Avg episode reward: [(0, '4851.419')] [2023-03-10 21:25:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000122400_62668800.pth... [2023-03-10 21:25:14,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000121688_62304256.pth [2023-03-10 21:25:18,243][1096443] Updated weights for policy 0, policy_version 122480 (0.0005) [2023-03-10 21:25:19,742][1096160] Fps is (10 sec: 11878.6, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 62726144. Throughput: 0: 12002.0. Samples: 62720968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:25:19,742][1096160] Avg episode reward: [(0, '4855.986')] [2023-03-10 21:25:21,888][1096443] Updated weights for policy 0, policy_version 122560 (0.0005) [2023-03-10 21:25:24,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 62783488. Throughput: 0: 11973.3. Samples: 62754880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:25:24,742][1096160] Avg episode reward: [(0, '4851.007')] [2023-03-10 21:25:25,303][1096443] Updated weights for policy 0, policy_version 122640 (0.0005) [2023-03-10 21:25:28,731][1096443] Updated weights for policy 0, policy_version 122720 (0.0005) [2023-03-10 21:25:29,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11968.6). Total num frames: 62840832. Throughput: 0: 11949.9. Samples: 62826856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:25:29,742][1096160] Avg episode reward: [(0, '4849.496')] [2023-03-10 21:25:29,774][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000122744_62844928.pth... [2023-03-10 21:25:29,776][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000122048_62488576.pth [2023-03-10 21:25:32,210][1096443] Updated weights for policy 0, policy_version 122800 (0.0005) [2023-03-10 21:25:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 62902272. Throughput: 0: 11868.2. Samples: 62897844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:25:34,742][1096160] Avg episode reward: [(0, '4852.340')] [2023-03-10 21:25:35,584][1096443] Updated weights for policy 0, policy_version 122880 (0.0005) [2023-03-10 21:25:38,891][1096443] Updated weights for policy 0, policy_version 122960 (0.0005) [2023-03-10 21:25:39,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 62963712. Throughput: 0: 11924.3. Samples: 62935120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:25:39,742][1096160] Avg episode reward: [(0, '4851.589')] [2023-03-10 21:25:41,997][1096443] Updated weights for policy 0, policy_version 123040 (0.0005) [2023-03-10 21:25:44,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12015.0, 300 sec: 11982.5). Total num frames: 63025152. Throughput: 0: 11974.0. Samples: 63011084. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:25:44,742][1096160] Avg episode reward: [(0, '4852.382')] [2023-03-10 21:25:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000123096_63025152.pth... [2023-03-10 21:25:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000122400_62668800.pth [2023-03-10 21:25:45,457][1096443] Updated weights for policy 0, policy_version 123120 (0.0005) [2023-03-10 21:25:49,021][1096443] Updated weights for policy 0, policy_version 123200 (0.0005) [2023-03-10 21:25:49,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 63082496. Throughput: 0: 11928.3. Samples: 63081024. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:25:49,742][1096160] Avg episode reward: [(0, '4850.171')] [2023-03-10 21:25:52,626][1096443] Updated weights for policy 0, policy_version 123280 (0.0005) [2023-03-10 21:25:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 63143936. Throughput: 0: 11900.7. Samples: 63115176. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:25:54,742][1096160] Avg episode reward: [(0, '4854.133')] [2023-03-10 21:25:55,956][1096443] Updated weights for policy 0, policy_version 123360 (0.0006) [2023-03-10 21:25:59,329][1096443] Updated weights for policy 0, policy_version 123440 (0.0005) [2023-03-10 21:25:59,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 63205376. Throughput: 0: 11919.1. Samples: 63187560. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:25:59,742][1096160] Avg episode reward: [(0, '4857.201')] [2023-03-10 21:25:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000123448_63205376.pth... [2023-03-10 21:25:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000122744_62844928.pth [2023-03-10 21:26:02,680][1096443] Updated weights for policy 0, policy_version 123520 (0.0004) [2023-03-10 21:26:04,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 63266816. Throughput: 0: 11997.3. Samples: 63260848. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:26:04,742][1096160] Avg episode reward: [(0, '4841.180')] [2023-03-10 21:26:06,105][1096443] Updated weights for policy 0, policy_version 123600 (0.0005) [2023-03-10 21:26:09,508][1096443] Updated weights for policy 0, policy_version 123680 (0.0004) [2023-03-10 21:26:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 63324160. Throughput: 0: 12035.7. Samples: 63296488. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:26:09,742][1096160] Avg episode reward: [(0, '4843.730')] [2023-03-10 21:26:12,978][1096443] Updated weights for policy 0, policy_version 123760 (0.0005) [2023-03-10 21:26:14,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 12010.3). Total num frames: 63385600. Throughput: 0: 12044.5. Samples: 63368860. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:26:14,742][1096160] Avg episode reward: [(0, '4856.267')] [2023-03-10 21:26:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000123800_63385600.pth... [2023-03-10 21:26:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000123096_63025152.pth [2023-03-10 21:26:16,336][1096443] Updated weights for policy 0, policy_version 123840 (0.0005) [2023-03-10 21:26:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 63442944. Throughput: 0: 12023.9. Samples: 63438920. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:26:19,742][1096160] Avg episode reward: [(0, '4855.909')] [2023-03-10 21:26:19,878][1096443] Updated weights for policy 0, policy_version 123920 (0.0005) [2023-03-10 21:26:23,243][1096443] Updated weights for policy 0, policy_version 124000 (0.0005) [2023-03-10 21:26:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11996.4). Total num frames: 63504384. Throughput: 0: 12012.5. Samples: 63475684. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:26:24,742][1096160] Avg episode reward: [(0, '4856.375')] [2023-03-10 21:26:26,689][1096443] Updated weights for policy 0, policy_version 124080 (0.0005) [2023-03-10 21:26:29,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 63565824. Throughput: 0: 11962.3. Samples: 63549388. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:26:29,742][1096160] Avg episode reward: [(0, '4860.399')] [2023-03-10 21:26:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000124152_63565824.pth... [2023-03-10 21:26:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000123448_63205376.pth [2023-03-10 21:26:29,916][1096443] Updated weights for policy 0, policy_version 124160 (0.0005) [2023-03-10 21:26:33,275][1096443] Updated weights for policy 0, policy_version 124240 (0.0004) [2023-03-10 21:26:34,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 63627264. Throughput: 0: 12023.6. Samples: 63622084. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:26:34,742][1096160] Avg episode reward: [(0, '4855.773')] [2023-03-10 21:26:37,052][1096443] Updated weights for policy 0, policy_version 124320 (0.0005) [2023-03-10 21:26:39,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 63680512. Throughput: 0: 11977.2. Samples: 63654152. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:26:39,742][1096160] Avg episode reward: [(0, '4856.492')] [2023-03-10 21:26:40,745][1096443] Updated weights for policy 0, policy_version 124400 (0.0005) [2023-03-10 21:26:44,205][1096443] Updated weights for policy 0, policy_version 124480 (0.0004) [2023-03-10 21:26:44,742][1096160] Fps is (10 sec: 11059.1, 60 sec: 11878.4, 300 sec: 11968.6). Total num frames: 63737856. Throughput: 0: 11867.6. Samples: 63721604. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:26:44,742][1096160] Avg episode reward: [(0, '4859.254')] [2023-03-10 21:26:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000124488_63737856.pth... [2023-03-10 21:26:44,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000123800_63385600.pth [2023-03-10 21:26:47,672][1096443] Updated weights for policy 0, policy_version 124560 (0.0005) [2023-03-10 21:26:49,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 63795200. Throughput: 0: 11797.9. Samples: 63791752. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:26:49,742][1096160] Avg episode reward: [(0, '4859.467')] [2023-03-10 21:26:51,181][1096443] Updated weights for policy 0, policy_version 124640 (0.0005) [2023-03-10 21:26:54,611][1096443] Updated weights for policy 0, policy_version 124720 (0.0004) [2023-03-10 21:26:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 63856640. Throughput: 0: 11810.7. Samples: 63827968. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:26:54,742][1096160] Avg episode reward: [(0, '4856.912')] [2023-03-10 21:26:57,952][1096443] Updated weights for policy 0, policy_version 124800 (0.0004) [2023-03-10 21:26:59,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.2, 300 sec: 11954.8). Total num frames: 63913984. Throughput: 0: 11831.0. Samples: 63901252. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:26:59,742][1096160] Avg episode reward: [(0, '4861.633')] [2023-03-10 21:26:59,770][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000124840_63918080.pth... [2023-03-10 21:26:59,772][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000124152_63565824.pth [2023-03-10 21:27:01,577][1096443] Updated weights for policy 0, policy_version 124880 (0.0004) [2023-03-10 21:27:04,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11954.8). Total num frames: 63975424. Throughput: 0: 11804.4. Samples: 63970120. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:27:04,742][1096160] Avg episode reward: [(0, '4856.811')] [2023-03-10 21:27:05,060][1096443] Updated weights for policy 0, policy_version 124960 (0.0005) [2023-03-10 21:27:08,644][1096443] Updated weights for policy 0, policy_version 125040 (0.0005) [2023-03-10 21:27:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11940.9). Total num frames: 64032768. Throughput: 0: 11774.5. Samples: 64005536. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:27:09,742][1096160] Avg episode reward: [(0, '4853.825')] [2023-03-10 21:27:12,116][1096443] Updated weights for policy 0, policy_version 125120 (0.0005) [2023-03-10 21:27:14,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11927.0). Total num frames: 64090112. Throughput: 0: 11686.9. Samples: 64075300. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:27:14,742][1096160] Avg episode reward: [(0, '4856.832')] [2023-03-10 21:27:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000125176_64090112.pth... [2023-03-10 21:27:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000124488_63737856.pth [2023-03-10 21:27:15,633][1096443] Updated weights for policy 0, policy_version 125200 (0.0004) [2023-03-10 21:27:19,127][1096443] Updated weights for policy 0, policy_version 125280 (0.0005) [2023-03-10 21:27:19,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11913.1). Total num frames: 64147456. Throughput: 0: 11597.4. Samples: 64143968. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:27:19,742][1096160] Avg episode reward: [(0, '4859.443')] [2023-03-10 21:27:22,567][1096443] Updated weights for policy 0, policy_version 125360 (0.0004) [2023-03-10 21:27:24,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11927.0). Total num frames: 64208896. Throughput: 0: 11691.9. Samples: 64180288. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:27:24,742][1096160] Avg episode reward: [(0, '4858.377')] [2023-03-10 21:27:25,755][1096443] Updated weights for policy 0, policy_version 125440 (0.0005) [2023-03-10 21:27:28,901][1096443] Updated weights for policy 0, policy_version 125520 (0.0005) [2023-03-10 21:27:29,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 11810.1, 300 sec: 11940.9). Total num frames: 64274432. Throughput: 0: 11911.2. Samples: 64257608. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:27:29,742][1096160] Avg episode reward: [(0, '4862.285')] [2023-03-10 21:27:29,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000125536_64274432.pth... [2023-03-10 21:27:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000124840_63918080.pth [2023-03-10 21:27:32,295][1096443] Updated weights for policy 0, policy_version 125600 (0.0004) [2023-03-10 21:27:34,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 11810.1, 300 sec: 11954.8). Total num frames: 64335872. Throughput: 0: 11986.7. Samples: 64331156. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:27:34,742][1096160] Avg episode reward: [(0, '4860.122')] [2023-03-10 21:27:35,677][1096443] Updated weights for policy 0, policy_version 125680 (0.0005) [2023-03-10 21:27:38,868][1096443] Updated weights for policy 0, policy_version 125760 (0.0005) [2023-03-10 21:27:39,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 64397312. Throughput: 0: 12011.5. Samples: 64368484. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:27:39,742][1096160] Avg episode reward: [(0, '4859.527')] [2023-03-10 21:27:42,289][1096443] Updated weights for policy 0, policy_version 125840 (0.0004) [2023-03-10 21:27:44,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12015.0, 300 sec: 11954.8). Total num frames: 64458752. Throughput: 0: 12016.3. Samples: 64441984. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:27:44,742][1096160] Avg episode reward: [(0, '4860.997')] [2023-03-10 21:27:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000125896_64458752.pth... [2023-03-10 21:27:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000125176_64090112.pth [2023-03-10 21:27:45,790][1096443] Updated weights for policy 0, policy_version 125920 (0.0004) [2023-03-10 21:27:49,209][1096443] Updated weights for policy 0, policy_version 126000 (0.0005) [2023-03-10 21:27:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 64516096. Throughput: 0: 12043.2. Samples: 64512064. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:27:49,742][1096160] Avg episode reward: [(0, '4864.691')] [2023-03-10 21:27:49,743][1096399] Saving new best policy, reward=4864.691! [2023-03-10 21:27:52,786][1096443] Updated weights for policy 0, policy_version 126080 (0.0005) [2023-03-10 21:27:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 64577536. Throughput: 0: 12038.7. Samples: 64547280. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:27:54,742][1096160] Avg episode reward: [(0, '4859.702')] [2023-03-10 21:27:56,220][1096443] Updated weights for policy 0, policy_version 126160 (0.0005) [2023-03-10 21:27:59,539][1096443] Updated weights for policy 0, policy_version 126240 (0.0004) [2023-03-10 21:27:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 64634880. Throughput: 0: 12068.9. Samples: 64618400. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:27:59,742][1096160] Avg episode reward: [(0, '4861.610')] [2023-03-10 21:27:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000126240_64634880.pth... [2023-03-10 21:27:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000125536_64274432.pth [2023-03-10 21:28:02,735][1096443] Updated weights for policy 0, policy_version 126320 (0.0005) [2023-03-10 21:28:04,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12015.0, 300 sec: 11954.8). Total num frames: 64696320. Throughput: 0: 12184.9. Samples: 64692288. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:28:04,742][1096160] Avg episode reward: [(0, '4863.274')] [2023-03-10 21:28:06,376][1096443] Updated weights for policy 0, policy_version 126400 (0.0004) [2023-03-10 21:28:09,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 64753664. Throughput: 0: 12172.3. Samples: 64728040. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:28:09,742][1096160] Avg episode reward: [(0, '4860.184')] [2023-03-10 21:28:09,884][1096443] Updated weights for policy 0, policy_version 126480 (0.0005) [2023-03-10 21:28:13,766][1096443] Updated weights for policy 0, policy_version 126560 (0.0006) [2023-03-10 21:28:14,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 64806912. Throughput: 0: 11916.0. Samples: 64793828. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:28:14,742][1096160] Avg episode reward: [(0, '4862.544')] [2023-03-10 21:28:14,750][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000126584_64811008.pth... [2023-03-10 21:28:14,752][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000125896_64458752.pth [2023-03-10 21:28:17,133][1096443] Updated weights for policy 0, policy_version 126640 (0.0004) [2023-03-10 21:28:19,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 64868352. Throughput: 0: 11846.4. Samples: 64864244. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:28:19,742][1096160] Avg episode reward: [(0, '4861.596')] [2023-03-10 21:28:20,725][1096443] Updated weights for policy 0, policy_version 126720 (0.0005) [2023-03-10 21:28:24,263][1096443] Updated weights for policy 0, policy_version 126800 (0.0005) [2023-03-10 21:28:24,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11940.9). Total num frames: 64925696. Throughput: 0: 11767.6. Samples: 64898024. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:28:24,742][1096160] Avg episode reward: [(0, '4860.261')] [2023-03-10 21:28:27,656][1096443] Updated weights for policy 0, policy_version 126880 (0.0005) [2023-03-10 21:28:29,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11927.0). Total num frames: 64983040. Throughput: 0: 11736.4. Samples: 64970124. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:28:29,742][1096160] Avg episode reward: [(0, '4858.900')] [2023-03-10 21:28:29,754][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000126928_64987136.pth... [2023-03-10 21:28:29,756][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000126240_64634880.pth [2023-03-10 21:28:31,189][1096443] Updated weights for policy 0, policy_version 126960 (0.0006) [2023-03-10 21:28:34,561][1096443] Updated weights for policy 0, policy_version 127040 (0.0005) [2023-03-10 21:28:34,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.2, 300 sec: 11940.9). Total num frames: 65044480. Throughput: 0: 11740.6. Samples: 65040392. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:28:34,742][1096160] Avg episode reward: [(0, '4861.292')] [2023-03-10 21:28:37,801][1096443] Updated weights for policy 0, policy_version 127120 (0.0005) [2023-03-10 21:28:39,741][1096160] Fps is (10 sec: 12288.2, 60 sec: 11810.2, 300 sec: 11940.9). Total num frames: 65105920. Throughput: 0: 11796.2. Samples: 65078108. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:28:39,742][1096160] Avg episode reward: [(0, '4860.446')] [2023-03-10 21:28:41,265][1096443] Updated weights for policy 0, policy_version 127200 (0.0005) [2023-03-10 21:28:44,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11741.8, 300 sec: 11927.0). Total num frames: 65163264. Throughput: 0: 11774.1. Samples: 65148236. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 21:28:44,742][1096160] Avg episode reward: [(0, '4858.848')] [2023-03-10 21:28:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000127272_65163264.pth... [2023-03-10 21:28:44,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000126584_64811008.pth [2023-03-10 21:28:44,803][1096443] Updated weights for policy 0, policy_version 127280 (0.0005) [2023-03-10 21:28:48,181][1096443] Updated weights for policy 0, policy_version 127360 (0.0005) [2023-03-10 21:28:49,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11940.9). Total num frames: 65224704. Throughput: 0: 11744.8. Samples: 65220804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:28:49,742][1096160] Avg episode reward: [(0, '4858.361')] [2023-03-10 21:28:51,501][1096443] Updated weights for policy 0, policy_version 127440 (0.0005) [2023-03-10 21:28:54,741][1096160] Fps is (10 sec: 12288.2, 60 sec: 11810.1, 300 sec: 11940.9). Total num frames: 65286144. Throughput: 0: 11765.2. Samples: 65257472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:28:54,742][1096160] Avg episode reward: [(0, '4855.130')] [2023-03-10 21:28:54,980][1096443] Updated weights for policy 0, policy_version 127520 (0.0004) [2023-03-10 21:28:58,454][1096443] Updated weights for policy 0, policy_version 127600 (0.0005) [2023-03-10 21:28:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11940.9). Total num frames: 65343488. Throughput: 0: 11870.2. Samples: 65327988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:28:59,742][1096160] Avg episode reward: [(0, '4860.083')] [2023-03-10 21:28:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000127624_65343488.pth... [2023-03-10 21:28:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000126928_64987136.pth [2023-03-10 21:29:01,963][1096443] Updated weights for policy 0, policy_version 127680 (0.0005) [2023-03-10 21:29:04,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11927.0). Total num frames: 65400832. Throughput: 0: 11875.6. Samples: 65398644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:29:04,742][1096160] Avg episode reward: [(0, '4858.529')] [2023-03-10 21:29:05,520][1096443] Updated weights for policy 0, policy_version 127760 (0.0005) [2023-03-10 21:29:09,052][1096443] Updated weights for policy 0, policy_version 127840 (0.0004) [2023-03-10 21:29:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11913.1). Total num frames: 65462272. Throughput: 0: 11901.7. Samples: 65433600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:29:09,742][1096160] Avg episode reward: [(0, '4857.386')] [2023-03-10 21:29:12,367][1096443] Updated weights for policy 0, policy_version 127920 (0.0005) [2023-03-10 21:29:14,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 65519616. Throughput: 0: 11866.1. Samples: 65504100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:29:14,742][1096160] Avg episode reward: [(0, '4862.175')] [2023-03-10 21:29:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000127968_65519616.pth... [2023-03-10 21:29:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000127272_65163264.pth [2023-03-10 21:29:15,913][1096443] Updated weights for policy 0, policy_version 128000 (0.0005) [2023-03-10 21:29:19,578][1096443] Updated weights for policy 0, policy_version 128080 (0.0005) [2023-03-10 21:29:19,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11885.3). Total num frames: 65576960. Throughput: 0: 11834.1. Samples: 65572928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:29:19,742][1096160] Avg episode reward: [(0, '4856.999')] [2023-03-10 21:29:23,077][1096443] Updated weights for policy 0, policy_version 128160 (0.0005) [2023-03-10 21:29:24,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11885.3). Total num frames: 65634304. Throughput: 0: 11762.7. Samples: 65607432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:29:24,742][1096160] Avg episode reward: [(0, '4857.253')] [2023-03-10 21:29:26,679][1096443] Updated weights for policy 0, policy_version 128240 (0.0005) [2023-03-10 21:29:29,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11810.1, 300 sec: 11885.3). Total num frames: 65691648. Throughput: 0: 11725.9. Samples: 65675900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:29:29,742][1096160] Avg episode reward: [(0, '4859.660')] [2023-03-10 21:29:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000128304_65691648.pth... [2023-03-10 21:29:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000127624_65343488.pth [2023-03-10 21:29:30,233][1096443] Updated weights for policy 0, policy_version 128320 (0.0004) [2023-03-10 21:29:33,785][1096443] Updated weights for policy 0, policy_version 128400 (0.0005) [2023-03-10 21:29:34,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11741.9, 300 sec: 11871.5). Total num frames: 65748992. Throughput: 0: 11650.4. Samples: 65745072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:29:34,742][1096160] Avg episode reward: [(0, '4860.748')] [2023-03-10 21:29:37,438][1096443] Updated weights for policy 0, policy_version 128480 (0.0005) [2023-03-10 21:29:39,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11871.5). Total num frames: 65806336. Throughput: 0: 11598.8. Samples: 65779420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:29:39,742][1096160] Avg episode reward: [(0, '4859.693')] [2023-03-10 21:29:40,943][1096443] Updated weights for policy 0, policy_version 128560 (0.0005) [2023-03-10 21:29:44,415][1096443] Updated weights for policy 0, policy_version 128640 (0.0005) [2023-03-10 21:29:44,741][1096160] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11857.6). Total num frames: 65863680. Throughput: 0: 11593.7. Samples: 65849704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:29:44,742][1096160] Avg episode reward: [(0, '4857.588')] [2023-03-10 21:29:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000128640_65863680.pth... [2023-03-10 21:29:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000127968_65519616.pth [2023-03-10 21:29:47,977][1096443] Updated weights for policy 0, policy_version 128720 (0.0005) [2023-03-10 21:29:49,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11843.7). Total num frames: 65921024. Throughput: 0: 11562.8. Samples: 65918972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:29:49,742][1096160] Avg episode reward: [(0, '4861.190')] [2023-03-10 21:29:51,480][1096443] Updated weights for policy 0, policy_version 128800 (0.0005) [2023-03-10 21:29:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 11843.7). Total num frames: 65982464. Throughput: 0: 11561.2. Samples: 65953856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:29:54,742][1096160] Avg episode reward: [(0, '4861.461')] [2023-03-10 21:29:54,942][1096443] Updated weights for policy 0, policy_version 128880 (0.0005) [2023-03-10 21:29:58,423][1096443] Updated weights for policy 0, policy_version 128960 (0.0005) [2023-03-10 21:29:59,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11605.3, 300 sec: 11843.7). Total num frames: 66039808. Throughput: 0: 11566.7. Samples: 66024600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:29:59,742][1096160] Avg episode reward: [(0, '4859.408')] [2023-03-10 21:29:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000128984_66039808.pth... [2023-03-10 21:29:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000128304_65691648.pth [2023-03-10 21:30:02,024][1096443] Updated weights for policy 0, policy_version 129040 (0.0005) [2023-03-10 21:30:04,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11829.8). Total num frames: 66097152. Throughput: 0: 11580.3. Samples: 66094044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:30:04,742][1096160] Avg episode reward: [(0, '4857.637')] [2023-03-10 21:30:05,496][1096443] Updated weights for policy 0, policy_version 129120 (0.0005) [2023-03-10 21:30:08,954][1096443] Updated weights for policy 0, policy_version 129200 (0.0005) [2023-03-10 21:30:09,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 11829.8). Total num frames: 66158592. Throughput: 0: 11605.7. Samples: 66129688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:30:09,742][1096160] Avg episode reward: [(0, '4860.734')] [2023-03-10 21:30:12,276][1096443] Updated weights for policy 0, policy_version 129280 (0.0005) [2023-03-10 21:30:14,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11605.4, 300 sec: 11829.8). Total num frames: 66215936. Throughput: 0: 11687.7. Samples: 66201844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:30:14,742][1096160] Avg episode reward: [(0, '4859.197')] [2023-03-10 21:30:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000129328_66215936.pth... [2023-03-10 21:30:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000128640_65863680.pth [2023-03-10 21:30:15,840][1096443] Updated weights for policy 0, policy_version 129360 (0.0005) [2023-03-10 21:30:19,369][1096443] Updated weights for policy 0, policy_version 129440 (0.0005) [2023-03-10 21:30:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11843.7). Total num frames: 66277376. Throughput: 0: 11725.6. Samples: 66272724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:30:19,742][1096160] Avg episode reward: [(0, '4861.986')] [2023-03-10 21:30:22,697][1096443] Updated weights for policy 0, policy_version 129520 (0.0005) [2023-03-10 21:30:24,741][1096160] Fps is (10 sec: 12287.9, 60 sec: 11741.9, 300 sec: 11857.6). Total num frames: 66338816. Throughput: 0: 11752.1. Samples: 66308264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:30:24,742][1096160] Avg episode reward: [(0, '4857.668')] [2023-03-10 21:30:26,117][1096443] Updated weights for policy 0, policy_version 129600 (0.0005) [2023-03-10 21:30:29,632][1096443] Updated weights for policy 0, policy_version 129680 (0.0004) [2023-03-10 21:30:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 66396160. Throughput: 0: 11778.3. Samples: 66379728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:30:29,742][1096160] Avg episode reward: [(0, '4855.973')] [2023-03-10 21:30:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000129680_66396160.pth... [2023-03-10 21:30:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000128984_66039808.pth [2023-03-10 21:30:33,323][1096443] Updated weights for policy 0, policy_version 129760 (0.0005) [2023-03-10 21:30:34,742][1096160] Fps is (10 sec: 11468.6, 60 sec: 11741.8, 300 sec: 11829.8). Total num frames: 66453504. Throughput: 0: 11757.4. Samples: 66448056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:30:34,743][1096160] Avg episode reward: [(0, '4859.840')] [2023-03-10 21:30:36,771][1096443] Updated weights for policy 0, policy_version 129840 (0.0005) [2023-03-10 21:30:39,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 66510848. Throughput: 0: 11771.3. Samples: 66483564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:30:39,742][1096160] Avg episode reward: [(0, '4858.404')] [2023-03-10 21:30:40,218][1096443] Updated weights for policy 0, policy_version 129920 (0.0005) [2023-03-10 21:30:43,798][1096443] Updated weights for policy 0, policy_version 130000 (0.0005) [2023-03-10 21:30:44,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 66568192. Throughput: 0: 11759.3. Samples: 66553768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:30:44,742][1096160] Avg episode reward: [(0, '4857.984')] [2023-03-10 21:30:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000130016_66568192.pth... [2023-03-10 21:30:44,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000129328_66215936.pth [2023-03-10 21:30:47,331][1096443] Updated weights for policy 0, policy_version 130080 (0.0005) [2023-03-10 21:30:49,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11802.0). Total num frames: 66625536. Throughput: 0: 11739.1. Samples: 66622304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:30:49,742][1096160] Avg episode reward: [(0, '4858.227')] [2023-03-10 21:30:50,813][1096443] Updated weights for policy 0, policy_version 130160 (0.0005) [2023-03-10 21:30:54,164][1096443] Updated weights for policy 0, policy_version 130240 (0.0005) [2023-03-10 21:30:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11802.0). Total num frames: 66686976. Throughput: 0: 11748.5. Samples: 66658368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:30:54,742][1096160] Avg episode reward: [(0, '4856.550')] [2023-03-10 21:30:57,674][1096443] Updated weights for policy 0, policy_version 130320 (0.0005) [2023-03-10 21:30:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11788.2). Total num frames: 66744320. Throughput: 0: 11745.8. Samples: 66730408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:30:59,742][1096160] Avg episode reward: [(0, '4851.966')] [2023-03-10 21:30:59,751][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000130368_66748416.pth... [2023-03-10 21:30:59,753][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000129680_66396160.pth [2023-03-10 21:31:01,193][1096443] Updated weights for policy 0, policy_version 130400 (0.0005) [2023-03-10 21:31:04,701][1096443] Updated weights for policy 0, policy_version 130480 (0.0005) [2023-03-10 21:31:04,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 66805760. Throughput: 0: 11716.9. Samples: 66799984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:31:04,742][1096160] Avg episode reward: [(0, '4853.012')] [2023-03-10 21:31:08,246][1096443] Updated weights for policy 0, policy_version 130560 (0.0005) [2023-03-10 21:31:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11788.1). Total num frames: 66863104. Throughput: 0: 11692.9. Samples: 66834444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:31:09,742][1096160] Avg episode reward: [(0, '4847.274')] [2023-03-10 21:31:11,647][1096443] Updated weights for policy 0, policy_version 130640 (0.0004) [2023-03-10 21:31:14,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 66924544. Throughput: 0: 11728.6. Samples: 66907512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:31:14,742][1096160] Avg episode reward: [(0, '4842.713')] [2023-03-10 21:31:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000130712_66924544.pth... [2023-03-10 21:31:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000130016_66568192.pth [2023-03-10 21:31:15,053][1096443] Updated weights for policy 0, policy_version 130720 (0.0004) [2023-03-10 21:31:18,544][1096443] Updated weights for policy 0, policy_version 130800 (0.0004) [2023-03-10 21:31:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11788.1). Total num frames: 66981888. Throughput: 0: 11770.4. Samples: 66977724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:31:19,742][1096160] Avg episode reward: [(0, '4855.199')] [2023-03-10 21:31:21,911][1096443] Updated weights for policy 0, policy_version 130880 (0.0004) [2023-03-10 21:31:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11788.1). Total num frames: 67043328. Throughput: 0: 11802.0. Samples: 67014656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:31:24,742][1096160] Avg episode reward: [(0, '4851.375')] [2023-03-10 21:31:25,268][1096443] Updated weights for policy 0, policy_version 130960 (0.0004) [2023-03-10 21:31:28,667][1096443] Updated weights for policy 0, policy_version 131040 (0.0005) [2023-03-10 21:31:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 67100672. Throughput: 0: 11873.2. Samples: 67088064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:31:29,742][1096160] Avg episode reward: [(0, '4860.515')] [2023-03-10 21:31:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000131064_67104768.pth... [2023-03-10 21:31:29,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000130368_66748416.pth [2023-03-10 21:31:32,162][1096443] Updated weights for policy 0, policy_version 131120 (0.0005) [2023-03-10 21:31:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.2, 300 sec: 11802.0). Total num frames: 67162112. Throughput: 0: 11902.1. Samples: 67157900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:31:34,742][1096160] Avg episode reward: [(0, '4860.759')] [2023-03-10 21:31:35,700][1096443] Updated weights for policy 0, policy_version 131200 (0.0005) [2023-03-10 21:31:39,091][1096443] Updated weights for policy 0, policy_version 131280 (0.0005) [2023-03-10 21:31:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 67219456. Throughput: 0: 11868.2. Samples: 67192436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:31:39,742][1096160] Avg episode reward: [(0, '4858.089')] [2023-03-10 21:31:42,660][1096443] Updated weights for policy 0, policy_version 131360 (0.0005) [2023-03-10 21:31:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11815.9). Total num frames: 67280896. Throughput: 0: 11855.5. Samples: 67263908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:31:44,742][1096160] Avg episode reward: [(0, '4859.217')] [2023-03-10 21:31:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000131408_67280896.pth... [2023-03-10 21:31:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000130712_66924544.pth [2023-03-10 21:31:46,097][1096443] Updated weights for policy 0, policy_version 131440 (0.0005) [2023-03-10 21:31:49,489][1096443] Updated weights for policy 0, policy_version 131520 (0.0005) [2023-03-10 21:31:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11802.0). Total num frames: 67338240. Throughput: 0: 11876.2. Samples: 67334412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:31:49,742][1096160] Avg episode reward: [(0, '4863.005')] [2023-03-10 21:31:52,949][1096443] Updated weights for policy 0, policy_version 131600 (0.0005) [2023-03-10 21:31:54,742][1096160] Fps is (10 sec: 11878.6, 60 sec: 11878.4, 300 sec: 11815.9). Total num frames: 67399680. Throughput: 0: 11923.7. Samples: 67371008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:31:54,742][1096160] Avg episode reward: [(0, '4861.806')] [2023-03-10 21:31:56,352][1096443] Updated weights for policy 0, policy_version 131680 (0.0005) [2023-03-10 21:31:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11802.0). Total num frames: 67457024. Throughput: 0: 11884.5. Samples: 67442316. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:31:59,742][1096160] Avg episode reward: [(0, '4862.038')] [2023-03-10 21:31:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000131752_67457024.pth... [2023-03-10 21:31:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000131064_67104768.pth [2023-03-10 21:31:59,949][1096443] Updated weights for policy 0, policy_version 131760 (0.0005) [2023-03-10 21:32:03,416][1096443] Updated weights for policy 0, policy_version 131840 (0.0005) [2023-03-10 21:32:04,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.2, 300 sec: 11802.0). Total num frames: 67514368. Throughput: 0: 11863.9. Samples: 67511600. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:32:04,742][1096160] Avg episode reward: [(0, '4856.948')] [2023-03-10 21:32:06,823][1096443] Updated weights for policy 0, policy_version 131920 (0.0005) [2023-03-10 21:32:09,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11815.9). Total num frames: 67575808. Throughput: 0: 11841.6. Samples: 67547528. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:32:09,742][1096160] Avg episode reward: [(0, '4858.202')] [2023-03-10 21:32:10,290][1096443] Updated weights for policy 0, policy_version 132000 (0.0005) [2023-03-10 21:32:13,811][1096443] Updated weights for policy 0, policy_version 132080 (0.0005) [2023-03-10 21:32:14,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 67633152. Throughput: 0: 11790.6. Samples: 67618640. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:32:14,742][1096160] Avg episode reward: [(0, '4860.896')] [2023-03-10 21:32:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000132096_67633152.pth... [2023-03-10 21:32:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000131408_67280896.pth [2023-03-10 21:32:17,319][1096443] Updated weights for policy 0, policy_version 132160 (0.0005) [2023-03-10 21:32:19,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 67690496. Throughput: 0: 11751.3. Samples: 67686708. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:32:19,742][1096160] Avg episode reward: [(0, '4863.417')] [2023-03-10 21:32:20,830][1096443] Updated weights for policy 0, policy_version 132240 (0.0005) [2023-03-10 21:32:24,209][1096443] Updated weights for policy 0, policy_version 132320 (0.0005) [2023-03-10 21:32:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 67751936. Throughput: 0: 11797.6. Samples: 67723328. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:32:24,742][1096160] Avg episode reward: [(0, '4860.487')] [2023-03-10 21:32:27,648][1096443] Updated weights for policy 0, policy_version 132400 (0.0005) [2023-03-10 21:32:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11774.3). Total num frames: 67809280. Throughput: 0: 11816.1. Samples: 67795632. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:32:29,742][1096160] Avg episode reward: [(0, '4861.874')] [2023-03-10 21:32:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000132440_67809280.pth... [2023-03-10 21:32:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000131752_67457024.pth [2023-03-10 21:32:31,203][1096443] Updated weights for policy 0, policy_version 132480 (0.0005) [2023-03-10 21:32:34,709][1096443] Updated weights for policy 0, policy_version 132560 (0.0006) [2023-03-10 21:32:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11774.3). Total num frames: 67870720. Throughput: 0: 11796.6. Samples: 67865260. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:32:34,742][1096160] Avg episode reward: [(0, '4861.556')] [2023-03-10 21:32:38,251][1096443] Updated weights for policy 0, policy_version 132640 (0.0006) [2023-03-10 21:32:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11760.4). Total num frames: 67928064. Throughput: 0: 11743.3. Samples: 67899456. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:32:39,742][1096160] Avg episode reward: [(0, '4864.699')] [2023-03-10 21:32:39,743][1096399] Saving new best policy, reward=4864.699! [2023-03-10 21:32:41,656][1096443] Updated weights for policy 0, policy_version 132720 (0.0005) [2023-03-10 21:32:44,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11774.3). Total num frames: 67989504. Throughput: 0: 11791.9. Samples: 67972952. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:32:44,742][1096160] Avg episode reward: [(0, '4864.633')] [2023-03-10 21:32:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000132792_67989504.pth... [2023-03-10 21:32:44,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000132096_67633152.pth [2023-03-10 21:32:44,948][1096443] Updated weights for policy 0, policy_version 132800 (0.0005) [2023-03-10 21:32:48,451][1096443] Updated weights for policy 0, policy_version 132880 (0.0005) [2023-03-10 21:32:49,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 11774.3). Total num frames: 68050944. Throughput: 0: 11828.6. Samples: 68043888. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:32:49,742][1096160] Avg episode reward: [(0, '4859.901')] [2023-03-10 21:32:51,928][1096443] Updated weights for policy 0, policy_version 132960 (0.0005) [2023-03-10 21:32:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11774.3). Total num frames: 68108288. Throughput: 0: 11824.0. Samples: 68079608. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:32:54,742][1096160] Avg episode reward: [(0, '4862.168')] [2023-03-10 21:32:55,363][1096443] Updated weights for policy 0, policy_version 133040 (0.0005) [2023-03-10 21:32:58,569][1096443] Updated weights for policy 0, policy_version 133120 (0.0005) [2023-03-10 21:32:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11774.3). Total num frames: 68169728. Throughput: 0: 11882.3. Samples: 68153344. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:32:59,742][1096160] Avg episode reward: [(0, '4865.098')] [2023-03-10 21:32:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000133144_68169728.pth... [2023-03-10 21:32:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000132440_67809280.pth [2023-03-10 21:32:59,748][1096399] Saving new best policy, reward=4865.098! [2023-03-10 21:33:02,023][1096443] Updated weights for policy 0, policy_version 133200 (0.0005) [2023-03-10 21:33:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11774.3). Total num frames: 68227072. Throughput: 0: 11917.2. Samples: 68222984. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:33:04,742][1096160] Avg episode reward: [(0, '4861.262')] [2023-03-10 21:33:05,628][1096443] Updated weights for policy 0, policy_version 133280 (0.0004) [2023-03-10 21:33:09,090][1096443] Updated weights for policy 0, policy_version 133360 (0.0005) [2023-03-10 21:33:09,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 68284416. Throughput: 0: 11906.8. Samples: 68259132. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:33:09,742][1096160] Avg episode reward: [(0, '4859.546')] [2023-03-10 21:33:12,467][1096443] Updated weights for policy 0, policy_version 133440 (0.0005) [2023-03-10 21:33:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11802.0). Total num frames: 68349952. Throughput: 0: 11871.4. Samples: 68329844. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:33:14,742][1096160] Avg episode reward: [(0, '4858.998')] [2023-03-10 21:33:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000133496_68349952.pth... [2023-03-10 21:33:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000132792_67989504.pth [2023-03-10 21:33:15,836][1096443] Updated weights for policy 0, policy_version 133520 (0.0005) [2023-03-10 21:33:19,303][1096443] Updated weights for policy 0, policy_version 133600 (0.0005) [2023-03-10 21:33:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11802.0). Total num frames: 68407296. Throughput: 0: 11952.0. Samples: 68403100. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:33:19,742][1096160] Avg episode reward: [(0, '4861.481')] [2023-03-10 21:33:22,823][1096443] Updated weights for policy 0, policy_version 133680 (0.0004) [2023-03-10 21:33:24,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11802.0). Total num frames: 68464640. Throughput: 0: 11924.1. Samples: 68436040. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:33:24,742][1096160] Avg episode reward: [(0, '4861.449')] [2023-03-10 21:33:26,197][1096443] Updated weights for policy 0, policy_version 133760 (0.0005) [2023-03-10 21:33:29,567][1096443] Updated weights for policy 0, policy_version 133840 (0.0005) [2023-03-10 21:33:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11802.0). Total num frames: 68526080. Throughput: 0: 11929.3. Samples: 68509768. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:33:29,742][1096160] Avg episode reward: [(0, '4859.562')] [2023-03-10 21:33:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000133840_68526080.pth... [2023-03-10 21:33:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000133144_68169728.pth [2023-03-10 21:33:33,014][1096443] Updated weights for policy 0, policy_version 133920 (0.0005) [2023-03-10 21:33:34,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11802.0). Total num frames: 68587520. Throughput: 0: 11982.6. Samples: 68583108. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:33:34,742][1096160] Avg episode reward: [(0, '4860.140')] [2023-03-10 21:33:36,359][1096443] Updated weights for policy 0, policy_version 134000 (0.0004) [2023-03-10 21:33:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11802.0). Total num frames: 68644864. Throughput: 0: 11949.6. Samples: 68617340. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:33:39,742][1096160] Avg episode reward: [(0, '4858.535')] [2023-03-10 21:33:39,900][1096443] Updated weights for policy 0, policy_version 134080 (0.0005) [2023-03-10 21:33:43,489][1096443] Updated weights for policy 0, policy_version 134160 (0.0004) [2023-03-10 21:33:44,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11788.1). Total num frames: 68702208. Throughput: 0: 11840.6. Samples: 68686172. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:33:44,742][1096160] Avg episode reward: [(0, '4863.379')] [2023-03-10 21:33:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000134184_68702208.pth... [2023-03-10 21:33:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000133496_68349952.pth [2023-03-10 21:33:46,768][1096443] Updated weights for policy 0, policy_version 134240 (0.0005) [2023-03-10 21:33:49,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11788.1). Total num frames: 68763648. Throughput: 0: 11925.3. Samples: 68759624. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:33:49,742][1096160] Avg episode reward: [(0, '4862.346')] [2023-03-10 21:33:50,185][1096443] Updated weights for policy 0, policy_version 134320 (0.0005) [2023-03-10 21:33:53,300][1096443] Updated weights for policy 0, policy_version 134400 (0.0004) [2023-03-10 21:33:54,742][1096160] Fps is (10 sec: 12697.7, 60 sec: 12014.9, 300 sec: 11815.9). Total num frames: 68829184. Throughput: 0: 12024.1. Samples: 68800216. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:33:54,742][1096160] Avg episode reward: [(0, '4857.991')] [2023-03-10 21:33:56,592][1096443] Updated weights for policy 0, policy_version 134480 (0.0005) [2023-03-10 21:33:59,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 12014.9, 300 sec: 11829.8). Total num frames: 68890624. Throughput: 0: 12074.4. Samples: 68873192. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 21:33:59,742][1096160] Avg episode reward: [(0, '4860.624')] [2023-03-10 21:33:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000134552_68890624.pth... [2023-03-10 21:33:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000133840_68526080.pth [2023-03-10 21:33:59,921][1096443] Updated weights for policy 0, policy_version 134560 (0.0005) [2023-03-10 21:34:03,303][1096443] Updated weights for policy 0, policy_version 134640 (0.0004) [2023-03-10 21:34:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11829.8). Total num frames: 68952064. Throughput: 0: 12106.8. Samples: 68947908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:34:04,742][1096160] Avg episode reward: [(0, '4859.046')] [2023-03-10 21:34:06,722][1096443] Updated weights for policy 0, policy_version 134720 (0.0005) [2023-03-10 21:34:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11829.8). Total num frames: 69009408. Throughput: 0: 12159.5. Samples: 68983216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:34:09,742][1096160] Avg episode reward: [(0, '4858.755')] [2023-03-10 21:34:10,064][1096443] Updated weights for policy 0, policy_version 134800 (0.0005) [2023-03-10 21:34:13,317][1096443] Updated weights for policy 0, policy_version 134880 (0.0004) [2023-03-10 21:34:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11843.7). Total num frames: 69070848. Throughput: 0: 12182.9. Samples: 69058000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:34:14,742][1096160] Avg episode reward: [(0, '4856.474')] [2023-03-10 21:34:14,757][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000134912_69074944.pth... [2023-03-10 21:34:14,759][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000134184_68702208.pth [2023-03-10 21:34:16,846][1096443] Updated weights for policy 0, policy_version 134960 (0.0005) [2023-03-10 21:34:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11857.6). Total num frames: 69132288. Throughput: 0: 12114.6. Samples: 69128264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:34:19,742][1096160] Avg episode reward: [(0, '4862.465')] [2023-03-10 21:34:20,286][1096443] Updated weights for policy 0, policy_version 135040 (0.0005) [2023-03-10 21:34:23,618][1096443] Updated weights for policy 0, policy_version 135120 (0.0004) [2023-03-10 21:34:24,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11871.5). Total num frames: 69193728. Throughput: 0: 12163.4. Samples: 69164692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:34:24,742][1096160] Avg episode reward: [(0, '4861.729')] [2023-03-10 21:34:27,079][1096443] Updated weights for policy 0, policy_version 135200 (0.0005) [2023-03-10 21:34:29,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11885.3). Total num frames: 69255168. Throughput: 0: 12203.9. Samples: 69235348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:34:29,742][1096160] Avg episode reward: [(0, '4858.354')] [2023-03-10 21:34:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000135264_69255168.pth... [2023-03-10 21:34:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000134552_68890624.pth [2023-03-10 21:34:30,502][1096443] Updated weights for policy 0, policy_version 135280 (0.0005) [2023-03-10 21:34:33,713][1096443] Updated weights for policy 0, policy_version 135360 (0.0005) [2023-03-10 21:34:34,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11899.2). Total num frames: 69316608. Throughput: 0: 12274.4. Samples: 69311972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:34:34,742][1096160] Avg episode reward: [(0, '4852.463')] [2023-03-10 21:34:36,989][1096443] Updated weights for policy 0, policy_version 135440 (0.0005) [2023-03-10 21:34:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 11899.2). Total num frames: 69373952. Throughput: 0: 12188.3. Samples: 69348688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:34:39,742][1096160] Avg episode reward: [(0, '4851.172')] [2023-03-10 21:34:40,567][1096443] Updated weights for policy 0, policy_version 135520 (0.0004) [2023-03-10 21:34:43,947][1096443] Updated weights for policy 0, policy_version 135600 (0.0004) [2023-03-10 21:34:44,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12219.8, 300 sec: 11913.1). Total num frames: 69435392. Throughput: 0: 12126.2. Samples: 69418868. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:34:44,742][1096160] Avg episode reward: [(0, '4851.235')] [2023-03-10 21:34:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000135616_69435392.pth... [2023-03-10 21:34:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000134912_69074944.pth [2023-03-10 21:34:47,266][1096443] Updated weights for policy 0, policy_version 135680 (0.0005) [2023-03-10 21:34:49,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12219.7, 300 sec: 11913.1). Total num frames: 69496832. Throughput: 0: 12104.8. Samples: 69492624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:34:49,742][1096160] Avg episode reward: [(0, '4850.407')] [2023-03-10 21:34:50,743][1096443] Updated weights for policy 0, policy_version 135760 (0.0005) [2023-03-10 21:34:53,893][1096443] Updated weights for policy 0, policy_version 135840 (0.0005) [2023-03-10 21:34:54,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11927.0). Total num frames: 69558272. Throughput: 0: 12127.8. Samples: 69528964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:34:54,742][1096160] Avg episode reward: [(0, '4859.195')] [2023-03-10 21:34:57,124][1096443] Updated weights for policy 0, policy_version 135920 (0.0005) [2023-03-10 21:34:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12151.5, 300 sec: 11940.9). Total num frames: 69619712. Throughput: 0: 12134.6. Samples: 69604056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:34:59,742][1096160] Avg episode reward: [(0, '4855.801')] [2023-03-10 21:34:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000135976_69619712.pth... [2023-03-10 21:34:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000135264_69255168.pth [2023-03-10 21:35:00,491][1096443] Updated weights for policy 0, policy_version 136000 (0.0005) [2023-03-10 21:35:03,792][1096443] Updated weights for policy 0, policy_version 136080 (0.0005) [2023-03-10 21:35:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.5, 300 sec: 11940.9). Total num frames: 69681152. Throughput: 0: 12226.6. Samples: 69678460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:35:04,742][1096160] Avg episode reward: [(0, '4852.911')] [2023-03-10 21:35:07,344][1096443] Updated weights for policy 0, policy_version 136160 (0.0005) [2023-03-10 21:35:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 11940.9). Total num frames: 69738496. Throughput: 0: 12194.5. Samples: 69713444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:35:09,742][1096160] Avg episode reward: [(0, '4856.430')] [2023-03-10 21:35:10,882][1096443] Updated weights for policy 0, policy_version 136240 (0.0005) [2023-03-10 21:35:14,312][1096443] Updated weights for policy 0, policy_version 136320 (0.0005) [2023-03-10 21:35:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12151.5, 300 sec: 11940.9). Total num frames: 69799936. Throughput: 0: 12160.5. Samples: 69782568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:35:14,742][1096160] Avg episode reward: [(0, '4852.208')] [2023-03-10 21:35:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000136328_69799936.pth... [2023-03-10 21:35:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000135616_69435392.pth [2023-03-10 21:35:17,926][1096443] Updated weights for policy 0, policy_version 136400 (0.0004) [2023-03-10 21:35:19,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 12015.0, 300 sec: 11913.1). Total num frames: 69853184. Throughput: 0: 11990.0. Samples: 69851520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:35:19,742][1096160] Avg episode reward: [(0, '4851.252')] [2023-03-10 21:35:21,555][1096443] Updated weights for policy 0, policy_version 136480 (0.0004) [2023-03-10 21:35:24,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11946.7, 300 sec: 11913.1). Total num frames: 69910528. Throughput: 0: 11939.5. Samples: 69885964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:35:24,742][1096160] Avg episode reward: [(0, '4856.804')] [2023-03-10 21:35:25,243][1096443] Updated weights for policy 0, policy_version 136560 (0.0005) [2023-03-10 21:35:28,693][1096443] Updated weights for policy 0, policy_version 136640 (0.0004) [2023-03-10 21:35:29,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11913.1). Total num frames: 69967872. Throughput: 0: 11904.3. Samples: 69954564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:35:29,742][1096160] Avg episode reward: [(0, '4858.068')] [2023-03-10 21:35:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000136656_69967872.pth... [2023-03-10 21:35:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000135976_69619712.pth [2023-03-10 21:35:32,258][1096443] Updated weights for policy 0, policy_version 136720 (0.0004) [2023-03-10 21:35:34,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11913.1). Total num frames: 70025216. Throughput: 0: 11789.3. Samples: 70023140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:35:34,742][1096160] Avg episode reward: [(0, '4856.885')] [2023-03-10 21:35:35,724][1096443] Updated weights for policy 0, policy_version 136800 (0.0005) [2023-03-10 21:35:39,127][1096443] Updated weights for policy 0, policy_version 136880 (0.0005) [2023-03-10 21:35:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11927.0). Total num frames: 70086656. Throughput: 0: 11787.6. Samples: 70059408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:35:39,742][1096160] Avg episode reward: [(0, '4853.715')] [2023-03-10 21:35:42,632][1096443] Updated weights for policy 0, policy_version 136960 (0.0004) [2023-03-10 21:35:44,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11927.0). Total num frames: 70144000. Throughput: 0: 11703.5. Samples: 70130712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:35:44,742][1096160] Avg episode reward: [(0, '4859.491')] [2023-03-10 21:35:44,765][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000137008_70148096.pth... [2023-03-10 21:35:44,767][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000136328_69799936.pth [2023-03-10 21:35:46,068][1096443] Updated weights for policy 0, policy_version 137040 (0.0005) [2023-03-10 21:35:49,478][1096443] Updated weights for policy 0, policy_version 137120 (0.0005) [2023-03-10 21:35:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11927.0). Total num frames: 70205440. Throughput: 0: 11621.1. Samples: 70201408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:35:49,742][1096160] Avg episode reward: [(0, '4855.419')] [2023-03-10 21:35:52,681][1096443] Updated weights for policy 0, policy_version 137200 (0.0005) [2023-03-10 21:35:54,742][1096160] Fps is (10 sec: 12697.5, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 70270976. Throughput: 0: 11728.4. Samples: 70241224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:35:54,742][1096160] Avg episode reward: [(0, '4854.802')] [2023-03-10 21:35:55,949][1096443] Updated weights for policy 0, policy_version 137280 (0.0005) [2023-03-10 21:35:59,110][1096443] Updated weights for policy 0, policy_version 137360 (0.0004) [2023-03-10 21:35:59,741][1096160] Fps is (10 sec: 12697.6, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 70332416. Throughput: 0: 11857.5. Samples: 70316156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:35:59,742][1096160] Avg episode reward: [(0, '4855.727')] [2023-03-10 21:35:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000137368_70332416.pth... [2023-03-10 21:35:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000136656_69967872.pth [2023-03-10 21:36:02,522][1096443] Updated weights for policy 0, policy_version 137440 (0.0005) [2023-03-10 21:36:04,741][1096160] Fps is (10 sec: 12288.2, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 70393856. Throughput: 0: 11950.1. Samples: 70389276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:36:04,742][1096160] Avg episode reward: [(0, '4860.378')] [2023-03-10 21:36:05,937][1096443] Updated weights for policy 0, policy_version 137520 (0.0004) [2023-03-10 21:36:09,374][1096443] Updated weights for policy 0, policy_version 137600 (0.0005) [2023-03-10 21:36:09,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 70451200. Throughput: 0: 11993.5. Samples: 70425672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:36:09,742][1096160] Avg episode reward: [(0, '4856.751')] [2023-03-10 21:36:12,755][1096443] Updated weights for policy 0, policy_version 137680 (0.0005) [2023-03-10 21:36:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 70512640. Throughput: 0: 12062.1. Samples: 70497360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:36:14,742][1096160] Avg episode reward: [(0, '4857.353')] [2023-03-10 21:36:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000137720_70512640.pth... [2023-03-10 21:36:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000137008_70148096.pth [2023-03-10 21:36:16,157][1096443] Updated weights for policy 0, policy_version 137760 (0.0004) [2023-03-10 21:36:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 70569984. Throughput: 0: 12094.1. Samples: 70567376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:36:19,742][1096160] Avg episode reward: [(0, '4858.510')] [2023-03-10 21:36:19,814][1096443] Updated weights for policy 0, policy_version 137840 (0.0005) [2023-03-10 21:36:23,248][1096443] Updated weights for policy 0, policy_version 137920 (0.0004) [2023-03-10 21:36:24,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11968.7). Total num frames: 70631424. Throughput: 0: 12080.4. Samples: 70603028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:36:24,742][1096160] Avg episode reward: [(0, '4858.122')] [2023-03-10 21:36:26,551][1096443] Updated weights for policy 0, policy_version 138000 (0.0005) [2023-03-10 21:36:29,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12083.2, 300 sec: 11968.6). Total num frames: 70692864. Throughput: 0: 12116.2. Samples: 70675944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:36:29,742][1096160] Avg episode reward: [(0, '4860.621')] [2023-03-10 21:36:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000138072_70692864.pth... [2023-03-10 21:36:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000137368_70332416.pth [2023-03-10 21:36:29,989][1096443] Updated weights for policy 0, policy_version 138080 (0.0005) [2023-03-10 21:36:33,455][1096443] Updated weights for policy 0, policy_version 138160 (0.0005) [2023-03-10 21:36:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11968.6). Total num frames: 70750208. Throughput: 0: 12110.8. Samples: 70746396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:36:34,742][1096160] Avg episode reward: [(0, '4861.236')] [2023-03-10 21:36:36,847][1096443] Updated weights for policy 0, policy_version 138240 (0.0005) [2023-03-10 21:36:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12083.2, 300 sec: 11968.7). Total num frames: 70811648. Throughput: 0: 12043.1. Samples: 70783164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:36:39,742][1096160] Avg episode reward: [(0, '4862.425')] [2023-03-10 21:36:40,231][1096443] Updated weights for policy 0, policy_version 138320 (0.0005) [2023-03-10 21:36:43,349][1096443] Updated weights for policy 0, policy_version 138400 (0.0004) [2023-03-10 21:36:44,742][1096160] Fps is (10 sec: 13107.1, 60 sec: 12288.0, 300 sec: 12010.3). Total num frames: 70881280. Throughput: 0: 12049.5. Samples: 70858384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:36:44,742][1096160] Avg episode reward: [(0, '4861.678')] [2023-03-10 21:36:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000138440_70881280.pth... [2023-03-10 21:36:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000137720_70512640.pth [2023-03-10 21:36:46,337][1096443] Updated weights for policy 0, policy_version 138480 (0.0004) [2023-03-10 21:36:49,522][1096443] Updated weights for policy 0, policy_version 138560 (0.0004) [2023-03-10 21:36:49,742][1096160] Fps is (10 sec: 13107.3, 60 sec: 12288.0, 300 sec: 12010.3). Total num frames: 70942720. Throughput: 0: 12209.5. Samples: 70938704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:36:49,742][1096160] Avg episode reward: [(0, '4856.812')] [2023-03-10 21:36:52,998][1096443] Updated weights for policy 0, policy_version 138640 (0.0004) [2023-03-10 21:36:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 71000064. Throughput: 0: 12202.9. Samples: 70974804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:36:54,742][1096160] Avg episode reward: [(0, '4860.521')] [2023-03-10 21:36:56,480][1096443] Updated weights for policy 0, policy_version 138720 (0.0005) [2023-03-10 21:36:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12151.4, 300 sec: 12024.2). Total num frames: 71061504. Throughput: 0: 12173.8. Samples: 71045184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:36:59,742][1096160] Avg episode reward: [(0, '4863.312')] [2023-03-10 21:36:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000138792_71061504.pth... [2023-03-10 21:36:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000138072_70692864.pth [2023-03-10 21:36:59,985][1096443] Updated weights for policy 0, policy_version 138800 (0.0004) [2023-03-10 21:37:03,333][1096443] Updated weights for policy 0, policy_version 138880 (0.0004) [2023-03-10 21:37:04,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12151.4, 300 sec: 12024.2). Total num frames: 71122944. Throughput: 0: 12246.2. Samples: 71118456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:37:04,742][1096160] Avg episode reward: [(0, '4858.895')] [2023-03-10 21:37:06,647][1096443] Updated weights for policy 0, policy_version 138960 (0.0005) [2023-03-10 21:37:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.4, 300 sec: 12024.2). Total num frames: 71180288. Throughput: 0: 12270.8. Samples: 71155216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:37:09,742][1096160] Avg episode reward: [(0, '4857.373')] [2023-03-10 21:37:10,217][1096443] Updated weights for policy 0, policy_version 139040 (0.0005) [2023-03-10 21:37:13,690][1096443] Updated weights for policy 0, policy_version 139120 (0.0005) [2023-03-10 21:37:14,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 12151.5, 300 sec: 12038.1). Total num frames: 71241728. Throughput: 0: 12206.0. Samples: 71225212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:37:14,742][1096160] Avg episode reward: [(0, '4859.496')] [2023-03-10 21:37:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000139144_71241728.pth... [2023-03-10 21:37:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000138440_70881280.pth [2023-03-10 21:37:17,181][1096443] Updated weights for policy 0, policy_version 139200 (0.0005) [2023-03-10 21:37:19,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 12083.2, 300 sec: 12010.3). Total num frames: 71294976. Throughput: 0: 12145.2. Samples: 71292932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:37:19,742][1096160] Avg episode reward: [(0, '4860.525')] [2023-03-10 21:37:20,872][1096443] Updated weights for policy 0, policy_version 139280 (0.0005) [2023-03-10 21:37:24,547][1096443] Updated weights for policy 0, policy_version 139360 (0.0005) [2023-03-10 21:37:24,742][1096160] Fps is (10 sec: 11059.0, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 71352320. Throughput: 0: 12083.6. Samples: 71326928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:37:24,742][1096160] Avg episode reward: [(0, '4859.426')] [2023-03-10 21:37:28,063][1096443] Updated weights for policy 0, policy_version 139440 (0.0005) [2023-03-10 21:37:29,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 71409664. Throughput: 0: 11928.0. Samples: 71395144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:37:29,742][1096160] Avg episode reward: [(0, '4862.105')] [2023-03-10 21:37:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000139472_71409664.pth... [2023-03-10 21:37:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000138792_71061504.pth [2023-03-10 21:37:31,665][1096443] Updated weights for policy 0, policy_version 139520 (0.0005) [2023-03-10 21:37:34,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 71467008. Throughput: 0: 11649.3. Samples: 71462924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:37:34,742][1096160] Avg episode reward: [(0, '4862.499')] [2023-03-10 21:37:35,377][1096443] Updated weights for policy 0, policy_version 139600 (0.0004) [2023-03-10 21:37:39,188][1096443] Updated weights for policy 0, policy_version 139680 (0.0005) [2023-03-10 21:37:39,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11810.1, 300 sec: 11968.7). Total num frames: 71520256. Throughput: 0: 11569.2. Samples: 71495416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:37:39,742][1096160] Avg episode reward: [(0, '4860.616')] [2023-03-10 21:37:42,797][1096443] Updated weights for policy 0, policy_version 139760 (0.0005) [2023-03-10 21:37:44,741][1096160] Fps is (10 sec: 11059.2, 60 sec: 11605.4, 300 sec: 11954.8). Total num frames: 71577600. Throughput: 0: 11469.5. Samples: 71561308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:37:44,742][1096160] Avg episode reward: [(0, '4856.910')] [2023-03-10 21:37:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000139800_71577600.pth... [2023-03-10 21:37:44,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000139144_71241728.pth [2023-03-10 21:37:46,156][1096443] Updated weights for policy 0, policy_version 139840 (0.0005) [2023-03-10 21:37:49,485][1096443] Updated weights for policy 0, policy_version 139920 (0.0005) [2023-03-10 21:37:49,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11605.3, 300 sec: 11968.7). Total num frames: 71639040. Throughput: 0: 11485.3. Samples: 71635296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:37:49,742][1096160] Avg episode reward: [(0, '4854.779')] [2023-03-10 21:37:52,996][1096443] Updated weights for policy 0, policy_version 140000 (0.0005) [2023-03-10 21:37:54,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 11741.9, 300 sec: 11982.5). Total num frames: 71704576. Throughput: 0: 11472.4. Samples: 71671472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:37:54,742][1096160] Avg episode reward: [(0, '4848.828')] [2023-03-10 21:37:56,148][1096443] Updated weights for policy 0, policy_version 140080 (0.0005) [2023-03-10 21:37:59,555][1096443] Updated weights for policy 0, policy_version 140160 (0.0005) [2023-03-10 21:37:59,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 11673.6, 300 sec: 11982.5). Total num frames: 71761920. Throughput: 0: 11558.2. Samples: 71745332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:37:59,742][1096160] Avg episode reward: [(0, '4851.640')] [2023-03-10 21:37:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000140160_71761920.pth... [2023-03-10 21:37:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000139472_71409664.pth [2023-03-10 21:38:02,945][1096443] Updated weights for policy 0, policy_version 140240 (0.0005) [2023-03-10 21:38:04,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11996.4). Total num frames: 71823360. Throughput: 0: 11688.7. Samples: 71818920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:38:04,742][1096160] Avg episode reward: [(0, '4850.338')] [2023-03-10 21:38:06,206][1096443] Updated weights for policy 0, policy_version 140320 (0.0004) [2023-03-10 21:38:09,708][1096443] Updated weights for policy 0, policy_version 140400 (0.0004) [2023-03-10 21:38:09,741][1096160] Fps is (10 sec: 12288.2, 60 sec: 11741.9, 300 sec: 11982.5). Total num frames: 71884800. Throughput: 0: 11758.5. Samples: 71856060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:38:09,742][1096160] Avg episode reward: [(0, '4856.601')] [2023-03-10 21:38:12,942][1096443] Updated weights for policy 0, policy_version 140480 (0.0005) [2023-03-10 21:38:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11741.8, 300 sec: 11996.4). Total num frames: 71946240. Throughput: 0: 11879.1. Samples: 71929704. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:38:14,742][1096160] Avg episode reward: [(0, '4856.596')] [2023-03-10 21:38:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000140520_71946240.pth... [2023-03-10 21:38:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000139800_71577600.pth [2023-03-10 21:38:16,353][1096443] Updated weights for policy 0, policy_version 140560 (0.0005) [2023-03-10 21:38:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11996.4). Total num frames: 72003584. Throughput: 0: 11965.4. Samples: 72001368. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:38:19,742][1096160] Avg episode reward: [(0, '4850.858')] [2023-03-10 21:38:19,757][1096443] Updated weights for policy 0, policy_version 140640 (0.0005) [2023-03-10 21:38:22,793][1096443] Updated weights for policy 0, policy_version 140720 (0.0005) [2023-03-10 21:38:24,742][1096160] Fps is (10 sec: 12697.7, 60 sec: 12015.0, 300 sec: 12024.2). Total num frames: 72073216. Throughput: 0: 12113.2. Samples: 72040512. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:38:24,742][1096160] Avg episode reward: [(0, '4862.099')] [2023-03-10 21:38:26,004][1096443] Updated weights for policy 0, policy_version 140800 (0.0005) [2023-03-10 21:38:29,533][1096443] Updated weights for policy 0, policy_version 140880 (0.0005) [2023-03-10 21:38:29,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 12014.9, 300 sec: 12010.3). Total num frames: 72130560. Throughput: 0: 12295.5. Samples: 72114608. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:38:29,742][1096160] Avg episode reward: [(0, '4863.178')] [2023-03-10 21:38:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000140880_72130560.pth... [2023-03-10 21:38:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000140160_71761920.pth [2023-03-10 21:38:32,988][1096443] Updated weights for policy 0, policy_version 140960 (0.0005) [2023-03-10 21:38:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 12024.2). Total num frames: 72192000. Throughput: 0: 12267.6. Samples: 72187340. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:38:34,742][1096160] Avg episode reward: [(0, '4860.165')] [2023-03-10 21:38:36,400][1096443] Updated weights for policy 0, policy_version 141040 (0.0005) [2023-03-10 21:38:39,735][1096443] Updated weights for policy 0, policy_version 141120 (0.0005) [2023-03-10 21:38:39,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12219.7, 300 sec: 12038.1). Total num frames: 72253440. Throughput: 0: 12253.0. Samples: 72222856. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:38:39,742][1096160] Avg episode reward: [(0, '4859.755')] [2023-03-10 21:38:43,322][1096443] Updated weights for policy 0, policy_version 141200 (0.0005) [2023-03-10 21:38:44,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 12151.5, 300 sec: 12010.3). Total num frames: 72306688. Throughput: 0: 12195.9. Samples: 72294148. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:38:44,742][1096160] Avg episode reward: [(0, '4860.230')] [2023-03-10 21:38:44,789][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000141232_72310784.pth... [2023-03-10 21:38:44,790][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000140520_71946240.pth [2023-03-10 21:38:46,852][1096443] Updated weights for policy 0, policy_version 141280 (0.0005) [2023-03-10 21:38:49,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 72364032. Throughput: 0: 12068.5. Samples: 72362004. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:38:49,742][1096160] Avg episode reward: [(0, '4861.161')] [2023-03-10 21:38:50,450][1096443] Updated weights for policy 0, policy_version 141360 (0.0005) [2023-03-10 21:38:53,969][1096443] Updated weights for policy 0, policy_version 141440 (0.0005) [2023-03-10 21:38:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11982.5). Total num frames: 72425472. Throughput: 0: 12016.4. Samples: 72396800. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:38:54,742][1096160] Avg episode reward: [(0, '4864.979')] [2023-03-10 21:38:57,256][1096443] Updated weights for policy 0, policy_version 141520 (0.0004) [2023-03-10 21:38:59,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 72486912. Throughput: 0: 12018.3. Samples: 72470528. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:38:59,742][1096160] Avg episode reward: [(0, '4861.885')] [2023-03-10 21:38:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000141576_72486912.pth... [2023-03-10 21:38:59,750][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000140880_72130560.pth [2023-03-10 21:39:00,749][1096443] Updated weights for policy 0, policy_version 141600 (0.0004) [2023-03-10 21:39:04,016][1096443] Updated weights for policy 0, policy_version 141680 (0.0004) [2023-03-10 21:39:04,742][1096160] Fps is (10 sec: 12287.4, 60 sec: 12083.1, 300 sec: 11996.4). Total num frames: 72548352. Throughput: 0: 12042.8. Samples: 72543300. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:39:04,743][1096160] Avg episode reward: [(0, '4865.169')] [2023-03-10 21:39:04,743][1096399] Saving new best policy, reward=4865.169! [2023-03-10 21:39:07,619][1096443] Updated weights for policy 0, policy_version 141760 (0.0005) [2023-03-10 21:39:09,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 72601600. Throughput: 0: 11920.9. Samples: 72576952. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 21:39:09,742][1096160] Avg episode reward: [(0, '4858.343')] [2023-03-10 21:39:11,156][1096443] Updated weights for policy 0, policy_version 141840 (0.0005) [2023-03-10 21:39:14,607][1096443] Updated weights for policy 0, policy_version 141920 (0.0005) [2023-03-10 21:39:14,742][1096160] Fps is (10 sec: 11469.3, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 72663040. Throughput: 0: 11824.9. Samples: 72646728. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:39:14,742][1096160] Avg episode reward: [(0, '4859.672')] [2023-03-10 21:39:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000141920_72663040.pth... [2023-03-10 21:39:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000141232_72310784.pth [2023-03-10 21:39:18,068][1096443] Updated weights for policy 0, policy_version 142000 (0.0005) [2023-03-10 21:39:19,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12015.0, 300 sec: 11968.7). Total num frames: 72724480. Throughput: 0: 11777.3. Samples: 72717316. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:39:19,742][1096160] Avg episode reward: [(0, '4859.293')] [2023-03-10 21:39:21,517][1096443] Updated weights for policy 0, policy_version 142080 (0.0005) [2023-03-10 21:39:24,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11954.8). Total num frames: 72781824. Throughput: 0: 11843.4. Samples: 72755808. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:39:24,742][1096160] Avg episode reward: [(0, '4857.156')] [2023-03-10 21:39:24,812][1096443] Updated weights for policy 0, policy_version 142160 (0.0004) [2023-03-10 21:39:28,002][1096443] Updated weights for policy 0, policy_version 142240 (0.0004) [2023-03-10 21:39:29,742][1096160] Fps is (10 sec: 12287.7, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 72847360. Throughput: 0: 11911.5. Samples: 72830168. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:39:29,742][1096160] Avg episode reward: [(0, '4853.717')] [2023-03-10 21:39:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000142280_72847360.pth... [2023-03-10 21:39:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000141576_72486912.pth [2023-03-10 21:39:31,318][1096443] Updated weights for policy 0, policy_version 142320 (0.0005) [2023-03-10 21:39:34,693][1096443] Updated weights for policy 0, policy_version 142400 (0.0005) [2023-03-10 21:39:34,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 72908800. Throughput: 0: 11993.3. Samples: 72901704. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:39:34,742][1096160] Avg episode reward: [(0, '4854.021')] [2023-03-10 21:39:38,158][1096443] Updated weights for policy 0, policy_version 142480 (0.0004) [2023-03-10 21:39:39,742][1096160] Fps is (10 sec: 11878.6, 60 sec: 11878.4, 300 sec: 11968.6). Total num frames: 72966144. Throughput: 0: 12067.1. Samples: 72939820. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:39:39,742][1096160] Avg episode reward: [(0, '4852.772')] [2023-03-10 21:39:41,660][1096443] Updated weights for policy 0, policy_version 142560 (0.0005) [2023-03-10 21:39:44,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 73023488. Throughput: 0: 11986.5. Samples: 73009920. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:39:44,742][1096160] Avg episode reward: [(0, '4861.254')] [2023-03-10 21:39:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000142624_73023488.pth... [2023-03-10 21:39:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000141920_72663040.pth [2023-03-10 21:39:45,173][1096443] Updated weights for policy 0, policy_version 142640 (0.0005) [2023-03-10 21:39:48,444][1096443] Updated weights for policy 0, policy_version 142720 (0.0005) [2023-03-10 21:39:49,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 73084928. Throughput: 0: 11987.6. Samples: 73082736. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:39:49,742][1096160] Avg episode reward: [(0, '4857.371')] [2023-03-10 21:39:51,791][1096443] Updated weights for policy 0, policy_version 142800 (0.0004) [2023-03-10 21:39:54,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 73146368. Throughput: 0: 12026.9. Samples: 73118164. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:39:54,742][1096160] Avg episode reward: [(0, '4856.964')] [2023-03-10 21:39:55,062][1096443] Updated weights for policy 0, policy_version 142880 (0.0005) [2023-03-10 21:39:58,480][1096443] Updated weights for policy 0, policy_version 142960 (0.0004) [2023-03-10 21:39:59,742][1096160] Fps is (10 sec: 12697.4, 60 sec: 12083.2, 300 sec: 11968.6). Total num frames: 73211904. Throughput: 0: 12107.7. Samples: 73191576. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:39:59,742][1096160] Avg episode reward: [(0, '4859.758')] [2023-03-10 21:39:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000142992_73211904.pth... [2023-03-10 21:39:59,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000142280_72847360.pth [2023-03-10 21:40:01,834][1096443] Updated weights for policy 0, policy_version 143040 (0.0005) [2023-03-10 21:40:04,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12015.0, 300 sec: 11968.6). Total num frames: 73269248. Throughput: 0: 12164.3. Samples: 73264712. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:40:04,742][1096160] Avg episode reward: [(0, '4858.685')] [2023-03-10 21:40:05,303][1096443] Updated weights for policy 0, policy_version 143120 (0.0005) [2023-03-10 21:40:08,765][1096443] Updated weights for policy 0, policy_version 143200 (0.0005) [2023-03-10 21:40:09,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 12083.2, 300 sec: 11954.8). Total num frames: 73326592. Throughput: 0: 12103.2. Samples: 73300452. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:40:09,742][1096160] Avg episode reward: [(0, '4859.278')] [2023-03-10 21:40:12,306][1096443] Updated weights for policy 0, policy_version 143280 (0.0005) [2023-03-10 21:40:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11982.5). Total num frames: 73388032. Throughput: 0: 12015.6. Samples: 73370872. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:40:14,742][1096160] Avg episode reward: [(0, '4859.095')] [2023-03-10 21:40:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000143336_73388032.pth... [2023-03-10 21:40:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000142624_73023488.pth [2023-03-10 21:40:15,640][1096443] Updated weights for policy 0, policy_version 143360 (0.0005) [2023-03-10 21:40:19,090][1096443] Updated weights for policy 0, policy_version 143440 (0.0005) [2023-03-10 21:40:19,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 73449472. Throughput: 0: 12027.6. Samples: 73442944. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:40:19,742][1096160] Avg episode reward: [(0, '4859.479')] [2023-03-10 21:40:22,532][1096443] Updated weights for policy 0, policy_version 143520 (0.0005) [2023-03-10 21:40:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12083.2, 300 sec: 11996.4). Total num frames: 73506816. Throughput: 0: 11970.9. Samples: 73478512. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:40:24,742][1096160] Avg episode reward: [(0, '4858.629')] [2023-03-10 21:40:26,058][1096443] Updated weights for policy 0, policy_version 143600 (0.0006) [2023-03-10 21:40:29,695][1096443] Updated weights for policy 0, policy_version 143680 (0.0005) [2023-03-10 21:40:29,741][1096160] Fps is (10 sec: 11468.7, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 73564160. Throughput: 0: 11950.0. Samples: 73547668. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:40:29,742][1096160] Avg episode reward: [(0, '4857.270')] [2023-03-10 21:40:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000143680_73564160.pth... [2023-03-10 21:40:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000142992_73211904.pth [2023-03-10 21:40:33,085][1096443] Updated weights for policy 0, policy_version 143760 (0.0005) [2023-03-10 21:40:34,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11878.4, 300 sec: 11982.5). Total num frames: 73621504. Throughput: 0: 11890.5. Samples: 73617808. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:40:34,742][1096160] Avg episode reward: [(0, '4859.289')] [2023-03-10 21:40:36,626][1096443] Updated weights for policy 0, policy_version 143840 (0.0004) [2023-03-10 21:40:39,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11996.4). Total num frames: 73682944. Throughput: 0: 11907.9. Samples: 73654020. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:40:39,742][1096160] Avg episode reward: [(0, '4855.930')] [2023-03-10 21:40:39,923][1096443] Updated weights for policy 0, policy_version 143920 (0.0005) [2023-03-10 21:40:43,413][1096443] Updated weights for policy 0, policy_version 144000 (0.0005) [2023-03-10 21:40:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11982.5). Total num frames: 73740288. Throughput: 0: 11889.6. Samples: 73726608. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:40:44,742][1096160] Avg episode reward: [(0, '4855.119')] [2023-03-10 21:40:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000144024_73740288.pth... [2023-03-10 21:40:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000143336_73388032.pth [2023-03-10 21:40:46,811][1096443] Updated weights for policy 0, policy_version 144080 (0.0004) [2023-03-10 21:40:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11968.7). Total num frames: 73801728. Throughput: 0: 11842.2. Samples: 73797612. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:40:49,742][1096160] Avg episode reward: [(0, '4858.480')] [2023-03-10 21:40:50,311][1096443] Updated weights for policy 0, policy_version 144160 (0.0004) [2023-03-10 21:40:53,898][1096443] Updated weights for policy 0, policy_version 144240 (0.0005) [2023-03-10 21:40:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 73859072. Throughput: 0: 11780.7. Samples: 73830584. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:40:54,742][1096160] Avg episode reward: [(0, '4858.266')] [2023-03-10 21:40:57,273][1096443] Updated weights for policy 0, policy_version 144320 (0.0005) [2023-03-10 21:40:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11954.8). Total num frames: 73920512. Throughput: 0: 11850.1. Samples: 73904128. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:40:59,742][1096160] Avg episode reward: [(0, '4862.549')] [2023-03-10 21:40:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000144376_73920512.pth... [2023-03-10 21:40:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000143680_73564160.pth [2023-03-10 21:41:00,661][1096443] Updated weights for policy 0, policy_version 144400 (0.0004) [2023-03-10 21:41:03,857][1096443] Updated weights for policy 0, policy_version 144480 (0.0005) [2023-03-10 21:41:04,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 73981952. Throughput: 0: 11888.3. Samples: 73977920. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:41:04,742][1096160] Avg episode reward: [(0, '4858.969')] [2023-03-10 21:41:07,321][1096443] Updated weights for policy 0, policy_version 144560 (0.0005) [2023-03-10 21:41:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 74039296. Throughput: 0: 11913.4. Samples: 74014612. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:41:09,742][1096160] Avg episode reward: [(0, '4860.616')] [2023-03-10 21:41:10,920][1096443] Updated weights for policy 0, policy_version 144640 (0.0004) [2023-03-10 21:41:14,490][1096443] Updated weights for policy 0, policy_version 144720 (0.0005) [2023-03-10 21:41:14,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.2, 300 sec: 11954.8). Total num frames: 74096640. Throughput: 0: 11899.1. Samples: 74083128. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:41:14,742][1096160] Avg episode reward: [(0, '4857.864')] [2023-03-10 21:41:14,787][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000144728_74100736.pth... [2023-03-10 21:41:14,789][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000144024_73740288.pth [2023-03-10 21:41:17,748][1096443] Updated weights for policy 0, policy_version 144800 (0.0004) [2023-03-10 21:41:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11968.7). Total num frames: 74162176. Throughput: 0: 11916.8. Samples: 74154064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:41:19,742][1096160] Avg episode reward: [(0, '4856.696')] [2023-03-10 21:41:21,137][1096443] Updated weights for policy 0, policy_version 144880 (0.0005) [2023-03-10 21:41:24,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11940.9). Total num frames: 74215424. Throughput: 0: 11929.7. Samples: 74190856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:41:24,742][1096160] Avg episode reward: [(0, '4859.022')] [2023-03-10 21:41:24,775][1096443] Updated weights for policy 0, policy_version 144960 (0.0005) [2023-03-10 21:41:28,317][1096443] Updated weights for policy 0, policy_version 145040 (0.0004) [2023-03-10 21:41:29,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11954.8). Total num frames: 74276864. Throughput: 0: 11853.5. Samples: 74260016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:41:29,742][1096160] Avg episode reward: [(0, '4858.138')] [2023-03-10 21:41:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000145072_74276864.pth... [2023-03-10 21:41:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000144376_73920512.pth [2023-03-10 21:41:31,725][1096443] Updated weights for policy 0, policy_version 145120 (0.0004) [2023-03-10 21:41:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11940.9). Total num frames: 74334208. Throughput: 0: 11836.3. Samples: 74330244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:41:34,742][1096160] Avg episode reward: [(0, '4856.620')] [2023-03-10 21:41:35,239][1096443] Updated weights for policy 0, policy_version 145200 (0.0004) [2023-03-10 21:41:38,730][1096443] Updated weights for policy 0, policy_version 145280 (0.0004) [2023-03-10 21:41:39,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 74391552. Throughput: 0: 11913.2. Samples: 74366680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:41:39,742][1096160] Avg episode reward: [(0, '4857.442')] [2023-03-10 21:41:42,218][1096443] Updated weights for policy 0, policy_version 145360 (0.0005) [2023-03-10 21:41:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 74452992. Throughput: 0: 11829.2. Samples: 74436440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:41:44,742][1096160] Avg episode reward: [(0, '4859.220')] [2023-03-10 21:41:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000145416_74452992.pth... [2023-03-10 21:41:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000144728_74100736.pth [2023-03-10 21:41:45,761][1096443] Updated weights for policy 0, policy_version 145440 (0.0005) [2023-03-10 21:41:49,099][1096443] Updated weights for policy 0, policy_version 145520 (0.0005) [2023-03-10 21:41:49,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 74510336. Throughput: 0: 11770.9. Samples: 74507612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:41:49,742][1096160] Avg episode reward: [(0, '4861.378')] [2023-03-10 21:41:52,598][1096443] Updated weights for policy 0, policy_version 145600 (0.0005) [2023-03-10 21:41:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 74571776. Throughput: 0: 11744.4. Samples: 74543112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:41:54,743][1096160] Avg episode reward: [(0, '4861.360')] [2023-03-10 21:41:56,056][1096443] Updated weights for policy 0, policy_version 145680 (0.0005) [2023-03-10 21:41:59,484][1096443] Updated weights for policy 0, policy_version 145760 (0.0005) [2023-03-10 21:41:59,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.2, 300 sec: 11885.3). Total num frames: 74629120. Throughput: 0: 11770.7. Samples: 74612808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:41:59,742][1096160] Avg episode reward: [(0, '4856.748')] [2023-03-10 21:41:59,766][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000145768_74633216.pth... [2023-03-10 21:41:59,767][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000145072_74276864.pth [2023-03-10 21:42:02,893][1096443] Updated weights for policy 0, policy_version 145840 (0.0004) [2023-03-10 21:42:04,742][1096160] Fps is (10 sec: 11878.6, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 74690560. Throughput: 0: 11827.8. Samples: 74686316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:42:04,742][1096160] Avg episode reward: [(0, '4856.594')] [2023-03-10 21:42:06,269][1096443] Updated weights for policy 0, policy_version 145920 (0.0005) [2023-03-10 21:42:09,369][1096443] Updated weights for policy 0, policy_version 146000 (0.0005) [2023-03-10 21:42:09,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11899.2). Total num frames: 74752000. Throughput: 0: 11834.2. Samples: 74723392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:42:09,742][1096160] Avg episode reward: [(0, '4858.837')] [2023-03-10 21:42:12,848][1096443] Updated weights for policy 0, policy_version 146080 (0.0005) [2023-03-10 21:42:14,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11946.7, 300 sec: 11927.0). Total num frames: 74813440. Throughput: 0: 11934.4. Samples: 74797064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:42:14,742][1096160] Avg episode reward: [(0, '4859.841')] [2023-03-10 21:42:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000146120_74813440.pth... [2023-03-10 21:42:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000145416_74452992.pth [2023-03-10 21:42:16,493][1096443] Updated weights for policy 0, policy_version 146160 (0.0005) [2023-03-10 21:42:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11927.0). Total num frames: 74870784. Throughput: 0: 11914.1. Samples: 74866380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:42:19,742][1096160] Avg episode reward: [(0, '4862.466')] [2023-03-10 21:42:20,044][1096443] Updated weights for policy 0, policy_version 146240 (0.0006) [2023-03-10 21:42:23,329][1096443] Updated weights for policy 0, policy_version 146320 (0.0005) [2023-03-10 21:42:24,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11940.9). Total num frames: 74932224. Throughput: 0: 11908.3. Samples: 74902552. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:42:24,742][1096160] Avg episode reward: [(0, '4854.331')] [2023-03-10 21:42:26,457][1096443] Updated weights for policy 0, policy_version 146400 (0.0005) [2023-03-10 21:42:29,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 74993664. Throughput: 0: 12020.3. Samples: 74977352. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:42:29,742][1096160] Avg episode reward: [(0, '4858.168')] [2023-03-10 21:42:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000146472_74993664.pth... [2023-03-10 21:42:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000145768_74633216.pth [2023-03-10 21:42:30,002][1096443] Updated weights for policy 0, policy_version 146480 (0.0005) [2023-03-10 21:42:33,501][1096443] Updated weights for policy 0, policy_version 146560 (0.0005) [2023-03-10 21:42:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 75051008. Throughput: 0: 11985.8. Samples: 75046976. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:42:34,742][1096160] Avg episode reward: [(0, '4855.368')] [2023-03-10 21:42:37,074][1096443] Updated weights for policy 0, policy_version 146640 (0.0004) [2023-03-10 21:42:39,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 75108352. Throughput: 0: 11966.7. Samples: 75081612. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:42:39,742][1096160] Avg episode reward: [(0, '4855.960')] [2023-03-10 21:42:40,575][1096443] Updated weights for policy 0, policy_version 146720 (0.0004) [2023-03-10 21:42:44,094][1096443] Updated weights for policy 0, policy_version 146800 (0.0005) [2023-03-10 21:42:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11968.6). Total num frames: 75169792. Throughput: 0: 11972.3. Samples: 75151564. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:42:44,742][1096160] Avg episode reward: [(0, '4856.404')] [2023-03-10 21:42:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000146816_75169792.pth... [2023-03-10 21:42:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000146120_74813440.pth [2023-03-10 21:42:47,336][1096443] Updated weights for policy 0, policy_version 146880 (0.0005) [2023-03-10 21:42:49,741][1096160] Fps is (10 sec: 12288.2, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 75231232. Throughput: 0: 12013.1. Samples: 75226904. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:42:49,742][1096160] Avg episode reward: [(0, '4855.132')] [2023-03-10 21:42:50,630][1096443] Updated weights for policy 0, policy_version 146960 (0.0004) [2023-03-10 21:42:53,974][1096443] Updated weights for policy 0, policy_version 147040 (0.0004) [2023-03-10 21:42:54,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12015.0, 300 sec: 11968.7). Total num frames: 75292672. Throughput: 0: 12010.1. Samples: 75263848. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:42:54,742][1096160] Avg episode reward: [(0, '4860.575')] [2023-03-10 21:42:57,193][1096443] Updated weights for policy 0, policy_version 147120 (0.0004) [2023-03-10 21:42:59,742][1096160] Fps is (10 sec: 12287.7, 60 sec: 12083.2, 300 sec: 11968.6). Total num frames: 75354112. Throughput: 0: 12014.7. Samples: 75337728. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:42:59,742][1096160] Avg episode reward: [(0, '4856.612')] [2023-03-10 21:42:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000147176_75354112.pth... [2023-03-10 21:42:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000146472_74993664.pth [2023-03-10 21:43:00,611][1096443] Updated weights for policy 0, policy_version 147200 (0.0005) [2023-03-10 21:43:04,067][1096443] Updated weights for policy 0, policy_version 147280 (0.0005) [2023-03-10 21:43:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 75411456. Throughput: 0: 12056.9. Samples: 75408940. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:43:04,742][1096160] Avg episode reward: [(0, '4857.027')] [2023-03-10 21:43:07,524][1096443] Updated weights for policy 0, policy_version 147360 (0.0004) [2023-03-10 21:43:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11954.8). Total num frames: 75472896. Throughput: 0: 12038.6. Samples: 75444288. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:43:09,742][1096160] Avg episode reward: [(0, '4859.313')] [2023-03-10 21:43:11,133][1096443] Updated weights for policy 0, policy_version 147440 (0.0005) [2023-03-10 21:43:14,619][1096443] Updated weights for policy 0, policy_version 147520 (0.0004) [2023-03-10 21:43:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11954.8). Total num frames: 75530240. Throughput: 0: 11948.0. Samples: 75515012. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:43:14,742][1096160] Avg episode reward: [(0, '4857.293')] [2023-03-10 21:43:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000147520_75530240.pth... [2023-03-10 21:43:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000146816_75169792.pth [2023-03-10 21:43:18,161][1096443] Updated weights for policy 0, policy_version 147600 (0.0004) [2023-03-10 21:43:19,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11946.7, 300 sec: 11913.1). Total num frames: 75587584. Throughput: 0: 11917.1. Samples: 75583244. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:43:19,742][1096160] Avg episode reward: [(0, '4859.157')] [2023-03-10 21:43:21,922][1096443] Updated weights for policy 0, policy_version 147680 (0.0006) [2023-03-10 21:43:24,742][1096160] Fps is (10 sec: 11059.3, 60 sec: 11810.1, 300 sec: 11899.2). Total num frames: 75640832. Throughput: 0: 11879.7. Samples: 75616196. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 21:43:24,742][1096160] Avg episode reward: [(0, '4859.451')] [2023-03-10 21:43:25,660][1096443] Updated weights for policy 0, policy_version 147760 (0.0005) [2023-03-10 21:43:29,311][1096443] Updated weights for policy 0, policy_version 147840 (0.0005) [2023-03-10 21:43:29,742][1096160] Fps is (10 sec: 11059.3, 60 sec: 11741.9, 300 sec: 11885.3). Total num frames: 75698176. Throughput: 0: 11783.0. Samples: 75681800. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:43:29,742][1096160] Avg episode reward: [(0, '4860.174')] [2023-03-10 21:43:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000147848_75698176.pth... [2023-03-10 21:43:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000147176_75354112.pth [2023-03-10 21:43:32,833][1096443] Updated weights for policy 0, policy_version 147920 (0.0005) [2023-03-10 21:43:34,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11741.9, 300 sec: 11871.5). Total num frames: 75755520. Throughput: 0: 11653.7. Samples: 75751324. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:43:34,742][1096160] Avg episode reward: [(0, '4859.954')] [2023-03-10 21:43:36,292][1096443] Updated weights for policy 0, policy_version 148000 (0.0005) [2023-03-10 21:43:39,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11741.9, 300 sec: 11885.3). Total num frames: 75812864. Throughput: 0: 11633.7. Samples: 75787364. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:43:39,742][1096160] Avg episode reward: [(0, '4858.204')] [2023-03-10 21:43:39,894][1096443] Updated weights for policy 0, policy_version 148080 (0.0005) [2023-03-10 21:43:43,504][1096443] Updated weights for policy 0, policy_version 148160 (0.0005) [2023-03-10 21:43:44,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11885.3). Total num frames: 75870208. Throughput: 0: 11508.3. Samples: 75855600. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:43:44,742][1096160] Avg episode reward: [(0, '4856.674')] [2023-03-10 21:43:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000148184_75870208.pth... [2023-03-10 21:43:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000147520_75530240.pth [2023-03-10 21:43:47,111][1096443] Updated weights for policy 0, policy_version 148240 (0.0005) [2023-03-10 21:43:49,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11605.3, 300 sec: 11871.5). Total num frames: 75927552. Throughput: 0: 11444.8. Samples: 75923956. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:43:49,742][1096160] Avg episode reward: [(0, '4856.546')] [2023-03-10 21:43:50,492][1096443] Updated weights for policy 0, policy_version 148320 (0.0005) [2023-03-10 21:43:53,874][1096443] Updated weights for policy 0, policy_version 148400 (0.0004) [2023-03-10 21:43:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 11871.5). Total num frames: 75988992. Throughput: 0: 11468.8. Samples: 75960384. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:43:54,742][1096160] Avg episode reward: [(0, '4857.119')] [2023-03-10 21:43:57,243][1096443] Updated weights for policy 0, policy_version 148480 (0.0005) [2023-03-10 21:43:59,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11605.3, 300 sec: 11871.5). Total num frames: 76050432. Throughput: 0: 11522.0. Samples: 76033504. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:43:59,742][1096160] Avg episode reward: [(0, '4855.461')] [2023-03-10 21:43:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000148536_76050432.pth... [2023-03-10 21:43:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000147848_75698176.pth [2023-03-10 21:44:00,855][1096443] Updated weights for policy 0, policy_version 148560 (0.0005) [2023-03-10 21:44:04,359][1096443] Updated weights for policy 0, policy_version 148640 (0.0005) [2023-03-10 21:44:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11885.3). Total num frames: 76107776. Throughput: 0: 11547.6. Samples: 76102888. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:44:04,742][1096160] Avg episode reward: [(0, '4858.927')] [2023-03-10 21:44:08,022][1096443] Updated weights for policy 0, policy_version 148720 (0.0005) [2023-03-10 21:44:09,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11871.5). Total num frames: 76165120. Throughput: 0: 11556.5. Samples: 76136240. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:44:09,742][1096160] Avg episode reward: [(0, '4855.777')] [2023-03-10 21:44:11,655][1096443] Updated weights for policy 0, policy_version 148800 (0.0005) [2023-03-10 21:44:14,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11857.6). Total num frames: 76222464. Throughput: 0: 11632.9. Samples: 76205280. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:44:14,742][1096160] Avg episode reward: [(0, '4859.420')] [2023-03-10 21:44:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000148872_76222464.pth... [2023-03-10 21:44:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000148184_75870208.pth [2023-03-10 21:44:15,014][1096443] Updated weights for policy 0, policy_version 148880 (0.0004) [2023-03-10 21:44:18,425][1096443] Updated weights for policy 0, policy_version 148960 (0.0004) [2023-03-10 21:44:19,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11537.1, 300 sec: 11857.6). Total num frames: 76279808. Throughput: 0: 11678.9. Samples: 76276876. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:44:19,742][1096160] Avg episode reward: [(0, '4860.927')] [2023-03-10 21:44:21,849][1096443] Updated weights for policy 0, policy_version 149040 (0.0005) [2023-03-10 21:44:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11843.7). Total num frames: 76341248. Throughput: 0: 11672.8. Samples: 76312640. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:44:24,742][1096160] Avg episode reward: [(0, '4859.950')] [2023-03-10 21:44:25,417][1096443] Updated weights for policy 0, policy_version 149120 (0.0005) [2023-03-10 21:44:28,886][1096443] Updated weights for policy 0, policy_version 149200 (0.0005) [2023-03-10 21:44:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11829.8). Total num frames: 76398592. Throughput: 0: 11703.8. Samples: 76382272. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:44:29,742][1096160] Avg episode reward: [(0, '4860.700')] [2023-03-10 21:44:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000149216_76398592.pth... [2023-03-10 21:44:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000148536_76050432.pth [2023-03-10 21:44:32,467][1096443] Updated weights for policy 0, policy_version 149280 (0.0005) [2023-03-10 21:44:34,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11829.8). Total num frames: 76455936. Throughput: 0: 11740.0. Samples: 76452256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:44:34,742][1096160] Avg episode reward: [(0, '4859.366')] [2023-03-10 21:44:36,051][1096443] Updated weights for policy 0, policy_version 149360 (0.0006) [2023-03-10 21:44:39,634][1096443] Updated weights for policy 0, policy_version 149440 (0.0005) [2023-03-10 21:44:39,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11829.8). Total num frames: 76513280. Throughput: 0: 11655.6. Samples: 76484888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:44:39,742][1096160] Avg episode reward: [(0, '4860.851')] [2023-03-10 21:44:43,088][1096443] Updated weights for policy 0, policy_version 149520 (0.0005) [2023-03-10 21:44:44,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11815.9). Total num frames: 76570624. Throughput: 0: 11603.8. Samples: 76555676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:44:44,742][1096160] Avg episode reward: [(0, '4859.206')] [2023-03-10 21:44:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000149552_76570624.pth... [2023-03-10 21:44:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000148872_76222464.pth [2023-03-10 21:44:46,621][1096443] Updated weights for policy 0, policy_version 149600 (0.0005) [2023-03-10 21:44:49,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11802.0). Total num frames: 76627968. Throughput: 0: 11588.9. Samples: 76624388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:44:49,742][1096160] Avg episode reward: [(0, '4861.102')] [2023-03-10 21:44:50,075][1096443] Updated weights for policy 0, policy_version 149680 (0.0005) [2023-03-10 21:44:53,597][1096443] Updated weights for policy 0, policy_version 149760 (0.0005) [2023-03-10 21:44:54,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11788.2). Total num frames: 76689408. Throughput: 0: 11663.1. Samples: 76661080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:44:54,742][1096160] Avg episode reward: [(0, '4858.587')] [2023-03-10 21:44:56,916][1096443] Updated weights for policy 0, policy_version 149840 (0.0005) [2023-03-10 21:44:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11673.6, 300 sec: 11802.0). Total num frames: 76750848. Throughput: 0: 11759.6. Samples: 76734464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:44:59,742][1096160] Avg episode reward: [(0, '4859.766')] [2023-03-10 21:44:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000149904_76750848.pth... [2023-03-10 21:44:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000149216_76398592.pth [2023-03-10 21:45:00,350][1096443] Updated weights for policy 0, policy_version 149920 (0.0005) [2023-03-10 21:45:03,672][1096443] Updated weights for policy 0, policy_version 150000 (0.0005) [2023-03-10 21:45:04,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 76812288. Throughput: 0: 11799.7. Samples: 76807864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:45:04,742][1096160] Avg episode reward: [(0, '4860.421')] [2023-03-10 21:45:07,219][1096443] Updated weights for policy 0, policy_version 150080 (0.0005) [2023-03-10 21:45:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11802.0). Total num frames: 76869632. Throughput: 0: 11741.9. Samples: 76841024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:45:09,742][1096160] Avg episode reward: [(0, '4861.446')] [2023-03-10 21:45:10,701][1096443] Updated weights for policy 0, policy_version 150160 (0.0005) [2023-03-10 21:45:14,127][1096443] Updated weights for policy 0, policy_version 150240 (0.0004) [2023-03-10 21:45:14,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11788.1). Total num frames: 76926976. Throughput: 0: 11789.0. Samples: 76912776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:45:14,742][1096160] Avg episode reward: [(0, '4857.012')] [2023-03-10 21:45:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000150248_76926976.pth... [2023-03-10 21:45:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000149552_76570624.pth [2023-03-10 21:45:17,674][1096443] Updated weights for policy 0, policy_version 150320 (0.0005) [2023-03-10 21:45:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 76988416. Throughput: 0: 11766.5. Samples: 76981748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:45:19,742][1096160] Avg episode reward: [(0, '4854.128')] [2023-03-10 21:45:21,031][1096443] Updated weights for policy 0, policy_version 150400 (0.0005) [2023-03-10 21:45:24,692][1096443] Updated weights for policy 0, policy_version 150480 (0.0005) [2023-03-10 21:45:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11802.0). Total num frames: 77045760. Throughput: 0: 11837.1. Samples: 77017556. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:45:24,742][1096160] Avg episode reward: [(0, '4859.928')] [2023-03-10 21:45:28,368][1096443] Updated weights for policy 0, policy_version 150560 (0.0006) [2023-03-10 21:45:29,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11673.6, 300 sec: 11788.1). Total num frames: 77099008. Throughput: 0: 11784.2. Samples: 77085964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:45:29,742][1096160] Avg episode reward: [(0, '4860.536')] [2023-03-10 21:45:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000150584_77099008.pth... [2023-03-10 21:45:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000149904_76750848.pth [2023-03-10 21:45:31,907][1096443] Updated weights for policy 0, policy_version 150640 (0.0005) [2023-03-10 21:45:34,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11673.6, 300 sec: 11774.3). Total num frames: 77156352. Throughput: 0: 11742.7. Samples: 77152812. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:45:34,742][1096160] Avg episode reward: [(0, '4857.624')] [2023-03-10 21:45:35,532][1096443] Updated weights for policy 0, policy_version 150720 (0.0006) [2023-03-10 21:45:38,937][1096443] Updated weights for policy 0, policy_version 150800 (0.0005) [2023-03-10 21:45:39,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11673.6, 300 sec: 11774.3). Total num frames: 77213696. Throughput: 0: 11731.3. Samples: 77188988. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:45:39,742][1096160] Avg episode reward: [(0, '4855.714')] [2023-03-10 21:45:42,631][1096443] Updated weights for policy 0, policy_version 150880 (0.0005) [2023-03-10 21:45:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 77275136. Throughput: 0: 11651.2. Samples: 77258768. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:45:44,742][1096160] Avg episode reward: [(0, '4861.526')] [2023-03-10 21:45:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000150928_77275136.pth... [2023-03-10 21:45:44,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000150248_76926976.pth [2023-03-10 21:45:45,723][1096443] Updated weights for policy 0, policy_version 150960 (0.0005) [2023-03-10 21:45:48,967][1096443] Updated weights for policy 0, policy_version 151040 (0.0005) [2023-03-10 21:45:49,742][1096160] Fps is (10 sec: 12697.4, 60 sec: 11878.4, 300 sec: 11802.0). Total num frames: 77340672. Throughput: 0: 11740.8. Samples: 77336200. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:45:49,742][1096160] Avg episode reward: [(0, '4860.065')] [2023-03-10 21:45:52,468][1096443] Updated weights for policy 0, policy_version 151120 (0.0005) [2023-03-10 21:45:54,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 77398016. Throughput: 0: 11782.1. Samples: 77371216. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:45:54,742][1096160] Avg episode reward: [(0, '4856.158')] [2023-03-10 21:45:55,839][1096443] Updated weights for policy 0, policy_version 151200 (0.0005) [2023-03-10 21:45:59,323][1096443] Updated weights for policy 0, policy_version 151280 (0.0005) [2023-03-10 21:45:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 77459456. Throughput: 0: 11766.4. Samples: 77442264. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:45:59,742][1096160] Avg episode reward: [(0, '4855.213')] [2023-03-10 21:45:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000151288_77459456.pth... [2023-03-10 21:45:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000150584_77099008.pth [2023-03-10 21:46:02,690][1096443] Updated weights for policy 0, policy_version 151360 (0.0006) [2023-03-10 21:46:04,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11788.1). Total num frames: 77516800. Throughput: 0: 11842.1. Samples: 77514644. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:46:04,742][1096160] Avg episode reward: [(0, '4850.837')] [2023-03-10 21:46:06,155][1096443] Updated weights for policy 0, policy_version 151440 (0.0005) [2023-03-10 21:46:09,530][1096443] Updated weights for policy 0, policy_version 151520 (0.0004) [2023-03-10 21:46:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 77578240. Throughput: 0: 11822.7. Samples: 77549576. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:46:09,742][1096160] Avg episode reward: [(0, '4855.724')] [2023-03-10 21:46:12,828][1096443] Updated weights for policy 0, policy_version 151600 (0.0005) [2023-03-10 21:46:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11788.1). Total num frames: 77639680. Throughput: 0: 11953.1. Samples: 77623856. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:46:14,742][1096160] Avg episode reward: [(0, '4856.345')] [2023-03-10 21:46:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000151640_77639680.pth... [2023-03-10 21:46:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000150928_77275136.pth [2023-03-10 21:46:16,270][1096443] Updated weights for policy 0, policy_version 151680 (0.0005) [2023-03-10 21:46:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 77697024. Throughput: 0: 12004.2. Samples: 77693000. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:46:19,742][1096160] Avg episode reward: [(0, '4854.546')] [2023-03-10 21:46:19,903][1096443] Updated weights for policy 0, policy_version 151760 (0.0005) [2023-03-10 21:46:23,492][1096443] Updated weights for policy 0, policy_version 151840 (0.0005) [2023-03-10 21:46:24,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 77754368. Throughput: 0: 11942.7. Samples: 77726412. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:46:24,742][1096160] Avg episode reward: [(0, '4858.714')] [2023-03-10 21:46:27,215][1096443] Updated weights for policy 0, policy_version 151920 (0.0005) [2023-03-10 21:46:29,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11788.1). Total num frames: 77811712. Throughput: 0: 11918.3. Samples: 77795092. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:46:29,742][1096160] Avg episode reward: [(0, '4855.475')] [2023-03-10 21:46:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000151976_77811712.pth... [2023-03-10 21:46:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000151288_77459456.pth [2023-03-10 21:46:30,626][1096443] Updated weights for policy 0, policy_version 152000 (0.0005) [2023-03-10 21:46:34,017][1096443] Updated weights for policy 0, policy_version 152080 (0.0005) [2023-03-10 21:46:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11802.0). Total num frames: 77873152. Throughput: 0: 11814.9. Samples: 77867872. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:46:34,742][1096160] Avg episode reward: [(0, '4853.838')] [2023-03-10 21:46:37,595][1096443] Updated weights for policy 0, policy_version 152160 (0.0006) [2023-03-10 21:46:39,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11774.3). Total num frames: 77926400. Throughput: 0: 11791.3. Samples: 77901824. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:46:39,742][1096160] Avg episode reward: [(0, '4856.118')] [2023-03-10 21:46:41,242][1096443] Updated weights for policy 0, policy_version 152240 (0.0006) [2023-03-10 21:46:44,575][1096443] Updated weights for policy 0, policy_version 152320 (0.0004) [2023-03-10 21:46:44,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11788.1). Total num frames: 77987840. Throughput: 0: 11758.0. Samples: 77971376. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:46:44,742][1096160] Avg episode reward: [(0, '4851.749')] [2023-03-10 21:46:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000152320_77987840.pth... [2023-03-10 21:46:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000151640_77639680.pth [2023-03-10 21:46:47,918][1096443] Updated weights for policy 0, policy_version 152400 (0.0005) [2023-03-10 21:46:49,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 78049280. Throughput: 0: 11781.8. Samples: 78044824. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:46:49,742][1096160] Avg episode reward: [(0, '4855.497')] [2023-03-10 21:46:51,447][1096443] Updated weights for policy 0, policy_version 152480 (0.0005) [2023-03-10 21:46:54,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 78106624. Throughput: 0: 11768.8. Samples: 78079172. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:46:54,742][1096160] Avg episode reward: [(0, '4855.676')] [2023-03-10 21:46:54,788][1096443] Updated weights for policy 0, policy_version 152560 (0.0005) [2023-03-10 21:46:58,244][1096443] Updated weights for policy 0, policy_version 152640 (0.0005) [2023-03-10 21:46:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 78168064. Throughput: 0: 11729.4. Samples: 78151680. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:46:59,742][1096160] Avg episode reward: [(0, '4856.595')] [2023-03-10 21:46:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000152672_78168064.pth... [2023-03-10 21:46:59,750][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000151976_77811712.pth [2023-03-10 21:47:01,744][1096443] Updated weights for policy 0, policy_version 152720 (0.0005) [2023-03-10 21:47:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11774.3). Total num frames: 78225408. Throughput: 0: 11740.5. Samples: 78221320. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:47:04,742][1096160] Avg episode reward: [(0, '4858.439')] [2023-03-10 21:47:05,221][1096443] Updated weights for policy 0, policy_version 152800 (0.0005) [2023-03-10 21:47:08,659][1096443] Updated weights for policy 0, policy_version 152880 (0.0005) [2023-03-10 21:47:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11774.3). Total num frames: 78286848. Throughput: 0: 11802.5. Samples: 78257524. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:47:09,742][1096160] Avg episode reward: [(0, '4856.848')] [2023-03-10 21:47:12,108][1096443] Updated weights for policy 0, policy_version 152960 (0.0005) [2023-03-10 21:47:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 78344192. Throughput: 0: 11849.4. Samples: 78328312. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:47:14,742][1096160] Avg episode reward: [(0, '4854.897')] [2023-03-10 21:47:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000153016_78344192.pth... [2023-03-10 21:47:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000152320_77987840.pth [2023-03-10 21:47:15,580][1096443] Updated weights for policy 0, policy_version 153040 (0.0005) [2023-03-10 21:47:19,038][1096443] Updated weights for policy 0, policy_version 153120 (0.0005) [2023-03-10 21:47:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11774.3). Total num frames: 78405632. Throughput: 0: 11821.4. Samples: 78399836. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:47:19,742][1096160] Avg episode reward: [(0, '4856.660')] [2023-03-10 21:47:22,056][1096443] Updated weights for policy 0, policy_version 153200 (0.0005) [2023-03-10 21:47:24,741][1096160] Fps is (10 sec: 12697.6, 60 sec: 11946.7, 300 sec: 11788.2). Total num frames: 78471168. Throughput: 0: 11968.4. Samples: 78440404. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:47:24,742][1096160] Avg episode reward: [(0, '4858.646')] [2023-03-10 21:47:25,436][1096443] Updated weights for policy 0, policy_version 153280 (0.0005) [2023-03-10 21:47:29,024][1096443] Updated weights for policy 0, policy_version 153360 (0.0005) [2023-03-10 21:47:29,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11774.3). Total num frames: 78524416. Throughput: 0: 12015.6. Samples: 78512076. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:47:29,742][1096160] Avg episode reward: [(0, '4856.090')] [2023-03-10 21:47:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000153368_78524416.pth... [2023-03-10 21:47:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000152672_78168064.pth [2023-03-10 21:47:32,523][1096443] Updated weights for policy 0, policy_version 153440 (0.0005) [2023-03-10 21:47:34,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11788.1). Total num frames: 78585856. Throughput: 0: 11932.1. Samples: 78581768. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:47:34,742][1096160] Avg episode reward: [(0, '4856.850')] [2023-03-10 21:47:36,042][1096443] Updated weights for policy 0, policy_version 153520 (0.0005) [2023-03-10 21:47:39,362][1096443] Updated weights for policy 0, policy_version 153600 (0.0005) [2023-03-10 21:47:39,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11788.2). Total num frames: 78647296. Throughput: 0: 11929.8. Samples: 78616012. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:47:39,742][1096160] Avg episode reward: [(0, '4860.448')] [2023-03-10 21:47:42,978][1096443] Updated weights for policy 0, policy_version 153680 (0.0005) [2023-03-10 21:47:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11774.3). Total num frames: 78704640. Throughput: 0: 11917.4. Samples: 78687964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:47:44,742][1096160] Avg episode reward: [(0, '4857.195')] [2023-03-10 21:47:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000153720_78704640.pth... [2023-03-10 21:47:44,750][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000153016_78344192.pth [2023-03-10 21:47:46,434][1096443] Updated weights for policy 0, policy_version 153760 (0.0005) [2023-03-10 21:47:49,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11760.4). Total num frames: 78761984. Throughput: 0: 11925.1. Samples: 78757952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:47:49,742][1096160] Avg episode reward: [(0, '4860.761')] [2023-03-10 21:47:49,950][1096443] Updated weights for policy 0, policy_version 153840 (0.0005) [2023-03-10 21:47:53,581][1096443] Updated weights for policy 0, policy_version 153920 (0.0006) [2023-03-10 21:47:54,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11746.5). Total num frames: 78819328. Throughput: 0: 11859.8. Samples: 78791216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:47:54,742][1096160] Avg episode reward: [(0, '4862.994')] [2023-03-10 21:47:57,065][1096443] Updated weights for policy 0, policy_version 154000 (0.0004) [2023-03-10 21:47:59,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11810.2, 300 sec: 11746.5). Total num frames: 78876672. Throughput: 0: 11867.7. Samples: 78862356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:47:59,742][1096160] Avg episode reward: [(0, '4860.045')] [2023-03-10 21:47:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000154056_78876672.pth... [2023-03-10 21:47:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000153368_78524416.pth [2023-03-10 21:48:00,488][1096443] Updated weights for policy 0, policy_version 154080 (0.0005) [2023-03-10 21:48:03,936][1096443] Updated weights for policy 0, policy_version 154160 (0.0005) [2023-03-10 21:48:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11746.5). Total num frames: 78938112. Throughput: 0: 11867.9. Samples: 78933892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:48:04,742][1096160] Avg episode reward: [(0, '4860.063')] [2023-03-10 21:48:07,658][1096443] Updated weights for policy 0, policy_version 154240 (0.0005) [2023-03-10 21:48:09,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11741.9, 300 sec: 11732.6). Total num frames: 78991360. Throughput: 0: 11683.2. Samples: 78966148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:48:09,742][1096160] Avg episode reward: [(0, '4863.303')] [2023-03-10 21:48:11,344][1096443] Updated weights for policy 0, policy_version 154320 (0.0005) [2023-03-10 21:48:14,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11741.8, 300 sec: 11732.6). Total num frames: 79048704. Throughput: 0: 11562.4. Samples: 79032384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:48:14,742][1096160] Avg episode reward: [(0, '4858.482')] [2023-03-10 21:48:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000154392_79048704.pth... [2023-03-10 21:48:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000153720_78704640.pth [2023-03-10 21:48:14,940][1096443] Updated weights for policy 0, policy_version 154400 (0.0005) [2023-03-10 21:48:18,718][1096443] Updated weights for policy 0, policy_version 154480 (0.0005) [2023-03-10 21:48:19,741][1096160] Fps is (10 sec: 11059.3, 60 sec: 11605.3, 300 sec: 11732.6). Total num frames: 79101952. Throughput: 0: 11517.4. Samples: 79100052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:48:19,742][1096160] Avg episode reward: [(0, '4858.194')] [2023-03-10 21:48:22,339][1096443] Updated weights for policy 0, policy_version 154560 (0.0005) [2023-03-10 21:48:24,741][1096160] Fps is (10 sec: 11059.3, 60 sec: 11468.8, 300 sec: 11732.6). Total num frames: 79159296. Throughput: 0: 11509.1. Samples: 79133920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:48:24,742][1096160] Avg episode reward: [(0, '4858.345')] [2023-03-10 21:48:25,898][1096443] Updated weights for policy 0, policy_version 154640 (0.0005) [2023-03-10 21:48:29,702][1096443] Updated weights for policy 0, policy_version 154720 (0.0005) [2023-03-10 21:48:29,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11537.1, 300 sec: 11732.6). Total num frames: 79216640. Throughput: 0: 11458.8. Samples: 79203608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:48:29,742][1096160] Avg episode reward: [(0, '4854.332')] [2023-03-10 21:48:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000154720_79216640.pth... [2023-03-10 21:48:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000154056_78876672.pth [2023-03-10 21:48:33,607][1096443] Updated weights for policy 0, policy_version 154800 (0.0004) [2023-03-10 21:48:34,742][1096160] Fps is (10 sec: 11059.1, 60 sec: 11400.5, 300 sec: 11718.7). Total num frames: 79269888. Throughput: 0: 11278.9. Samples: 79265504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:48:34,742][1096160] Avg episode reward: [(0, '4856.894')] [2023-03-10 21:48:37,259][1096443] Updated weights for policy 0, policy_version 154880 (0.0005) [2023-03-10 21:48:39,742][1096160] Fps is (10 sec: 10649.7, 60 sec: 11264.0, 300 sec: 11704.8). Total num frames: 79323136. Throughput: 0: 11258.5. Samples: 79297848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:48:39,742][1096160] Avg episode reward: [(0, '4860.938')] [2023-03-10 21:48:40,818][1096443] Updated weights for policy 0, policy_version 154960 (0.0005) [2023-03-10 21:48:44,159][1096443] Updated weights for policy 0, policy_version 155040 (0.0005) [2023-03-10 21:48:44,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11718.7). Total num frames: 79384576. Throughput: 0: 11242.9. Samples: 79368288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:48:44,742][1096160] Avg episode reward: [(0, '4855.918')] [2023-03-10 21:48:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000155048_79384576.pth... [2023-03-10 21:48:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000154392_79048704.pth [2023-03-10 21:48:47,556][1096443] Updated weights for policy 0, policy_version 155120 (0.0005) [2023-03-10 21:48:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11400.5, 300 sec: 11718.7). Total num frames: 79446016. Throughput: 0: 11277.7. Samples: 79441388. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:48:49,742][1096160] Avg episode reward: [(0, '4856.187')] [2023-03-10 21:48:51,024][1096443] Updated weights for policy 0, policy_version 155200 (0.0005) [2023-03-10 21:48:51,792][1096399] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000005 [2023-03-10 21:48:54,588][1096443] Updated weights for policy 0, policy_version 155280 (0.0005) [2023-03-10 21:48:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11400.5, 300 sec: 11704.8). Total num frames: 79503360. Throughput: 0: 11310.6. Samples: 79475124. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:48:54,742][1096160] Avg episode reward: [(0, '4858.279')] [2023-03-10 21:48:58,055][1096443] Updated weights for policy 0, policy_version 155360 (0.0005) [2023-03-10 21:48:59,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11704.8). Total num frames: 79560704. Throughput: 0: 11423.7. Samples: 79546448. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:48:59,742][1096160] Avg episode reward: [(0, '4856.061')] [2023-03-10 21:48:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000155392_79560704.pth... [2023-03-10 21:48:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000154720_79216640.pth [2023-03-10 21:49:01,473][1096443] Updated weights for policy 0, policy_version 155440 (0.0004) [2023-03-10 21:49:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11400.5, 300 sec: 11718.7). Total num frames: 79622144. Throughput: 0: 11512.6. Samples: 79618120. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:49:04,742][1096160] Avg episode reward: [(0, '4856.305')] [2023-03-10 21:49:04,818][1096443] Updated weights for policy 0, policy_version 155520 (0.0005) [2023-03-10 21:49:08,238][1096443] Updated weights for policy 0, policy_version 155600 (0.0005) [2023-03-10 21:49:09,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11537.1, 300 sec: 11732.6). Total num frames: 79683584. Throughput: 0: 11584.7. Samples: 79655232. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:49:09,742][1096160] Avg episode reward: [(0, '4852.283')] [2023-03-10 21:49:11,696][1096443] Updated weights for policy 0, policy_version 155680 (0.0004) [2023-03-10 21:49:14,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11605.3, 300 sec: 11746.5). Total num frames: 79745024. Throughput: 0: 11664.6. Samples: 79728516. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:49:14,742][1096160] Avg episode reward: [(0, '4857.428')] [2023-03-10 21:49:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000155752_79745024.pth... [2023-03-10 21:49:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000155048_79384576.pth [2023-03-10 21:49:15,024][1096443] Updated weights for policy 0, policy_version 155760 (0.0005) [2023-03-10 21:49:18,530][1096443] Updated weights for policy 0, policy_version 155840 (0.0005) [2023-03-10 21:49:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11732.6). Total num frames: 79802368. Throughput: 0: 11834.8. Samples: 79798068. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:49:19,742][1096160] Avg episode reward: [(0, '4854.800')] [2023-03-10 21:49:22,077][1096443] Updated weights for policy 0, policy_version 155920 (0.0005) [2023-03-10 21:49:24,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11732.6). Total num frames: 79859712. Throughput: 0: 11877.4. Samples: 79832332. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:49:24,742][1096160] Avg episode reward: [(0, '4849.220')] [2023-03-10 21:49:25,486][1096443] Updated weights for policy 0, policy_version 156000 (0.0005) [2023-03-10 21:49:29,024][1096443] Updated weights for policy 0, policy_version 156080 (0.0005) [2023-03-10 21:49:29,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11746.5). Total num frames: 79921152. Throughput: 0: 11895.0. Samples: 79903560. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:49:29,742][1096160] Avg episode reward: [(0, '4855.683')] [2023-03-10 21:49:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000156096_79921152.pth... [2023-03-10 21:49:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000155392_79560704.pth [2023-03-10 21:49:32,510][1096443] Updated weights for policy 0, policy_version 156160 (0.0005) [2023-03-10 21:49:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11746.5). Total num frames: 79978496. Throughput: 0: 11844.7. Samples: 79974400. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:49:34,742][1096160] Avg episode reward: [(0, '4856.088')] [2023-03-10 21:49:36,011][1096443] Updated weights for policy 0, policy_version 156240 (0.0005) [2023-03-10 21:49:39,589][1096443] Updated weights for policy 0, policy_version 156320 (0.0005) [2023-03-10 21:49:39,742][1096160] Fps is (10 sec: 11468.6, 60 sec: 11878.4, 300 sec: 11746.5). Total num frames: 80035840. Throughput: 0: 11860.1. Samples: 80008828. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:49:39,742][1096160] Avg episode reward: [(0, '4853.203')] [2023-03-10 21:49:42,985][1096443] Updated weights for policy 0, policy_version 156400 (0.0005) [2023-03-10 21:49:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11760.4). Total num frames: 80097280. Throughput: 0: 11863.3. Samples: 80080296. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:49:44,742][1096160] Avg episode reward: [(0, '4850.960')] [2023-03-10 21:49:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000156440_80097280.pth... [2023-03-10 21:49:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000155752_79745024.pth [2023-03-10 21:49:46,478][1096443] Updated weights for policy 0, policy_version 156480 (0.0005) [2023-03-10 21:49:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11746.5). Total num frames: 80154624. Throughput: 0: 11809.4. Samples: 80149544. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:49:49,742][1096160] Avg episode reward: [(0, '4853.467')] [2023-03-10 21:49:50,028][1096443] Updated weights for policy 0, policy_version 156560 (0.0005) [2023-03-10 21:49:53,533][1096443] Updated weights for policy 0, policy_version 156640 (0.0005) [2023-03-10 21:49:54,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11732.6). Total num frames: 80211968. Throughput: 0: 11756.3. Samples: 80184268. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:49:54,742][1096160] Avg episode reward: [(0, '4851.853')] [2023-03-10 21:49:57,055][1096443] Updated weights for policy 0, policy_version 156720 (0.0005) [2023-03-10 21:49:59,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11718.7). Total num frames: 80269312. Throughput: 0: 11704.8. Samples: 80255232. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:49:59,742][1096160] Avg episode reward: [(0, '4855.431')] [2023-03-10 21:49:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000156776_80269312.pth... [2023-03-10 21:49:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000156096_79921152.pth [2023-03-10 21:50:00,505][1096443] Updated weights for policy 0, policy_version 156800 (0.0005) [2023-03-10 21:50:03,895][1096443] Updated weights for policy 0, policy_version 156880 (0.0005) [2023-03-10 21:50:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11732.6). Total num frames: 80330752. Throughput: 0: 11744.9. Samples: 80326588. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:50:04,742][1096160] Avg episode reward: [(0, '4857.133')] [2023-03-10 21:50:07,502][1096443] Updated weights for policy 0, policy_version 156960 (0.0005) [2023-03-10 21:50:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11732.6). Total num frames: 80388096. Throughput: 0: 11737.6. Samples: 80360524. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:50:09,742][1096160] Avg episode reward: [(0, '4855.856')] [2023-03-10 21:50:11,030][1096443] Updated weights for policy 0, policy_version 157040 (0.0005) [2023-03-10 21:50:14,518][1096443] Updated weights for policy 0, policy_version 157120 (0.0005) [2023-03-10 21:50:14,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11718.7). Total num frames: 80445440. Throughput: 0: 11698.5. Samples: 80429996. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:50:14,742][1096160] Avg episode reward: [(0, '4854.006')] [2023-03-10 21:50:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000157120_80445440.pth... [2023-03-10 21:50:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000156440_80097280.pth [2023-03-10 21:50:18,038][1096443] Updated weights for policy 0, policy_version 157200 (0.0004) [2023-03-10 21:50:19,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11718.7). Total num frames: 80502784. Throughput: 0: 11669.7. Samples: 80499536. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:50:19,742][1096160] Avg episode reward: [(0, '4857.851')] [2023-03-10 21:50:21,585][1096443] Updated weights for policy 0, policy_version 157280 (0.0005) [2023-03-10 21:50:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11746.5). Total num frames: 80564224. Throughput: 0: 11705.2. Samples: 80535560. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:50:24,742][1096160] Avg episode reward: [(0, '4857.303')] [2023-03-10 21:50:25,098][1096443] Updated weights for policy 0, policy_version 157360 (0.0005) [2023-03-10 21:50:28,702][1096443] Updated weights for policy 0, policy_version 157440 (0.0005) [2023-03-10 21:50:29,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11732.6). Total num frames: 80617472. Throughput: 0: 11629.9. Samples: 80603644. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:50:29,742][1096160] Avg episode reward: [(0, '4858.241')] [2023-03-10 21:50:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000157464_80621568.pth... [2023-03-10 21:50:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000156776_80269312.pth [2023-03-10 21:50:32,143][1096443] Updated weights for policy 0, policy_version 157520 (0.0005) [2023-03-10 21:50:34,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11746.5). Total num frames: 80678912. Throughput: 0: 11674.1. Samples: 80674880. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:50:34,742][1096160] Avg episode reward: [(0, '4857.027')] [2023-03-10 21:50:35,702][1096443] Updated weights for policy 0, policy_version 157600 (0.0005) [2023-03-10 21:50:39,048][1096443] Updated weights for policy 0, policy_version 157680 (0.0005) [2023-03-10 21:50:39,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11673.6, 300 sec: 11732.6). Total num frames: 80736256. Throughput: 0: 11644.1. Samples: 80708252. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:50:39,742][1096160] Avg episode reward: [(0, '4854.172')] [2023-03-10 21:50:42,468][1096443] Updated weights for policy 0, policy_version 157760 (0.0005) [2023-03-10 21:50:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11718.7). Total num frames: 80797696. Throughput: 0: 11719.2. Samples: 80782596. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:50:44,742][1096160] Avg episode reward: [(0, '4861.237')] [2023-03-10 21:50:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000157808_80797696.pth... [2023-03-10 21:50:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000157120_80445440.pth [2023-03-10 21:50:45,943][1096443] Updated weights for policy 0, policy_version 157840 (0.0004) [2023-03-10 21:50:49,182][1096443] Updated weights for policy 0, policy_version 157920 (0.0005) [2023-03-10 21:50:49,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11741.9, 300 sec: 11732.6). Total num frames: 80859136. Throughput: 0: 11744.8. Samples: 80855104. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 21:50:49,742][1096160] Avg episode reward: [(0, '4856.718')] [2023-03-10 21:50:52,635][1096443] Updated weights for policy 0, policy_version 158000 (0.0005) [2023-03-10 21:50:54,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11741.9, 300 sec: 11718.7). Total num frames: 80916480. Throughput: 0: 11797.3. Samples: 80891400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:50:54,742][1096160] Avg episode reward: [(0, '4858.976')] [2023-03-10 21:50:56,120][1096443] Updated weights for policy 0, policy_version 158080 (0.0006) [2023-03-10 21:50:59,386][1096443] Updated weights for policy 0, policy_version 158160 (0.0005) [2023-03-10 21:50:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11746.5). Total num frames: 80982016. Throughput: 0: 11827.5. Samples: 80962236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:50:59,745][1096160] Avg episode reward: [(0, '4863.426')] [2023-03-10 21:50:59,748][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000158168_80982016.pth... [2023-03-10 21:50:59,751][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000157464_80621568.pth [2023-03-10 21:51:02,643][1096443] Updated weights for policy 0, policy_version 158240 (0.0004) [2023-03-10 21:51:04,742][1096160] Fps is (10 sec: 12697.6, 60 sec: 11878.4, 300 sec: 11746.5). Total num frames: 81043456. Throughput: 0: 11951.9. Samples: 81037372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:51:04,742][1096160] Avg episode reward: [(0, '4860.177')] [2023-03-10 21:51:06,135][1096443] Updated weights for policy 0, policy_version 158320 (0.0005) [2023-03-10 21:51:09,558][1096443] Updated weights for policy 0, policy_version 158400 (0.0004) [2023-03-10 21:51:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11732.6). Total num frames: 81100800. Throughput: 0: 11925.3. Samples: 81072200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:51:09,742][1096160] Avg episode reward: [(0, '4858.578')] [2023-03-10 21:51:13,026][1096443] Updated weights for policy 0, policy_version 158480 (0.0004) [2023-03-10 21:51:14,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11732.6). Total num frames: 81158144. Throughput: 0: 12014.5. Samples: 81144296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:51:14,742][1096160] Avg episode reward: [(0, '4858.330')] [2023-03-10 21:51:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000158512_81158144.pth... [2023-03-10 21:51:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000157808_80797696.pth [2023-03-10 21:51:16,616][1096443] Updated weights for policy 0, policy_version 158560 (0.0004) [2023-03-10 21:51:19,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 11746.5). Total num frames: 81219584. Throughput: 0: 11971.9. Samples: 81213612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:51:19,742][1096160] Avg episode reward: [(0, '4858.836')] [2023-03-10 21:51:20,080][1096443] Updated weights for policy 0, policy_version 158640 (0.0004) [2023-03-10 21:51:23,532][1096443] Updated weights for policy 0, policy_version 158720 (0.0004) [2023-03-10 21:51:24,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11746.5). Total num frames: 81276928. Throughput: 0: 12001.5. Samples: 81248320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:51:24,742][1096160] Avg episode reward: [(0, '4860.236')] [2023-03-10 21:51:26,981][1096443] Updated weights for policy 0, policy_version 158800 (0.0004) [2023-03-10 21:51:29,742][1096160] Fps is (10 sec: 11468.6, 60 sec: 11946.7, 300 sec: 11732.6). Total num frames: 81334272. Throughput: 0: 11955.3. Samples: 81320584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:51:29,742][1096160] Avg episode reward: [(0, '4860.984')] [2023-03-10 21:51:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000158856_81334272.pth... [2023-03-10 21:51:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000158168_80982016.pth [2023-03-10 21:51:30,616][1096443] Updated weights for policy 0, policy_version 158880 (0.0005) [2023-03-10 21:51:33,987][1096443] Updated weights for policy 0, policy_version 158960 (0.0004) [2023-03-10 21:51:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11760.4). Total num frames: 81395712. Throughput: 0: 11907.3. Samples: 81390932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:51:34,742][1096160] Avg episode reward: [(0, '4862.772')] [2023-03-10 21:51:37,450][1096443] Updated weights for policy 0, policy_version 159040 (0.0005) [2023-03-10 21:51:39,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 12014.9, 300 sec: 11760.4). Total num frames: 81457152. Throughput: 0: 11883.0. Samples: 81426136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:51:39,742][1096160] Avg episode reward: [(0, '4862.430')] [2023-03-10 21:51:40,664][1096443] Updated weights for policy 0, policy_version 159120 (0.0005) [2023-03-10 21:51:44,042][1096443] Updated weights for policy 0, policy_version 159200 (0.0004) [2023-03-10 21:51:44,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11746.5). Total num frames: 81514496. Throughput: 0: 11950.8. Samples: 81500020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:51:44,742][1096160] Avg episode reward: [(0, '4858.042')] [2023-03-10 21:51:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000159208_81514496.pth... [2023-03-10 21:51:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000158512_81158144.pth [2023-03-10 21:51:47,537][1096443] Updated weights for policy 0, policy_version 159280 (0.0005) [2023-03-10 21:51:49,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11760.4). Total num frames: 81575936. Throughput: 0: 11878.5. Samples: 81571904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:51:49,742][1096160] Avg episode reward: [(0, '4860.441')] [2023-03-10 21:51:50,908][1096443] Updated weights for policy 0, policy_version 159360 (0.0004) [2023-03-10 21:51:54,399][1096443] Updated weights for policy 0, policy_version 159440 (0.0005) [2023-03-10 21:51:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11746.5). Total num frames: 81633280. Throughput: 0: 11906.0. Samples: 81607968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:51:54,742][1096160] Avg episode reward: [(0, '4861.472')] [2023-03-10 21:51:57,902][1096443] Updated weights for policy 0, policy_version 159520 (0.0005) [2023-03-10 21:51:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11760.4). Total num frames: 81694720. Throughput: 0: 11867.0. Samples: 81678312. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:51:59,742][1096160] Avg episode reward: [(0, '4858.664')] [2023-03-10 21:51:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000159560_81694720.pth... [2023-03-10 21:51:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000158856_81334272.pth [2023-03-10 21:52:01,219][1096443] Updated weights for policy 0, policy_version 159600 (0.0005) [2023-03-10 21:52:04,674][1096443] Updated weights for policy 0, policy_version 159680 (0.0005) [2023-03-10 21:52:04,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 11760.4). Total num frames: 81756160. Throughput: 0: 11940.9. Samples: 81750952. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:52:04,742][1096160] Avg episode reward: [(0, '4860.213')] [2023-03-10 21:52:08,081][1096443] Updated weights for policy 0, policy_version 159760 (0.0005) [2023-03-10 21:52:09,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11774.3). Total num frames: 81817600. Throughput: 0: 11963.2. Samples: 81786664. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:52:09,742][1096160] Avg episode reward: [(0, '4855.050')] [2023-03-10 21:52:11,495][1096443] Updated weights for policy 0, policy_version 159840 (0.0004) [2023-03-10 21:52:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11760.4). Total num frames: 81874944. Throughput: 0: 11958.0. Samples: 81858692. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:52:14,742][1096160] Avg episode reward: [(0, '4859.564')] [2023-03-10 21:52:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000159912_81874944.pth... [2023-03-10 21:52:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000159208_81514496.pth [2023-03-10 21:52:14,902][1096443] Updated weights for policy 0, policy_version 159920 (0.0005) [2023-03-10 21:52:18,315][1096443] Updated weights for policy 0, policy_version 160000 (0.0005) [2023-03-10 21:52:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.6, 300 sec: 11746.5). Total num frames: 81936384. Throughput: 0: 12003.3. Samples: 81931080. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:52:19,742][1096160] Avg episode reward: [(0, '4859.090')] [2023-03-10 21:52:21,811][1096443] Updated weights for policy 0, policy_version 160080 (0.0004) [2023-03-10 21:52:24,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11760.4). Total num frames: 81993728. Throughput: 0: 11990.4. Samples: 81965704. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:52:24,742][1096160] Avg episode reward: [(0, '4858.812')] [2023-03-10 21:52:25,072][1096443] Updated weights for policy 0, policy_version 160160 (0.0005) [2023-03-10 21:52:28,547][1096443] Updated weights for policy 0, policy_version 160240 (0.0005) [2023-03-10 21:52:29,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11760.4). Total num frames: 82055168. Throughput: 0: 11974.1. Samples: 82038856. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:52:29,742][1096160] Avg episode reward: [(0, '4858.304')] [2023-03-10 21:52:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000160264_82055168.pth... [2023-03-10 21:52:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000159560_81694720.pth [2023-03-10 21:52:31,870][1096443] Updated weights for policy 0, policy_version 160320 (0.0005) [2023-03-10 21:52:34,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 12015.0, 300 sec: 11760.4). Total num frames: 82116608. Throughput: 0: 12014.9. Samples: 82112576. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:52:34,742][1096160] Avg episode reward: [(0, '4859.843')] [2023-03-10 21:52:35,254][1096443] Updated weights for policy 0, policy_version 160400 (0.0005) [2023-03-10 21:52:38,808][1096443] Updated weights for policy 0, policy_version 160480 (0.0005) [2023-03-10 21:52:39,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 11760.4). Total num frames: 82173952. Throughput: 0: 11975.5. Samples: 82146864. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:52:39,742][1096160] Avg episode reward: [(0, '4856.389')] [2023-03-10 21:52:42,327][1096443] Updated weights for policy 0, policy_version 160560 (0.0005) [2023-03-10 21:52:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12014.9, 300 sec: 11774.3). Total num frames: 82235392. Throughput: 0: 12006.1. Samples: 82218588. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:52:44,742][1096160] Avg episode reward: [(0, '4859.867')] [2023-03-10 21:52:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000160616_82235392.pth... [2023-03-10 21:52:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000159912_81874944.pth [2023-03-10 21:52:45,728][1096443] Updated weights for policy 0, policy_version 160640 (0.0005) [2023-03-10 21:52:49,222][1096443] Updated weights for policy 0, policy_version 160720 (0.0004) [2023-03-10 21:52:49,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11774.3). Total num frames: 82292736. Throughput: 0: 11950.0. Samples: 82288704. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:52:49,742][1096160] Avg episode reward: [(0, '4861.688')] [2023-03-10 21:52:52,644][1096443] Updated weights for policy 0, policy_version 160800 (0.0005) [2023-03-10 21:52:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11788.1). Total num frames: 82354176. Throughput: 0: 11970.3. Samples: 82325328. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:52:54,742][1096160] Avg episode reward: [(0, '4862.237')] [2023-03-10 21:52:56,059][1096443] Updated weights for policy 0, policy_version 160880 (0.0004) [2023-03-10 21:52:59,623][1096443] Updated weights for policy 0, policy_version 160960 (0.0005) [2023-03-10 21:52:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11774.3). Total num frames: 82411520. Throughput: 0: 11945.1. Samples: 82396220. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:52:59,742][1096160] Avg episode reward: [(0, '4863.983')] [2023-03-10 21:52:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000160960_82411520.pth... [2023-03-10 21:52:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000160264_82055168.pth [2023-03-10 21:53:02,957][1096443] Updated weights for policy 0, policy_version 161040 (0.0005) [2023-03-10 21:53:04,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11802.0). Total num frames: 82472960. Throughput: 0: 11951.0. Samples: 82468872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:53:04,742][1096160] Avg episode reward: [(0, '4864.444')] [2023-03-10 21:53:06,307][1096443] Updated weights for policy 0, policy_version 161120 (0.0005) [2023-03-10 21:53:09,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11878.4, 300 sec: 11802.0). Total num frames: 82530304. Throughput: 0: 11963.9. Samples: 82504080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:53:09,742][1096160] Avg episode reward: [(0, '4862.151')] [2023-03-10 21:53:09,833][1096443] Updated weights for policy 0, policy_version 161200 (0.0005) [2023-03-10 21:53:13,267][1096443] Updated weights for policy 0, policy_version 161280 (0.0004) [2023-03-10 21:53:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11829.8). Total num frames: 82591744. Throughput: 0: 11921.5. Samples: 82575324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:53:14,742][1096160] Avg episode reward: [(0, '4859.753')] [2023-03-10 21:53:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000161312_82591744.pth... [2023-03-10 21:53:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000160616_82235392.pth [2023-03-10 21:53:16,741][1096443] Updated weights for policy 0, policy_version 161360 (0.0004) [2023-03-10 21:53:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11829.8). Total num frames: 82649088. Throughput: 0: 11832.9. Samples: 82645056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:53:19,742][1096160] Avg episode reward: [(0, '4863.291')] [2023-03-10 21:53:20,282][1096443] Updated weights for policy 0, policy_version 161440 (0.0005) [2023-03-10 21:53:23,769][1096443] Updated weights for policy 0, policy_version 161520 (0.0005) [2023-03-10 21:53:24,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11829.8). Total num frames: 82706432. Throughput: 0: 11873.6. Samples: 82681176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:53:24,742][1096160] Avg episode reward: [(0, '4863.140')] [2023-03-10 21:53:27,303][1096443] Updated weights for policy 0, policy_version 161600 (0.0004) [2023-03-10 21:53:29,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 82763776. Throughput: 0: 11781.6. Samples: 82748760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:53:29,742][1096160] Avg episode reward: [(0, '4860.916')] [2023-03-10 21:53:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000161648_82763776.pth... [2023-03-10 21:53:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000160960_82411520.pth [2023-03-10 21:53:30,732][1096443] Updated weights for policy 0, policy_version 161680 (0.0005) [2023-03-10 21:53:34,198][1096443] Updated weights for policy 0, policy_version 161760 (0.0005) [2023-03-10 21:53:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11871.5). Total num frames: 82825216. Throughput: 0: 11832.9. Samples: 82821184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:53:34,742][1096160] Avg episode reward: [(0, '4862.401')] [2023-03-10 21:53:37,629][1096443] Updated weights for policy 0, policy_version 161840 (0.0005) [2023-03-10 21:53:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11857.6). Total num frames: 82882560. Throughput: 0: 11829.1. Samples: 82857636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:53:39,742][1096160] Avg episode reward: [(0, '4861.574')] [2023-03-10 21:53:41,212][1096443] Updated weights for policy 0, policy_version 161920 (0.0005) [2023-03-10 21:53:44,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 82939904. Throughput: 0: 11755.9. Samples: 82925236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:53:44,742][1096160] Avg episode reward: [(0, '4860.869')] [2023-03-10 21:53:44,755][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000162000_82944000.pth... [2023-03-10 21:53:44,755][1096443] Updated weights for policy 0, policy_version 162000 (0.0005) [2023-03-10 21:53:44,756][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000161312_82591744.pth [2023-03-10 21:53:48,501][1096443] Updated weights for policy 0, policy_version 162080 (0.0005) [2023-03-10 21:53:49,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 82997248. Throughput: 0: 11650.9. Samples: 82993164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:53:49,742][1096160] Avg episode reward: [(0, '4860.767')] [2023-03-10 21:53:52,098][1096443] Updated weights for policy 0, policy_version 162160 (0.0005) [2023-03-10 21:53:54,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11843.7). Total num frames: 83054592. Throughput: 0: 11627.8. Samples: 83027332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:53:54,742][1096160] Avg episode reward: [(0, '4862.767')] [2023-03-10 21:53:55,694][1096443] Updated weights for policy 0, policy_version 162240 (0.0005) [2023-03-10 21:53:59,144][1096443] Updated weights for policy 0, policy_version 162320 (0.0005) [2023-03-10 21:53:59,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 83116032. Throughput: 0: 11589.0. Samples: 83096828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:53:59,742][1096160] Avg episode reward: [(0, '4857.061')] [2023-03-10 21:53:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000162336_83116032.pth... [2023-03-10 21:53:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000161648_82763776.pth [2023-03-10 21:54:02,539][1096443] Updated weights for policy 0, policy_version 162400 (0.0004) [2023-03-10 21:54:04,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11829.8). Total num frames: 83173376. Throughput: 0: 11649.4. Samples: 83169280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:54:04,742][1096160] Avg episode reward: [(0, '4862.392')] [2023-03-10 21:54:05,886][1096443] Updated weights for policy 0, policy_version 162480 (0.0004) [2023-03-10 21:54:09,265][1096443] Updated weights for policy 0, policy_version 162560 (0.0005) [2023-03-10 21:54:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11741.8, 300 sec: 11829.8). Total num frames: 83234816. Throughput: 0: 11646.4. Samples: 83205264. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:54:09,742][1096160] Avg episode reward: [(0, '4858.015')] [2023-03-10 21:54:12,819][1096443] Updated weights for policy 0, policy_version 162640 (0.0005) [2023-03-10 21:54:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11829.8). Total num frames: 83292160. Throughput: 0: 11712.9. Samples: 83275840. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:54:14,742][1096160] Avg episode reward: [(0, '4861.182')] [2023-03-10 21:54:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000162680_83292160.pth... [2023-03-10 21:54:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000162000_82944000.pth [2023-03-10 21:54:16,250][1096443] Updated weights for policy 0, policy_version 162720 (0.0004) [2023-03-10 21:54:19,648][1096443] Updated weights for policy 0, policy_version 162800 (0.0005) [2023-03-10 21:54:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 83353600. Throughput: 0: 11740.6. Samples: 83349512. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:54:19,742][1096160] Avg episode reward: [(0, '4858.989')] [2023-03-10 21:54:23,301][1096443] Updated weights for policy 0, policy_version 162880 (0.0005) [2023-03-10 21:54:24,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11829.8). Total num frames: 83410944. Throughput: 0: 11658.9. Samples: 83382284. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:54:24,742][1096160] Avg episode reward: [(0, '4861.721')] [2023-03-10 21:54:26,768][1096443] Updated weights for policy 0, policy_version 162960 (0.0006) [2023-03-10 21:54:29,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11829.8). Total num frames: 83468288. Throughput: 0: 11718.0. Samples: 83452544. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:54:29,742][1096160] Avg episode reward: [(0, '4863.603')] [2023-03-10 21:54:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000163024_83468288.pth... [2023-03-10 21:54:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000162336_83116032.pth [2023-03-10 21:54:30,187][1096443] Updated weights for policy 0, policy_version 163040 (0.0005) [2023-03-10 21:54:33,850][1096443] Updated weights for policy 0, policy_version 163120 (0.0005) [2023-03-10 21:54:34,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11829.8). Total num frames: 83525632. Throughput: 0: 11741.5. Samples: 83521532. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:54:34,742][1096160] Avg episode reward: [(0, '4859.268')] [2023-03-10 21:54:37,380][1096443] Updated weights for policy 0, policy_version 163200 (0.0005) [2023-03-10 21:54:39,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11815.9). Total num frames: 83582976. Throughput: 0: 11767.0. Samples: 83556848. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:54:39,742][1096160] Avg episode reward: [(0, '4861.182')] [2023-03-10 21:54:40,835][1096443] Updated weights for policy 0, policy_version 163280 (0.0004) [2023-03-10 21:54:44,417][1096443] Updated weights for policy 0, policy_version 163360 (0.0005) [2023-03-10 21:54:44,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11815.9). Total num frames: 83640320. Throughput: 0: 11761.1. Samples: 83626076. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:54:44,742][1096160] Avg episode reward: [(0, '4860.234')] [2023-03-10 21:54:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000163360_83640320.pth... [2023-03-10 21:54:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000162680_83292160.pth [2023-03-10 21:54:47,839][1096443] Updated weights for policy 0, policy_version 163440 (0.0005) [2023-03-10 21:54:49,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.8, 300 sec: 11829.8). Total num frames: 83701760. Throughput: 0: 11762.9. Samples: 83698612. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:54:49,742][1096160] Avg episode reward: [(0, '4860.247')] [2023-03-10 21:54:51,225][1096443] Updated weights for policy 0, policy_version 163520 (0.0005) [2023-03-10 21:54:54,716][1096443] Updated weights for policy 0, policy_version 163600 (0.0005) [2023-03-10 21:54:54,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 83763200. Throughput: 0: 11761.6. Samples: 83734536. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:54:54,742][1096160] Avg episode reward: [(0, '4864.984')] [2023-03-10 21:54:58,103][1096443] Updated weights for policy 0, policy_version 163680 (0.0005) [2023-03-10 21:54:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11829.8). Total num frames: 83820544. Throughput: 0: 11767.1. Samples: 83805360. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:54:59,742][1096160] Avg episode reward: [(0, '4861.643')] [2023-03-10 21:54:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000163712_83820544.pth... [2023-03-10 21:54:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000163024_83468288.pth [2023-03-10 21:55:01,553][1096443] Updated weights for policy 0, policy_version 163760 (0.0005) [2023-03-10 21:55:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 83881984. Throughput: 0: 11721.8. Samples: 83876992. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:55:04,742][1096160] Avg episode reward: [(0, '4862.619')] [2023-03-10 21:55:05,036][1096443] Updated weights for policy 0, policy_version 163840 (0.0004) [2023-03-10 21:55:08,547][1096443] Updated weights for policy 0, policy_version 163920 (0.0004) [2023-03-10 21:55:09,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 83939328. Throughput: 0: 11777.2. Samples: 83912260. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-03-10 21:55:09,742][1096160] Avg episode reward: [(0, '4859.369')] [2023-03-10 21:55:12,138][1096443] Updated weights for policy 0, policy_version 164000 (0.0005) [2023-03-10 21:55:14,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 83996672. Throughput: 0: 11729.3. Samples: 83980360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:55:14,742][1096160] Avg episode reward: [(0, '4862.090')] [2023-03-10 21:55:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000164056_83996672.pth... [2023-03-10 21:55:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000163360_83640320.pth [2023-03-10 21:55:15,460][1096443] Updated weights for policy 0, policy_version 164080 (0.0005) [2023-03-10 21:55:19,100][1096443] Updated weights for policy 0, policy_version 164160 (0.0004) [2023-03-10 21:55:19,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11829.8). Total num frames: 84054016. Throughput: 0: 11769.4. Samples: 84051156. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:55:19,742][1096160] Avg episode reward: [(0, '4861.144')] [2023-03-10 21:55:22,668][1096443] Updated weights for policy 0, policy_version 164240 (0.0005) [2023-03-10 21:55:24,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11843.7). Total num frames: 84111360. Throughput: 0: 11771.5. Samples: 84086564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:55:24,742][1096160] Avg episode reward: [(0, '4862.667')] [2023-03-10 21:55:26,112][1096443] Updated weights for policy 0, policy_version 164320 (0.0005) [2023-03-10 21:55:29,594][1096443] Updated weights for policy 0, policy_version 164400 (0.0005) [2023-03-10 21:55:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 84172800. Throughput: 0: 11795.4. Samples: 84156872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:55:29,742][1096160] Avg episode reward: [(0, '4859.715')] [2023-03-10 21:55:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000164400_84172800.pth... [2023-03-10 21:55:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000163712_83820544.pth [2023-03-10 21:55:33,048][1096443] Updated weights for policy 0, policy_version 164480 (0.0005) [2023-03-10 21:55:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11843.7). Total num frames: 84230144. Throughput: 0: 11727.2. Samples: 84226336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:55:34,742][1096160] Avg episode reward: [(0, '4860.067')] [2023-03-10 21:55:36,470][1096443] Updated weights for policy 0, policy_version 164560 (0.0005) [2023-03-10 21:55:39,700][1096443] Updated weights for policy 0, policy_version 164640 (0.0005) [2023-03-10 21:55:39,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 11857.6). Total num frames: 84295680. Throughput: 0: 11791.2. Samples: 84265140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:55:39,742][1096160] Avg episode reward: [(0, '4861.129')] [2023-03-10 21:55:43,050][1096443] Updated weights for policy 0, policy_version 164720 (0.0005) [2023-03-10 21:55:44,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11878.4, 300 sec: 11843.7). Total num frames: 84353024. Throughput: 0: 11857.9. Samples: 84338968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:55:44,742][1096160] Avg episode reward: [(0, '4864.482')] [2023-03-10 21:55:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000164752_84353024.pth... [2023-03-10 21:55:44,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000164056_83996672.pth [2023-03-10 21:55:46,500][1096443] Updated weights for policy 0, policy_version 164800 (0.0005) [2023-03-10 21:55:49,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11843.7). Total num frames: 84410368. Throughput: 0: 11795.0. Samples: 84407764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:55:49,742][1096160] Avg episode reward: [(0, '4862.155')] [2023-03-10 21:55:50,153][1096443] Updated weights for policy 0, policy_version 164880 (0.0005) [2023-03-10 21:55:53,631][1096443] Updated weights for policy 0, policy_version 164960 (0.0006) [2023-03-10 21:55:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11829.8). Total num frames: 84471808. Throughput: 0: 11802.7. Samples: 84443380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:55:54,742][1096160] Avg episode reward: [(0, '4858.114')] [2023-03-10 21:55:56,982][1096443] Updated weights for policy 0, policy_version 165040 (0.0005) [2023-03-10 21:55:59,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 11878.4, 300 sec: 11829.8). Total num frames: 84533248. Throughput: 0: 11902.8. Samples: 84515988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:55:59,743][1096160] Avg episode reward: [(0, '4859.131')] [2023-03-10 21:55:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000165104_84533248.pth... [2023-03-10 21:55:59,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000164400_84172800.pth [2023-03-10 21:56:00,367][1096443] Updated weights for policy 0, policy_version 165120 (0.0004) [2023-03-10 21:56:04,076][1096443] Updated weights for policy 0, policy_version 165200 (0.0005) [2023-03-10 21:56:04,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 84586496. Throughput: 0: 11837.3. Samples: 84583836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:56:04,742][1096160] Avg episode reward: [(0, '4862.948')] [2023-03-10 21:56:07,399][1096443] Updated weights for policy 0, policy_version 165280 (0.0006) [2023-03-10 21:56:09,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11810.2, 300 sec: 11829.8). Total num frames: 84647936. Throughput: 0: 11870.8. Samples: 84620748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:56:09,742][1096160] Avg episode reward: [(0, '4861.017')] [2023-03-10 21:56:10,780][1096443] Updated weights for policy 0, policy_version 165360 (0.0005) [2023-03-10 21:56:14,226][1096443] Updated weights for policy 0, policy_version 165440 (0.0005) [2023-03-10 21:56:14,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11829.8). Total num frames: 84709376. Throughput: 0: 11915.4. Samples: 84693064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:56:14,742][1096160] Avg episode reward: [(0, '4861.945')] [2023-03-10 21:56:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000165448_84709376.pth... [2023-03-10 21:56:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000164752_84353024.pth [2023-03-10 21:56:17,601][1096443] Updated weights for policy 0, policy_version 165520 (0.0005) [2023-03-10 21:56:19,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11843.7). Total num frames: 84770816. Throughput: 0: 12007.8. Samples: 84766688. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:56:19,742][1096160] Avg episode reward: [(0, '4864.494')] [2023-03-10 21:56:20,824][1096443] Updated weights for policy 0, policy_version 165600 (0.0005) [2023-03-10 21:56:24,305][1096443] Updated weights for policy 0, policy_version 165680 (0.0005) [2023-03-10 21:56:24,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 12014.9, 300 sec: 11857.6). Total num frames: 84832256. Throughput: 0: 11965.7. Samples: 84803596. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:56:24,742][1096160] Avg episode reward: [(0, '4862.927')] [2023-03-10 21:56:27,722][1096443] Updated weights for policy 0, policy_version 165760 (0.0005) [2023-03-10 21:56:29,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11857.6). Total num frames: 84893696. Throughput: 0: 11909.5. Samples: 84874896. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:56:29,742][1096160] Avg episode reward: [(0, '4862.938')] [2023-03-10 21:56:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000165808_84893696.pth... [2023-03-10 21:56:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000165104_84533248.pth [2023-03-10 21:56:31,060][1096443] Updated weights for policy 0, policy_version 165840 (0.0005) [2023-03-10 21:56:34,448][1096443] Updated weights for policy 0, policy_version 165920 (0.0005) [2023-03-10 21:56:34,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 12014.9, 300 sec: 11843.7). Total num frames: 84951040. Throughput: 0: 12002.3. Samples: 84947868. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:56:34,742][1096160] Avg episode reward: [(0, '4863.369')] [2023-03-10 21:56:38,016][1096443] Updated weights for policy 0, policy_version 166000 (0.0005) [2023-03-10 21:56:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11857.6). Total num frames: 85012480. Throughput: 0: 11983.8. Samples: 84982652. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:56:39,742][1096160] Avg episode reward: [(0, '4864.859')] [2023-03-10 21:56:41,367][1096443] Updated weights for policy 0, policy_version 166080 (0.0004) [2023-03-10 21:56:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11843.7). Total num frames: 85069824. Throughput: 0: 11983.0. Samples: 85055220. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:56:44,742][1096160] Avg episode reward: [(0, '4864.430')] [2023-03-10 21:56:44,764][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000166160_85073920.pth... [2023-03-10 21:56:44,765][1096443] Updated weights for policy 0, policy_version 166160 (0.0004) [2023-03-10 21:56:44,766][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000165448_84709376.pth [2023-03-10 21:56:48,288][1096443] Updated weights for policy 0, policy_version 166240 (0.0005) [2023-03-10 21:56:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11857.6). Total num frames: 85131264. Throughput: 0: 12056.3. Samples: 85126372. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:56:49,742][1096160] Avg episode reward: [(0, '4860.229')] [2023-03-10 21:56:51,720][1096443] Updated weights for policy 0, policy_version 166320 (0.0005) [2023-03-10 21:56:54,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11843.7). Total num frames: 85188608. Throughput: 0: 12023.1. Samples: 85161788. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:56:54,742][1096160] Avg episode reward: [(0, '4858.871')] [2023-03-10 21:56:55,337][1096443] Updated weights for policy 0, policy_version 166400 (0.0005) [2023-03-10 21:56:58,628][1096443] Updated weights for policy 0, policy_version 166480 (0.0005) [2023-03-10 21:56:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11843.7). Total num frames: 85250048. Throughput: 0: 11984.6. Samples: 85232372. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:56:59,742][1096160] Avg episode reward: [(0, '4863.573')] [2023-03-10 21:56:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000166504_85250048.pth... [2023-03-10 21:56:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000165808_84893696.pth [2023-03-10 21:57:01,872][1096443] Updated weights for policy 0, policy_version 166560 (0.0004) [2023-03-10 21:57:04,741][1096160] Fps is (10 sec: 12288.0, 60 sec: 12083.2, 300 sec: 11843.7). Total num frames: 85311488. Throughput: 0: 12012.4. Samples: 85307244. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:57:04,742][1096160] Avg episode reward: [(0, '4864.935')] [2023-03-10 21:57:05,349][1096443] Updated weights for policy 0, policy_version 166640 (0.0005) [2023-03-10 21:57:09,030][1096443] Updated weights for policy 0, policy_version 166720 (0.0005) [2023-03-10 21:57:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11843.7). Total num frames: 85368832. Throughput: 0: 11941.2. Samples: 85340948. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:57:09,742][1096160] Avg episode reward: [(0, '4863.726')] [2023-03-10 21:57:12,491][1096443] Updated weights for policy 0, policy_version 166800 (0.0005) [2023-03-10 21:57:14,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11946.7, 300 sec: 11829.8). Total num frames: 85426176. Throughput: 0: 11888.0. Samples: 85409856. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:57:14,742][1096160] Avg episode reward: [(0, '4860.004')] [2023-03-10 21:57:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000166848_85426176.pth... [2023-03-10 21:57:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000166160_85073920.pth [2023-03-10 21:57:15,934][1096443] Updated weights for policy 0, policy_version 166880 (0.0005) [2023-03-10 21:57:19,520][1096443] Updated weights for policy 0, policy_version 166960 (0.0005) [2023-03-10 21:57:19,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11878.4, 300 sec: 11829.8). Total num frames: 85483520. Throughput: 0: 11813.8. Samples: 85479488. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:57:19,742][1096160] Avg episode reward: [(0, '4862.174')] [2023-03-10 21:57:23,052][1096443] Updated weights for policy 0, policy_version 167040 (0.0005) [2023-03-10 21:57:24,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11810.2, 300 sec: 11815.9). Total num frames: 85540864. Throughput: 0: 11829.0. Samples: 85514956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:57:24,742][1096160] Avg episode reward: [(0, '4862.146')] [2023-03-10 21:57:26,523][1096443] Updated weights for policy 0, policy_version 167120 (0.0005) [2023-03-10 21:57:29,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 85602304. Throughput: 0: 11789.8. Samples: 85585764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:57:29,742][1096160] Avg episode reward: [(0, '4866.348')] [2023-03-10 21:57:29,747][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000167192_85602304.pth... [2023-03-10 21:57:29,750][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000166504_85250048.pth [2023-03-10 21:57:29,750][1096399] Saving new best policy, reward=4866.348! [2023-03-10 21:57:29,811][1096443] Updated weights for policy 0, policy_version 167200 (0.0005) [2023-03-10 21:57:33,412][1096443] Updated weights for policy 0, policy_version 167280 (0.0005) [2023-03-10 21:57:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 85659648. Throughput: 0: 11766.9. Samples: 85655884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:57:34,742][1096160] Avg episode reward: [(0, '4860.024')] [2023-03-10 21:57:37,019][1096443] Updated weights for policy 0, policy_version 167360 (0.0005) [2023-03-10 21:57:39,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11802.0). Total num frames: 85716992. Throughput: 0: 11761.3. Samples: 85691048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:57:39,742][1096160] Avg episode reward: [(0, '4863.658')] [2023-03-10 21:57:40,486][1096443] Updated weights for policy 0, policy_version 167440 (0.0005) [2023-03-10 21:57:43,877][1096443] Updated weights for policy 0, policy_version 167520 (0.0005) [2023-03-10 21:57:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 85778432. Throughput: 0: 11790.7. Samples: 85762956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:57:44,742][1096160] Avg episode reward: [(0, '4862.061')] [2023-03-10 21:57:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000167536_85778432.pth... [2023-03-10 21:57:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000166848_85426176.pth [2023-03-10 21:57:47,266][1096443] Updated weights for policy 0, policy_version 167600 (0.0005) [2023-03-10 21:57:49,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 85839872. Throughput: 0: 11736.2. Samples: 85835376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:57:49,742][1096160] Avg episode reward: [(0, '4860.690')] [2023-03-10 21:57:50,715][1096443] Updated weights for policy 0, policy_version 167680 (0.0006) [2023-03-10 21:57:54,212][1096443] Updated weights for policy 0, policy_version 167760 (0.0004) [2023-03-10 21:57:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 85897216. Throughput: 0: 11751.5. Samples: 85869764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:57:54,742][1096160] Avg episode reward: [(0, '4860.603')] [2023-03-10 21:57:57,664][1096443] Updated weights for policy 0, policy_version 167840 (0.0005) [2023-03-10 21:57:59,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11802.0). Total num frames: 85954560. Throughput: 0: 11821.3. Samples: 85941816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:57:59,742][1096160] Avg episode reward: [(0, '4861.083')] [2023-03-10 21:57:59,783][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000167888_85958656.pth... [2023-03-10 21:57:59,784][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000167192_85602304.pth [2023-03-10 21:58:01,154][1096443] Updated weights for policy 0, policy_version 167920 (0.0005) [2023-03-10 21:58:04,547][1096443] Updated weights for policy 0, policy_version 168000 (0.0006) [2023-03-10 21:58:04,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 86016000. Throughput: 0: 11831.7. Samples: 86011916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:58:04,742][1096160] Avg episode reward: [(0, '4861.945')] [2023-03-10 21:58:07,880][1096443] Updated weights for policy 0, policy_version 168080 (0.0005) [2023-03-10 21:58:09,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 86077440. Throughput: 0: 11878.5. Samples: 86049488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:58:09,742][1096160] Avg episode reward: [(0, '4865.121')] [2023-03-10 21:58:11,310][1096443] Updated weights for policy 0, policy_version 168160 (0.0005) [2023-03-10 21:58:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.2, 300 sec: 11815.9). Total num frames: 86134784. Throughput: 0: 11852.4. Samples: 86119120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:58:14,742][1096160] Avg episode reward: [(0, '4864.382')] [2023-03-10 21:58:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000168232_86134784.pth... [2023-03-10 21:58:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000167536_85778432.pth [2023-03-10 21:58:14,802][1096443] Updated weights for policy 0, policy_version 168240 (0.0005) [2023-03-10 21:58:18,301][1096443] Updated weights for policy 0, policy_version 168320 (0.0005) [2023-03-10 21:58:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11829.8). Total num frames: 86196224. Throughput: 0: 11897.8. Samples: 86191284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:58:19,742][1096160] Avg episode reward: [(0, '4864.844')] [2023-03-10 21:58:21,499][1096443] Updated weights for policy 0, policy_version 168400 (0.0005) [2023-03-10 21:58:24,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.6, 300 sec: 11843.7). Total num frames: 86257664. Throughput: 0: 11974.5. Samples: 86229900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:58:24,742][1096160] Avg episode reward: [(0, '4860.254')] [2023-03-10 21:58:24,786][1096443] Updated weights for policy 0, policy_version 168480 (0.0005) [2023-03-10 21:58:28,327][1096443] Updated weights for policy 0, policy_version 168560 (0.0005) [2023-03-10 21:58:29,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11843.7). Total num frames: 86319104. Throughput: 0: 11987.0. Samples: 86302368. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:58:29,742][1096160] Avg episode reward: [(0, '4860.053')] [2023-03-10 21:58:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000168592_86319104.pth... [2023-03-10 21:58:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000167888_85958656.pth [2023-03-10 21:58:31,738][1096443] Updated weights for policy 0, policy_version 168640 (0.0005) [2023-03-10 21:58:34,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11843.7). Total num frames: 86376448. Throughput: 0: 11970.9. Samples: 86374064. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:58:34,742][1096160] Avg episode reward: [(0, '4861.029')] [2023-03-10 21:58:35,174][1096443] Updated weights for policy 0, policy_version 168720 (0.0005) [2023-03-10 21:58:38,704][1096443] Updated weights for policy 0, policy_version 168800 (0.0005) [2023-03-10 21:58:39,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11946.7, 300 sec: 11843.7). Total num frames: 86433792. Throughput: 0: 11985.8. Samples: 86409124. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:58:39,742][1096160] Avg episode reward: [(0, '4862.492')] [2023-03-10 21:58:42,160][1096443] Updated weights for policy 0, policy_version 168880 (0.0004) [2023-03-10 21:58:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11857.6). Total num frames: 86495232. Throughput: 0: 11933.8. Samples: 86478836. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:58:44,742][1096160] Avg episode reward: [(0, '4860.637')] [2023-03-10 21:58:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000168936_86495232.pth... [2023-03-10 21:58:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000168232_86134784.pth [2023-03-10 21:58:45,719][1096443] Updated weights for policy 0, policy_version 168960 (0.0006) [2023-03-10 21:58:49,229][1096443] Updated weights for policy 0, policy_version 169040 (0.0004) [2023-03-10 21:58:49,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11857.6). Total num frames: 86552576. Throughput: 0: 11923.8. Samples: 86548488. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:58:49,742][1096160] Avg episode reward: [(0, '4861.947')] [2023-03-10 21:58:52,547][1096443] Updated weights for policy 0, policy_version 169120 (0.0005) [2023-03-10 21:58:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11857.6). Total num frames: 86614016. Throughput: 0: 11909.3. Samples: 86585408. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:58:54,743][1096160] Avg episode reward: [(0, '4862.782')] [2023-03-10 21:58:56,057][1096443] Updated weights for policy 0, policy_version 169200 (0.0005) [2023-03-10 21:58:59,639][1096443] Updated weights for policy 0, policy_version 169280 (0.0004) [2023-03-10 21:58:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11857.6). Total num frames: 86671360. Throughput: 0: 11909.5. Samples: 86655048. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:58:59,742][1096160] Avg episode reward: [(0, '4862.171')] [2023-03-10 21:58:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000169280_86671360.pth... [2023-03-10 21:58:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000168592_86319104.pth [2023-03-10 21:59:02,999][1096443] Updated weights for policy 0, policy_version 169360 (0.0005) [2023-03-10 21:59:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.6, 300 sec: 11857.6). Total num frames: 86732800. Throughput: 0: 11942.2. Samples: 86728684. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:59:04,742][1096160] Avg episode reward: [(0, '4856.392')] [2023-03-10 21:59:06,390][1096443] Updated weights for policy 0, policy_version 169440 (0.0005) [2023-03-10 21:59:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11878.4, 300 sec: 11857.6). Total num frames: 86790144. Throughput: 0: 11837.3. Samples: 86762580. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:59:09,742][1096160] Avg episode reward: [(0, '4862.693')] [2023-03-10 21:59:09,867][1096443] Updated weights for policy 0, policy_version 169520 (0.0005) [2023-03-10 21:59:13,384][1096443] Updated weights for policy 0, policy_version 169600 (0.0005) [2023-03-10 21:59:14,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11843.7). Total num frames: 86847488. Throughput: 0: 11794.2. Samples: 86833108. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:59:14,742][1096160] Avg episode reward: [(0, '4860.889')] [2023-03-10 21:59:14,766][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000169632_86851584.pth... [2023-03-10 21:59:14,767][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000168936_86495232.pth [2023-03-10 21:59:16,821][1096443] Updated weights for policy 0, policy_version 169680 (0.0006) [2023-03-10 21:59:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11857.6). Total num frames: 86908928. Throughput: 0: 11792.8. Samples: 86904740. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:59:19,742][1096160] Avg episode reward: [(0, '4859.434')] [2023-03-10 21:59:20,371][1096443] Updated weights for policy 0, policy_version 169760 (0.0006) [2023-03-10 21:59:23,879][1096443] Updated weights for policy 0, policy_version 169840 (0.0004) [2023-03-10 21:59:24,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11857.6). Total num frames: 86966272. Throughput: 0: 11755.8. Samples: 86938136. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-03-10 21:59:24,742][1096160] Avg episode reward: [(0, '4859.858')] [2023-03-10 21:59:27,372][1096443] Updated weights for policy 0, policy_version 169920 (0.0005) [2023-03-10 21:59:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11871.5). Total num frames: 87027712. Throughput: 0: 11756.6. Samples: 87007880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:59:29,742][1096160] Avg episode reward: [(0, '4861.042')] [2023-03-10 21:59:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000169976_87027712.pth... [2023-03-10 21:59:29,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000169280_86671360.pth [2023-03-10 21:59:30,786][1096443] Updated weights for policy 0, policy_version 170000 (0.0005) [2023-03-10 21:59:34,327][1096443] Updated weights for policy 0, policy_version 170080 (0.0004) [2023-03-10 21:59:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11871.5). Total num frames: 87085056. Throughput: 0: 11817.7. Samples: 87080284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:59:34,742][1096160] Avg episode reward: [(0, '4861.457')] [2023-03-10 21:59:37,606][1096443] Updated weights for policy 0, policy_version 170160 (0.0005) [2023-03-10 21:59:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11885.3). Total num frames: 87146496. Throughput: 0: 11820.7. Samples: 87117340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:59:39,742][1096160] Avg episode reward: [(0, '4861.569')] [2023-03-10 21:59:41,115][1096443] Updated weights for policy 0, policy_version 170240 (0.0004) [2023-03-10 21:59:44,621][1096443] Updated weights for policy 0, policy_version 170320 (0.0005) [2023-03-10 21:59:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11871.5). Total num frames: 87203840. Throughput: 0: 11832.7. Samples: 87187520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:59:44,742][1096160] Avg episode reward: [(0, '4857.523')] [2023-03-10 21:59:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000170320_87203840.pth... [2023-03-10 21:59:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000169632_86851584.pth [2023-03-10 21:59:48,039][1096443] Updated weights for policy 0, policy_version 170400 (0.0005) [2023-03-10 21:59:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11871.5). Total num frames: 87265280. Throughput: 0: 11794.9. Samples: 87259456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:59:49,742][1096160] Avg episode reward: [(0, '4861.731')] [2023-03-10 21:59:51,478][1096443] Updated weights for policy 0, policy_version 170480 (0.0005) [2023-03-10 21:59:54,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11857.6). Total num frames: 87318528. Throughput: 0: 11809.9. Samples: 87294024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:59:54,742][1096160] Avg episode reward: [(0, '4861.710')] [2023-03-10 21:59:55,101][1096443] Updated weights for policy 0, policy_version 170560 (0.0005) [2023-03-10 21:59:58,568][1096443] Updated weights for policy 0, policy_version 170640 (0.0005) [2023-03-10 21:59:59,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11857.6). Total num frames: 87379968. Throughput: 0: 11788.5. Samples: 87363592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 21:59:59,742][1096160] Avg episode reward: [(0, '4863.698')] [2023-03-10 21:59:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000170664_87379968.pth... [2023-03-10 21:59:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000169976_87027712.pth [2023-03-10 22:00:02,054][1096443] Updated weights for policy 0, policy_version 170720 (0.0005) [2023-03-10 22:00:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11857.6). Total num frames: 87437312. Throughput: 0: 11744.1. Samples: 87433224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:00:04,742][1096160] Avg episode reward: [(0, '4862.543')] [2023-03-10 22:00:05,561][1096443] Updated weights for policy 0, policy_version 170800 (0.0004) [2023-03-10 22:00:09,141][1096443] Updated weights for policy 0, policy_version 170880 (0.0005) [2023-03-10 22:00:09,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11857.6). Total num frames: 87494656. Throughput: 0: 11814.9. Samples: 87469804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:00:09,742][1096160] Avg episode reward: [(0, '4859.968')] [2023-03-10 22:00:12,643][1096443] Updated weights for policy 0, policy_version 170960 (0.0005) [2023-03-10 22:00:14,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11857.6). Total num frames: 87552000. Throughput: 0: 11786.0. Samples: 87538248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:00:14,742][1096160] Avg episode reward: [(0, '4859.383')] [2023-03-10 22:00:14,779][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000171008_87556096.pth... [2023-03-10 22:00:14,780][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000170320_87203840.pth [2023-03-10 22:00:16,250][1096443] Updated weights for policy 0, policy_version 171040 (0.0005) [2023-03-10 22:00:19,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11857.6). Total num frames: 87609344. Throughput: 0: 11679.7. Samples: 87605872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:00:19,742][1096160] Avg episode reward: [(0, '4859.238')] [2023-03-10 22:00:19,885][1096443] Updated weights for policy 0, policy_version 171120 (0.0005) [2023-03-10 22:00:23,500][1096443] Updated weights for policy 0, policy_version 171200 (0.0005) [2023-03-10 22:00:24,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11843.7). Total num frames: 87666688. Throughput: 0: 11580.1. Samples: 87638444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:00:24,742][1096160] Avg episode reward: [(0, '4864.296')] [2023-03-10 22:00:27,201][1096443] Updated weights for policy 0, policy_version 171280 (0.0005) [2023-03-10 22:00:29,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11843.7). Total num frames: 87724032. Throughput: 0: 11559.8. Samples: 87707712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:00:29,742][1096160] Avg episode reward: [(0, '4860.506')] [2023-03-10 22:00:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000171336_87724032.pth... [2023-03-10 22:00:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000170664_87379968.pth [2023-03-10 22:00:30,577][1096443] Updated weights for policy 0, policy_version 171360 (0.0005) [2023-03-10 22:00:34,176][1096443] Updated weights for policy 0, policy_version 171440 (0.0005) [2023-03-10 22:00:34,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11605.4, 300 sec: 11815.9). Total num frames: 87781376. Throughput: 0: 11509.0. Samples: 87777360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:00:34,742][1096160] Avg episode reward: [(0, '4857.172')] [2023-03-10 22:00:37,610][1096443] Updated weights for policy 0, policy_version 171520 (0.0004) [2023-03-10 22:00:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11829.8). Total num frames: 87842816. Throughput: 0: 11552.7. Samples: 87813896. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:00:39,742][1096160] Avg episode reward: [(0, '4861.456')] [2023-03-10 22:00:41,006][1096443] Updated weights for policy 0, policy_version 171600 (0.0005) [2023-03-10 22:00:44,598][1096443] Updated weights for policy 0, policy_version 171680 (0.0005) [2023-03-10 22:00:44,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11605.3, 300 sec: 11829.8). Total num frames: 87900160. Throughput: 0: 11644.4. Samples: 87887588. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:00:44,742][1096160] Avg episode reward: [(0, '4862.010')] [2023-03-10 22:00:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000171680_87900160.pth... [2023-03-10 22:00:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000171008_87556096.pth [2023-03-10 22:00:48,117][1096443] Updated weights for policy 0, policy_version 171760 (0.0005) [2023-03-10 22:00:49,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11537.1, 300 sec: 11815.9). Total num frames: 87957504. Throughput: 0: 11600.6. Samples: 87955252. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:00:49,742][1096160] Avg episode reward: [(0, '4861.719')] [2023-03-10 22:00:51,476][1096443] Updated weights for policy 0, policy_version 171840 (0.0005) [2023-03-10 22:00:54,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11673.6, 300 sec: 11815.9). Total num frames: 88018944. Throughput: 0: 11567.6. Samples: 87990348. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:00:54,742][1096160] Avg episode reward: [(0, '4865.825')] [2023-03-10 22:00:55,046][1096443] Updated weights for policy 0, policy_version 171920 (0.0004) [2023-03-10 22:00:58,565][1096443] Updated weights for policy 0, policy_version 172000 (0.0005) [2023-03-10 22:00:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11605.3, 300 sec: 11829.8). Total num frames: 88076288. Throughput: 0: 11593.8. Samples: 88059968. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:00:59,742][1096160] Avg episode reward: [(0, '4864.466')] [2023-03-10 22:00:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000172024_88076288.pth... [2023-03-10 22:00:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000171336_87724032.pth [2023-03-10 22:01:01,983][1096443] Updated weights for policy 0, policy_version 172080 (0.0005) [2023-03-10 22:01:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11673.6, 300 sec: 11829.8). Total num frames: 88137728. Throughput: 0: 11719.4. Samples: 88133244. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:01:04,742][1096160] Avg episode reward: [(0, '4860.866')] [2023-03-10 22:01:05,321][1096443] Updated weights for policy 0, policy_version 172160 (0.0004) [2023-03-10 22:01:08,857][1096443] Updated weights for policy 0, policy_version 172240 (0.0005) [2023-03-10 22:01:09,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11673.6, 300 sec: 11815.9). Total num frames: 88195072. Throughput: 0: 11740.5. Samples: 88166768. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:01:09,742][1096160] Avg episode reward: [(0, '4864.193')] [2023-03-10 22:01:12,465][1096443] Updated weights for policy 0, policy_version 172320 (0.0005) [2023-03-10 22:01:14,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11802.0). Total num frames: 88252416. Throughput: 0: 11757.6. Samples: 88236804. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:01:14,742][1096160] Avg episode reward: [(0, '4863.449')] [2023-03-10 22:01:14,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000172368_88252416.pth... [2023-03-10 22:01:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000171680_87900160.pth [2023-03-10 22:01:15,907][1096443] Updated weights for policy 0, policy_version 172400 (0.0005) [2023-03-10 22:01:19,504][1096443] Updated weights for policy 0, policy_version 172480 (0.0006) [2023-03-10 22:01:19,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11788.1). Total num frames: 88309760. Throughput: 0: 11745.9. Samples: 88305928. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:01:19,742][1096160] Avg episode reward: [(0, '4862.531')] [2023-03-10 22:01:23,048][1096443] Updated weights for policy 0, policy_version 172560 (0.0005) [2023-03-10 22:01:24,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11673.6, 300 sec: 11774.3). Total num frames: 88367104. Throughput: 0: 11715.9. Samples: 88341112. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:01:24,742][1096160] Avg episode reward: [(0, '4861.593')] [2023-03-10 22:01:26,573][1096443] Updated weights for policy 0, policy_version 172640 (0.0005) [2023-03-10 22:01:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11788.1). Total num frames: 88428544. Throughput: 0: 11619.1. Samples: 88410448. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:01:29,742][1096160] Avg episode reward: [(0, '4862.628')] [2023-03-10 22:01:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000172712_88428544.pth... [2023-03-10 22:01:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000172024_88076288.pth [2023-03-10 22:01:30,014][1096443] Updated weights for policy 0, policy_version 172720 (0.0005) [2023-03-10 22:01:33,373][1096443] Updated weights for policy 0, policy_version 172800 (0.0005) [2023-03-10 22:01:34,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 88489984. Throughput: 0: 11755.4. Samples: 88484248. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:01:34,742][1096160] Avg episode reward: [(0, '4860.914')] [2023-03-10 22:01:36,801][1096443] Updated weights for policy 0, policy_version 172880 (0.0005) [2023-03-10 22:01:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11788.1). Total num frames: 88547328. Throughput: 0: 11778.8. Samples: 88520396. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:01:39,742][1096160] Avg episode reward: [(0, '4864.061')] [2023-03-10 22:01:40,361][1096443] Updated weights for policy 0, policy_version 172960 (0.0005) [2023-03-10 22:01:43,734][1096443] Updated weights for policy 0, policy_version 173040 (0.0004) [2023-03-10 22:01:44,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 88604672. Throughput: 0: 11792.8. Samples: 88590644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:01:44,742][1096160] Avg episode reward: [(0, '4861.662')] [2023-03-10 22:01:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000173056_88604672.pth... [2023-03-10 22:01:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000172368_88252416.pth [2023-03-10 22:01:47,215][1096443] Updated weights for policy 0, policy_version 173120 (0.0005) [2023-03-10 22:01:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 88666112. Throughput: 0: 11750.8. Samples: 88662028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:01:49,742][1096160] Avg episode reward: [(0, '4865.261')] [2023-03-10 22:01:50,694][1096443] Updated weights for policy 0, policy_version 173200 (0.0005) [2023-03-10 22:01:54,240][1096443] Updated weights for policy 0, policy_version 173280 (0.0004) [2023-03-10 22:01:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 88723456. Throughput: 0: 11765.0. Samples: 88696192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:01:54,742][1096160] Avg episode reward: [(0, '4862.610')] [2023-03-10 22:01:57,653][1096443] Updated weights for policy 0, policy_version 173360 (0.0004) [2023-03-10 22:01:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11774.3). Total num frames: 88784896. Throughput: 0: 11778.3. Samples: 88766828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:01:59,742][1096160] Avg episode reward: [(0, '4860.406')] [2023-03-10 22:01:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000173408_88784896.pth... [2023-03-10 22:01:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000172712_88428544.pth [2023-03-10 22:02:00,893][1096443] Updated weights for policy 0, policy_version 173440 (0.0004) [2023-03-10 22:02:04,269][1096443] Updated weights for policy 0, policy_version 173520 (0.0005) [2023-03-10 22:02:04,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11810.2, 300 sec: 11788.2). Total num frames: 88846336. Throughput: 0: 11916.5. Samples: 88842168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:02:04,742][1096160] Avg episode reward: [(0, '4861.603')] [2023-03-10 22:02:07,655][1096443] Updated weights for policy 0, policy_version 173600 (0.0005) [2023-03-10 22:02:09,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11810.2, 300 sec: 11788.2). Total num frames: 88903680. Throughput: 0: 11953.9. Samples: 88879036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:02:09,742][1096160] Avg episode reward: [(0, '4865.012')] [2023-03-10 22:02:11,088][1096443] Updated weights for policy 0, policy_version 173680 (0.0005) [2023-03-10 22:02:14,593][1096443] Updated weights for policy 0, policy_version 173760 (0.0005) [2023-03-10 22:02:14,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11878.4, 300 sec: 11802.0). Total num frames: 88965120. Throughput: 0: 11962.0. Samples: 88948736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:02:14,742][1096160] Avg episode reward: [(0, '4859.411')] [2023-03-10 22:02:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000173760_88965120.pth... [2023-03-10 22:02:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000173056_88604672.pth [2023-03-10 22:02:18,021][1096443] Updated weights for policy 0, policy_version 173840 (0.0005) [2023-03-10 22:02:19,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11946.7, 300 sec: 11815.9). Total num frames: 89026560. Throughput: 0: 11912.7. Samples: 89020320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:02:19,742][1096160] Avg episode reward: [(0, '4862.352')] [2023-03-10 22:02:21,281][1096443] Updated weights for policy 0, policy_version 173920 (0.0004) [2023-03-10 22:02:24,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11946.7, 300 sec: 11802.0). Total num frames: 89083904. Throughput: 0: 11956.3. Samples: 89058428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:02:24,742][1096160] Avg episode reward: [(0, '4861.762')] [2023-03-10 22:02:24,766][1096443] Updated weights for policy 0, policy_version 174000 (0.0006) [2023-03-10 22:02:28,078][1096443] Updated weights for policy 0, policy_version 174080 (0.0004) [2023-03-10 22:02:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11946.7, 300 sec: 11815.9). Total num frames: 89145344. Throughput: 0: 12007.3. Samples: 89130972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:02:29,742][1096160] Avg episode reward: [(0, '4861.191')] [2023-03-10 22:02:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000174112_89145344.pth... [2023-03-10 22:02:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000173408_88784896.pth [2023-03-10 22:02:31,570][1096443] Updated weights for policy 0, policy_version 174160 (0.0005) [2023-03-10 22:02:34,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 11946.7, 300 sec: 11829.8). Total num frames: 89206784. Throughput: 0: 12007.9. Samples: 89202384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:02:34,742][1096160] Avg episode reward: [(0, '4864.237')] [2023-03-10 22:02:34,984][1096443] Updated weights for policy 0, policy_version 174240 (0.0005) [2023-03-10 22:02:38,392][1096443] Updated weights for policy 0, policy_version 174320 (0.0004) [2023-03-10 22:02:39,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.7, 300 sec: 11815.9). Total num frames: 89264128. Throughput: 0: 12031.9. Samples: 89237624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:02:39,742][1096160] Avg episode reward: [(0, '4865.991')] [2023-03-10 22:02:41,783][1096443] Updated weights for policy 0, policy_version 174400 (0.0004) [2023-03-10 22:02:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11815.9). Total num frames: 89325568. Throughput: 0: 12053.8. Samples: 89309248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:02:44,742][1096160] Avg episode reward: [(0, '4866.042')] [2023-03-10 22:02:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000174464_89325568.pth... [2023-03-10 22:02:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000173760_88965120.pth [2023-03-10 22:02:45,262][1096443] Updated weights for policy 0, policy_version 174480 (0.0005) [2023-03-10 22:02:48,857][1096443] Updated weights for policy 0, policy_version 174560 (0.0004) [2023-03-10 22:02:49,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11815.9). Total num frames: 89382912. Throughput: 0: 11926.9. Samples: 89378880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:02:49,742][1096160] Avg episode reward: [(0, '4863.552')] [2023-03-10 22:02:52,253][1096443] Updated weights for policy 0, policy_version 174640 (0.0004) [2023-03-10 22:02:54,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11829.8). Total num frames: 89444352. Throughput: 0: 11924.7. Samples: 89415648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:02:54,742][1096160] Avg episode reward: [(0, '4860.248')] [2023-03-10 22:02:55,879][1096443] Updated weights for policy 0, policy_version 174720 (0.0005) [2023-03-10 22:02:59,337][1096443] Updated weights for policy 0, policy_version 174800 (0.0005) [2023-03-10 22:02:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11946.7, 300 sec: 11815.9). Total num frames: 89501696. Throughput: 0: 11895.8. Samples: 89484048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:02:59,742][1096160] Avg episode reward: [(0, '4861.989')] [2023-03-10 22:02:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000174808_89501696.pth... [2023-03-10 22:02:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000174112_89145344.pth [2023-03-10 22:03:02,786][1096443] Updated weights for policy 0, policy_version 174880 (0.0004) [2023-03-10 22:03:04,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11878.4, 300 sec: 11802.0). Total num frames: 89559040. Throughput: 0: 11888.8. Samples: 89555316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:03:04,742][1096160] Avg episode reward: [(0, '4862.365')] [2023-03-10 22:03:06,300][1096443] Updated weights for policy 0, policy_version 174960 (0.0005) [2023-03-10 22:03:09,688][1096443] Updated weights for policy 0, policy_version 175040 (0.0005) [2023-03-10 22:03:09,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11946.6, 300 sec: 11815.9). Total num frames: 89620480. Throughput: 0: 11847.1. Samples: 89591548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:03:09,742][1096160] Avg episode reward: [(0, '4863.349')] [2023-03-10 22:03:13,081][1096443] Updated weights for policy 0, policy_version 175120 (0.0004) [2023-03-10 22:03:14,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11815.9). Total num frames: 89681920. Throughput: 0: 11828.1. Samples: 89663236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:03:14,742][1096160] Avg episode reward: [(0, '4862.414')] [2023-03-10 22:03:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000175160_89681920.pth... [2023-03-10 22:03:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000174464_89325568.pth [2023-03-10 22:03:16,336][1096443] Updated weights for policy 0, policy_version 175200 (0.0005) [2023-03-10 22:03:19,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11878.4, 300 sec: 11802.0). Total num frames: 89739264. Throughput: 0: 11842.2. Samples: 89735284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:03:19,742][1096160] Avg episode reward: [(0, '4861.819')] [2023-03-10 22:03:19,869][1096443] Updated weights for policy 0, policy_version 175280 (0.0005) [2023-03-10 22:03:23,580][1096443] Updated weights for policy 0, policy_version 175360 (0.0005) [2023-03-10 22:03:24,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11878.4, 300 sec: 11788.2). Total num frames: 89796608. Throughput: 0: 11867.7. Samples: 89771672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:03:24,742][1096160] Avg episode reward: [(0, '4860.527')] [2023-03-10 22:03:27,182][1096443] Updated weights for policy 0, policy_version 175440 (0.0005) [2023-03-10 22:03:29,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 89853952. Throughput: 0: 11740.6. Samples: 89837576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:03:29,742][1096160] Avg episode reward: [(0, '4863.022')] [2023-03-10 22:03:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000175496_89853952.pth... [2023-03-10 22:03:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000174808_89501696.pth [2023-03-10 22:03:30,606][1096443] Updated weights for policy 0, policy_version 175520 (0.0005) [2023-03-10 22:03:34,092][1096443] Updated weights for policy 0, policy_version 175600 (0.0005) [2023-03-10 22:03:34,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11788.2). Total num frames: 89911296. Throughput: 0: 11771.5. Samples: 89908596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:03:34,742][1096160] Avg episode reward: [(0, '4858.993')] [2023-03-10 22:03:37,432][1096443] Updated weights for policy 0, policy_version 175680 (0.0006) [2023-03-10 22:03:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 89972736. Throughput: 0: 11768.5. Samples: 89945232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:03:39,742][1096160] Avg episode reward: [(0, '4858.560')] [2023-03-10 22:03:40,887][1096443] Updated weights for policy 0, policy_version 175760 (0.0005) [2023-03-10 22:03:44,279][1096443] Updated weights for policy 0, policy_version 175840 (0.0005) [2023-03-10 22:03:44,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 90034176. Throughput: 0: 11859.7. Samples: 90017736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:03:44,742][1096160] Avg episode reward: [(0, '4865.699')] [2023-03-10 22:03:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000175848_90034176.pth... [2023-03-10 22:03:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000175160_89681920.pth [2023-03-10 22:03:47,829][1096443] Updated weights for policy 0, policy_version 175920 (0.0005) [2023-03-10 22:03:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 90091520. Throughput: 0: 11826.1. Samples: 90087488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:03:49,742][1096160] Avg episode reward: [(0, '4861.549')] [2023-03-10 22:03:51,245][1096443] Updated weights for policy 0, policy_version 176000 (0.0005) [2023-03-10 22:03:54,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11788.2). Total num frames: 90148864. Throughput: 0: 11837.2. Samples: 90124220. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:03:54,742][1096160] Avg episode reward: [(0, '4861.422')] [2023-03-10 22:03:54,830][1096443] Updated weights for policy 0, policy_version 176080 (0.0005) [2023-03-10 22:03:58,238][1096443] Updated weights for policy 0, policy_version 176160 (0.0006) [2023-03-10 22:03:59,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.2, 300 sec: 11788.2). Total num frames: 90210304. Throughput: 0: 11793.2. Samples: 90193928. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:03:59,742][1096160] Avg episode reward: [(0, '4860.047')] [2023-03-10 22:03:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000176192_90210304.pth... [2023-03-10 22:03:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000175496_89853952.pth [2023-03-10 22:04:01,760][1096443] Updated weights for policy 0, policy_version 176240 (0.0006) [2023-03-10 22:04:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 90267648. Throughput: 0: 11742.0. Samples: 90263676. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:04:04,742][1096160] Avg episode reward: [(0, '4863.969')] [2023-03-10 22:04:05,194][1096443] Updated weights for policy 0, policy_version 176320 (0.0005) [2023-03-10 22:04:08,622][1096443] Updated weights for policy 0, policy_version 176400 (0.0005) [2023-03-10 22:04:09,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 90329088. Throughput: 0: 11751.4. Samples: 90300488. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:04:09,742][1096160] Avg episode reward: [(0, '4862.872')] [2023-03-10 22:04:12,127][1096443] Updated weights for policy 0, policy_version 176480 (0.0005) [2023-03-10 22:04:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.8, 300 sec: 11788.1). Total num frames: 90386432. Throughput: 0: 11846.7. Samples: 90370680. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:04:14,742][1096160] Avg episode reward: [(0, '4863.206')] [2023-03-10 22:04:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000176536_90386432.pth... [2023-03-10 22:04:14,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000175848_90034176.pth [2023-03-10 22:04:15,712][1096443] Updated weights for policy 0, policy_version 176560 (0.0005) [2023-03-10 22:04:19,287][1096443] Updated weights for policy 0, policy_version 176640 (0.0006) [2023-03-10 22:04:19,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.8, 300 sec: 11788.1). Total num frames: 90443776. Throughput: 0: 11796.7. Samples: 90439448. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:04:19,742][1096160] Avg episode reward: [(0, '4864.287')] [2023-03-10 22:04:22,858][1096443] Updated weights for policy 0, policy_version 176720 (0.0005) [2023-03-10 22:04:24,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 90501120. Throughput: 0: 11724.3. Samples: 90472824. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:04:24,742][1096160] Avg episode reward: [(0, '4861.562')] [2023-03-10 22:04:26,166][1096443] Updated weights for policy 0, policy_version 176800 (0.0005) [2023-03-10 22:04:29,620][1096443] Updated weights for policy 0, policy_version 176880 (0.0006) [2023-03-10 22:04:29,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 90562560. Throughput: 0: 11743.1. Samples: 90546176. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:04:29,742][1096160] Avg episode reward: [(0, '4861.511')] [2023-03-10 22:04:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000176880_90562560.pth... [2023-03-10 22:04:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000176192_90210304.pth [2023-03-10 22:04:33,137][1096443] Updated weights for policy 0, policy_version 176960 (0.0005) [2023-03-10 22:04:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11774.3). Total num frames: 90619904. Throughput: 0: 11767.9. Samples: 90617044. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:04:34,742][1096160] Avg episode reward: [(0, '4860.437')] [2023-03-10 22:04:36,409][1096443] Updated weights for policy 0, policy_version 177040 (0.0005) [2023-03-10 22:04:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 90681344. Throughput: 0: 11760.0. Samples: 90653420. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:04:39,742][1096160] Avg episode reward: [(0, '4863.912')] [2023-03-10 22:04:39,950][1096443] Updated weights for policy 0, policy_version 177120 (0.0004) [2023-03-10 22:04:43,538][1096443] Updated weights for policy 0, policy_version 177200 (0.0005) [2023-03-10 22:04:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 90738688. Throughput: 0: 11743.3. Samples: 90722376. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:04:44,742][1096160] Avg episode reward: [(0, '4862.123')] [2023-03-10 22:04:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000177224_90738688.pth... [2023-03-10 22:04:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000176536_90386432.pth [2023-03-10 22:04:47,025][1096443] Updated weights for policy 0, policy_version 177280 (0.0005) [2023-03-10 22:04:49,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11788.1). Total num frames: 90796032. Throughput: 0: 11740.5. Samples: 90792000. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:04:49,742][1096160] Avg episode reward: [(0, '4863.789')] [2023-03-10 22:04:50,687][1096443] Updated weights for policy 0, policy_version 177360 (0.0005) [2023-03-10 22:04:54,152][1096443] Updated weights for policy 0, policy_version 177440 (0.0005) [2023-03-10 22:04:54,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 90853376. Throughput: 0: 11667.4. Samples: 90825520. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:04:54,742][1096160] Avg episode reward: [(0, '4862.621')] [2023-03-10 22:04:57,604][1096443] Updated weights for policy 0, policy_version 177520 (0.0005) [2023-03-10 22:04:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11788.1). Total num frames: 90914816. Throughput: 0: 11721.6. Samples: 90898152. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 22:04:59,742][1096160] Avg episode reward: [(0, '4864.580')] [2023-03-10 22:04:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000177568_90914816.pth... [2023-03-10 22:04:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000176880_90562560.pth [2023-03-10 22:05:01,110][1096443] Updated weights for policy 0, policy_version 177600 (0.0005) [2023-03-10 22:05:04,575][1096443] Updated weights for policy 0, policy_version 177680 (0.0005) [2023-03-10 22:05:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11788.1). Total num frames: 90972160. Throughput: 0: 11746.1. Samples: 90968020. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 22:05:04,742][1096160] Avg episode reward: [(0, '4862.140')] [2023-03-10 22:05:08,077][1096443] Updated weights for policy 0, policy_version 177760 (0.0005) [2023-03-10 22:05:09,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11788.1). Total num frames: 91029504. Throughput: 0: 11797.3. Samples: 91003704. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 22:05:09,742][1096160] Avg episode reward: [(0, '4859.908')] [2023-03-10 22:05:11,509][1096443] Updated weights for policy 0, policy_version 177840 (0.0005) [2023-03-10 22:05:14,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11741.8, 300 sec: 11802.0). Total num frames: 91090944. Throughput: 0: 11759.5. Samples: 91075356. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 22:05:14,742][1096160] Avg episode reward: [(0, '4862.163')] [2023-03-10 22:05:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000177912_91090944.pth... [2023-03-10 22:05:14,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000177224_90738688.pth [2023-03-10 22:05:14,918][1096443] Updated weights for policy 0, policy_version 177920 (0.0005) [2023-03-10 22:05:18,256][1096443] Updated weights for policy 0, policy_version 178000 (0.0005) [2023-03-10 22:05:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 91152384. Throughput: 0: 11800.8. Samples: 91148080. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 22:05:19,742][1096160] Avg episode reward: [(0, '4862.229')] [2023-03-10 22:05:21,624][1096443] Updated weights for policy 0, policy_version 178080 (0.0004) [2023-03-10 22:05:24,742][1096160] Fps is (10 sec: 11878.6, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 91209728. Throughput: 0: 11806.5. Samples: 91184712. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 22:05:24,742][1096160] Avg episode reward: [(0, '4865.950')] [2023-03-10 22:05:25,165][1096443] Updated weights for policy 0, policy_version 178160 (0.0005) [2023-03-10 22:05:28,712][1096443] Updated weights for policy 0, policy_version 178240 (0.0005) [2023-03-10 22:05:29,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 91267072. Throughput: 0: 11787.0. Samples: 91252792. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 22:05:29,742][1096160] Avg episode reward: [(0, '4860.001')] [2023-03-10 22:05:29,771][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000178264_91271168.pth... [2023-03-10 22:05:29,773][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000177568_90914816.pth [2023-03-10 22:05:32,105][1096443] Updated weights for policy 0, policy_version 178320 (0.0005) [2023-03-10 22:05:34,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 91328512. Throughput: 0: 11832.9. Samples: 91324480. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 22:05:34,742][1096160] Avg episode reward: [(0, '4862.831')] [2023-03-10 22:05:35,615][1096443] Updated weights for policy 0, policy_version 178400 (0.0005) [2023-03-10 22:05:39,067][1096443] Updated weights for policy 0, policy_version 178480 (0.0005) [2023-03-10 22:05:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 91385856. Throughput: 0: 11896.3. Samples: 91360852. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 22:05:39,742][1096160] Avg episode reward: [(0, '4859.099')] [2023-03-10 22:05:42,565][1096443] Updated weights for policy 0, policy_version 178560 (0.0005) [2023-03-10 22:05:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11829.8). Total num frames: 91447296. Throughput: 0: 11836.3. Samples: 91430788. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 22:05:44,742][1096160] Avg episode reward: [(0, '4859.660')] [2023-03-10 22:05:44,754][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000178608_91447296.pth... [2023-03-10 22:05:44,757][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000177912_91090944.pth [2023-03-10 22:05:46,095][1096443] Updated weights for policy 0, policy_version 178640 (0.0005) [2023-03-10 22:05:49,580][1096443] Updated weights for policy 0, policy_version 178720 (0.0005) [2023-03-10 22:05:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 91504640. Throughput: 0: 11835.3. Samples: 91500608. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 22:05:49,742][1096160] Avg episode reward: [(0, '4861.759')] [2023-03-10 22:05:53,147][1096443] Updated weights for policy 0, policy_version 178800 (0.0005) [2023-03-10 22:05:54,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 91561984. Throughput: 0: 11803.6. Samples: 91534868. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 22:05:54,742][1096160] Avg episode reward: [(0, '4863.247')] [2023-03-10 22:05:56,726][1096443] Updated weights for policy 0, policy_version 178880 (0.0005) [2023-03-10 22:05:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 91623424. Throughput: 0: 11815.4. Samples: 91607048. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-03-10 22:05:59,742][1096160] Avg episode reward: [(0, '4859.403')] [2023-03-10 22:05:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000178952_91623424.pth... [2023-03-10 22:05:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000178264_91271168.pth [2023-03-10 22:05:59,948][1096443] Updated weights for policy 0, policy_version 178960 (0.0005) [2023-03-10 22:06:03,407][1096443] Updated weights for policy 0, policy_version 179040 (0.0005) [2023-03-10 22:06:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 91680768. Throughput: 0: 11793.7. Samples: 91678796. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:06:04,742][1096160] Avg episode reward: [(0, '4862.698')] [2023-03-10 22:06:06,928][1096443] Updated weights for policy 0, policy_version 179120 (0.0005) [2023-03-10 22:06:09,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 91738112. Throughput: 0: 11745.6. Samples: 91713264. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:06:09,742][1096160] Avg episode reward: [(0, '4860.117')] [2023-03-10 22:06:10,516][1096443] Updated weights for policy 0, policy_version 179200 (0.0005) [2023-03-10 22:06:13,866][1096443] Updated weights for policy 0, policy_version 179280 (0.0005) [2023-03-10 22:06:14,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11829.8). Total num frames: 91799552. Throughput: 0: 11792.9. Samples: 91783472. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:06:14,742][1096160] Avg episode reward: [(0, '4858.701')] [2023-03-10 22:06:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000179296_91799552.pth... [2023-03-10 22:06:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000178608_91447296.pth [2023-03-10 22:06:17,486][1096443] Updated weights for policy 0, policy_version 179360 (0.0005) [2023-03-10 22:06:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11829.8). Total num frames: 91856896. Throughput: 0: 11740.4. Samples: 91852800. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:06:19,742][1096160] Avg episode reward: [(0, '4861.804')] [2023-03-10 22:06:20,949][1096443] Updated weights for policy 0, policy_version 179440 (0.0005) [2023-03-10 22:06:24,470][1096443] Updated weights for policy 0, policy_version 179520 (0.0005) [2023-03-10 22:06:24,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 91914240. Throughput: 0: 11731.6. Samples: 91888776. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:06:24,742][1096160] Avg episode reward: [(0, '4863.619')] [2023-03-10 22:06:27,965][1096443] Updated weights for policy 0, policy_version 179600 (0.0005) [2023-03-10 22:06:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 91975680. Throughput: 0: 11735.1. Samples: 91958868. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:06:29,742][1096160] Avg episode reward: [(0, '4860.435')] [2023-03-10 22:06:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000179640_91975680.pth... [2023-03-10 22:06:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000178952_91623424.pth [2023-03-10 22:06:31,348][1096443] Updated weights for policy 0, policy_version 179680 (0.0005) [2023-03-10 22:06:34,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11741.9, 300 sec: 11815.9). Total num frames: 92033024. Throughput: 0: 11746.5. Samples: 92029200. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:06:34,742][1096160] Avg episode reward: [(0, '4861.882')] [2023-03-10 22:06:34,841][1096443] Updated weights for policy 0, policy_version 179760 (0.0005) [2023-03-10 22:06:38,252][1096443] Updated weights for policy 0, policy_version 179840 (0.0005) [2023-03-10 22:06:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11829.8). Total num frames: 92094464. Throughput: 0: 11797.4. Samples: 92065752. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:06:39,742][1096160] Avg episode reward: [(0, '4861.553')] [2023-03-10 22:06:41,952][1096443] Updated weights for policy 0, policy_version 179920 (0.0005) [2023-03-10 22:06:44,742][1096160] Fps is (10 sec: 11468.6, 60 sec: 11673.6, 300 sec: 11802.0). Total num frames: 92147712. Throughput: 0: 11684.9. Samples: 92132868. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:06:44,742][1096160] Avg episode reward: [(0, '4864.123')] [2023-03-10 22:06:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000179976_92147712.pth... [2023-03-10 22:06:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000179296_91799552.pth [2023-03-10 22:06:45,421][1096443] Updated weights for policy 0, policy_version 180000 (0.0005) [2023-03-10 22:06:48,632][1096443] Updated weights for policy 0, policy_version 180080 (0.0005) [2023-03-10 22:06:49,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11829.8). Total num frames: 92213248. Throughput: 0: 11759.3. Samples: 92207964. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:06:49,742][1096160] Avg episode reward: [(0, '4863.929')] [2023-03-10 22:06:52,041][1096443] Updated weights for policy 0, policy_version 180160 (0.0005) [2023-03-10 22:06:54,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 92270592. Throughput: 0: 11798.1. Samples: 92244180. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:06:54,742][1096160] Avg episode reward: [(0, '4860.523')] [2023-03-10 22:06:55,613][1096443] Updated weights for policy 0, policy_version 180240 (0.0004) [2023-03-10 22:06:59,244][1096443] Updated weights for policy 0, policy_version 180320 (0.0006) [2023-03-10 22:06:59,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11741.9, 300 sec: 11802.0). Total num frames: 92327936. Throughput: 0: 11744.5. Samples: 92311976. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:06:59,742][1096160] Avg episode reward: [(0, '4860.391')] [2023-03-10 22:06:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000180328_92327936.pth... [2023-03-10 22:06:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000179640_91975680.pth [2023-03-10 22:07:02,513][1096443] Updated weights for policy 0, policy_version 180400 (0.0004) [2023-03-10 22:07:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11815.9). Total num frames: 92389376. Throughput: 0: 11826.6. Samples: 92384996. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) [2023-03-10 22:07:04,742][1096160] Avg episode reward: [(0, '4860.861')] [2023-03-10 22:07:05,998][1096443] Updated weights for policy 0, policy_version 180480 (0.0005) [2023-03-10 22:07:09,342][1096443] Updated weights for policy 0, policy_version 180560 (0.0005) [2023-03-10 22:07:09,742][1096160] Fps is (10 sec: 12288.1, 60 sec: 11878.4, 300 sec: 11815.9). Total num frames: 92450816. Throughput: 0: 11798.5. Samples: 92419708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:07:09,742][1096160] Avg episode reward: [(0, '4864.426')] [2023-03-10 22:07:12,683][1096443] Updated weights for policy 0, policy_version 180640 (0.0004) [2023-03-10 22:07:14,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 92508160. Throughput: 0: 11885.4. Samples: 92493712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:07:14,742][1096160] Avg episode reward: [(0, '4859.670')] [2023-03-10 22:07:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000180680_92508160.pth... [2023-03-10 22:07:14,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000179976_92147712.pth [2023-03-10 22:07:16,224][1096443] Updated weights for policy 0, policy_version 180720 (0.0005) [2023-03-10 22:07:19,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.1, 300 sec: 11802.0). Total num frames: 92565504. Throughput: 0: 11828.2. Samples: 92561472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:07:19,742][1096160] Avg episode reward: [(0, '4861.873')] [2023-03-10 22:07:19,910][1096443] Updated weights for policy 0, policy_version 180800 (0.0005) [2023-03-10 22:07:23,448][1096443] Updated weights for policy 0, policy_version 180880 (0.0004) [2023-03-10 22:07:24,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 92622848. Throughput: 0: 11817.2. Samples: 92597524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:07:24,742][1096160] Avg episode reward: [(0, '4861.540')] [2023-03-10 22:07:26,938][1096443] Updated weights for policy 0, policy_version 180960 (0.0005) [2023-03-10 22:07:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 92684288. Throughput: 0: 11888.7. Samples: 92667860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:07:29,742][1096160] Avg episode reward: [(0, '4862.708')] [2023-03-10 22:07:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000181024_92684288.pth... [2023-03-10 22:07:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000180328_92327936.pth [2023-03-10 22:07:30,369][1096443] Updated weights for policy 0, policy_version 181040 (0.0005) [2023-03-10 22:07:33,962][1096443] Updated weights for policy 0, policy_version 181120 (0.0005) [2023-03-10 22:07:34,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 92741632. Throughput: 0: 11755.9. Samples: 92736980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:07:34,742][1096160] Avg episode reward: [(0, '4857.703')] [2023-03-10 22:07:37,368][1096443] Updated weights for policy 0, policy_version 181200 (0.0005) [2023-03-10 22:07:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 92803072. Throughput: 0: 11751.7. Samples: 92773008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:07:39,742][1096160] Avg episode reward: [(0, '4860.181')] [2023-03-10 22:07:40,797][1096443] Updated weights for policy 0, policy_version 181280 (0.0005) [2023-03-10 22:07:44,224][1096443] Updated weights for policy 0, policy_version 181360 (0.0005) [2023-03-10 22:07:44,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11878.4, 300 sec: 11788.2). Total num frames: 92860416. Throughput: 0: 11820.1. Samples: 92843880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:07:44,742][1096160] Avg episode reward: [(0, '4862.061')] [2023-03-10 22:07:44,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000181368_92860416.pth... [2023-03-10 22:07:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000180680_92508160.pth [2023-03-10 22:07:47,589][1096443] Updated weights for policy 0, policy_version 181440 (0.0005) [2023-03-10 22:07:49,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 92921856. Throughput: 0: 11810.1. Samples: 92916448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:07:49,742][1096160] Avg episode reward: [(0, '4860.437')] [2023-03-10 22:07:51,041][1096443] Updated weights for policy 0, policy_version 181520 (0.0004) [2023-03-10 22:07:54,637][1096443] Updated weights for policy 0, policy_version 181600 (0.0005) [2023-03-10 22:07:54,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 92979200. Throughput: 0: 11832.1. Samples: 92952152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:07:54,742][1096160] Avg episode reward: [(0, '4857.927')] [2023-03-10 22:07:58,152][1096443] Updated weights for policy 0, policy_version 181680 (0.0005) [2023-03-10 22:07:59,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11810.2, 300 sec: 11788.2). Total num frames: 93036544. Throughput: 0: 11709.0. Samples: 93020616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:07:59,742][1096160] Avg episode reward: [(0, '4862.968')] [2023-03-10 22:07:59,789][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000181720_93040640.pth... [2023-03-10 22:07:59,791][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000181024_92684288.pth [2023-03-10 22:08:01,514][1096443] Updated weights for policy 0, policy_version 181760 (0.0005) [2023-03-10 22:08:04,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 93097984. Throughput: 0: 11819.9. Samples: 93093368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:08:04,742][1096160] Avg episode reward: [(0, '4859.632')] [2023-03-10 22:08:05,060][1096443] Updated weights for policy 0, policy_version 181840 (0.0005) [2023-03-10 22:08:08,666][1096443] Updated weights for policy 0, policy_version 181920 (0.0005) [2023-03-10 22:08:09,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 93155328. Throughput: 0: 11760.1. Samples: 93126728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:08:09,742][1096160] Avg episode reward: [(0, '4858.178')] [2023-03-10 22:08:12,112][1096443] Updated weights for policy 0, policy_version 182000 (0.0005) [2023-03-10 22:08:14,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.8, 300 sec: 11774.3). Total num frames: 93212672. Throughput: 0: 11763.4. Samples: 93197212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:08:14,742][1096160] Avg episode reward: [(0, '4863.276')] [2023-03-10 22:08:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000182056_93212672.pth... [2023-03-10 22:08:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000181368_92860416.pth [2023-03-10 22:08:15,650][1096443] Updated weights for policy 0, policy_version 182080 (0.0005) [2023-03-10 22:08:19,160][1096443] Updated weights for policy 0, policy_version 182160 (0.0005) [2023-03-10 22:08:19,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 93270016. Throughput: 0: 11761.2. Samples: 93266236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:08:19,742][1096160] Avg episode reward: [(0, '4858.852')] [2023-03-10 22:08:22,610][1096443] Updated weights for policy 0, policy_version 182240 (0.0005) [2023-03-10 22:08:24,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 93331456. Throughput: 0: 11772.8. Samples: 93302784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:08:24,742][1096160] Avg episode reward: [(0, '4863.715')] [2023-03-10 22:08:26,127][1096443] Updated weights for policy 0, policy_version 182320 (0.0005) [2023-03-10 22:08:29,620][1096443] Updated weights for policy 0, policy_version 182400 (0.0005) [2023-03-10 22:08:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11741.9, 300 sec: 11788.1). Total num frames: 93388800. Throughput: 0: 11746.6. Samples: 93372480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:08:29,742][1096160] Avg episode reward: [(0, '4859.929')] [2023-03-10 22:08:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000182400_93388800.pth... [2023-03-10 22:08:29,750][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000181720_93040640.pth [2023-03-10 22:08:32,925][1096443] Updated weights for policy 0, policy_version 182480 (0.0005) [2023-03-10 22:08:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 93450240. Throughput: 0: 11767.9. Samples: 93446004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:08:34,742][1096160] Avg episode reward: [(0, '4861.759')] [2023-03-10 22:08:36,276][1096443] Updated weights for policy 0, policy_version 182560 (0.0006) [2023-03-10 22:08:39,540][1096443] Updated weights for policy 0, policy_version 182640 (0.0005) [2023-03-10 22:08:39,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 93511680. Throughput: 0: 11804.7. Samples: 93483364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:08:39,742][1096160] Avg episode reward: [(0, '4857.849')] [2023-03-10 22:08:42,923][1096443] Updated weights for policy 0, policy_version 182720 (0.0004) [2023-03-10 22:08:44,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11802.0). Total num frames: 93573120. Throughput: 0: 11913.8. Samples: 93556736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:08:44,742][1096160] Avg episode reward: [(0, '4858.853')] [2023-03-10 22:08:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000182760_93573120.pth... [2023-03-10 22:08:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000182056_93212672.pth [2023-03-10 22:08:46,263][1096443] Updated weights for policy 0, policy_version 182800 (0.0005) [2023-03-10 22:08:49,447][1096443] Updated weights for policy 0, policy_version 182880 (0.0005) [2023-03-10 22:08:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11878.4, 300 sec: 11815.9). Total num frames: 93634560. Throughput: 0: 11967.1. Samples: 93631888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:08:49,742][1096160] Avg episode reward: [(0, '4865.234')] [2023-03-10 22:08:52,881][1096443] Updated weights for policy 0, policy_version 182960 (0.0005) [2023-03-10 22:08:54,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11946.7, 300 sec: 11815.9). Total num frames: 93696000. Throughput: 0: 12009.4. Samples: 93667148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:08:54,742][1096160] Avg episode reward: [(0, '4862.541')] [2023-03-10 22:08:56,207][1096443] Updated weights for policy 0, policy_version 183040 (0.0005) [2023-03-10 22:08:59,729][1096443] Updated weights for policy 0, policy_version 183120 (0.0005) [2023-03-10 22:08:59,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 12014.9, 300 sec: 11829.8). Total num frames: 93757440. Throughput: 0: 12087.0. Samples: 93741128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:08:59,742][1096160] Avg episode reward: [(0, '4859.635')] [2023-03-10 22:08:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000183120_93757440.pth... [2023-03-10 22:08:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000182400_93388800.pth [2023-03-10 22:09:03,555][1096443] Updated weights for policy 0, policy_version 183200 (0.0005) [2023-03-10 22:09:04,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11878.4, 300 sec: 11802.0). Total num frames: 93810688. Throughput: 0: 12007.9. Samples: 93806592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:09:04,742][1096160] Avg episode reward: [(0, '4861.335')] [2023-03-10 22:09:07,194][1096443] Updated weights for policy 0, policy_version 183280 (0.0005) [2023-03-10 22:09:09,742][1096160] Fps is (10 sec: 10649.6, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 93863936. Throughput: 0: 11925.5. Samples: 93839432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:09:09,742][1096160] Avg episode reward: [(0, '4859.923')] [2023-03-10 22:09:10,896][1096443] Updated weights for policy 0, policy_version 183360 (0.0005) [2023-03-10 22:09:14,726][1096443] Updated weights for policy 0, policy_version 183440 (0.0005) [2023-03-10 22:09:14,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11810.1, 300 sec: 11788.2). Total num frames: 93921280. Throughput: 0: 11844.9. Samples: 93905500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:09:14,742][1096160] Avg episode reward: [(0, '4858.668')] [2023-03-10 22:09:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000183440_93921280.pth... [2023-03-10 22:09:14,746][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000182760_93573120.pth [2023-03-10 22:09:18,218][1096443] Updated weights for policy 0, policy_version 183520 (0.0005) [2023-03-10 22:09:19,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11810.1, 300 sec: 11788.1). Total num frames: 93978624. Throughput: 0: 11729.9. Samples: 93973848. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:09:19,742][1096160] Avg episode reward: [(0, '4859.931')] [2023-03-10 22:09:21,773][1096443] Updated weights for policy 0, policy_version 183600 (0.0006) [2023-03-10 22:09:24,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 94035968. Throughput: 0: 11675.1. Samples: 94008744. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:09:24,742][1096160] Avg episode reward: [(0, '4862.490')] [2023-03-10 22:09:25,295][1096443] Updated weights for policy 0, policy_version 183680 (0.0005) [2023-03-10 22:09:28,780][1096443] Updated weights for policy 0, policy_version 183760 (0.0004) [2023-03-10 22:09:29,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11741.9, 300 sec: 11774.3). Total num frames: 94093312. Throughput: 0: 11596.3. Samples: 94078568. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:09:29,742][1096160] Avg episode reward: [(0, '4861.365')] [2023-03-10 22:09:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000183776_94093312.pth... [2023-03-10 22:09:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000183120_93757440.pth [2023-03-10 22:09:32,376][1096443] Updated weights for policy 0, policy_version 183840 (0.0005) [2023-03-10 22:09:34,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11760.4). Total num frames: 94150656. Throughput: 0: 11438.6. Samples: 94146624. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:09:34,742][1096160] Avg episode reward: [(0, '4861.064')] [2023-03-10 22:09:36,026][1096443] Updated weights for policy 0, policy_version 183920 (0.0005) [2023-03-10 22:09:39,647][1096443] Updated weights for policy 0, policy_version 184000 (0.0005) [2023-03-10 22:09:39,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11760.4). Total num frames: 94208000. Throughput: 0: 11409.6. Samples: 94180580. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:09:39,742][1096160] Avg episode reward: [(0, '4856.500')] [2023-03-10 22:09:43,163][1096443] Updated weights for policy 0, policy_version 184080 (0.0005) [2023-03-10 22:09:44,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11760.4). Total num frames: 94265344. Throughput: 0: 11290.9. Samples: 94249220. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:09:44,742][1096160] Avg episode reward: [(0, '4861.634')] [2023-03-10 22:09:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000184112_94265344.pth... [2023-03-10 22:09:44,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000183440_93921280.pth [2023-03-10 22:09:46,824][1096443] Updated weights for policy 0, policy_version 184160 (0.0006) [2023-03-10 22:09:49,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11746.5). Total num frames: 94318592. Throughput: 0: 11315.2. Samples: 94315776. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:09:49,742][1096160] Avg episode reward: [(0, '4861.081')] [2023-03-10 22:09:50,511][1096443] Updated weights for policy 0, policy_version 184240 (0.0005) [2023-03-10 22:09:53,997][1096443] Updated weights for policy 0, policy_version 184320 (0.0004) [2023-03-10 22:09:54,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11746.5). Total num frames: 94380032. Throughput: 0: 11370.0. Samples: 94351080. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:09:54,742][1096160] Avg episode reward: [(0, '4862.567')] [2023-03-10 22:09:57,367][1096443] Updated weights for policy 0, policy_version 184400 (0.0005) [2023-03-10 22:09:59,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11264.0, 300 sec: 11732.6). Total num frames: 94433280. Throughput: 0: 11384.7. Samples: 94417812. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:09:59,742][1096160] Avg episode reward: [(0, '4860.208')] [2023-03-10 22:09:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000184440_94433280.pth... [2023-03-10 22:09:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000183776_94093312.pth [2023-03-10 22:10:01,376][1096443] Updated weights for policy 0, policy_version 184480 (0.0005) [2023-03-10 22:10:04,741][1096160] Fps is (10 sec: 11059.3, 60 sec: 11332.3, 300 sec: 11732.6). Total num frames: 94490624. Throughput: 0: 11394.5. Samples: 94486600. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:10:04,773][1096160] Avg episode reward: [(0, '4863.085')] [2023-03-10 22:10:04,881][1096443] Updated weights for policy 0, policy_version 184560 (0.0005) [2023-03-10 22:10:08,418][1096443] Updated weights for policy 0, policy_version 184640 (0.0006) [2023-03-10 22:10:09,741][1096160] Fps is (10 sec: 11468.8, 60 sec: 11400.5, 300 sec: 11718.7). Total num frames: 94547968. Throughput: 0: 11431.0. Samples: 94523140. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:10:09,742][1096160] Avg episode reward: [(0, '4862.064')] [2023-03-10 22:10:12,168][1096443] Updated weights for policy 0, policy_version 184720 (0.0006) [2023-03-10 22:10:14,742][1096160] Fps is (10 sec: 11059.0, 60 sec: 11332.3, 300 sec: 11691.0). Total num frames: 94601216. Throughput: 0: 11341.3. Samples: 94588928. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:10:14,742][1096160] Avg episode reward: [(0, '4861.862')] [2023-03-10 22:10:14,806][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000184776_94605312.pth... [2023-03-10 22:10:14,808][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000184112_94265344.pth [2023-03-10 22:10:15,922][1096443] Updated weights for policy 0, policy_version 184800 (0.0006) [2023-03-10 22:10:19,576][1096443] Updated weights for policy 0, policy_version 184880 (0.0005) [2023-03-10 22:10:19,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11332.3, 300 sec: 11691.0). Total num frames: 94658560. Throughput: 0: 11286.8. Samples: 94654528. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-03-10 22:10:19,742][1096160] Avg episode reward: [(0, '4860.856')] [2023-03-10 22:10:23,088][1096443] Updated weights for policy 0, policy_version 184960 (0.0006) [2023-03-10 22:10:24,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11332.3, 300 sec: 11691.0). Total num frames: 94715904. Throughput: 0: 11285.0. Samples: 94688404. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:10:24,890][1096160] Avg episode reward: [(0, '4862.648')] [2023-03-10 22:10:26,547][1096443] Updated weights for policy 0, policy_version 185040 (0.0004) [2023-03-10 22:10:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11400.5, 300 sec: 11691.0). Total num frames: 94777344. Throughput: 0: 11372.2. Samples: 94760968. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:10:29,742][1096160] Avg episode reward: [(0, '4862.789')] [2023-03-10 22:10:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000185112_94777344.pth... [2023-03-10 22:10:29,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000184440_94433280.pth [2023-03-10 22:10:30,074][1096443] Updated weights for policy 0, policy_version 185120 (0.0005) [2023-03-10 22:10:33,626][1096443] Updated weights for policy 0, policy_version 185200 (0.0005) [2023-03-10 22:10:34,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11400.5, 300 sec: 11691.0). Total num frames: 94834688. Throughput: 0: 11440.4. Samples: 94830592. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:10:34,742][1096160] Avg episode reward: [(0, '4862.604')] [2023-03-10 22:10:37,091][1096443] Updated weights for policy 0, policy_version 185280 (0.0005) [2023-03-10 22:10:39,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11400.6, 300 sec: 11677.1). Total num frames: 94892032. Throughput: 0: 11410.8. Samples: 94864564. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:10:40,116][1096160] Avg episode reward: [(0, '4863.377')] [2023-03-10 22:10:40,641][1096443] Updated weights for policy 0, policy_version 185360 (0.0005) [2023-03-10 22:10:44,192][1096443] Updated weights for policy 0, policy_version 185440 (0.0005) [2023-03-10 22:10:44,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11400.5, 300 sec: 11677.1). Total num frames: 94949376. Throughput: 0: 11449.8. Samples: 94933056. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:10:44,742][1096160] Avg episode reward: [(0, '4864.150')] [2023-03-10 22:10:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000185448_94949376.pth... [2023-03-10 22:10:44,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000184776_94605312.pth [2023-03-10 22:10:47,604][1096443] Updated weights for policy 0, policy_version 185520 (0.0005) [2023-03-10 22:10:49,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11537.1, 300 sec: 11691.0). Total num frames: 95010816. Throughput: 0: 11536.9. Samples: 95005760. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:10:49,786][1096160] Avg episode reward: [(0, '4864.413')] [2023-03-10 22:10:50,745][1096443] Updated weights for policy 0, policy_version 185600 (0.0005) [2023-03-10 22:10:54,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11400.5, 300 sec: 11663.2). Total num frames: 95064064. Throughput: 0: 11454.0. Samples: 95038572. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:10:54,742][1096160] Avg episode reward: [(0, '4862.498')] [2023-03-10 22:10:54,967][1096443] Updated weights for policy 0, policy_version 185680 (0.0005) [2023-03-10 22:10:58,666][1096443] Updated weights for policy 0, policy_version 185760 (0.0005) [2023-03-10 22:10:59,742][1096160] Fps is (10 sec: 11059.1, 60 sec: 11468.8, 300 sec: 11663.2). Total num frames: 95121408. Throughput: 0: 11468.5. Samples: 95105012. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:10:59,742][1096160] Avg episode reward: [(0, '4862.345')] [2023-03-10 22:10:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000185784_95121408.pth... [2023-03-10 22:10:59,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000185112_94777344.pth [2023-03-10 22:11:02,075][1096443] Updated weights for policy 0, policy_version 185840 (0.0005) [2023-03-10 22:11:04,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11663.2). Total num frames: 95178752. Throughput: 0: 11559.8. Samples: 95174720. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:11:04,742][1096160] Avg episode reward: [(0, '4862.267')] [2023-03-10 22:11:05,512][1096443] Updated weights for policy 0, policy_version 185920 (0.0005) [2023-03-10 22:11:09,097][1096443] Updated weights for policy 0, policy_version 186000 (0.0005) [2023-03-10 22:11:09,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11468.8, 300 sec: 11649.3). Total num frames: 95236096. Throughput: 0: 11612.3. Samples: 95210956. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:11:09,742][1096160] Avg episode reward: [(0, '4859.594')] [2023-03-10 22:11:12,746][1096443] Updated weights for policy 0, policy_version 186080 (0.0005) [2023-03-10 22:11:14,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11649.3). Total num frames: 95293440. Throughput: 0: 11499.0. Samples: 95278424. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:11:14,742][1096160] Avg episode reward: [(0, '4861.382')] [2023-03-10 22:11:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000186120_95293440.pth... [2023-03-10 22:11:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000185448_94949376.pth [2023-03-10 22:11:16,202][1096443] Updated weights for policy 0, policy_version 186160 (0.0006) [2023-03-10 22:11:19,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11537.1, 300 sec: 11649.3). Total num frames: 95350784. Throughput: 0: 11525.3. Samples: 95349232. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:11:19,742][1096160] Avg episode reward: [(0, '4863.018')] [2023-03-10 22:11:19,777][1096443] Updated weights for policy 0, policy_version 186240 (0.0005) [2023-03-10 22:11:23,315][1096443] Updated weights for policy 0, policy_version 186320 (0.0005) [2023-03-10 22:11:24,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11537.1, 300 sec: 11635.4). Total num frames: 95408128. Throughput: 0: 11533.2. Samples: 95383560. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:11:24,742][1096160] Avg episode reward: [(0, '4867.286')] [2023-03-10 22:11:24,745][1096399] Saving new best policy, reward=4867.286! [2023-03-10 22:11:26,856][1096443] Updated weights for policy 0, policy_version 186400 (0.0005) [2023-03-10 22:11:29,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11635.4). Total num frames: 95465472. Throughput: 0: 11465.9. Samples: 95449020. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:11:29,742][1096160] Avg episode reward: [(0, '4861.315')] [2023-03-10 22:11:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000186456_95465472.pth... [2023-03-10 22:11:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000185784_95121408.pth [2023-03-10 22:11:30,590][1096443] Updated weights for policy 0, policy_version 186480 (0.0004) [2023-03-10 22:11:33,848][1096443] Updated weights for policy 0, policy_version 186560 (0.0005) [2023-03-10 22:11:34,742][1096160] Fps is (10 sec: 11059.1, 60 sec: 11400.5, 300 sec: 11607.6). Total num frames: 95518720. Throughput: 0: 11400.7. Samples: 95518792. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 22:11:34,742][1096160] Avg episode reward: [(0, '4862.188')] [2023-03-10 22:11:38,026][1096443] Updated weights for policy 0, policy_version 186640 (0.0005) [2023-03-10 22:11:39,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11635.4). Total num frames: 95580160. Throughput: 0: 11384.8. Samples: 95550888. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 22:11:39,742][1096160] Avg episode reward: [(0, '4868.005')] [2023-03-10 22:11:39,743][1096399] Saving new best policy, reward=4868.005! [2023-03-10 22:11:41,483][1096443] Updated weights for policy 0, policy_version 186720 (0.0005) [2023-03-10 22:11:44,741][1096160] Fps is (10 sec: 11878.5, 60 sec: 11468.8, 300 sec: 11607.7). Total num frames: 95637504. Throughput: 0: 11462.1. Samples: 95620804. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 22:11:44,742][1096160] Avg episode reward: [(0, '4864.463')] [2023-03-10 22:11:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000186792_95637504.pth... [2023-03-10 22:11:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000186120_95293440.pth [2023-03-10 22:11:45,040][1096443] Updated weights for policy 0, policy_version 186800 (0.0005) [2023-03-10 22:11:48,705][1096443] Updated weights for policy 0, policy_version 186880 (0.0005) [2023-03-10 22:11:49,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11332.3, 300 sec: 11593.8). Total num frames: 95690752. Throughput: 0: 11419.0. Samples: 95688576. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 22:11:49,742][1096160] Avg episode reward: [(0, '4861.763')] [2023-03-10 22:11:52,011][1096443] Updated weights for policy 0, policy_version 186960 (0.0005) [2023-03-10 22:11:54,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11607.7). Total num frames: 95752192. Throughput: 0: 11458.7. Samples: 95726596. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 22:11:54,743][1096160] Avg episode reward: [(0, '4864.901')] [2023-03-10 22:11:55,492][1096443] Updated weights for policy 0, policy_version 187040 (0.0006) [2023-03-10 22:11:59,035][1096443] Updated weights for policy 0, policy_version 187120 (0.0005) [2023-03-10 22:11:59,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 11593.8). Total num frames: 95809536. Throughput: 0: 11529.2. Samples: 95797236. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 22:11:59,743][1096160] Avg episode reward: [(0, '4865.481')] [2023-03-10 22:11:59,754][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000187136_95813632.pth... [2023-03-10 22:11:59,756][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000186456_95465472.pth [2023-03-10 22:12:02,750][1096443] Updated weights for policy 0, policy_version 187200 (0.0005) [2023-03-10 22:12:04,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11468.8, 300 sec: 11579.9). Total num frames: 95866880. Throughput: 0: 11413.9. Samples: 95862856. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 22:12:04,742][1096160] Avg episode reward: [(0, '4866.036')] [2023-03-10 22:12:06,474][1096443] Updated weights for policy 0, policy_version 187280 (0.0006) [2023-03-10 22:12:09,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11579.9). Total num frames: 95924224. Throughput: 0: 11384.6. Samples: 95895868. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 22:12:09,742][1096160] Avg episode reward: [(0, '4862.100')] [2023-03-10 22:12:09,992][1096443] Updated weights for policy 0, policy_version 187360 (0.0005) [2023-03-10 22:12:13,587][1096443] Updated weights for policy 0, policy_version 187440 (0.0005) [2023-03-10 22:12:14,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11579.9). Total num frames: 95981568. Throughput: 0: 11470.5. Samples: 95965192. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 22:12:14,742][1096160] Avg episode reward: [(0, '4863.776')] [2023-03-10 22:12:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000187464_95981568.pth... [2023-03-10 22:12:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000186792_95637504.pth [2023-03-10 22:12:17,252][1096443] Updated weights for policy 0, policy_version 187520 (0.0005) [2023-03-10 22:12:19,742][1096160] Fps is (10 sec: 11059.3, 60 sec: 11400.5, 300 sec: 11566.0). Total num frames: 96034816. Throughput: 0: 11425.9. Samples: 96032956. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 22:12:19,742][1096160] Avg episode reward: [(0, '4860.735')] [2023-03-10 22:12:20,904][1096443] Updated weights for policy 0, policy_version 187600 (0.0005) [2023-03-10 22:12:24,653][1096443] Updated weights for policy 0, policy_version 187680 (0.0005) [2023-03-10 22:12:24,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11400.5, 300 sec: 11552.1). Total num frames: 96092160. Throughput: 0: 11432.6. Samples: 96065356. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 22:12:24,742][1096160] Avg episode reward: [(0, '4863.945')] [2023-03-10 22:12:28,330][1096443] Updated weights for policy 0, policy_version 187760 (0.0005) [2023-03-10 22:12:29,742][1096160] Fps is (10 sec: 11059.1, 60 sec: 11332.3, 300 sec: 11538.2). Total num frames: 96145408. Throughput: 0: 11369.4. Samples: 96132428. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 22:12:29,742][1096160] Avg episode reward: [(0, '4859.137')] [2023-03-10 22:12:29,762][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000187792_96149504.pth... [2023-03-10 22:12:29,763][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000187136_95813632.pth [2023-03-10 22:12:32,054][1096443] Updated weights for policy 0, policy_version 187840 (0.0005) [2023-03-10 22:12:34,742][1096160] Fps is (10 sec: 11059.3, 60 sec: 11400.5, 300 sec: 11524.3). Total num frames: 96202752. Throughput: 0: 11335.3. Samples: 96198664. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) [2023-03-10 22:12:34,742][1096160] Avg episode reward: [(0, '4865.302')] [2023-03-10 22:12:35,752][1096443] Updated weights for policy 0, policy_version 187920 (0.0005) [2023-03-10 22:12:39,194][1096443] Updated weights for policy 0, policy_version 188000 (0.0005) [2023-03-10 22:12:39,742][1096160] Fps is (10 sec: 11059.3, 60 sec: 11264.0, 300 sec: 11510.5). Total num frames: 96256000. Throughput: 0: 11226.3. Samples: 96231780. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:12:39,945][1096160] Avg episode reward: [(0, '4862.090')] [2023-03-10 22:12:43,190][1096443] Updated weights for policy 0, policy_version 188080 (0.0004) [2023-03-10 22:12:44,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11264.0, 300 sec: 11496.6). Total num frames: 96313344. Throughput: 0: 11106.6. Samples: 96297032. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:12:44,742][1096160] Avg episode reward: [(0, '4861.954')] [2023-03-10 22:12:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000188112_96313344.pth... [2023-03-10 22:12:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000187464_95981568.pth [2023-03-10 22:12:46,600][1096443] Updated weights for policy 0, policy_version 188160 (0.0005) [2023-03-10 22:12:49,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11400.5, 300 sec: 11510.5). Total num frames: 96374784. Throughput: 0: 11285.3. Samples: 96370696. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:12:49,742][1096160] Avg episode reward: [(0, '4861.261')] [2023-03-10 22:12:49,925][1096443] Updated weights for policy 0, policy_version 188240 (0.0005) [2023-03-10 22:12:53,505][1096443] Updated weights for policy 0, policy_version 188320 (0.0005) [2023-03-10 22:12:54,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11264.0, 300 sec: 11496.6). Total num frames: 96428032. Throughput: 0: 11293.5. Samples: 96404076. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:12:54,801][1096160] Avg episode reward: [(0, '4865.147')] [2023-03-10 22:12:57,139][1096443] Updated weights for policy 0, policy_version 188400 (0.0005) [2023-03-10 22:12:59,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11332.3, 300 sec: 11496.6). Total num frames: 96489472. Throughput: 0: 11288.7. Samples: 96473184. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:12:59,742][1096160] Avg episode reward: [(0, '4865.229')] [2023-03-10 22:12:59,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000188456_96489472.pth... [2023-03-10 22:12:59,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000187792_96149504.pth [2023-03-10 22:13:00,629][1096443] Updated weights for policy 0, policy_version 188480 (0.0005) [2023-03-10 22:13:04,061][1096443] Updated weights for policy 0, policy_version 188560 (0.0005) [2023-03-10 22:13:04,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11332.3, 300 sec: 11496.6). Total num frames: 96546816. Throughput: 0: 11375.3. Samples: 96544844. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:13:05,012][1096160] Avg episode reward: [(0, '4861.458')] [2023-03-10 22:13:07,595][1096443] Updated weights for policy 0, policy_version 188640 (0.0005) [2023-03-10 22:13:09,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11332.3, 300 sec: 11496.6). Total num frames: 96604160. Throughput: 0: 11428.8. Samples: 96579648. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:13:09,742][1096160] Avg episode reward: [(0, '4861.514')] [2023-03-10 22:13:11,130][1096443] Updated weights for policy 0, policy_version 188720 (0.0005) [2023-03-10 22:13:14,557][1096443] Updated weights for policy 0, policy_version 188800 (0.0005) [2023-03-10 22:13:14,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11400.5, 300 sec: 11510.5). Total num frames: 96665600. Throughput: 0: 11512.1. Samples: 96650472. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:13:14,742][1096160] Avg episode reward: [(0, '4865.608')] [2023-03-10 22:13:14,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000188800_96665600.pth... [2023-03-10 22:13:14,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000188112_96313344.pth [2023-03-10 22:13:18,197][1096443] Updated weights for policy 0, policy_version 188880 (0.0005) [2023-03-10 22:13:19,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11468.8, 300 sec: 11496.6). Total num frames: 96722944. Throughput: 0: 11577.3. Samples: 96719644. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:13:20,349][1096160] Avg episode reward: [(0, '4862.309')] [2023-03-10 22:13:21,487][1096443] Updated weights for policy 0, policy_version 188960 (0.0005) [2023-03-10 22:13:24,782][1096443] Updated weights for policy 0, policy_version 189040 (0.0004) [2023-03-10 22:13:24,741][1096160] Fps is (10 sec: 11878.6, 60 sec: 11537.1, 300 sec: 11510.5). Total num frames: 96784384. Throughput: 0: 11643.0. Samples: 96755712. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:13:24,989][1096160] Avg episode reward: [(0, '4861.367')] [2023-03-10 22:13:28,478][1096443] Updated weights for policy 0, policy_version 189120 (0.0005) [2023-03-10 22:13:29,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11496.6). Total num frames: 96841728. Throughput: 0: 11742.8. Samples: 96825460. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:13:29,742][1096160] Avg episode reward: [(0, '4863.506')] [2023-03-10 22:13:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000189144_96841728.pth... [2023-03-10 22:13:29,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000188456_96489472.pth [2023-03-10 22:13:32,167][1096443] Updated weights for policy 0, policy_version 189200 (0.0006) [2023-03-10 22:13:34,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11482.7). Total num frames: 96899072. Throughput: 0: 11650.7. Samples: 96894976. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:13:34,742][1096160] Avg episode reward: [(0, '4860.688')] [2023-03-10 22:13:35,714][1096443] Updated weights for policy 0, policy_version 189280 (0.0006) [2023-03-10 22:13:39,598][1096443] Updated weights for policy 0, policy_version 189360 (0.0005) [2023-03-10 22:13:39,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 11605.3, 300 sec: 11454.9). Total num frames: 96952320. Throughput: 0: 11641.8. Samples: 96927956. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-10 22:13:39,742][1096160] Avg episode reward: [(0, '4867.084')] [2023-03-10 22:13:42,637][1096443] Updated weights for policy 0, policy_version 189440 (0.0004) [2023-03-10 22:13:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.9, 300 sec: 11468.8). Total num frames: 97017856. Throughput: 0: 11736.7. Samples: 97001336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:13:44,742][1096160] Avg episode reward: [(0, '4863.593')] [2023-03-10 22:13:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000189488_97017856.pth... [2023-03-10 22:13:44,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000188800_96665600.pth [2023-03-10 22:13:46,044][1096443] Updated weights for policy 0, policy_version 189520 (0.0005) [2023-03-10 22:13:49,652][1096443] Updated weights for policy 0, policy_version 189600 (0.0005) [2023-03-10 22:13:49,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11673.6, 300 sec: 11454.9). Total num frames: 97075200. Throughput: 0: 11692.3. Samples: 97071000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:13:49,742][1096160] Avg episode reward: [(0, '4860.267')] [2023-03-10 22:13:53,015][1096443] Updated weights for policy 0, policy_version 189680 (0.0005) [2023-03-10 22:13:54,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11741.9, 300 sec: 11441.0). Total num frames: 97132544. Throughput: 0: 11716.9. Samples: 97106908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:13:54,742][1096160] Avg episode reward: [(0, '4865.230')] [2023-03-10 22:13:56,557][1096443] Updated weights for policy 0, policy_version 189760 (0.0005) [2023-03-10 22:13:59,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11673.6, 300 sec: 11454.9). Total num frames: 97189888. Throughput: 0: 11709.2. Samples: 97177384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:13:59,742][1096160] Avg episode reward: [(0, '4861.076')] [2023-03-10 22:13:59,768][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000189832_97193984.pth... [2023-03-10 22:13:59,770][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000189144_96841728.pth [2023-03-10 22:14:00,086][1096443] Updated weights for policy 0, policy_version 189840 (0.0005) [2023-03-10 22:14:03,782][1096443] Updated weights for policy 0, policy_version 189920 (0.0006) [2023-03-10 22:14:04,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11468.8). Total num frames: 97247232. Throughput: 0: 11650.2. Samples: 97243904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:14:04,742][1096160] Avg episode reward: [(0, '4864.142')] [2023-03-10 22:14:07,264][1096443] Updated weights for policy 0, policy_version 190000 (0.0005) [2023-03-10 22:14:09,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11741.8, 300 sec: 11482.7). Total num frames: 97308672. Throughput: 0: 11650.7. Samples: 97279996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:14:09,742][1096160] Avg episode reward: [(0, '4863.847')] [2023-03-10 22:14:10,660][1096443] Updated weights for policy 0, policy_version 190080 (0.0005) [2023-03-10 22:14:14,421][1096443] Updated weights for policy 0, policy_version 190160 (0.0005) [2023-03-10 22:14:14,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11605.4, 300 sec: 11468.8). Total num frames: 97361920. Throughput: 0: 11577.3. Samples: 97346436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:14:14,742][1096160] Avg episode reward: [(0, '4862.609')] [2023-03-10 22:14:14,770][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000190168_97366016.pth... [2023-03-10 22:14:14,771][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000189488_97017856.pth [2023-03-10 22:14:17,810][1096443] Updated weights for policy 0, policy_version 190240 (0.0005) [2023-03-10 22:14:19,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11673.6, 300 sec: 11482.7). Total num frames: 97423360. Throughput: 0: 11652.5. Samples: 97419336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:14:19,800][1096160] Avg episode reward: [(0, '4864.578')] [2023-03-10 22:14:21,214][1096443] Updated weights for policy 0, policy_version 190320 (0.0005) [2023-03-10 22:14:24,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11482.7). Total num frames: 97480704. Throughput: 0: 11619.1. Samples: 97450816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:14:24,852][1096160] Avg episode reward: [(0, '4859.264')] [2023-03-10 22:14:24,964][1096443] Updated weights for policy 0, policy_version 190400 (0.0005) [2023-03-10 22:14:28,568][1096443] Updated weights for policy 0, policy_version 190480 (0.0005) [2023-03-10 22:14:29,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11482.7). Total num frames: 97538048. Throughput: 0: 11563.0. Samples: 97521672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:14:29,743][1096160] Avg episode reward: [(0, '4861.212')] [2023-03-10 22:14:29,747][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000190504_97538048.pth... [2023-03-10 22:14:29,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000189832_97193984.pth [2023-03-10 22:14:32,207][1096443] Updated weights for policy 0, policy_version 190560 (0.0005) [2023-03-10 22:14:34,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11605.3, 300 sec: 11482.7). Total num frames: 97595392. Throughput: 0: 11546.6. Samples: 97590596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:14:34,742][1096160] Avg episode reward: [(0, '4859.626')] [2023-03-10 22:14:35,822][1096443] Updated weights for policy 0, policy_version 190640 (0.0005) [2023-03-10 22:14:39,537][1096443] Updated weights for policy 0, policy_version 190720 (0.0005) [2023-03-10 22:14:39,742][1096160] Fps is (10 sec: 11059.3, 60 sec: 11605.3, 300 sec: 11468.8). Total num frames: 97648640. Throughput: 0: 11490.3. Samples: 97623972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:14:40,216][1096160] Avg episode reward: [(0, '4864.840')] [2023-03-10 22:14:43,555][1096443] Updated weights for policy 0, policy_version 190800 (0.0004) [2023-03-10 22:14:44,742][1096160] Fps is (10 sec: 10649.5, 60 sec: 11400.5, 300 sec: 11468.8). Total num frames: 97701888. Throughput: 0: 11291.7. Samples: 97685512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:14:44,742][1096160] Avg episode reward: [(0, '4865.326')] [2023-03-10 22:14:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000190824_97701888.pth... [2023-03-10 22:14:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000190168_97366016.pth [2023-03-10 22:14:47,314][1096443] Updated weights for policy 0, policy_version 190880 (0.0005) [2023-03-10 22:14:49,742][1096160] Fps is (10 sec: 10649.5, 60 sec: 11332.3, 300 sec: 11441.0). Total num frames: 97755136. Throughput: 0: 11297.8. Samples: 97752308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:14:49,742][1096160] Avg episode reward: [(0, '4860.646')] [2023-03-10 22:14:50,947][1096443] Updated weights for policy 0, policy_version 190960 (0.0005) [2023-03-10 22:14:54,742][1096160] Fps is (10 sec: 9420.9, 60 sec: 11059.2, 300 sec: 11399.4). Total num frames: 97796096. Throughput: 0: 10924.4. Samples: 97771592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:14:54,742][1096160] Avg episode reward: [(0, '4862.649')] [2023-03-10 22:14:55,952][1096443] Updated weights for policy 0, policy_version 191040 (0.0005) [2023-03-10 22:14:59,098][1096443] Updated weights for policy 0, policy_version 191120 (0.0004) [2023-03-10 22:14:59,742][1096160] Fps is (10 sec: 10649.6, 60 sec: 11195.7, 300 sec: 11427.1). Total num frames: 97861632. Throughput: 0: 10957.6. Samples: 97839528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:14:59,742][1096160] Avg episode reward: [(0, '4862.605')] [2023-03-10 22:14:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000191136_97861632.pth... [2023-03-10 22:15:00,498][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000190504_97538048.pth [2023-03-10 22:15:03,194][1096443] Updated weights for policy 0, policy_version 191200 (0.0005) [2023-03-10 22:15:04,742][1096160] Fps is (10 sec: 11059.2, 60 sec: 10990.9, 300 sec: 11385.5). Total num frames: 97906688. Throughput: 0: 10755.9. Samples: 97903352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:15:04,742][1096160] Avg episode reward: [(0, '4864.265')] [2023-03-10 22:15:06,705][1096443] Updated weights for policy 0, policy_version 191280 (0.0005) [2023-03-10 22:15:09,742][1096160] Fps is (10 sec: 10649.7, 60 sec: 10991.0, 300 sec: 11413.3). Total num frames: 97968128. Throughput: 0: 10880.3. Samples: 97940428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:15:09,742][1096160] Avg episode reward: [(0, '4865.577')] [2023-03-10 22:15:10,391][1096443] Updated weights for policy 0, policy_version 191360 (0.0006) [2023-03-10 22:15:13,286][1096443] Updated weights for policy 0, policy_version 191440 (0.0004) [2023-03-10 22:15:14,742][1096160] Fps is (10 sec: 12287.8, 60 sec: 11127.4, 300 sec: 11427.1). Total num frames: 98029568. Throughput: 0: 11010.6. Samples: 98017148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:15:14,742][1096160] Avg episode reward: [(0, '4864.074')] [2023-03-10 22:15:14,777][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000191472_98033664.pth... [2023-03-10 22:15:14,778][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000190824_97701888.pth [2023-03-10 22:15:16,976][1096443] Updated weights for policy 0, policy_version 191520 (0.0005) [2023-03-10 22:15:20,033][1096443] Updated weights for policy 0, policy_version 191600 (0.0004) [2023-03-10 22:15:19,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11127.5, 300 sec: 11441.0). Total num frames: 98091008. Throughput: 0: 11027.4. Samples: 98086828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:15:20,591][1096160] Avg episode reward: [(0, '4862.293')] [2023-03-10 22:15:24,152][1096443] Updated weights for policy 0, policy_version 191680 (0.0005) [2023-03-10 22:15:24,742][1096160] Fps is (10 sec: 11059.4, 60 sec: 10990.9, 300 sec: 11399.4). Total num frames: 98140160. Throughput: 0: 10969.5. Samples: 98117600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:15:25,121][1096160] Avg episode reward: [(0, '4867.553')] [2023-03-10 22:15:28,515][1096443] Updated weights for policy 0, policy_version 191760 (0.0005) [2023-03-10 22:15:29,742][1096160] Fps is (10 sec: 10239.9, 60 sec: 10922.7, 300 sec: 11385.5). Total num frames: 98193408. Throughput: 0: 10946.9. Samples: 98178124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:15:29,742][1096160] Avg episode reward: [(0, '4865.461')] [2023-03-10 22:15:29,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000191784_98193408.pth... [2023-03-10 22:15:29,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000191136_97861632.pth [2023-03-10 22:15:31,831][1096443] Updated weights for policy 0, policy_version 191840 (0.0005) [2023-03-10 22:15:34,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 10990.9, 300 sec: 11399.4). Total num frames: 98254848. Throughput: 0: 11058.0. Samples: 98249916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:15:34,742][1096160] Avg episode reward: [(0, '4863.512')] [2023-03-10 22:15:35,479][1096443] Updated weights for policy 0, policy_version 191920 (0.0006) [2023-03-10 22:15:38,824][1096443] Updated weights for policy 0, policy_version 192000 (0.0005) [2023-03-10 22:15:39,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11059.2, 300 sec: 11399.4). Total num frames: 98312192. Throughput: 0: 11390.0. Samples: 98284144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:15:39,742][1096160] Avg episode reward: [(0, '4864.471')] [2023-03-10 22:15:42,337][1096443] Updated weights for policy 0, policy_version 192080 (0.0005) [2023-03-10 22:15:44,742][1096160] Fps is (10 sec: 11468.7, 60 sec: 11127.5, 300 sec: 11385.5). Total num frames: 98369536. Throughput: 0: 11475.1. Samples: 98355908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:15:44,742][1096160] Avg episode reward: [(0, '4865.357')] [2023-03-10 22:15:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000192128_98369536.pth... [2023-03-10 22:15:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000191472_98033664.pth [2023-03-10 22:15:45,980][1096443] Updated weights for policy 0, policy_version 192160 (0.0005) [2023-03-10 22:15:49,510][1096443] Updated weights for policy 0, policy_version 192240 (0.0005) [2023-03-10 22:15:49,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11195.7, 300 sec: 11399.4). Total num frames: 98426880. Throughput: 0: 11544.5. Samples: 98422856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:15:49,742][1096160] Avg episode reward: [(0, '4864.869')] [2023-03-10 22:15:53,042][1096443] Updated weights for policy 0, policy_version 192320 (0.0005) [2023-03-10 22:15:54,742][1096160] Fps is (10 sec: 11468.9, 60 sec: 11468.8, 300 sec: 11399.4). Total num frames: 98484224. Throughput: 0: 11535.1. Samples: 98459508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:15:54,742][1096160] Avg episode reward: [(0, '4859.728')] [2023-03-10 22:15:56,404][1096443] Updated weights for policy 0, policy_version 192400 (0.0005) [2023-03-10 22:15:59,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 11400.5, 300 sec: 11413.3). Total num frames: 98545664. Throughput: 0: 11386.2. Samples: 98529528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:15:59,742][1096160] Avg episode reward: [(0, '4864.103')] [2023-03-10 22:15:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000192472_98545664.pth... [2023-03-10 22:15:59,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000191784_98193408.pth [2023-03-10 22:16:00,056][1096443] Updated weights for policy 0, policy_version 192480 (0.0005) [2023-03-10 22:16:03,240][1096443] Updated weights for policy 0, policy_version 192560 (0.0005) [2023-03-10 22:16:04,741][1096160] Fps is (10 sec: 12288.1, 60 sec: 11673.6, 300 sec: 11427.1). Total num frames: 98607104. Throughput: 0: 11469.8. Samples: 98602968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:16:04,742][1096160] Avg episode reward: [(0, '4864.268')] [2023-03-10 22:16:06,303][1096443] Updated weights for policy 0, policy_version 192640 (0.0004) [2023-03-10 22:16:09,256][1096443] Updated weights for policy 0, policy_version 192720 (0.0004) [2023-03-10 22:16:09,742][1096160] Fps is (10 sec: 12697.7, 60 sec: 11741.9, 300 sec: 11454.9). Total num frames: 98672640. Throughput: 0: 11690.9. Samples: 98643692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:16:09,742][1096160] Avg episode reward: [(0, '4865.827')] [2023-03-10 22:16:12,907][1096443] Updated weights for policy 0, policy_version 192800 (0.0004) [2023-03-10 22:16:14,742][1096160] Fps is (10 sec: 12288.0, 60 sec: 11673.6, 300 sec: 11454.9). Total num frames: 98729984. Throughput: 0: 11989.6. Samples: 98717656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:16:14,742][1096160] Avg episode reward: [(0, '4860.631')] [2023-03-10 22:16:14,756][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000192840_98734080.pth... [2023-03-10 22:16:14,757][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000192128_98369536.pth [2023-03-10 22:16:16,534][1096443] Updated weights for policy 0, policy_version 192880 (0.0005) [2023-03-10 22:16:19,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11605.3, 300 sec: 11454.9). Total num frames: 98787328. Throughput: 0: 11887.1. Samples: 98784836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:16:19,742][1096160] Avg episode reward: [(0, '4860.334')] [2023-03-10 22:16:20,174][1096443] Updated weights for policy 0, policy_version 192960 (0.0004) [2023-03-10 22:16:24,742][1096160] Fps is (10 sec: 9830.4, 60 sec: 11468.8, 300 sec: 11399.4). Total num frames: 98828288. Throughput: 0: 11454.9. Samples: 98799616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:16:24,742][1096160] Avg episode reward: [(0, '4864.870')] [2023-03-10 22:16:25,194][1096443] Updated weights for policy 0, policy_version 193040 (0.0005) [2023-03-10 22:16:29,296][1096443] Updated weights for policy 0, policy_version 193120 (0.0004) [2023-03-10 22:16:29,742][1096160] Fps is (10 sec: 9420.8, 60 sec: 11468.8, 300 sec: 11399.4). Total num frames: 98881536. Throughput: 0: 11316.7. Samples: 98865160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:16:30,243][1096160] Avg episode reward: [(0, '4864.670')] [2023-03-10 22:16:30,246][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000193144_98889728.pth... [2023-03-10 22:16:30,249][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000192472_98545664.pth [2023-03-10 22:16:32,259][1096443] Updated weights for policy 0, policy_version 193200 (0.0004) [2023-03-10 22:16:34,742][1096160] Fps is (10 sec: 11468.8, 60 sec: 11468.8, 300 sec: 11399.4). Total num frames: 98942976. Throughput: 0: 11485.5. Samples: 98939704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:16:34,742][1096160] Avg episode reward: [(0, '4862.045')] [2023-03-10 22:16:35,850][1096443] Updated weights for policy 0, policy_version 193280 (0.0004) [2023-03-10 22:16:39,069][1096443] Updated weights for policy 0, policy_version 193360 (0.0005) [2023-03-10 22:16:39,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 11468.8, 300 sec: 11399.4). Total num frames: 99000320. Throughput: 0: 11471.8. Samples: 98975740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:16:39,742][1096160] Avg episode reward: [(0, '4864.457')] [2023-03-10 22:16:42,835][1096443] Updated weights for policy 0, policy_version 193440 (0.0005) [2023-03-10 22:16:44,742][1096160] Fps is (10 sec: 11878.5, 60 sec: 11537.1, 300 sec: 11427.1). Total num frames: 99061760. Throughput: 0: 11480.5. Samples: 99046148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:16:44,742][1096160] Avg episode reward: [(0, '4865.735')] [2023-03-10 22:16:44,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000193480_99061760.pth... [2023-03-10 22:16:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000192840_98734080.pth [2023-03-10 22:16:46,172][1096443] Updated weights for policy 0, policy_version 193520 (0.0005) [2023-03-10 22:16:49,699][1096443] Updated weights for policy 0, policy_version 193600 (0.0005) [2023-03-10 22:16:49,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11605.3, 300 sec: 11427.1). Total num frames: 99123200. Throughput: 0: 11421.5. Samples: 99116936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:16:49,742][1096160] Avg episode reward: [(0, '4863.913')] [2023-03-10 22:16:52,790][1096443] Updated weights for policy 0, policy_version 193680 (0.0004) [2023-03-10 22:16:54,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 11605.3, 300 sec: 11427.1). Total num frames: 99180544. Throughput: 0: 11357.5. Samples: 99154780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:16:54,742][1096160] Avg episode reward: [(0, '4861.334')] [2023-03-10 22:16:56,527][1096443] Updated weights for policy 0, policy_version 193760 (0.0005) [2023-03-10 22:16:59,243][1096443] Updated weights for policy 0, policy_version 193840 (0.0004) [2023-03-10 22:16:59,741][1096160] Fps is (10 sec: 12697.8, 60 sec: 11741.9, 300 sec: 11468.8). Total num frames: 99250176. Throughput: 0: 11402.9. Samples: 99230784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:16:59,742][1096160] Avg episode reward: [(0, '4862.224')] [2023-03-10 22:16:59,744][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000193848_99250176.pth... [2023-03-10 22:16:59,747][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000193144_98889728.pth [2023-03-10 22:17:02,732][1096443] Updated weights for policy 0, policy_version 193920 (0.0005) [2023-03-10 22:17:04,741][1096160] Fps is (10 sec: 12697.6, 60 sec: 11673.6, 300 sec: 11468.8). Total num frames: 99307520. Throughput: 0: 11545.5. Samples: 99304384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:17:04,742][1096160] Avg episode reward: [(0, '4861.933')] [2023-03-10 22:17:06,256][1096443] Updated weights for policy 0, policy_version 194000 (0.0005) [2023-03-10 22:17:09,319][1096443] Updated weights for policy 0, policy_version 194080 (0.0005) [2023-03-10 22:17:09,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11673.6, 300 sec: 11496.6). Total num frames: 99373056. Throughput: 0: 12016.3. Samples: 99340352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:17:09,742][1096160] Avg episode reward: [(0, '4861.711')] [2023-03-10 22:17:12,892][1096443] Updated weights for policy 0, policy_version 194160 (0.0006) [2023-03-10 22:17:14,742][1096160] Fps is (10 sec: 12287.9, 60 sec: 11673.6, 300 sec: 11510.5). Total num frames: 99430400. Throughput: 0: 12197.2. Samples: 99414032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:17:14,742][1096160] Avg episode reward: [(0, '4862.094')] [2023-03-10 22:17:14,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000194200_99430400.pth... [2023-03-10 22:17:14,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000193480_99061760.pth [2023-03-10 22:17:16,540][1096443] Updated weights for policy 0, policy_version 194240 (0.0005) [2023-03-10 22:17:19,741][1096160] Fps is (10 sec: 11469.0, 60 sec: 11673.6, 300 sec: 11510.5). Total num frames: 99487744. Throughput: 0: 12089.3. Samples: 99483720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:17:19,742][1096160] Avg episode reward: [(0, '4859.682')] [2023-03-10 22:17:19,842][1096443] Updated weights for policy 0, policy_version 194320 (0.0005) [2023-03-10 22:17:23,319][1096443] Updated weights for policy 0, policy_version 194400 (0.0005) [2023-03-10 22:17:24,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11524.3). Total num frames: 99545088. Throughput: 0: 12103.3. Samples: 99520388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:17:24,742][1096160] Avg episode reward: [(0, '4863.938')] [2023-03-10 22:17:26,720][1096443] Updated weights for policy 0, policy_version 194480 (0.0005) [2023-03-10 22:17:29,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11538.2). Total num frames: 99606528. Throughput: 0: 12129.4. Samples: 99591972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:17:29,742][1096160] Avg episode reward: [(0, '4865.196')] [2023-03-10 22:17:29,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000194544_99606528.pth... [2023-03-10 22:17:29,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000193848_99250176.pth [2023-03-10 22:17:30,335][1096443] Updated weights for policy 0, policy_version 194560 (0.0005) [2023-03-10 22:17:33,871][1096443] Updated weights for policy 0, policy_version 194640 (0.0005) [2023-03-10 22:17:34,741][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11552.1). Total num frames: 99663872. Throughput: 0: 12064.6. Samples: 99659840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:17:34,742][1096160] Avg episode reward: [(0, '4865.606')] [2023-03-10 22:17:37,200][1096443] Updated weights for policy 0, policy_version 194720 (0.0005) [2023-03-10 22:17:39,742][1096160] Fps is (10 sec: 11878.3, 60 sec: 12083.2, 300 sec: 11566.0). Total num frames: 99725312. Throughput: 0: 12042.7. Samples: 99696704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:17:39,742][1096160] Avg episode reward: [(0, '4863.767')] [2023-03-10 22:17:40,795][1096443] Updated weights for policy 0, policy_version 194800 (0.0004) [2023-03-10 22:17:44,351][1096443] Updated weights for policy 0, policy_version 194880 (0.0005) [2023-03-10 22:17:44,742][1096160] Fps is (10 sec: 11878.4, 60 sec: 12014.9, 300 sec: 11552.1). Total num frames: 99782656. Throughput: 0: 11870.1. Samples: 99764940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:17:44,742][1096160] Avg episode reward: [(0, '4865.073')] [2023-03-10 22:17:44,745][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000194888_99782656.pth... [2023-03-10 22:17:44,748][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000194200_99430400.pth [2023-03-10 22:17:47,904][1096443] Updated weights for policy 0, policy_version 194960 (0.0005) [2023-03-10 22:17:49,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11566.0). Total num frames: 99840000. Throughput: 0: 11801.0. Samples: 99835428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:17:49,742][1096160] Avg episode reward: [(0, '4864.129')] [2023-03-10 22:17:51,451][1096443] Updated weights for policy 0, policy_version 195040 (0.0005) [2023-03-10 22:17:54,741][1096160] Fps is (10 sec: 11468.9, 60 sec: 11946.7, 300 sec: 11552.1). Total num frames: 99897344. Throughput: 0: 11747.8. Samples: 99869000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:17:54,742][1096160] Avg episode reward: [(0, '4859.233')] [2023-03-10 22:17:54,872][1096443] Updated weights for policy 0, policy_version 195120 (0.0004) [2023-03-10 22:17:58,256][1096443] Updated weights for policy 0, policy_version 195200 (0.0005) [2023-03-10 22:17:59,742][1096160] Fps is (10 sec: 11878.2, 60 sec: 11810.1, 300 sec: 11566.0). Total num frames: 99958784. Throughput: 0: 11740.8. Samples: 99942368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-03-10 22:17:59,742][1096160] Avg episode reward: [(0, '4861.204')] [2023-03-10 22:17:59,746][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000195232_99958784.pth... [2023-03-10 22:17:59,749][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000194544_99606528.pth [2023-03-10 22:18:01,745][1096443] Updated weights for policy 0, policy_version 195280 (0.0005) [2023-03-10 22:18:03,553][1096399] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 [2023-03-10 22:18:03,860][1096399] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 [2023-03-10 22:18:03,861][1096450] Stopping RolloutWorker_w6... [2023-03-10 22:18:03,861][1096444] Stopping RolloutWorker_w1... [2023-03-10 22:18:03,861][1096445] Stopping RolloutWorker_w3... [2023-03-10 22:18:03,861][1096495] Stopping RolloutWorker_w7... [2023-03-10 22:18:03,861][1096449] Stopping RolloutWorker_w4... [2023-03-10 22:18:03,861][1096447] Stopping RolloutWorker_w2... [2023-03-10 22:18:03,861][1096446] Stopping RolloutWorker_w5... [2023-03-10 22:18:03,861][1096444] Loop rollout_proc1_evt_loop terminating... [2023-03-10 22:18:03,861][1096495] Loop rollout_proc7_evt_loop terminating... [2023-03-10 22:18:03,861][1096450] Loop rollout_proc6_evt_loop terminating... [2023-03-10 22:18:03,861][1096445] Loop rollout_proc3_evt_loop terminating... [2023-03-10 22:18:03,861][1096399] Stopping Batcher_0... [2023-03-10 22:18:03,861][1096448] Stopping RolloutWorker_w0... [2023-03-10 22:18:03,861][1096160] Component RolloutWorker_w6 stopped! [2023-03-10 22:18:03,861][1096447] Loop rollout_proc2_evt_loop terminating... [2023-03-10 22:18:03,861][1096446] Loop rollout_proc5_evt_loop terminating... [2023-03-10 22:18:03,861][1096449] Loop rollout_proc4_evt_loop terminating... [2023-03-10 22:18:03,861][1096399] Loop batcher_evt_loop terminating... [2023-03-10 22:18:03,861][1096448] Loop rollout_proc0_evt_loop terminating... [2023-03-10 22:18:03,861][1096160] Component RolloutWorker_w1 stopped! [2023-03-10 22:18:03,862][1096160] Component RolloutWorker_w3 stopped! [2023-03-10 22:18:03,862][1096160] Component RolloutWorker_w4 stopped! [2023-03-10 22:18:03,862][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000195328_100007936.pth... [2023-03-10 22:18:03,862][1096160] Component RolloutWorker_w2 stopped! [2023-03-10 22:18:03,862][1096160] Component RolloutWorker_w5 stopped! [2023-03-10 22:18:03,862][1096160] Component RolloutWorker_w7 stopped! [2023-03-10 22:18:03,863][1096160] Component RolloutWorker_w0 stopped! [2023-03-10 22:18:03,863][1096160] Component Batcher_0 stopped! [2023-03-10 22:18:03,864][1096399] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000194888_99782656.pth [2023-03-10 22:18:03,865][1096399] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/reach-v2/checkpoint_p0/checkpoint_000195328_100007936.pth... [2023-03-10 22:18:03,867][1096399] Stopping LearnerWorker_p0... [2023-03-10 22:18:03,868][1096399] Loop learner_proc0_evt_loop terminating... [2023-03-10 22:18:03,868][1096160] Component LearnerWorker_p0 stopped! [2023-03-10 22:18:03,921][1096443] Weights refcount: 2 0 [2023-03-10 22:18:03,922][1096443] Stopping InferenceWorker_p0-w0... [2023-03-10 22:18:03,922][1096443] Loop inference_proc0-0_evt_loop terminating... [2023-03-10 22:18:03,922][1096160] Component InferenceWorker_p0-w0 stopped! [2023-03-10 22:18:03,923][1096160] Waiting for process learner_proc0 to stop... [2023-03-10 22:18:06,388][1096160] Waiting for process inference_proc0-0 to join... [2023-03-10 22:18:06,403][1096160] Waiting for process rollout_proc0 to join... [2023-03-10 22:18:06,408][1096160] Waiting for process rollout_proc1 to join... [2023-03-10 22:18:06,411][1096160] Waiting for process rollout_proc2 to join... [2023-03-10 22:18:06,412][1096160] Waiting for process rollout_proc3 to join... [2023-03-10 22:18:06,412][1096160] Waiting for process rollout_proc4 to join... [2023-03-10 22:18:06,412][1096160] Waiting for process rollout_proc5 to join... [2023-03-10 22:18:06,413][1096160] Waiting for process rollout_proc6 to join... [2023-03-10 22:18:06,413][1096160] Waiting for process rollout_proc7 to join... [2023-03-10 22:18:06,413][1096160] Batcher 0 profile tree view: batching: 17.2663, releasing_batches: 15.0437 [2023-03-10 22:18:06,414][1096160] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0051 wait_policy_total: 2914.4646 update_model: 107.0092 weight_update: 0.0004 one_step: 0.0007 handle_policy_step: 4810.9336 deserialize: 205.6029, stack: 48.4944, obs_to_device_normalize: 837.7748, forward: 2369.8099, send_messages: 415.4985 prepare_outputs: 525.6058 to_cpu: 78.8770 [2023-03-10 22:18:06,414][1096160] Learner 0 profile tree view: misc: 0.1035, prepare_batch: 79.8003 train: 1029.5858 epoch_init: 0.3686, minibatch_init: 10.9723, losses_postprocess: 12.2165, kl_divergence: 3.9742, after_optimizer: 5.7839 calculate_losses: 419.4263 losses_init: 0.3029, forward_head: 202.3505, bptt_initial: 1.1295, bptt: 1.1582, tail: 103.2040, advantages_returns: 8.2801, losses: 90.2101 update: 562.1959 clip: 51.4604 [2023-03-10 22:18:06,414][1096160] RolloutWorker_w0 profile tree view: wait_for_trajectories: 4.4499, enqueue_policy_requests: 151.9760, env_step: 4309.1282, overhead: 338.8203, complete_rollouts: 3.8055 save_policy_outputs: 376.4760 split_output_tensors: 185.3736 [2023-03-10 22:18:06,414][1096160] RolloutWorker_w7 profile tree view: wait_for_trajectories: 4.2111, enqueue_policy_requests: 150.2908, env_step: 4287.7057, overhead: 333.8058, complete_rollouts: 3.6615 save_policy_outputs: 373.6110 split_output_tensors: 181.5592 [2023-03-10 22:18:06,415][1096160] Loop Runner_EvtLoop terminating... [2023-03-10 22:18:06,415][1096160] Runner profile tree view: main_loop: 8404.3527 [2023-03-10 22:18:06,415][1096160] Collected {0: 100007936}, FPS: 11899.5